Segmentation of Printed Urdu Scripts Using Structural Features

Character segmentation forms the basis for optical character recognition. In this paper, we have proposed a character segmentation approach for printed Urdu script. Urdu is cursive by nature and its script is written from right to left. Both these factors make the segmentation more difficult and req...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Malik, H., Fahiem, M.A.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 195
container_issue
container_start_page 191
container_title
container_volume
creator Malik, H.
Fahiem, M.A.
description Character segmentation forms the basis for optical character recognition. In this paper, we have proposed a character segmentation approach for printed Urdu script. Urdu is cursive by nature and its script is written from right to left. Both these factors make the segmentation more difficult and require special attention. Our approach is based on structural features and we have overcome different problems like over segmentation and under segmentation, present in previous approaches. We have achieved an accuracy rate of 99.4% which is better than others. The approach may be very useful for developing an optical character recognition system for Urdu language.
doi_str_mv 10.1109/VIZ.2009.12
format Conference Proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_5230742</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>5230742</ieee_id><sourcerecordid>5230742</sourcerecordid><originalsourceid>FETCH-LOGICAL-i90t-c8f76efa8db02ac9cd97c12d532286b7bc68f95fceeab8c12f9dbe9b417953f83</originalsourceid><addsrcrecordid>eNotj0FLAzEUhANS0NaePHrJH9j1Jdkkm4sgxdpCQWFbD15KNnkpkXZbstmD_94tepqZb2BgCHlgUDIG5ulz_VVyAFMyfkOmoJWRQosKJmR6xQau8ZbM-_4bAJhRWkp1R54bPJywyzbHc0fPgX6k2GX0dJf8QBuX4iX3dNfH7kCbnAaXh2SPdIl2NNjfk0mwxx7n_zoj2-XrdrEqNu9v68XLpogGcuHqoBUGW_sWuHXGeaMd414KzmvV6tapOhgZHKJt67EJxrdo2orp8UWoxYw8_s1GRNxfUjzZ9LOXXICuuPgFWfVI9A</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Segmentation of Printed Urdu Scripts Using Structural Features</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Malik, H. ; Fahiem, M.A.</creator><creatorcontrib>Malik, H. ; Fahiem, M.A.</creatorcontrib><description>Character segmentation forms the basis for optical character recognition. In this paper, we have proposed a character segmentation approach for printed Urdu script. Urdu is cursive by nature and its script is written from right to left. Both these factors make the segmentation more difficult and require special attention. Our approach is based on structural features and we have overcome different problems like over segmentation and under segmentation, present in previous approaches. We have achieved an accuracy rate of 99.4% which is better than others. The approach may be very useful for developing an optical character recognition system for Urdu language.</description><identifier>ISBN: 0769537340</identifier><identifier>ISBN: 9780769537344</identifier><identifier>DOI: 10.1109/VIZ.2009.12</identifier><identifier>LCCN: 2009905373</identifier><language>eng</language><publisher>IEEE</publisher><subject>Character recognition ; Character segmentation ; Cursive scripts ; Educational institutions ; FCC ; Hidden Markov models ; Image segmentation ; Ligature ; Neural networks ; Optical character recognition software ; Pixel ; Shape ; Structural features ; Urdu alphabets ; Visualization</subject><ispartof>2009 Second International Conference in Visualisation, 2009, p.191-195</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/5230742$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,777,781,786,787,2052,27906,54901</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/5230742$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Malik, H.</creatorcontrib><creatorcontrib>Fahiem, M.A.</creatorcontrib><title>Segmentation of Printed Urdu Scripts Using Structural Features</title><title>2009 Second International Conference in Visualisation</title><addtitle>VIZ</addtitle><description>Character segmentation forms the basis for optical character recognition. In this paper, we have proposed a character segmentation approach for printed Urdu script. Urdu is cursive by nature and its script is written from right to left. Both these factors make the segmentation more difficult and require special attention. Our approach is based on structural features and we have overcome different problems like over segmentation and under segmentation, present in previous approaches. We have achieved an accuracy rate of 99.4% which is better than others. The approach may be very useful for developing an optical character recognition system for Urdu language.</description><subject>Character recognition</subject><subject>Character segmentation</subject><subject>Cursive scripts</subject><subject>Educational institutions</subject><subject>FCC</subject><subject>Hidden Markov models</subject><subject>Image segmentation</subject><subject>Ligature</subject><subject>Neural networks</subject><subject>Optical character recognition software</subject><subject>Pixel</subject><subject>Shape</subject><subject>Structural features</subject><subject>Urdu alphabets</subject><subject>Visualization</subject><isbn>0769537340</isbn><isbn>9780769537344</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2009</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotj0FLAzEUhANS0NaePHrJH9j1Jdkkm4sgxdpCQWFbD15KNnkpkXZbstmD_94tepqZb2BgCHlgUDIG5ulz_VVyAFMyfkOmoJWRQosKJmR6xQau8ZbM-_4bAJhRWkp1R54bPJywyzbHc0fPgX6k2GX0dJf8QBuX4iX3dNfH7kCbnAaXh2SPdIl2NNjfk0mwxx7n_zoj2-XrdrEqNu9v68XLpogGcuHqoBUGW_sWuHXGeaMd414KzmvV6tapOhgZHKJt67EJxrdo2orp8UWoxYw8_s1GRNxfUjzZ9LOXXICuuPgFWfVI9A</recordid><startdate>200907</startdate><enddate>200907</enddate><creator>Malik, H.</creator><creator>Fahiem, M.A.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>200907</creationdate><title>Segmentation of Printed Urdu Scripts Using Structural Features</title><author>Malik, H. ; Fahiem, M.A.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i90t-c8f76efa8db02ac9cd97c12d532286b7bc68f95fceeab8c12f9dbe9b417953f83</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2009</creationdate><topic>Character recognition</topic><topic>Character segmentation</topic><topic>Cursive scripts</topic><topic>Educational institutions</topic><topic>FCC</topic><topic>Hidden Markov models</topic><topic>Image segmentation</topic><topic>Ligature</topic><topic>Neural networks</topic><topic>Optical character recognition software</topic><topic>Pixel</topic><topic>Shape</topic><topic>Structural features</topic><topic>Urdu alphabets</topic><topic>Visualization</topic><toplevel>online_resources</toplevel><creatorcontrib>Malik, H.</creatorcontrib><creatorcontrib>Fahiem, M.A.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Malik, H.</au><au>Fahiem, M.A.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Segmentation of Printed Urdu Scripts Using Structural Features</atitle><btitle>2009 Second International Conference in Visualisation</btitle><stitle>VIZ</stitle><date>2009-07</date><risdate>2009</risdate><spage>191</spage><epage>195</epage><pages>191-195</pages><isbn>0769537340</isbn><isbn>9780769537344</isbn><abstract>Character segmentation forms the basis for optical character recognition. In this paper, we have proposed a character segmentation approach for printed Urdu script. Urdu is cursive by nature and its script is written from right to left. Both these factors make the segmentation more difficult and require special attention. Our approach is based on structural features and we have overcome different problems like over segmentation and under segmentation, present in previous approaches. We have achieved an accuracy rate of 99.4% which is better than others. The approach may be very useful for developing an optical character recognition system for Urdu language.</abstract><pub>IEEE</pub><doi>10.1109/VIZ.2009.12</doi><tpages>5</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISBN: 0769537340
ispartof 2009 Second International Conference in Visualisation, 2009, p.191-195
issn
language eng
recordid cdi_ieee_primary_5230742
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Character recognition
Character segmentation
Cursive scripts
Educational institutions
FCC
Hidden Markov models
Image segmentation
Ligature
Neural networks
Optical character recognition software
Pixel
Shape
Structural features
Urdu alphabets
Visualization
title Segmentation of Printed Urdu Scripts Using Structural Features
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T05%3A43%3A57IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Segmentation%20of%20Printed%20Urdu%20Scripts%20Using%20Structural%20Features&rft.btitle=2009%20Second%20International%20Conference%20in%20Visualisation&rft.au=Malik,%20H.&rft.date=2009-07&rft.spage=191&rft.epage=195&rft.pages=191-195&rft.isbn=0769537340&rft.isbn_list=9780769537344&rft_id=info:doi/10.1109/VIZ.2009.12&rft_dat=%3Cieee_6IE%3E5230742%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=5230742&rfr_iscdi=true