An efficient restoration algorithm for the historic middle-age Persian (Pahlavi) manuscripts

This paper aims to provide a restoration algorithm for the Pahlavi or middle-age Persian manuscript. This is the preliminary document processing view to this area. The central idea is based on the morphological analysis and connected component concept. The proposed algorithm uses the mathematical mo...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Alirezaee, S., Aghaeinia, H., Ahmadi, M., Faez, K.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 2120 Vol. 3
container_issue
container_start_page 2114
container_title
container_volume 3
creator Alirezaee, S.
Aghaeinia, H.
Ahmadi, M.
Faez, K.
description This paper aims to provide a restoration algorithm for the Pahlavi or middle-age Persian manuscript. This is the preliminary document processing view to this area. The central idea is based on the morphological analysis and connected component concept. The proposed algorithm uses the mathematical morphology and connected component concept to segment the line, word, and character overlapped Pahlavi documents and prepares those texts for OCR application. To evaluate the performance of the algorithm, it has been tested on 200 pages of the Pahlavi documents. The algorithm has a good success on document restoration and segmentation. Numerical results indicate that the proposed algorithm can remove the noise and destructive effects. The results also show 99.14% accuracy on the baseline detection, 97.35% accuracy on the text line extraction and removing other lines overlaps, and 99.5% accuracy for segmenting the extracted text lines to their components.
doi_str_mv 10.1109/ICSMC.2005.1571461
format Conference Proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_1571461</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>1571461</ieee_id><sourcerecordid>1571461</sourcerecordid><originalsourceid>FETCH-LOGICAL-i90t-5796d84362cd5f51ead7a5a24948fe1b70d8e0a85ebc59654f64eebcbfdda0fb3</originalsourceid><addsrcrecordid>eNotkEtLAzEYRYMPsNb-Ad1kqYupSSbPZRl8FCoW7MKFUDKTL53IPEoSBf-9FQsX7oELZ3ERuqZkTikx98vq7aWaM0LEnApFuaQnaMKEUgWVQpyimVGaHFIaZjQ_QxNKJCsMY-8X6DKlT0IY4VRP0MdiwOB9aAIMGUdIeYw2h3HAttuNMeS2x36MOLeA2_C3hgb3wbkOCrsDvIaYgh3w7dq2nf0Od7i3w1dqYtjndIXOve0SzI49RZvHh031XKxen5bVYlUEQ3IhlJFO81KyxgkvKFinrLCMG6490FoRp4FYLaBuhJGCe8nhwLV3zhJfl1N0868NALDdx9Db-LM9_lL-AqN6Vz0</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>An efficient restoration algorithm for the historic middle-age Persian (Pahlavi) manuscripts</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Alirezaee, S. ; Aghaeinia, H. ; Ahmadi, M. ; Faez, K.</creator><creatorcontrib>Alirezaee, S. ; Aghaeinia, H. ; Ahmadi, M. ; Faez, K.</creatorcontrib><description>This paper aims to provide a restoration algorithm for the Pahlavi or middle-age Persian manuscript. This is the preliminary document processing view to this area. The central idea is based on the morphological analysis and connected component concept. The proposed algorithm uses the mathematical morphology and connected component concept to segment the line, word, and character overlapped Pahlavi documents and prepares those texts for OCR application. To evaluate the performance of the algorithm, it has been tested on 200 pages of the Pahlavi documents. The algorithm has a good success on document restoration and segmentation. Numerical results indicate that the proposed algorithm can remove the noise and destructive effects. The results also show 99.14% accuracy on the baseline detection, 97.35% accuracy on the text line extraction and removing other lines overlaps, and 99.5% accuracy for segmenting the extracted text lines to their components.</description><identifier>ISSN: 1062-922X</identifier><identifier>ISBN: 9780780392984</identifier><identifier>ISBN: 0780392981</identifier><identifier>EISSN: 2577-1655</identifier><identifier>DOI: 10.1109/ICSMC.2005.1571461</identifier><language>eng</language><publisher>IEEE</publisher><subject>Character recognition ; Connected component ; Document restoration ; Entropy ; Gray-scale ; Handwriting recognition ; Image restoration ; Image segmentation ; Morphology ; Optical character recognition software ; Preprocessing ; Segmentation ; Testing ; Text recognition</subject><ispartof>2005 IEEE International Conference on Systems, Man and Cybernetics, 2005, Vol.3, p.2114-2120 Vol. 3</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/1571461$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,4050,4051,27925,54920</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/1571461$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Alirezaee, S.</creatorcontrib><creatorcontrib>Aghaeinia, H.</creatorcontrib><creatorcontrib>Ahmadi, M.</creatorcontrib><creatorcontrib>Faez, K.</creatorcontrib><title>An efficient restoration algorithm for the historic middle-age Persian (Pahlavi) manuscripts</title><title>2005 IEEE International Conference on Systems, Man and Cybernetics</title><addtitle>ICSMC</addtitle><description>This paper aims to provide a restoration algorithm for the Pahlavi or middle-age Persian manuscript. This is the preliminary document processing view to this area. The central idea is based on the morphological analysis and connected component concept. The proposed algorithm uses the mathematical morphology and connected component concept to segment the line, word, and character overlapped Pahlavi documents and prepares those texts for OCR application. To evaluate the performance of the algorithm, it has been tested on 200 pages of the Pahlavi documents. The algorithm has a good success on document restoration and segmentation. Numerical results indicate that the proposed algorithm can remove the noise and destructive effects. The results also show 99.14% accuracy on the baseline detection, 97.35% accuracy on the text line extraction and removing other lines overlaps, and 99.5% accuracy for segmenting the extracted text lines to their components.</description><subject>Character recognition</subject><subject>Connected component</subject><subject>Document restoration</subject><subject>Entropy</subject><subject>Gray-scale</subject><subject>Handwriting recognition</subject><subject>Image restoration</subject><subject>Image segmentation</subject><subject>Morphology</subject><subject>Optical character recognition software</subject><subject>Preprocessing</subject><subject>Segmentation</subject><subject>Testing</subject><subject>Text recognition</subject><issn>1062-922X</issn><issn>2577-1655</issn><isbn>9780780392984</isbn><isbn>0780392981</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2005</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotkEtLAzEYRYMPsNb-Ad1kqYupSSbPZRl8FCoW7MKFUDKTL53IPEoSBf-9FQsX7oELZ3ERuqZkTikx98vq7aWaM0LEnApFuaQnaMKEUgWVQpyimVGaHFIaZjQ_QxNKJCsMY-8X6DKlT0IY4VRP0MdiwOB9aAIMGUdIeYw2h3HAttuNMeS2x36MOLeA2_C3hgb3wbkOCrsDvIaYgh3w7dq2nf0Od7i3w1dqYtjndIXOve0SzI49RZvHh031XKxen5bVYlUEQ3IhlJFO81KyxgkvKFinrLCMG6490FoRp4FYLaBuhJGCe8nhwLV3zhJfl1N0868NALDdx9Db-LM9_lL-AqN6Vz0</recordid><startdate>2005</startdate><enddate>2005</enddate><creator>Alirezaee, S.</creator><creator>Aghaeinia, H.</creator><creator>Ahmadi, M.</creator><creator>Faez, K.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>2005</creationdate><title>An efficient restoration algorithm for the historic middle-age Persian (Pahlavi) manuscripts</title><author>Alirezaee, S. ; Aghaeinia, H. ; Ahmadi, M. ; Faez, K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i90t-5796d84362cd5f51ead7a5a24948fe1b70d8e0a85ebc59654f64eebcbfdda0fb3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2005</creationdate><topic>Character recognition</topic><topic>Connected component</topic><topic>Document restoration</topic><topic>Entropy</topic><topic>Gray-scale</topic><topic>Handwriting recognition</topic><topic>Image restoration</topic><topic>Image segmentation</topic><topic>Morphology</topic><topic>Optical character recognition software</topic><topic>Preprocessing</topic><topic>Segmentation</topic><topic>Testing</topic><topic>Text recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Alirezaee, S.</creatorcontrib><creatorcontrib>Aghaeinia, H.</creatorcontrib><creatorcontrib>Ahmadi, M.</creatorcontrib><creatorcontrib>Faez, K.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Alirezaee, S.</au><au>Aghaeinia, H.</au><au>Ahmadi, M.</au><au>Faez, K.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>An efficient restoration algorithm for the historic middle-age Persian (Pahlavi) manuscripts</atitle><btitle>2005 IEEE International Conference on Systems, Man and Cybernetics</btitle><stitle>ICSMC</stitle><date>2005</date><risdate>2005</risdate><volume>3</volume><spage>2114</spage><epage>2120 Vol. 3</epage><pages>2114-2120 Vol. 3</pages><issn>1062-922X</issn><eissn>2577-1655</eissn><isbn>9780780392984</isbn><isbn>0780392981</isbn><abstract>This paper aims to provide a restoration algorithm for the Pahlavi or middle-age Persian manuscript. This is the preliminary document processing view to this area. The central idea is based on the morphological analysis and connected component concept. The proposed algorithm uses the mathematical morphology and connected component concept to segment the line, word, and character overlapped Pahlavi documents and prepares those texts for OCR application. To evaluate the performance of the algorithm, it has been tested on 200 pages of the Pahlavi documents. The algorithm has a good success on document restoration and segmentation. Numerical results indicate that the proposed algorithm can remove the noise and destructive effects. The results also show 99.14% accuracy on the baseline detection, 97.35% accuracy on the text line extraction and removing other lines overlaps, and 99.5% accuracy for segmenting the extracted text lines to their components.</abstract><pub>IEEE</pub><doi>10.1109/ICSMC.2005.1571461</doi></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1062-922X
ispartof 2005 IEEE International Conference on Systems, Man and Cybernetics, 2005, Vol.3, p.2114-2120 Vol. 3
issn 1062-922X
2577-1655
language eng
recordid cdi_ieee_primary_1571461
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Character recognition
Connected component
Document restoration
Entropy
Gray-scale
Handwriting recognition
Image restoration
Image segmentation
Morphology
Optical character recognition software
Preprocessing
Segmentation
Testing
Text recognition
title An efficient restoration algorithm for the historic middle-age Persian (Pahlavi) manuscripts
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T18%3A03%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=An%20efficient%20restoration%20algorithm%20for%20the%20historic%20middle-age%20Persian%20(Pahlavi)%20manuscripts&rft.btitle=2005%20IEEE%20International%20Conference%20on%20Systems,%20Man%20and%20Cybernetics&rft.au=Alirezaee,%20S.&rft.date=2005&rft.volume=3&rft.spage=2114&rft.epage=2120%20Vol.%203&rft.pages=2114-2120%20Vol.%203&rft.issn=1062-922X&rft.eissn=2577-1655&rft.isbn=9780780392984&rft.isbn_list=0780392981&rft_id=info:doi/10.1109/ICSMC.2005.1571461&rft_dat=%3Cieee_6IE%3E1571461%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=1571461&rfr_iscdi=true