A Robust Modified Character Segmentation Approach for the Handwritten Archaic Modi Documents
Character segmentation is an imperative step of intelligent handwritten archaic Modi document recognition system. Numbers of challenges like inconsistent and non-uniform handwritten characters, broken or degraded characters etc. are faced in the process of character segmentation due to age, cursive...
Gespeichert in:
Veröffentlicht in: | SN computer science 2024-06, Vol.5 (6), p.667, Article 667 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 6 |
container_start_page | 667 |
container_title | SN computer science |
container_volume | 5 |
creator | Deshmukh, Manisha S. Kolhe, Satish R. |
description | Character segmentation is an imperative step of intelligent handwritten archaic Modi document recognition system. Numbers of challenges like inconsistent and non-uniform handwritten characters, broken or degraded characters etc. are faced in the process of character segmentation due to age, cursive and stylish writing nature of Modi script. This paper presents a modified preliminary segmentation step for segmentation of Modi intact characters and overlapping/touching characters cluster to overcome problem of under/bad segmentation. Here, a zone based three steps Modi script character segmentation approach is presented. At preliminary step, column wise background and foreground pixels are scrutinized to determine leading segmentation of text line. Modi text line is segmented in two types of segments as isolated characters and cluster of overlapping/touching characters. Local zoning based method and column wise background pixel density exploration is used to segment overlapping and touching characters clusters respectively. Performance of the proposed method and comparative analysis is verified using MODI-HHDoc Modi document dataset. Successful Modi character segmentation rate is achieved as 87.70%. As compared to previous hybrid background pixel density based technique, bad segmentation rate is reduced from 0.8 to 0.2%. Comparative analysis shows that proposed modified Modi character segmentation technique is more efficient to state-of-art benchmarking techniques. |
doi_str_mv | 10.1007/s42979-024-03003-z |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3072096837</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3072096837</sourcerecordid><originalsourceid>FETCH-LOGICAL-c115z-18316a9c462ea05d276d07d1310f621f522321cf41da710da3b28a5b0608f773</originalsourceid><addsrcrecordid>eNp9kF1LwzAUhoMoOOb-gFcBr6snSZu0l2N-TJgIukshZPlYO1wzkxRxv95uFfTKqxzI-7zn8CB0SeCaAIibmNNKVBnQPAMGwLL9CRpRzklWViBO_8znaBLjBgBoAXnOixF6m-IXv-piwk_eNK6xBs9qFZRONuBXu97aNqnU-BZPd7vgla6x8wGn2uK5as1naFKy_WfQtWr0sQTfet0duHiBzpx6j3by847R8v5uOZtni-eHx9l0kWlCin1GSka4qnTOqVVQGCq4AWEII-A4Ja6glFGiXU6MEgSMYitaqmIFHEonBBujq6G2P_CjszHJje9C22-UDASFipfskKJDSgcfY7BO7kKzVeFLEpAHj3LwKHuP8uhR7nuIDVDsw-3aht_qf6hvLxN1FA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3072096837</pqid></control><display><type>article</type><title>A Robust Modified Character Segmentation Approach for the Handwritten Archaic Modi Documents</title><source>SpringerLink Journals - AutoHoldings</source><creator>Deshmukh, Manisha S. ; Kolhe, Satish R.</creator><creatorcontrib>Deshmukh, Manisha S. ; Kolhe, Satish R.</creatorcontrib><description>Character segmentation is an imperative step of intelligent handwritten archaic Modi document recognition system. Numbers of challenges like inconsistent and non-uniform handwritten characters, broken or degraded characters etc. are faced in the process of character segmentation due to age, cursive and stylish writing nature of Modi script. This paper presents a modified preliminary segmentation step for segmentation of Modi intact characters and overlapping/touching characters cluster to overcome problem of under/bad segmentation. Here, a zone based three steps Modi script character segmentation approach is presented. At preliminary step, column wise background and foreground pixels are scrutinized to determine leading segmentation of text line. Modi text line is segmented in two types of segments as isolated characters and cluster of overlapping/touching characters. Local zoning based method and column wise background pixel density exploration is used to segment overlapping and touching characters clusters respectively. Performance of the proposed method and comparative analysis is verified using MODI-HHDoc Modi document dataset. Successful Modi character segmentation rate is achieved as 87.70%. As compared to previous hybrid background pixel density based technique, bad segmentation rate is reduced from 0.8 to 0.2%. Comparative analysis shows that proposed modified Modi character segmentation technique is more efficient to state-of-art benchmarking techniques.</description><identifier>ISSN: 2661-8907</identifier><identifier>ISSN: 2662-995X</identifier><identifier>EISSN: 2661-8907</identifier><identifier>DOI: 10.1007/s42979-024-03003-z</identifier><language>eng</language><publisher>Singapore: Springer Nature Singapore</publisher><subject>Accuracy ; Algorithms ; Clusters ; Computer Imaging ; Computer Science ; Computer Systems Organization and Communication Networks ; Data Structures and Information Theory ; Density ; Documents ; Handwriting ; Handwriting recognition ; Information retrieval ; Information Systems and Communication Service ; Literature reviews ; Original Research ; Pattern Recognition and Graphics ; Pixels ; Segments ; Software Engineering/Programming and Operating Systems ; Vision ; Writing</subject><ispartof>SN computer science, 2024-06, Vol.5 (6), p.667, Article 667</ispartof><rights>The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd. 2024. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c115z-18316a9c462ea05d276d07d1310f621f522321cf41da710da3b28a5b0608f773</cites><orcidid>0000-0002-6273-7492</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s42979-024-03003-z$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s42979-024-03003-z$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27923,27924,41487,42556,51318</link.rule.ids></links><search><creatorcontrib>Deshmukh, Manisha S.</creatorcontrib><creatorcontrib>Kolhe, Satish R.</creatorcontrib><title>A Robust Modified Character Segmentation Approach for the Handwritten Archaic Modi Documents</title><title>SN computer science</title><addtitle>SN COMPUT. SCI</addtitle><description>Character segmentation is an imperative step of intelligent handwritten archaic Modi document recognition system. Numbers of challenges like inconsistent and non-uniform handwritten characters, broken or degraded characters etc. are faced in the process of character segmentation due to age, cursive and stylish writing nature of Modi script. This paper presents a modified preliminary segmentation step for segmentation of Modi intact characters and overlapping/touching characters cluster to overcome problem of under/bad segmentation. Here, a zone based three steps Modi script character segmentation approach is presented. At preliminary step, column wise background and foreground pixels are scrutinized to determine leading segmentation of text line. Modi text line is segmented in two types of segments as isolated characters and cluster of overlapping/touching characters. Local zoning based method and column wise background pixel density exploration is used to segment overlapping and touching characters clusters respectively. Performance of the proposed method and comparative analysis is verified using MODI-HHDoc Modi document dataset. Successful Modi character segmentation rate is achieved as 87.70%. As compared to previous hybrid background pixel density based technique, bad segmentation rate is reduced from 0.8 to 0.2%. Comparative analysis shows that proposed modified Modi character segmentation technique is more efficient to state-of-art benchmarking techniques.</description><subject>Accuracy</subject><subject>Algorithms</subject><subject>Clusters</subject><subject>Computer Imaging</subject><subject>Computer Science</subject><subject>Computer Systems Organization and Communication Networks</subject><subject>Data Structures and Information Theory</subject><subject>Density</subject><subject>Documents</subject><subject>Handwriting</subject><subject>Handwriting recognition</subject><subject>Information retrieval</subject><subject>Information Systems and Communication Service</subject><subject>Literature reviews</subject><subject>Original Research</subject><subject>Pattern Recognition and Graphics</subject><subject>Pixels</subject><subject>Segments</subject><subject>Software Engineering/Programming and Operating Systems</subject><subject>Vision</subject><subject>Writing</subject><issn>2661-8907</issn><issn>2662-995X</issn><issn>2661-8907</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kF1LwzAUhoMoOOb-gFcBr6snSZu0l2N-TJgIukshZPlYO1wzkxRxv95uFfTKqxzI-7zn8CB0SeCaAIibmNNKVBnQPAMGwLL9CRpRzklWViBO_8znaBLjBgBoAXnOixF6m-IXv-piwk_eNK6xBs9qFZRONuBXu97aNqnU-BZPd7vgla6x8wGn2uK5as1naFKy_WfQtWr0sQTfet0duHiBzpx6j3by847R8v5uOZtni-eHx9l0kWlCin1GSka4qnTOqVVQGCq4AWEII-A4Ja6glFGiXU6MEgSMYitaqmIFHEonBBujq6G2P_CjszHJje9C22-UDASFipfskKJDSgcfY7BO7kKzVeFLEpAHj3LwKHuP8uhR7nuIDVDsw-3aht_qf6hvLxN1FA</recordid><startdate>20240625</startdate><enddate>20240625</enddate><creator>Deshmukh, Manisha S.</creator><creator>Kolhe, Satish R.</creator><general>Springer Nature Singapore</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>JQ2</scope><orcidid>https://orcid.org/0000-0002-6273-7492</orcidid></search><sort><creationdate>20240625</creationdate><title>A Robust Modified Character Segmentation Approach for the Handwritten Archaic Modi Documents</title><author>Deshmukh, Manisha S. ; Kolhe, Satish R.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c115z-18316a9c462ea05d276d07d1310f621f522321cf41da710da3b28a5b0608f773</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Accuracy</topic><topic>Algorithms</topic><topic>Clusters</topic><topic>Computer Imaging</topic><topic>Computer Science</topic><topic>Computer Systems Organization and Communication Networks</topic><topic>Data Structures and Information Theory</topic><topic>Density</topic><topic>Documents</topic><topic>Handwriting</topic><topic>Handwriting recognition</topic><topic>Information retrieval</topic><topic>Information Systems and Communication Service</topic><topic>Literature reviews</topic><topic>Original Research</topic><topic>Pattern Recognition and Graphics</topic><topic>Pixels</topic><topic>Segments</topic><topic>Software Engineering/Programming and Operating Systems</topic><topic>Vision</topic><topic>Writing</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Deshmukh, Manisha S.</creatorcontrib><creatorcontrib>Kolhe, Satish R.</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Computer Science Collection</collection><jtitle>SN computer science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Deshmukh, Manisha S.</au><au>Kolhe, Satish R.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Robust Modified Character Segmentation Approach for the Handwritten Archaic Modi Documents</atitle><jtitle>SN computer science</jtitle><stitle>SN COMPUT. SCI</stitle><date>2024-06-25</date><risdate>2024</risdate><volume>5</volume><issue>6</issue><spage>667</spage><pages>667-</pages><artnum>667</artnum><issn>2661-8907</issn><issn>2662-995X</issn><eissn>2661-8907</eissn><abstract>Character segmentation is an imperative step of intelligent handwritten archaic Modi document recognition system. Numbers of challenges like inconsistent and non-uniform handwritten characters, broken or degraded characters etc. are faced in the process of character segmentation due to age, cursive and stylish writing nature of Modi script. This paper presents a modified preliminary segmentation step for segmentation of Modi intact characters and overlapping/touching characters cluster to overcome problem of under/bad segmentation. Here, a zone based three steps Modi script character segmentation approach is presented. At preliminary step, column wise background and foreground pixels are scrutinized to determine leading segmentation of text line. Modi text line is segmented in two types of segments as isolated characters and cluster of overlapping/touching characters. Local zoning based method and column wise background pixel density exploration is used to segment overlapping and touching characters clusters respectively. Performance of the proposed method and comparative analysis is verified using MODI-HHDoc Modi document dataset. Successful Modi character segmentation rate is achieved as 87.70%. As compared to previous hybrid background pixel density based technique, bad segmentation rate is reduced from 0.8 to 0.2%. Comparative analysis shows that proposed modified Modi character segmentation technique is more efficient to state-of-art benchmarking techniques.</abstract><cop>Singapore</cop><pub>Springer Nature Singapore</pub><doi>10.1007/s42979-024-03003-z</doi><orcidid>https://orcid.org/0000-0002-6273-7492</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2661-8907 |
ispartof | SN computer science, 2024-06, Vol.5 (6), p.667, Article 667 |
issn | 2661-8907 2662-995X 2661-8907 |
language | eng |
recordid | cdi_proquest_journals_3072096837 |
source | SpringerLink Journals - AutoHoldings |
subjects | Accuracy Algorithms Clusters Computer Imaging Computer Science Computer Systems Organization and Communication Networks Data Structures and Information Theory Density Documents Handwriting Handwriting recognition Information retrieval Information Systems and Communication Service Literature reviews Original Research Pattern Recognition and Graphics Pixels Segments Software Engineering/Programming and Operating Systems Vision Writing |
title | A Robust Modified Character Segmentation Approach for the Handwritten Archaic Modi Documents |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-13T04%3A15%3A01IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Robust%20Modified%20Character%20Segmentation%20Approach%20for%20the%20Handwritten%20Archaic%20Modi%20Documents&rft.jtitle=SN%20computer%20science&rft.au=Deshmukh,%20Manisha%20S.&rft.date=2024-06-25&rft.volume=5&rft.issue=6&rft.spage=667&rft.pages=667-&rft.artnum=667&rft.issn=2661-8907&rft.eissn=2661-8907&rft_id=info:doi/10.1007/s42979-024-03003-z&rft_dat=%3Cproquest_cross%3E3072096837%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3072096837&rft_id=info:pmid/&rfr_iscdi=true |