A Robust Modified Character Segmentation Approach for the Handwritten Archaic Modi Documents

Character segmentation is an imperative step of intelligent handwritten archaic Modi document recognition system. Numbers of challenges like inconsistent and non-uniform handwritten characters, broken or degraded characters etc. are faced in the process of character segmentation due to age, cursive...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:SN computer science 2024-06, Vol.5 (6), p.667, Article 667
Hauptverfasser: Deshmukh, Manisha S., Kolhe, Satish R.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 6
container_start_page 667
container_title SN computer science
container_volume 5
creator Deshmukh, Manisha S.
Kolhe, Satish R.
description Character segmentation is an imperative step of intelligent handwritten archaic Modi document recognition system. Numbers of challenges like inconsistent and non-uniform handwritten characters, broken or degraded characters etc. are faced in the process of character segmentation due to age, cursive and stylish writing nature of Modi script. This paper presents a modified preliminary segmentation step for segmentation of Modi intact characters and overlapping/touching characters cluster to overcome problem of under/bad segmentation. Here, a zone based three steps Modi script character segmentation approach is presented. At preliminary step, column wise background and foreground pixels are scrutinized to determine leading segmentation of text line. Modi text line is segmented in two types of segments as isolated characters and cluster of overlapping/touching characters. Local zoning based method and column wise background pixel density exploration is used to segment overlapping and touching characters clusters respectively. Performance of the proposed method and comparative analysis is verified using MODI-HHDoc Modi document dataset. Successful Modi character segmentation rate is achieved as 87.70%. As compared to previous hybrid background pixel density based technique, bad segmentation rate is reduced from 0.8 to 0.2%. Comparative analysis shows that proposed modified Modi character segmentation technique is more efficient to state-of-art benchmarking techniques.
doi_str_mv 10.1007/s42979-024-03003-z
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3072096837</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3072096837</sourcerecordid><originalsourceid>FETCH-LOGICAL-c115z-18316a9c462ea05d276d07d1310f621f522321cf41da710da3b28a5b0608f773</originalsourceid><addsrcrecordid>eNp9kF1LwzAUhoMoOOb-gFcBr6snSZu0l2N-TJgIukshZPlYO1wzkxRxv95uFfTKqxzI-7zn8CB0SeCaAIibmNNKVBnQPAMGwLL9CRpRzklWViBO_8znaBLjBgBoAXnOixF6m-IXv-piwk_eNK6xBs9qFZRONuBXu97aNqnU-BZPd7vgla6x8wGn2uK5as1naFKy_WfQtWr0sQTfet0duHiBzpx6j3by847R8v5uOZtni-eHx9l0kWlCin1GSka4qnTOqVVQGCq4AWEII-A4Ja6glFGiXU6MEgSMYitaqmIFHEonBBujq6G2P_CjszHJje9C22-UDASFipfskKJDSgcfY7BO7kKzVeFLEpAHj3LwKHuP8uhR7nuIDVDsw-3aht_qf6hvLxN1FA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3072096837</pqid></control><display><type>article</type><title>A Robust Modified Character Segmentation Approach for the Handwritten Archaic Modi Documents</title><source>SpringerLink Journals - AutoHoldings</source><creator>Deshmukh, Manisha S. ; Kolhe, Satish R.</creator><creatorcontrib>Deshmukh, Manisha S. ; Kolhe, Satish R.</creatorcontrib><description>Character segmentation is an imperative step of intelligent handwritten archaic Modi document recognition system. Numbers of challenges like inconsistent and non-uniform handwritten characters, broken or degraded characters etc. are faced in the process of character segmentation due to age, cursive and stylish writing nature of Modi script. This paper presents a modified preliminary segmentation step for segmentation of Modi intact characters and overlapping/touching characters cluster to overcome problem of under/bad segmentation. Here, a zone based three steps Modi script character segmentation approach is presented. At preliminary step, column wise background and foreground pixels are scrutinized to determine leading segmentation of text line. Modi text line is segmented in two types of segments as isolated characters and cluster of overlapping/touching characters. Local zoning based method and column wise background pixel density exploration is used to segment overlapping and touching characters clusters respectively. Performance of the proposed method and comparative analysis is verified using MODI-HHDoc Modi document dataset. Successful Modi character segmentation rate is achieved as 87.70%. As compared to previous hybrid background pixel density based technique, bad segmentation rate is reduced from 0.8 to 0.2%. Comparative analysis shows that proposed modified Modi character segmentation technique is more efficient to state-of-art benchmarking techniques.</description><identifier>ISSN: 2661-8907</identifier><identifier>ISSN: 2662-995X</identifier><identifier>EISSN: 2661-8907</identifier><identifier>DOI: 10.1007/s42979-024-03003-z</identifier><language>eng</language><publisher>Singapore: Springer Nature Singapore</publisher><subject>Accuracy ; Algorithms ; Clusters ; Computer Imaging ; Computer Science ; Computer Systems Organization and Communication Networks ; Data Structures and Information Theory ; Density ; Documents ; Handwriting ; Handwriting recognition ; Information retrieval ; Information Systems and Communication Service ; Literature reviews ; Original Research ; Pattern Recognition and Graphics ; Pixels ; Segments ; Software Engineering/Programming and Operating Systems ; Vision ; Writing</subject><ispartof>SN computer science, 2024-06, Vol.5 (6), p.667, Article 667</ispartof><rights>The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd. 2024. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c115z-18316a9c462ea05d276d07d1310f621f522321cf41da710da3b28a5b0608f773</cites><orcidid>0000-0002-6273-7492</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s42979-024-03003-z$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s42979-024-03003-z$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27923,27924,41487,42556,51318</link.rule.ids></links><search><creatorcontrib>Deshmukh, Manisha S.</creatorcontrib><creatorcontrib>Kolhe, Satish R.</creatorcontrib><title>A Robust Modified Character Segmentation Approach for the Handwritten Archaic Modi Documents</title><title>SN computer science</title><addtitle>SN COMPUT. SCI</addtitle><description>Character segmentation is an imperative step of intelligent handwritten archaic Modi document recognition system. Numbers of challenges like inconsistent and non-uniform handwritten characters, broken or degraded characters etc. are faced in the process of character segmentation due to age, cursive and stylish writing nature of Modi script. This paper presents a modified preliminary segmentation step for segmentation of Modi intact characters and overlapping/touching characters cluster to overcome problem of under/bad segmentation. Here, a zone based three steps Modi script character segmentation approach is presented. At preliminary step, column wise background and foreground pixels are scrutinized to determine leading segmentation of text line. Modi text line is segmented in two types of segments as isolated characters and cluster of overlapping/touching characters. Local zoning based method and column wise background pixel density exploration is used to segment overlapping and touching characters clusters respectively. Performance of the proposed method and comparative analysis is verified using MODI-HHDoc Modi document dataset. Successful Modi character segmentation rate is achieved as 87.70%. As compared to previous hybrid background pixel density based technique, bad segmentation rate is reduced from 0.8 to 0.2%. Comparative analysis shows that proposed modified Modi character segmentation technique is more efficient to state-of-art benchmarking techniques.</description><subject>Accuracy</subject><subject>Algorithms</subject><subject>Clusters</subject><subject>Computer Imaging</subject><subject>Computer Science</subject><subject>Computer Systems Organization and Communication Networks</subject><subject>Data Structures and Information Theory</subject><subject>Density</subject><subject>Documents</subject><subject>Handwriting</subject><subject>Handwriting recognition</subject><subject>Information retrieval</subject><subject>Information Systems and Communication Service</subject><subject>Literature reviews</subject><subject>Original Research</subject><subject>Pattern Recognition and Graphics</subject><subject>Pixels</subject><subject>Segments</subject><subject>Software Engineering/Programming and Operating Systems</subject><subject>Vision</subject><subject>Writing</subject><issn>2661-8907</issn><issn>2662-995X</issn><issn>2661-8907</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kF1LwzAUhoMoOOb-gFcBr6snSZu0l2N-TJgIukshZPlYO1wzkxRxv95uFfTKqxzI-7zn8CB0SeCaAIibmNNKVBnQPAMGwLL9CRpRzklWViBO_8znaBLjBgBoAXnOixF6m-IXv-piwk_eNK6xBs9qFZRONuBXu97aNqnU-BZPd7vgla6x8wGn2uK5as1naFKy_WfQtWr0sQTfet0duHiBzpx6j3by847R8v5uOZtni-eHx9l0kWlCin1GSka4qnTOqVVQGCq4AWEII-A4Ja6glFGiXU6MEgSMYitaqmIFHEonBBujq6G2P_CjszHJje9C22-UDASFipfskKJDSgcfY7BO7kKzVeFLEpAHj3LwKHuP8uhR7nuIDVDsw-3aht_qf6hvLxN1FA</recordid><startdate>20240625</startdate><enddate>20240625</enddate><creator>Deshmukh, Manisha S.</creator><creator>Kolhe, Satish R.</creator><general>Springer Nature Singapore</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>JQ2</scope><orcidid>https://orcid.org/0000-0002-6273-7492</orcidid></search><sort><creationdate>20240625</creationdate><title>A Robust Modified Character Segmentation Approach for the Handwritten Archaic Modi Documents</title><author>Deshmukh, Manisha S. ; Kolhe, Satish R.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c115z-18316a9c462ea05d276d07d1310f621f522321cf41da710da3b28a5b0608f773</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Accuracy</topic><topic>Algorithms</topic><topic>Clusters</topic><topic>Computer Imaging</topic><topic>Computer Science</topic><topic>Computer Systems Organization and Communication Networks</topic><topic>Data Structures and Information Theory</topic><topic>Density</topic><topic>Documents</topic><topic>Handwriting</topic><topic>Handwriting recognition</topic><topic>Information retrieval</topic><topic>Information Systems and Communication Service</topic><topic>Literature reviews</topic><topic>Original Research</topic><topic>Pattern Recognition and Graphics</topic><topic>Pixels</topic><topic>Segments</topic><topic>Software Engineering/Programming and Operating Systems</topic><topic>Vision</topic><topic>Writing</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Deshmukh, Manisha S.</creatorcontrib><creatorcontrib>Kolhe, Satish R.</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Computer Science Collection</collection><jtitle>SN computer science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Deshmukh, Manisha S.</au><au>Kolhe, Satish R.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Robust Modified Character Segmentation Approach for the Handwritten Archaic Modi Documents</atitle><jtitle>SN computer science</jtitle><stitle>SN COMPUT. SCI</stitle><date>2024-06-25</date><risdate>2024</risdate><volume>5</volume><issue>6</issue><spage>667</spage><pages>667-</pages><artnum>667</artnum><issn>2661-8907</issn><issn>2662-995X</issn><eissn>2661-8907</eissn><abstract>Character segmentation is an imperative step of intelligent handwritten archaic Modi document recognition system. Numbers of challenges like inconsistent and non-uniform handwritten characters, broken or degraded characters etc. are faced in the process of character segmentation due to age, cursive and stylish writing nature of Modi script. This paper presents a modified preliminary segmentation step for segmentation of Modi intact characters and overlapping/touching characters cluster to overcome problem of under/bad segmentation. Here, a zone based three steps Modi script character segmentation approach is presented. At preliminary step, column wise background and foreground pixels are scrutinized to determine leading segmentation of text line. Modi text line is segmented in two types of segments as isolated characters and cluster of overlapping/touching characters. Local zoning based method and column wise background pixel density exploration is used to segment overlapping and touching characters clusters respectively. Performance of the proposed method and comparative analysis is verified using MODI-HHDoc Modi document dataset. Successful Modi character segmentation rate is achieved as 87.70%. As compared to previous hybrid background pixel density based technique, bad segmentation rate is reduced from 0.8 to 0.2%. Comparative analysis shows that proposed modified Modi character segmentation technique is more efficient to state-of-art benchmarking techniques.</abstract><cop>Singapore</cop><pub>Springer Nature Singapore</pub><doi>10.1007/s42979-024-03003-z</doi><orcidid>https://orcid.org/0000-0002-6273-7492</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 2661-8907
ispartof SN computer science, 2024-06, Vol.5 (6), p.667, Article 667
issn 2661-8907
2662-995X
2661-8907
language eng
recordid cdi_proquest_journals_3072096837
source SpringerLink Journals - AutoHoldings
subjects Accuracy
Algorithms
Clusters
Computer Imaging
Computer Science
Computer Systems Organization and Communication Networks
Data Structures and Information Theory
Density
Documents
Handwriting
Handwriting recognition
Information retrieval
Information Systems and Communication Service
Literature reviews
Original Research
Pattern Recognition and Graphics
Pixels
Segments
Software Engineering/Programming and Operating Systems
Vision
Writing
title A Robust Modified Character Segmentation Approach for the Handwritten Archaic Modi Documents
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-13T04%3A15%3A01IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Robust%20Modified%20Character%20Segmentation%20Approach%20for%20the%20Handwritten%20Archaic%20Modi%20Documents&rft.jtitle=SN%20computer%20science&rft.au=Deshmukh,%20Manisha%20S.&rft.date=2024-06-25&rft.volume=5&rft.issue=6&rft.spage=667&rft.pages=667-&rft.artnum=667&rft.issn=2661-8907&rft.eissn=2661-8907&rft_id=info:doi/10.1007/s42979-024-03003-z&rft_dat=%3Cproquest_cross%3E3072096837%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3072096837&rft_id=info:pmid/&rfr_iscdi=true