Improved linear density technique for segmentation in Arabic handwritten text recognition
The challenge in handwriting recognition, especially in the segmentation process, took the researchers’ attention. These Arabic handwritten text processes are a challenging job because their characters are generally both cursive and unconstrained. In this paper, a new segmentation technique is propo...
Gespeichert in:
Veröffentlicht in: | Multimedia tools and applications 2022-08, Vol.81 (20), p.28531-28558 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 28558 |
---|---|
container_issue | 20 |
container_start_page | 28531 |
container_title | Multimedia tools and applications |
container_volume | 81 |
creator | Al Hamad, Husam Ahmed Abualigah, Laith Shehab, Mohammad Al-Shqeerat, Khalil H. A. Otair, Mohammad |
description | The challenge in handwriting recognition, especially in the segmentation process, took the researchers’ attention. These Arabic handwritten text processes are a challenging job because their characters are generally both cursive and unconstrained. In this paper, a new segmentation technique is proposed for solving the problem of Arabic handwritten scripts, called ILDT. The proposed technique’s main objective is to use the word image’s vertical linear density for clarifying character boundaries and districting between characters. In the proposed method, three pre-processing steps are applied: fill close and open holes (missing circle), remove punctuation to clarify the area of ligature points and avoid characters overlapping, and crop the word image to remove excess white space. The goal of filling close and open holes is to increase the character’s pixel density and then apply the vertical linear density. The proposed technique calculates the distance histogram of vertical linear, aiming to discover local minima points to precisely determine the segmentation points. Several experiments were conducted, including elapsed CPU times and accuracies values. All comparative techniques are examined on a local benchmark database. The proposed method (ILDT) got almost all the best segmentation and recognition accuracy compared with other comparative methods. |
doi_str_mv | 10.1007/s11042-022-12717-2 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2693179323</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2693179323</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-82a1441fb321e045b1e13117bbf9b000b1744895ce4ba67647810bdff366a6103</originalsourceid><addsrcrecordid>eNp9kD1PwzAQhiMEEqXwB5gsMRvubCdOxqrio1IlFhiYLDtxWletU2wX6L_HpUhsTHfD-9zHUxTXCLcIIO8iIghGgTGKTKKk7KQYYSk5lZLhae55DVSWgOfFRYwrAKxKJkbF22yzDcOH7cjaeasD6ayPLu1Jsu3Su_edJf0QSLSLjfVJJzd44jyZBG1cS5bad5_BpWR9Br4SCbYdFt4dYpfFWa_X0V791nHx-nD_Mn2i8-fH2XQypy3HJtGaaRQCe8MZWhClQYscURrTNwYADEoh6qZsrTC6kpWQNYLp-p5Xla4Q-Li4Oc7Nf-RzY1KrYRd8XqlY1XCUDWc8p9gx1YYhxmB7tQ1uo8NeIaiDQnVUqLJC9aNQsQzxIxRz2C9s-Bv9D_UNMOl0Lg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2693179323</pqid></control><display><type>article</type><title>Improved linear density technique for segmentation in Arabic handwritten text recognition</title><source>SpringerNature Complete Journals</source><creator>Al Hamad, Husam Ahmed ; Abualigah, Laith ; Shehab, Mohammad ; Al-Shqeerat, Khalil H. A. ; Otair, Mohammad</creator><creatorcontrib>Al Hamad, Husam Ahmed ; Abualigah, Laith ; Shehab, Mohammad ; Al-Shqeerat, Khalil H. A. ; Otair, Mohammad</creatorcontrib><description>The challenge in handwriting recognition, especially in the segmentation process, took the researchers’ attention. These Arabic handwritten text processes are a challenging job because their characters are generally both cursive and unconstrained. In this paper, a new segmentation technique is proposed for solving the problem of Arabic handwritten scripts, called ILDT. The proposed technique’s main objective is to use the word image’s vertical linear density for clarifying character boundaries and districting between characters. In the proposed method, three pre-processing steps are applied: fill close and open holes (missing circle), remove punctuation to clarify the area of ligature points and avoid characters overlapping, and crop the word image to remove excess white space. The goal of filling close and open holes is to increase the character’s pixel density and then apply the vertical linear density. The proposed technique calculates the distance histogram of vertical linear, aiming to discover local minima points to precisely determine the segmentation points. Several experiments were conducted, including elapsed CPU times and accuracies values. All comparative techniques are examined on a local benchmark database. The proposed method (ILDT) got almost all the best segmentation and recognition accuracy compared with other comparative methods.</description><identifier>ISSN: 1380-7501</identifier><identifier>EISSN: 1573-7721</identifier><identifier>DOI: 10.1007/s11042-022-12717-2</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Arabic language ; Classification ; Computer Communication Networks ; Computer Science ; Data Structures and Information Theory ; Density ; Handwriting ; Handwriting recognition ; Histograms ; Image segmentation ; Multimedia Information Systems ; Special Purpose and Application-Based Systems ; Support vector machines</subject><ispartof>Multimedia tools and applications, 2022-08, Vol.81 (20), p.28531-28558</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. corrected publication 2022</rights><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. corrected publication 2022.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-82a1441fb321e045b1e13117bbf9b000b1744895ce4ba67647810bdff366a6103</citedby><cites>FETCH-LOGICAL-c319t-82a1441fb321e045b1e13117bbf9b000b1744895ce4ba67647810bdff366a6103</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11042-022-12717-2$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11042-022-12717-2$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Al Hamad, Husam Ahmed</creatorcontrib><creatorcontrib>Abualigah, Laith</creatorcontrib><creatorcontrib>Shehab, Mohammad</creatorcontrib><creatorcontrib>Al-Shqeerat, Khalil H. A.</creatorcontrib><creatorcontrib>Otair, Mohammad</creatorcontrib><title>Improved linear density technique for segmentation in Arabic handwritten text recognition</title><title>Multimedia tools and applications</title><addtitle>Multimed Tools Appl</addtitle><description>The challenge in handwriting recognition, especially in the segmentation process, took the researchers’ attention. These Arabic handwritten text processes are a challenging job because their characters are generally both cursive and unconstrained. In this paper, a new segmentation technique is proposed for solving the problem of Arabic handwritten scripts, called ILDT. The proposed technique’s main objective is to use the word image’s vertical linear density for clarifying character boundaries and districting between characters. In the proposed method, three pre-processing steps are applied: fill close and open holes (missing circle), remove punctuation to clarify the area of ligature points and avoid characters overlapping, and crop the word image to remove excess white space. The goal of filling close and open holes is to increase the character’s pixel density and then apply the vertical linear density. The proposed technique calculates the distance histogram of vertical linear, aiming to discover local minima points to precisely determine the segmentation points. Several experiments were conducted, including elapsed CPU times and accuracies values. All comparative techniques are examined on a local benchmark database. The proposed method (ILDT) got almost all the best segmentation and recognition accuracy compared with other comparative methods.</description><subject>Arabic language</subject><subject>Classification</subject><subject>Computer Communication Networks</subject><subject>Computer Science</subject><subject>Data Structures and Information Theory</subject><subject>Density</subject><subject>Handwriting</subject><subject>Handwriting recognition</subject><subject>Histograms</subject><subject>Image segmentation</subject><subject>Multimedia Information Systems</subject><subject>Special Purpose and Application-Based Systems</subject><subject>Support vector machines</subject><issn>1380-7501</issn><issn>1573-7721</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNp9kD1PwzAQhiMEEqXwB5gsMRvubCdOxqrio1IlFhiYLDtxWletU2wX6L_HpUhsTHfD-9zHUxTXCLcIIO8iIghGgTGKTKKk7KQYYSk5lZLhae55DVSWgOfFRYwrAKxKJkbF22yzDcOH7cjaeasD6ayPLu1Jsu3Su_edJf0QSLSLjfVJJzd44jyZBG1cS5bad5_BpWR9Br4SCbYdFt4dYpfFWa_X0V791nHx-nD_Mn2i8-fH2XQypy3HJtGaaRQCe8MZWhClQYscURrTNwYADEoh6qZsrTC6kpWQNYLp-p5Xla4Q-Li4Oc7Nf-RzY1KrYRd8XqlY1XCUDWc8p9gx1YYhxmB7tQ1uo8NeIaiDQnVUqLJC9aNQsQzxIxRz2C9s-Bv9D_UNMOl0Lg</recordid><startdate>20220801</startdate><enddate>20220801</enddate><creator>Al Hamad, Husam Ahmed</creator><creator>Abualigah, Laith</creator><creator>Shehab, Mohammad</creator><creator>Al-Shqeerat, Khalil H. A.</creator><creator>Otair, Mohammad</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M2O</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope></search><sort><creationdate>20220801</creationdate><title>Improved linear density technique for segmentation in Arabic handwritten text recognition</title><author>Al Hamad, Husam Ahmed ; Abualigah, Laith ; Shehab, Mohammad ; Al-Shqeerat, Khalil H. A. ; Otair, Mohammad</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-82a1441fb321e045b1e13117bbf9b000b1744895ce4ba67647810bdff366a6103</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Arabic language</topic><topic>Classification</topic><topic>Computer Communication Networks</topic><topic>Computer Science</topic><topic>Data Structures and Information Theory</topic><topic>Density</topic><topic>Handwriting</topic><topic>Handwriting recognition</topic><topic>Histograms</topic><topic>Image segmentation</topic><topic>Multimedia Information Systems</topic><topic>Special Purpose and Application-Based Systems</topic><topic>Support vector machines</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Al Hamad, Husam Ahmed</creatorcontrib><creatorcontrib>Abualigah, Laith</creatorcontrib><creatorcontrib>Shehab, Mohammad</creatorcontrib><creatorcontrib>Al-Shqeerat, Khalil H. A.</creatorcontrib><creatorcontrib>Otair, Mohammad</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer science database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>ProQuest research library</collection><collection>Research Library (Corporate)</collection><collection>ProQuest advanced technologies & aerospace journals</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Business</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><jtitle>Multimedia tools and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Al Hamad, Husam Ahmed</au><au>Abualigah, Laith</au><au>Shehab, Mohammad</au><au>Al-Shqeerat, Khalil H. A.</au><au>Otair, Mohammad</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Improved linear density technique for segmentation in Arabic handwritten text recognition</atitle><jtitle>Multimedia tools and applications</jtitle><stitle>Multimed Tools Appl</stitle><date>2022-08-01</date><risdate>2022</risdate><volume>81</volume><issue>20</issue><spage>28531</spage><epage>28558</epage><pages>28531-28558</pages><issn>1380-7501</issn><eissn>1573-7721</eissn><abstract>The challenge in handwriting recognition, especially in the segmentation process, took the researchers’ attention. These Arabic handwritten text processes are a challenging job because their characters are generally both cursive and unconstrained. In this paper, a new segmentation technique is proposed for solving the problem of Arabic handwritten scripts, called ILDT. The proposed technique’s main objective is to use the word image’s vertical linear density for clarifying character boundaries and districting between characters. In the proposed method, three pre-processing steps are applied: fill close and open holes (missing circle), remove punctuation to clarify the area of ligature points and avoid characters overlapping, and crop the word image to remove excess white space. The goal of filling close and open holes is to increase the character’s pixel density and then apply the vertical linear density. The proposed technique calculates the distance histogram of vertical linear, aiming to discover local minima points to precisely determine the segmentation points. Several experiments were conducted, including elapsed CPU times and accuracies values. All comparative techniques are examined on a local benchmark database. The proposed method (ILDT) got almost all the best segmentation and recognition accuracy compared with other comparative methods.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11042-022-12717-2</doi><tpages>28</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1380-7501 |
ispartof | Multimedia tools and applications, 2022-08, Vol.81 (20), p.28531-28558 |
issn | 1380-7501 1573-7721 |
language | eng |
recordid | cdi_proquest_journals_2693179323 |
source | SpringerNature Complete Journals |
subjects | Arabic language Classification Computer Communication Networks Computer Science Data Structures and Information Theory Density Handwriting Handwriting recognition Histograms Image segmentation Multimedia Information Systems Special Purpose and Application-Based Systems Support vector machines |
title | Improved linear density technique for segmentation in Arabic handwritten text recognition |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T06%3A12%3A56IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Improved%20linear%20density%20technique%20for%20segmentation%20in%20Arabic%20handwritten%20text%20recognition&rft.jtitle=Multimedia%20tools%20and%20applications&rft.au=Al%20Hamad,%20Husam%20Ahmed&rft.date=2022-08-01&rft.volume=81&rft.issue=20&rft.spage=28531&rft.epage=28558&rft.pages=28531-28558&rft.issn=1380-7501&rft.eissn=1573-7721&rft_id=info:doi/10.1007/s11042-022-12717-2&rft_dat=%3Cproquest_cross%3E2693179323%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2693179323&rft_id=info:pmid/&rfr_iscdi=true |