Handwritten English word recognition using a deep learning based object detection architecture

Handwriting is used to distribute information among people. To access this information for further analysis the page needs to be optically scanned and converted to machine recognizable form. Due to unconstrained writing styles along with connected and overlapping characters, handwriting recognition...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Multimedia tools and applications 2022, Vol.81 (1), p.975-1000
Hauptverfasser: Mondal, Riktim, Malakar, Samir, Barney Smith, Elisa H., Sarkar, Ram
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1000
container_issue 1
container_start_page 975
container_title Multimedia tools and applications
container_volume 81
creator Mondal, Riktim
Malakar, Samir
Barney Smith, Elisa H.
Sarkar, Ram
description Handwriting is used to distribute information among people. To access this information for further analysis the page needs to be optically scanned and converted to machine recognizable form. Due to unconstrained writing styles along with connected and overlapping characters, handwriting recognition remains a challenging task. Most of the methods in the literature use lexicon-based approaches and train their models on large datasets having near 50 K word samples to achieve good results. This results in high computational requirements. While these models use around 50 K words in their dictionary when recognizing handwritten English text, the actual number of words in the dictionary is much higher than this. To this end, we propose a handwriting recognition technique to recognize handwritten English text based on a YOLOv3 object recognition model that is lexicon-free and that performs sequential character detection and identification with a low number of training samples (only 1200 word images). This model works well without any dependency on writers’ style of writing. This is tested on the IAM dataset and it is able to achieve 29.21% Word Error Rate and 9.53% Character Error Rate without a predefined vocabulary, which is on par with the state-of-the-art lexicon-based word recognition models.
doi_str_mv 10.1007/s11042-021-11425-7
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2621426055</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2621426055</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-bdf402bbb2230f712ef80a0f91ac215a5cd126e303929598183351162b20fa343</originalsourceid><addsrcrecordid>eNp9kEFLxDAQhYMouK7-AU8Bz9WZSdO0R1lWVxC86NWQtmm3y5quScvivze1gjdPb4Z57w18jF0j3CKAuguIkFIChAliSjJRJ2yBUolEKcLTOIscEiUBz9lFCDsAzCSlC_a-Ma4--m4YrONr1-67sOXH3tfc26pvXTd0veNj6FzLDa-tPfC9Nd5Ne2mCrXlf7mw1xNMQZTIbX227aRm9vWRnjdkHe_WrS_b2sH5dbZLnl8en1f1zUgkshqSsmxSoLEsiAY1Csk0OBpoCTUUojaxqpMwKEAUVssgxF0IiZlQSNEakYslu5t6D7z9HGwa960fv4ktNGUUiGUgZXTS7Kt-H4G2jD777MP5LI-iJo5456shR_3DUKobEHArR7Frr_6r_SX0DkpF1lQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2621426055</pqid></control><display><type>article</type><title>Handwritten English word recognition using a deep learning based object detection architecture</title><source>Springer Nature - Complete Springer Journals</source><creator>Mondal, Riktim ; Malakar, Samir ; Barney Smith, Elisa H. ; Sarkar, Ram</creator><creatorcontrib>Mondal, Riktim ; Malakar, Samir ; Barney Smith, Elisa H. ; Sarkar, Ram</creatorcontrib><description>Handwriting is used to distribute information among people. To access this information for further analysis the page needs to be optically scanned and converted to machine recognizable form. Due to unconstrained writing styles along with connected and overlapping characters, handwriting recognition remains a challenging task. Most of the methods in the literature use lexicon-based approaches and train their models on large datasets having near 50 K word samples to achieve good results. This results in high computational requirements. While these models use around 50 K words in their dictionary when recognizing handwritten English text, the actual number of words in the dictionary is much higher than this. To this end, we propose a handwriting recognition technique to recognize handwritten English text based on a YOLOv3 object recognition model that is lexicon-free and that performs sequential character detection and identification with a low number of training samples (only 1200 word images). This model works well without any dependency on writers’ style of writing. This is tested on the IAM dataset and it is able to achieve 29.21% Word Error Rate and 9.53% Character Error Rate without a predefined vocabulary, which is on par with the state-of-the-art lexicon-based word recognition models.</description><identifier>ISSN: 1380-7501</identifier><identifier>EISSN: 1573-7721</identifier><identifier>DOI: 10.1007/s11042-021-11425-7</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Character recognition ; Computer Communication Networks ; Computer Science ; Data Structures and Information Theory ; Datasets ; Deep learning ; Dictionaries ; Handwriting ; Handwriting recognition ; Machine learning ; Multimedia Information Systems ; Object recognition ; Special Purpose and Application-Based Systems</subject><ispartof>Multimedia tools and applications, 2022, Vol.81 (1), p.975-1000</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2021</rights><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2021.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-bdf402bbb2230f712ef80a0f91ac215a5cd126e303929598183351162b20fa343</citedby><cites>FETCH-LOGICAL-c319t-bdf402bbb2230f712ef80a0f91ac215a5cd126e303929598183351162b20fa343</cites><orcidid>0000-0003-4217-2372</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11042-021-11425-7$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11042-021-11425-7$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Mondal, Riktim</creatorcontrib><creatorcontrib>Malakar, Samir</creatorcontrib><creatorcontrib>Barney Smith, Elisa H.</creatorcontrib><creatorcontrib>Sarkar, Ram</creatorcontrib><title>Handwritten English word recognition using a deep learning based object detection architecture</title><title>Multimedia tools and applications</title><addtitle>Multimed Tools Appl</addtitle><description>Handwriting is used to distribute information among people. To access this information for further analysis the page needs to be optically scanned and converted to machine recognizable form. Due to unconstrained writing styles along with connected and overlapping characters, handwriting recognition remains a challenging task. Most of the methods in the literature use lexicon-based approaches and train their models on large datasets having near 50 K word samples to achieve good results. This results in high computational requirements. While these models use around 50 K words in their dictionary when recognizing handwritten English text, the actual number of words in the dictionary is much higher than this. To this end, we propose a handwriting recognition technique to recognize handwritten English text based on a YOLOv3 object recognition model that is lexicon-free and that performs sequential character detection and identification with a low number of training samples (only 1200 word images). This model works well without any dependency on writers’ style of writing. This is tested on the IAM dataset and it is able to achieve 29.21% Word Error Rate and 9.53% Character Error Rate without a predefined vocabulary, which is on par with the state-of-the-art lexicon-based word recognition models.</description><subject>Character recognition</subject><subject>Computer Communication Networks</subject><subject>Computer Science</subject><subject>Data Structures and Information Theory</subject><subject>Datasets</subject><subject>Deep learning</subject><subject>Dictionaries</subject><subject>Handwriting</subject><subject>Handwriting recognition</subject><subject>Machine learning</subject><subject>Multimedia Information Systems</subject><subject>Object recognition</subject><subject>Special Purpose and Application-Based Systems</subject><issn>1380-7501</issn><issn>1573-7721</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>BENPR</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNp9kEFLxDAQhYMouK7-AU8Bz9WZSdO0R1lWVxC86NWQtmm3y5quScvivze1gjdPb4Z57w18jF0j3CKAuguIkFIChAliSjJRJ2yBUolEKcLTOIscEiUBz9lFCDsAzCSlC_a-Ma4--m4YrONr1-67sOXH3tfc26pvXTd0veNj6FzLDa-tPfC9Nd5Ne2mCrXlf7mw1xNMQZTIbX227aRm9vWRnjdkHe_WrS_b2sH5dbZLnl8en1f1zUgkshqSsmxSoLEsiAY1Csk0OBpoCTUUojaxqpMwKEAUVssgxF0IiZlQSNEakYslu5t6D7z9HGwa960fv4ktNGUUiGUgZXTS7Kt-H4G2jD777MP5LI-iJo5456shR_3DUKobEHArR7Frr_6r_SX0DkpF1lQ</recordid><startdate>2022</startdate><enddate>2022</enddate><creator>Mondal, Riktim</creator><creator>Malakar, Samir</creator><creator>Barney Smith, Elisa H.</creator><creator>Sarkar, Ram</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M2O</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PHGZM</scope><scope>PHGZT</scope><scope>PKEHL</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQGLB</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><orcidid>https://orcid.org/0000-0003-4217-2372</orcidid></search><sort><creationdate>2022</creationdate><title>Handwritten English word recognition using a deep learning based object detection architecture</title><author>Mondal, Riktim ; Malakar, Samir ; Barney Smith, Elisa H. ; Sarkar, Ram</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-bdf402bbb2230f712ef80a0f91ac215a5cd126e303929598183351162b20fa343</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Character recognition</topic><topic>Computer Communication Networks</topic><topic>Computer Science</topic><topic>Data Structures and Information Theory</topic><topic>Datasets</topic><topic>Deep learning</topic><topic>Dictionaries</topic><topic>Handwriting</topic><topic>Handwriting recognition</topic><topic>Machine learning</topic><topic>Multimedia Information Systems</topic><topic>Object recognition</topic><topic>Special Purpose and Application-Based Systems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mondal, Riktim</creatorcontrib><creatorcontrib>Malakar, Samir</creatorcontrib><creatorcontrib>Barney Smith, Elisa H.</creatorcontrib><creatorcontrib>Sarkar, Ram</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Global (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>Research Library</collection><collection>Research Library (Corporate)</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central (New)</collection><collection>ProQuest One Academic (New)</collection><collection>ProQuest One Academic Middle East (New)</collection><collection>ProQuest One Business</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Applied &amp; Life Sciences</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>Multimedia tools and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mondal, Riktim</au><au>Malakar, Samir</au><au>Barney Smith, Elisa H.</au><au>Sarkar, Ram</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Handwritten English word recognition using a deep learning based object detection architecture</atitle><jtitle>Multimedia tools and applications</jtitle><stitle>Multimed Tools Appl</stitle><date>2022</date><risdate>2022</risdate><volume>81</volume><issue>1</issue><spage>975</spage><epage>1000</epage><pages>975-1000</pages><issn>1380-7501</issn><eissn>1573-7721</eissn><abstract>Handwriting is used to distribute information among people. To access this information for further analysis the page needs to be optically scanned and converted to machine recognizable form. Due to unconstrained writing styles along with connected and overlapping characters, handwriting recognition remains a challenging task. Most of the methods in the literature use lexicon-based approaches and train their models on large datasets having near 50 K word samples to achieve good results. This results in high computational requirements. While these models use around 50 K words in their dictionary when recognizing handwritten English text, the actual number of words in the dictionary is much higher than this. To this end, we propose a handwriting recognition technique to recognize handwritten English text based on a YOLOv3 object recognition model that is lexicon-free and that performs sequential character detection and identification with a low number of training samples (only 1200 word images). This model works well without any dependency on writers’ style of writing. This is tested on the IAM dataset and it is able to achieve 29.21% Word Error Rate and 9.53% Character Error Rate without a predefined vocabulary, which is on par with the state-of-the-art lexicon-based word recognition models.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11042-021-11425-7</doi><tpages>26</tpages><orcidid>https://orcid.org/0000-0003-4217-2372</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 1380-7501
ispartof Multimedia tools and applications, 2022, Vol.81 (1), p.975-1000
issn 1380-7501
1573-7721
language eng
recordid cdi_proquest_journals_2621426055
source Springer Nature - Complete Springer Journals
subjects Character recognition
Computer Communication Networks
Computer Science
Data Structures and Information Theory
Datasets
Deep learning
Dictionaries
Handwriting
Handwriting recognition
Machine learning
Multimedia Information Systems
Object recognition
Special Purpose and Application-Based Systems
title Handwritten English word recognition using a deep learning based object detection architecture
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-20T21%3A12%3A32IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Handwritten%20English%20word%20recognition%20using%20a%20deep%20learning%20based%20object%20detection%20architecture&rft.jtitle=Multimedia%20tools%20and%20applications&rft.au=Mondal,%20Riktim&rft.date=2022&rft.volume=81&rft.issue=1&rft.spage=975&rft.epage=1000&rft.pages=975-1000&rft.issn=1380-7501&rft.eissn=1573-7721&rft_id=info:doi/10.1007/s11042-021-11425-7&rft_dat=%3Cproquest_cross%3E2621426055%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2621426055&rft_id=info:pmid/&rfr_iscdi=true