An Efficient Deep Learning Model with Interrelated Tagging Prototype with Segmentation for Telugu Optical Character Recognition

More than 66 million people in India speak Telugu, a language that dates back thousands of years and is widely spoken in South India. There has not been much progress reported on the advancement of Telugu text Optical Character Recognition (OCR) systems. Telugu characters can be composed of many sym...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Scientific programming 2022-08, Vol.2022, p.1-10
Hauptverfasser: Dhanikonda, Srinivasa Rao, Sowjanya, Ponnuru, Ramanaiah, M. Laxmidevi, Joshi, Rahul, Krishna Mohan, B. H., Dhabliya, Dharmesh, Raja, N. Kannaiya
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 10
container_issue
container_start_page 1
container_title Scientific programming
container_volume 2022
creator Dhanikonda, Srinivasa Rao
Sowjanya, Ponnuru
Ramanaiah, M. Laxmidevi
Joshi, Rahul
Krishna Mohan, B. H.
Dhabliya, Dharmesh
Raja, N. Kannaiya
description More than 66 million people in India speak Telugu, a language that dates back thousands of years and is widely spoken in South India. There has not been much progress reported on the advancement of Telugu text Optical Character Recognition (OCR) systems. Telugu characters can be composed of many symbols joined together. OCR is the process of turning a document image into a text-editable one that may be used in other applications. It saves a great deal of time and effort by not having to start from scratch each time. There are hundreds of thousands of different combinations of modifiers and consonants when writing compound letters. Symbols joined to one another form a compound character. Since there are so many output classes in Telugu, there’s a lot of interclass variation. Additionally, there are not any Telugu OCR systems that take use of recent breakthroughs in deep learning, which prompted us to create our own. When used in conjunction with a word processor, an OCR system has a significant impact on real-world applications. In a Telugu OCR system, we offer two ways to improve symbol or glyph segmentation. When it comes to Telugu OCR, the ability to recognise that Telugu text is crucial. In a picture, connected components are collections of identical pixels that are connected to one another by either 4- or 8-pixel connectivity. These connected components are known as glyphs in Telugu. In the proposed research, an efficient deep learning model with Interrelated Tagging Prototype with Segmentation for Telugu Text Recognition (ITP-STTR) is introduced. The proposed model is compared with the existing model and the results exhibit that the proposed model’s performance in text recognition is high.
doi_str_mv 10.1155/2022/1059004
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2712661054</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2712661054</sourcerecordid><originalsourceid>FETCH-LOGICAL-c294t-ba6229cf0a42b0a0e6b4431822023bb66c64f579e409f310484a99c3b0c38bbf3</originalsourceid><addsrcrecordid>eNp9kEFLwzAUx4soOKc3P0DAo9a9pGnXHMecOphMdIK3kmYvXUaX1DRj7ORXt2M7e3p_eD_-j_eLolsKj5Sm6YABYwMKqQDgZ1GP5sM0FlR8n3cZ0jwWjPPL6Kpt1wA0pwC96HdkyURrowzaQJ4QGzJD6a2xFXlzS6zJzoQVmdqA3mMtAy7JQlbVYf_uXXBh3-CR-cRq05XIYJwl2nmywHpbbcm8CUbJmoxX0kvV9ZAPVK6y5gBeRxda1i3enGY_-nqeLMav8Wz-Mh2PZrFigoe4lBljQmmQnJUgAbOS84TmrHs5KcssUxnX6VAgB6ETCjznUgiVlKCSvCx10o_ujr2Ndz9bbEOxdltvu5MFG1KWZZ0g3lEPR0p517YeddF4s5F-X1AoDoqLg-LipLjD74_4ytil3Jn_6T-VyHvn</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2712661054</pqid></control><display><type>article</type><title>An Efficient Deep Learning Model with Interrelated Tagging Prototype with Segmentation for Telugu Optical Character Recognition</title><source>EZB-FREE-00999 freely available EZB journals</source><source>Wiley Online Library (Open Access Collection)</source><source>Alma/SFX Local Collection</source><creator>Dhanikonda, Srinivasa Rao ; Sowjanya, Ponnuru ; Ramanaiah, M. Laxmidevi ; Joshi, Rahul ; Krishna Mohan, B. H. ; Dhabliya, Dharmesh ; Raja, N. Kannaiya</creator><contributor>Gupta, Punit ; Punit Gupta</contributor><creatorcontrib>Dhanikonda, Srinivasa Rao ; Sowjanya, Ponnuru ; Ramanaiah, M. Laxmidevi ; Joshi, Rahul ; Krishna Mohan, B. H. ; Dhabliya, Dharmesh ; Raja, N. Kannaiya ; Gupta, Punit ; Punit Gupta</creatorcontrib><description>More than 66 million people in India speak Telugu, a language that dates back thousands of years and is widely spoken in South India. There has not been much progress reported on the advancement of Telugu text Optical Character Recognition (OCR) systems. Telugu characters can be composed of many symbols joined together. OCR is the process of turning a document image into a text-editable one that may be used in other applications. It saves a great deal of time and effort by not having to start from scratch each time. There are hundreds of thousands of different combinations of modifiers and consonants when writing compound letters. Symbols joined to one another form a compound character. Since there are so many output classes in Telugu, there’s a lot of interclass variation. Additionally, there are not any Telugu OCR systems that take use of recent breakthroughs in deep learning, which prompted us to create our own. When used in conjunction with a word processor, an OCR system has a significant impact on real-world applications. In a Telugu OCR system, we offer two ways to improve symbol or glyph segmentation. When it comes to Telugu OCR, the ability to recognise that Telugu text is crucial. In a picture, connected components are collections of identical pixels that are connected to one another by either 4- or 8-pixel connectivity. These connected components are known as glyphs in Telugu. In the proposed research, an efficient deep learning model with Interrelated Tagging Prototype with Segmentation for Telugu Text Recognition (ITP-STTR) is introduced. The proposed model is compared with the existing model and the results exhibit that the proposed model’s performance in text recognition is high.</description><identifier>ISSN: 1058-9244</identifier><identifier>EISSN: 1875-919X</identifier><identifier>DOI: 10.1155/2022/1059004</identifier><language>eng</language><publisher>New York: Hindawi</publisher><subject>Automation ; Classification ; Consonants (speech) ; Deep learning ; Image segmentation ; Machine learning ; Marking ; Microprocessors ; Neural networks ; Optical character recognition ; Pattern recognition ; Pixels ; Prototypes ; Reading ; Software ; Symbols ; Word processors</subject><ispartof>Scientific programming, 2022-08, Vol.2022, p.1-10</ispartof><rights>Copyright © 2022 Srinivasa Rao Dhanikonda et al.</rights><rights>Copyright © 2022 Srinivasa Rao Dhanikonda et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c294t-ba6229cf0a42b0a0e6b4431822023bb66c64f579e409f310484a99c3b0c38bbf3</cites><orcidid>0000-0002-5871-890X ; 0000-0002-4302-8252</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><contributor>Gupta, Punit</contributor><contributor>Punit Gupta</contributor><creatorcontrib>Dhanikonda, Srinivasa Rao</creatorcontrib><creatorcontrib>Sowjanya, Ponnuru</creatorcontrib><creatorcontrib>Ramanaiah, M. Laxmidevi</creatorcontrib><creatorcontrib>Joshi, Rahul</creatorcontrib><creatorcontrib>Krishna Mohan, B. H.</creatorcontrib><creatorcontrib>Dhabliya, Dharmesh</creatorcontrib><creatorcontrib>Raja, N. Kannaiya</creatorcontrib><title>An Efficient Deep Learning Model with Interrelated Tagging Prototype with Segmentation for Telugu Optical Character Recognition</title><title>Scientific programming</title><description>More than 66 million people in India speak Telugu, a language that dates back thousands of years and is widely spoken in South India. There has not been much progress reported on the advancement of Telugu text Optical Character Recognition (OCR) systems. Telugu characters can be composed of many symbols joined together. OCR is the process of turning a document image into a text-editable one that may be used in other applications. It saves a great deal of time and effort by not having to start from scratch each time. There are hundreds of thousands of different combinations of modifiers and consonants when writing compound letters. Symbols joined to one another form a compound character. Since there are so many output classes in Telugu, there’s a lot of interclass variation. Additionally, there are not any Telugu OCR systems that take use of recent breakthroughs in deep learning, which prompted us to create our own. When used in conjunction with a word processor, an OCR system has a significant impact on real-world applications. In a Telugu OCR system, we offer two ways to improve symbol or glyph segmentation. When it comes to Telugu OCR, the ability to recognise that Telugu text is crucial. In a picture, connected components are collections of identical pixels that are connected to one another by either 4- or 8-pixel connectivity. These connected components are known as glyphs in Telugu. In the proposed research, an efficient deep learning model with Interrelated Tagging Prototype with Segmentation for Telugu Text Recognition (ITP-STTR) is introduced. The proposed model is compared with the existing model and the results exhibit that the proposed model’s performance in text recognition is high.</description><subject>Automation</subject><subject>Classification</subject><subject>Consonants (speech)</subject><subject>Deep learning</subject><subject>Image segmentation</subject><subject>Machine learning</subject><subject>Marking</subject><subject>Microprocessors</subject><subject>Neural networks</subject><subject>Optical character recognition</subject><subject>Pattern recognition</subject><subject>Pixels</subject><subject>Prototypes</subject><subject>Reading</subject><subject>Software</subject><subject>Symbols</subject><subject>Word processors</subject><issn>1058-9244</issn><issn>1875-919X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>RHX</sourceid><recordid>eNp9kEFLwzAUx4soOKc3P0DAo9a9pGnXHMecOphMdIK3kmYvXUaX1DRj7ORXt2M7e3p_eD_-j_eLolsKj5Sm6YABYwMKqQDgZ1GP5sM0FlR8n3cZ0jwWjPPL6Kpt1wA0pwC96HdkyURrowzaQJ4QGzJD6a2xFXlzS6zJzoQVmdqA3mMtAy7JQlbVYf_uXXBh3-CR-cRq05XIYJwl2nmywHpbbcm8CUbJmoxX0kvV9ZAPVK6y5gBeRxda1i3enGY_-nqeLMav8Wz-Mh2PZrFigoe4lBljQmmQnJUgAbOS84TmrHs5KcssUxnX6VAgB6ETCjznUgiVlKCSvCx10o_ujr2Ndz9bbEOxdltvu5MFG1KWZZ0g3lEPR0p517YeddF4s5F-X1AoDoqLg-LipLjD74_4ytil3Jn_6T-VyHvn</recordid><startdate>20220829</startdate><enddate>20220829</enddate><creator>Dhanikonda, Srinivasa Rao</creator><creator>Sowjanya, Ponnuru</creator><creator>Ramanaiah, M. Laxmidevi</creator><creator>Joshi, Rahul</creator><creator>Krishna Mohan, B. H.</creator><creator>Dhabliya, Dharmesh</creator><creator>Raja, N. Kannaiya</creator><general>Hindawi</general><general>Hindawi Limited</general><scope>RHU</scope><scope>RHW</scope><scope>RHX</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-5871-890X</orcidid><orcidid>https://orcid.org/0000-0002-4302-8252</orcidid></search><sort><creationdate>20220829</creationdate><title>An Efficient Deep Learning Model with Interrelated Tagging Prototype with Segmentation for Telugu Optical Character Recognition</title><author>Dhanikonda, Srinivasa Rao ; Sowjanya, Ponnuru ; Ramanaiah, M. Laxmidevi ; Joshi, Rahul ; Krishna Mohan, B. H. ; Dhabliya, Dharmesh ; Raja, N. Kannaiya</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c294t-ba6229cf0a42b0a0e6b4431822023bb66c64f579e409f310484a99c3b0c38bbf3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Automation</topic><topic>Classification</topic><topic>Consonants (speech)</topic><topic>Deep learning</topic><topic>Image segmentation</topic><topic>Machine learning</topic><topic>Marking</topic><topic>Microprocessors</topic><topic>Neural networks</topic><topic>Optical character recognition</topic><topic>Pattern recognition</topic><topic>Pixels</topic><topic>Prototypes</topic><topic>Reading</topic><topic>Software</topic><topic>Symbols</topic><topic>Word processors</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Dhanikonda, Srinivasa Rao</creatorcontrib><creatorcontrib>Sowjanya, Ponnuru</creatorcontrib><creatorcontrib>Ramanaiah, M. Laxmidevi</creatorcontrib><creatorcontrib>Joshi, Rahul</creatorcontrib><creatorcontrib>Krishna Mohan, B. H.</creatorcontrib><creatorcontrib>Dhabliya, Dharmesh</creatorcontrib><creatorcontrib>Raja, N. Kannaiya</creatorcontrib><collection>Hindawi Publishing Complete</collection><collection>Hindawi Publishing Subscription Journals</collection><collection>Hindawi Publishing Open Access</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Scientific programming</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Dhanikonda, Srinivasa Rao</au><au>Sowjanya, Ponnuru</au><au>Ramanaiah, M. Laxmidevi</au><au>Joshi, Rahul</au><au>Krishna Mohan, B. H.</au><au>Dhabliya, Dharmesh</au><au>Raja, N. Kannaiya</au><au>Gupta, Punit</au><au>Punit Gupta</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An Efficient Deep Learning Model with Interrelated Tagging Prototype with Segmentation for Telugu Optical Character Recognition</atitle><jtitle>Scientific programming</jtitle><date>2022-08-29</date><risdate>2022</risdate><volume>2022</volume><spage>1</spage><epage>10</epage><pages>1-10</pages><issn>1058-9244</issn><eissn>1875-919X</eissn><abstract>More than 66 million people in India speak Telugu, a language that dates back thousands of years and is widely spoken in South India. There has not been much progress reported on the advancement of Telugu text Optical Character Recognition (OCR) systems. Telugu characters can be composed of many symbols joined together. OCR is the process of turning a document image into a text-editable one that may be used in other applications. It saves a great deal of time and effort by not having to start from scratch each time. There are hundreds of thousands of different combinations of modifiers and consonants when writing compound letters. Symbols joined to one another form a compound character. Since there are so many output classes in Telugu, there’s a lot of interclass variation. Additionally, there are not any Telugu OCR systems that take use of recent breakthroughs in deep learning, which prompted us to create our own. When used in conjunction with a word processor, an OCR system has a significant impact on real-world applications. In a Telugu OCR system, we offer two ways to improve symbol or glyph segmentation. When it comes to Telugu OCR, the ability to recognise that Telugu text is crucial. In a picture, connected components are collections of identical pixels that are connected to one another by either 4- or 8-pixel connectivity. These connected components are known as glyphs in Telugu. In the proposed research, an efficient deep learning model with Interrelated Tagging Prototype with Segmentation for Telugu Text Recognition (ITP-STTR) is introduced. The proposed model is compared with the existing model and the results exhibit that the proposed model’s performance in text recognition is high.</abstract><cop>New York</cop><pub>Hindawi</pub><doi>10.1155/2022/1059004</doi><tpages>10</tpages><orcidid>https://orcid.org/0000-0002-5871-890X</orcidid><orcidid>https://orcid.org/0000-0002-4302-8252</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1058-9244
ispartof Scientific programming, 2022-08, Vol.2022, p.1-10
issn 1058-9244
1875-919X
language eng
recordid cdi_proquest_journals_2712661054
source EZB-FREE-00999 freely available EZB journals; Wiley Online Library (Open Access Collection); Alma/SFX Local Collection
subjects Automation
Classification
Consonants (speech)
Deep learning
Image segmentation
Machine learning
Marking
Microprocessors
Neural networks
Optical character recognition
Pattern recognition
Pixels
Prototypes
Reading
Software
Symbols
Word processors
title An Efficient Deep Learning Model with Interrelated Tagging Prototype with Segmentation for Telugu Optical Character Recognition
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T05%3A27%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20Efficient%20Deep%20Learning%20Model%20with%20Interrelated%20Tagging%20Prototype%20with%20Segmentation%20for%20Telugu%20Optical%20Character%20Recognition&rft.jtitle=Scientific%20programming&rft.au=Dhanikonda,%20Srinivasa%20Rao&rft.date=2022-08-29&rft.volume=2022&rft.spage=1&rft.epage=10&rft.pages=1-10&rft.issn=1058-9244&rft.eissn=1875-919X&rft_id=info:doi/10.1155/2022/1059004&rft_dat=%3Cproquest_cross%3E2712661054%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2712661054&rft_id=info:pmid/&rfr_iscdi=true