An Efficient Deep Learning Model with Interrelated Tagging Prototype with Segmentation for Telugu Optical Character Recognition
More than 66 million people in India speak Telugu, a language that dates back thousands of years and is widely spoken in South India. There has not been much progress reported on the advancement of Telugu text Optical Character Recognition (OCR) systems. Telugu characters can be composed of many sym...
Gespeichert in:
Veröffentlicht in: | Scientific programming 2022-08, Vol.2022, p.1-10 |
---|---|
Hauptverfasser: | , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 10 |
---|---|
container_issue | |
container_start_page | 1 |
container_title | Scientific programming |
container_volume | 2022 |
creator | Dhanikonda, Srinivasa Rao Sowjanya, Ponnuru Ramanaiah, M. Laxmidevi Joshi, Rahul Krishna Mohan, B. H. Dhabliya, Dharmesh Raja, N. Kannaiya |
description | More than 66 million people in India speak Telugu, a language that dates back thousands of years and is widely spoken in South India. There has not been much progress reported on the advancement of Telugu text Optical Character Recognition (OCR) systems. Telugu characters can be composed of many symbols joined together. OCR is the process of turning a document image into a text-editable one that may be used in other applications. It saves a great deal of time and effort by not having to start from scratch each time. There are hundreds of thousands of different combinations of modifiers and consonants when writing compound letters. Symbols joined to one another form a compound character. Since there are so many output classes in Telugu, there’s a lot of interclass variation. Additionally, there are not any Telugu OCR systems that take use of recent breakthroughs in deep learning, which prompted us to create our own. When used in conjunction with a word processor, an OCR system has a significant impact on real-world applications. In a Telugu OCR system, we offer two ways to improve symbol or glyph segmentation. When it comes to Telugu OCR, the ability to recognise that Telugu text is crucial. In a picture, connected components are collections of identical pixels that are connected to one another by either 4- or 8-pixel connectivity. These connected components are known as glyphs in Telugu. In the proposed research, an efficient deep learning model with Interrelated Tagging Prototype with Segmentation for Telugu Text Recognition (ITP-STTR) is introduced. The proposed model is compared with the existing model and the results exhibit that the proposed model’s performance in text recognition is high. |
doi_str_mv | 10.1155/2022/1059004 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2712661054</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2712661054</sourcerecordid><originalsourceid>FETCH-LOGICAL-c294t-ba6229cf0a42b0a0e6b4431822023bb66c64f579e409f310484a99c3b0c38bbf3</originalsourceid><addsrcrecordid>eNp9kEFLwzAUx4soOKc3P0DAo9a9pGnXHMecOphMdIK3kmYvXUaX1DRj7ORXt2M7e3p_eD_-j_eLolsKj5Sm6YABYwMKqQDgZ1GP5sM0FlR8n3cZ0jwWjPPL6Kpt1wA0pwC96HdkyURrowzaQJ4QGzJD6a2xFXlzS6zJzoQVmdqA3mMtAy7JQlbVYf_uXXBh3-CR-cRq05XIYJwl2nmywHpbbcm8CUbJmoxX0kvV9ZAPVK6y5gBeRxda1i3enGY_-nqeLMav8Wz-Mh2PZrFigoe4lBljQmmQnJUgAbOS84TmrHs5KcssUxnX6VAgB6ETCjznUgiVlKCSvCx10o_ujr2Ndz9bbEOxdltvu5MFG1KWZZ0g3lEPR0p517YeddF4s5F-X1AoDoqLg-LipLjD74_4ytil3Jn_6T-VyHvn</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2712661054</pqid></control><display><type>article</type><title>An Efficient Deep Learning Model with Interrelated Tagging Prototype with Segmentation for Telugu Optical Character Recognition</title><source>EZB-FREE-00999 freely available EZB journals</source><source>Wiley Online Library (Open Access Collection)</source><source>Alma/SFX Local Collection</source><creator>Dhanikonda, Srinivasa Rao ; Sowjanya, Ponnuru ; Ramanaiah, M. Laxmidevi ; Joshi, Rahul ; Krishna Mohan, B. H. ; Dhabliya, Dharmesh ; Raja, N. Kannaiya</creator><contributor>Gupta, Punit ; Punit Gupta</contributor><creatorcontrib>Dhanikonda, Srinivasa Rao ; Sowjanya, Ponnuru ; Ramanaiah, M. Laxmidevi ; Joshi, Rahul ; Krishna Mohan, B. H. ; Dhabliya, Dharmesh ; Raja, N. Kannaiya ; Gupta, Punit ; Punit Gupta</creatorcontrib><description>More than 66 million people in India speak Telugu, a language that dates back thousands of years and is widely spoken in South India. There has not been much progress reported on the advancement of Telugu text Optical Character Recognition (OCR) systems. Telugu characters can be composed of many symbols joined together. OCR is the process of turning a document image into a text-editable one that may be used in other applications. It saves a great deal of time and effort by not having to start from scratch each time. There are hundreds of thousands of different combinations of modifiers and consonants when writing compound letters. Symbols joined to one another form a compound character. Since there are so many output classes in Telugu, there’s a lot of interclass variation. Additionally, there are not any Telugu OCR systems that take use of recent breakthroughs in deep learning, which prompted us to create our own. When used in conjunction with a word processor, an OCR system has a significant impact on real-world applications. In a Telugu OCR system, we offer two ways to improve symbol or glyph segmentation. When it comes to Telugu OCR, the ability to recognise that Telugu text is crucial. In a picture, connected components are collections of identical pixels that are connected to one another by either 4- or 8-pixel connectivity. These connected components are known as glyphs in Telugu. In the proposed research, an efficient deep learning model with Interrelated Tagging Prototype with Segmentation for Telugu Text Recognition (ITP-STTR) is introduced. The proposed model is compared with the existing model and the results exhibit that the proposed model’s performance in text recognition is high.</description><identifier>ISSN: 1058-9244</identifier><identifier>EISSN: 1875-919X</identifier><identifier>DOI: 10.1155/2022/1059004</identifier><language>eng</language><publisher>New York: Hindawi</publisher><subject>Automation ; Classification ; Consonants (speech) ; Deep learning ; Image segmentation ; Machine learning ; Marking ; Microprocessors ; Neural networks ; Optical character recognition ; Pattern recognition ; Pixels ; Prototypes ; Reading ; Software ; Symbols ; Word processors</subject><ispartof>Scientific programming, 2022-08, Vol.2022, p.1-10</ispartof><rights>Copyright © 2022 Srinivasa Rao Dhanikonda et al.</rights><rights>Copyright © 2022 Srinivasa Rao Dhanikonda et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c294t-ba6229cf0a42b0a0e6b4431822023bb66c64f579e409f310484a99c3b0c38bbf3</cites><orcidid>0000-0002-5871-890X ; 0000-0002-4302-8252</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><contributor>Gupta, Punit</contributor><contributor>Punit Gupta</contributor><creatorcontrib>Dhanikonda, Srinivasa Rao</creatorcontrib><creatorcontrib>Sowjanya, Ponnuru</creatorcontrib><creatorcontrib>Ramanaiah, M. Laxmidevi</creatorcontrib><creatorcontrib>Joshi, Rahul</creatorcontrib><creatorcontrib>Krishna Mohan, B. H.</creatorcontrib><creatorcontrib>Dhabliya, Dharmesh</creatorcontrib><creatorcontrib>Raja, N. Kannaiya</creatorcontrib><title>An Efficient Deep Learning Model with Interrelated Tagging Prototype with Segmentation for Telugu Optical Character Recognition</title><title>Scientific programming</title><description>More than 66 million people in India speak Telugu, a language that dates back thousands of years and is widely spoken in South India. There has not been much progress reported on the advancement of Telugu text Optical Character Recognition (OCR) systems. Telugu characters can be composed of many symbols joined together. OCR is the process of turning a document image into a text-editable one that may be used in other applications. It saves a great deal of time and effort by not having to start from scratch each time. There are hundreds of thousands of different combinations of modifiers and consonants when writing compound letters. Symbols joined to one another form a compound character. Since there are so many output classes in Telugu, there’s a lot of interclass variation. Additionally, there are not any Telugu OCR systems that take use of recent breakthroughs in deep learning, which prompted us to create our own. When used in conjunction with a word processor, an OCR system has a significant impact on real-world applications. In a Telugu OCR system, we offer two ways to improve symbol or glyph segmentation. When it comes to Telugu OCR, the ability to recognise that Telugu text is crucial. In a picture, connected components are collections of identical pixels that are connected to one another by either 4- or 8-pixel connectivity. These connected components are known as glyphs in Telugu. In the proposed research, an efficient deep learning model with Interrelated Tagging Prototype with Segmentation for Telugu Text Recognition (ITP-STTR) is introduced. The proposed model is compared with the existing model and the results exhibit that the proposed model’s performance in text recognition is high.</description><subject>Automation</subject><subject>Classification</subject><subject>Consonants (speech)</subject><subject>Deep learning</subject><subject>Image segmentation</subject><subject>Machine learning</subject><subject>Marking</subject><subject>Microprocessors</subject><subject>Neural networks</subject><subject>Optical character recognition</subject><subject>Pattern recognition</subject><subject>Pixels</subject><subject>Prototypes</subject><subject>Reading</subject><subject>Software</subject><subject>Symbols</subject><subject>Word processors</subject><issn>1058-9244</issn><issn>1875-919X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>RHX</sourceid><recordid>eNp9kEFLwzAUx4soOKc3P0DAo9a9pGnXHMecOphMdIK3kmYvXUaX1DRj7ORXt2M7e3p_eD_-j_eLolsKj5Sm6YABYwMKqQDgZ1GP5sM0FlR8n3cZ0jwWjPPL6Kpt1wA0pwC96HdkyURrowzaQJ4QGzJD6a2xFXlzS6zJzoQVmdqA3mMtAy7JQlbVYf_uXXBh3-CR-cRq05XIYJwl2nmywHpbbcm8CUbJmoxX0kvV9ZAPVK6y5gBeRxda1i3enGY_-nqeLMav8Wz-Mh2PZrFigoe4lBljQmmQnJUgAbOS84TmrHs5KcssUxnX6VAgB6ETCjznUgiVlKCSvCx10o_ujr2Ndz9bbEOxdltvu5MFG1KWZZ0g3lEPR0p517YeddF4s5F-X1AoDoqLg-LipLjD74_4ytil3Jn_6T-VyHvn</recordid><startdate>20220829</startdate><enddate>20220829</enddate><creator>Dhanikonda, Srinivasa Rao</creator><creator>Sowjanya, Ponnuru</creator><creator>Ramanaiah, M. Laxmidevi</creator><creator>Joshi, Rahul</creator><creator>Krishna Mohan, B. H.</creator><creator>Dhabliya, Dharmesh</creator><creator>Raja, N. Kannaiya</creator><general>Hindawi</general><general>Hindawi Limited</general><scope>RHU</scope><scope>RHW</scope><scope>RHX</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-5871-890X</orcidid><orcidid>https://orcid.org/0000-0002-4302-8252</orcidid></search><sort><creationdate>20220829</creationdate><title>An Efficient Deep Learning Model with Interrelated Tagging Prototype with Segmentation for Telugu Optical Character Recognition</title><author>Dhanikonda, Srinivasa Rao ; Sowjanya, Ponnuru ; Ramanaiah, M. Laxmidevi ; Joshi, Rahul ; Krishna Mohan, B. H. ; Dhabliya, Dharmesh ; Raja, N. Kannaiya</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c294t-ba6229cf0a42b0a0e6b4431822023bb66c64f579e409f310484a99c3b0c38bbf3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Automation</topic><topic>Classification</topic><topic>Consonants (speech)</topic><topic>Deep learning</topic><topic>Image segmentation</topic><topic>Machine learning</topic><topic>Marking</topic><topic>Microprocessors</topic><topic>Neural networks</topic><topic>Optical character recognition</topic><topic>Pattern recognition</topic><topic>Pixels</topic><topic>Prototypes</topic><topic>Reading</topic><topic>Software</topic><topic>Symbols</topic><topic>Word processors</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Dhanikonda, Srinivasa Rao</creatorcontrib><creatorcontrib>Sowjanya, Ponnuru</creatorcontrib><creatorcontrib>Ramanaiah, M. Laxmidevi</creatorcontrib><creatorcontrib>Joshi, Rahul</creatorcontrib><creatorcontrib>Krishna Mohan, B. H.</creatorcontrib><creatorcontrib>Dhabliya, Dharmesh</creatorcontrib><creatorcontrib>Raja, N. Kannaiya</creatorcontrib><collection>Hindawi Publishing Complete</collection><collection>Hindawi Publishing Subscription Journals</collection><collection>Hindawi Publishing Open Access</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Scientific programming</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Dhanikonda, Srinivasa Rao</au><au>Sowjanya, Ponnuru</au><au>Ramanaiah, M. Laxmidevi</au><au>Joshi, Rahul</au><au>Krishna Mohan, B. H.</au><au>Dhabliya, Dharmesh</au><au>Raja, N. Kannaiya</au><au>Gupta, Punit</au><au>Punit Gupta</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An Efficient Deep Learning Model with Interrelated Tagging Prototype with Segmentation for Telugu Optical Character Recognition</atitle><jtitle>Scientific programming</jtitle><date>2022-08-29</date><risdate>2022</risdate><volume>2022</volume><spage>1</spage><epage>10</epage><pages>1-10</pages><issn>1058-9244</issn><eissn>1875-919X</eissn><abstract>More than 66 million people in India speak Telugu, a language that dates back thousands of years and is widely spoken in South India. There has not been much progress reported on the advancement of Telugu text Optical Character Recognition (OCR) systems. Telugu characters can be composed of many symbols joined together. OCR is the process of turning a document image into a text-editable one that may be used in other applications. It saves a great deal of time and effort by not having to start from scratch each time. There are hundreds of thousands of different combinations of modifiers and consonants when writing compound letters. Symbols joined to one another form a compound character. Since there are so many output classes in Telugu, there’s a lot of interclass variation. Additionally, there are not any Telugu OCR systems that take use of recent breakthroughs in deep learning, which prompted us to create our own. When used in conjunction with a word processor, an OCR system has a significant impact on real-world applications. In a Telugu OCR system, we offer two ways to improve symbol or glyph segmentation. When it comes to Telugu OCR, the ability to recognise that Telugu text is crucial. In a picture, connected components are collections of identical pixels that are connected to one another by either 4- or 8-pixel connectivity. These connected components are known as glyphs in Telugu. In the proposed research, an efficient deep learning model with Interrelated Tagging Prototype with Segmentation for Telugu Text Recognition (ITP-STTR) is introduced. The proposed model is compared with the existing model and the results exhibit that the proposed model’s performance in text recognition is high.</abstract><cop>New York</cop><pub>Hindawi</pub><doi>10.1155/2022/1059004</doi><tpages>10</tpages><orcidid>https://orcid.org/0000-0002-5871-890X</orcidid><orcidid>https://orcid.org/0000-0002-4302-8252</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1058-9244 |
ispartof | Scientific programming, 2022-08, Vol.2022, p.1-10 |
issn | 1058-9244 1875-919X |
language | eng |
recordid | cdi_proquest_journals_2712661054 |
source | EZB-FREE-00999 freely available EZB journals; Wiley Online Library (Open Access Collection); Alma/SFX Local Collection |
subjects | Automation Classification Consonants (speech) Deep learning Image segmentation Machine learning Marking Microprocessors Neural networks Optical character recognition Pattern recognition Pixels Prototypes Reading Software Symbols Word processors |
title | An Efficient Deep Learning Model with Interrelated Tagging Prototype with Segmentation for Telugu Optical Character Recognition |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T05%3A27%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20Efficient%20Deep%20Learning%20Model%20with%20Interrelated%20Tagging%20Prototype%20with%20Segmentation%20for%20Telugu%20Optical%20Character%20Recognition&rft.jtitle=Scientific%20programming&rft.au=Dhanikonda,%20Srinivasa%20Rao&rft.date=2022-08-29&rft.volume=2022&rft.spage=1&rft.epage=10&rft.pages=1-10&rft.issn=1058-9244&rft.eissn=1875-919X&rft_id=info:doi/10.1155/2022/1059004&rft_dat=%3Cproquest_cross%3E2712661054%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2712661054&rft_id=info:pmid/&rfr_iscdi=true |