Comparative study of feature extraction and classification techniques for printed bilingual Gujarati-English text
Accuracy of any Optical Character Recognition system (OCR) depends on extraction of optimal features from the text object. To extract optimal features, an appropriate feature selection method needs to be selected from available methods. The character images can be represented perfectly by employing...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 1 |
container_start_page | |
container_title | |
container_volume | 2842 |
creator | Chaudhari, Shailesh |
description | Accuracy of any Optical Character Recognition system (OCR) depends on extraction of optimal features from the text object. To extract optimal features, an appropriate feature selection method needs to be selected from available methods. The character images can be represented perfectly by employing the appropriate feature selection method. This article presents the comparison of two feature extraction techniques along with widely used classifiers. Here, the performance is evaluated based on feature extraction techniques regarding character image classifications. The purpose of the current research work is to identify and show the most optimal feature extraction technique for printed bilingual documents. Here, two feature extraction namely Discrete Cosine Transform (DCT) feature and zone based pixel density feature are used. The classification accuracy is compared and evaluated with each feature extraction method for different classifiers. From the experiments and analysis, it is observed that energy based Discrete Cosine Transform (DCT) feature is outperformed as compared to zone based pixel density feature. |
doi_str_mv | 10.1063/5.0175651 |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>proquest_scita</sourceid><recordid>TN_cdi_proquest_journals_2876143946</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2876143946</sourcerecordid><originalsourceid>FETCH-LOGICAL-p133t-48c70a4795bb9efe1d7216f1a4f89662e83be29ffa01dd497e4497a78025bf783</originalsourceid><addsrcrecordid>eNotkE9LAzEQxYMoWKsHv0HAm7A1_7M5SqlVKHhR8Bayu0mbss1uk6y0395t62UGhjdv5v0AeMRohpGgL3yGsOSC4yswwZzjQgosrsEEIcUKwujPLbhLaYsQUVKWE7Cfd7veRJP9r4UpD80Rdg46a_IQLbSHHE2dfRegCQ2sW5OSd74251G29Sb4_WATdF2EffQh2wZWvvVhPZgWLoft2bpYhHXr02bcOOR7cONMm-zDf5-C77fF1_y9WH0uP-avq6LHlOaClbVEhknFq0pZZ3EjCRYOG-ZKJQSxJa0sUc4ZhJuGKWnZWIwsEeGVkyWdgqeLbx-7049Zb7shhvGkJuVIhVHFxKh6vqhS7fM5lh5z7Ew8aoz0Canm-h8p_QOXf2qz</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype><pqid>2876143946</pqid></control><display><type>conference_proceeding</type><title>Comparative study of feature extraction and classification techniques for printed bilingual Gujarati-English text</title><source>AIP Journals Complete</source><creator>Chaudhari, Shailesh</creator><contributor>Alegavi, Sujata ; Mishra, B. K. ; Kasturiwale, Hemant P.</contributor><creatorcontrib>Chaudhari, Shailesh ; Alegavi, Sujata ; Mishra, B. K. ; Kasturiwale, Hemant P.</creatorcontrib><description>Accuracy of any Optical Character Recognition system (OCR) depends on extraction of optimal features from the text object. To extract optimal features, an appropriate feature selection method needs to be selected from available methods. The character images can be represented perfectly by employing the appropriate feature selection method. This article presents the comparison of two feature extraction techniques along with widely used classifiers. Here, the performance is evaluated based on feature extraction techniques regarding character image classifications. The purpose of the current research work is to identify and show the most optimal feature extraction technique for printed bilingual documents. Here, two feature extraction namely Discrete Cosine Transform (DCT) feature and zone based pixel density feature are used. The classification accuracy is compared and evaluated with each feature extraction method for different classifiers. From the experiments and analysis, it is observed that energy based Discrete Cosine Transform (DCT) feature is outperformed as compared to zone based pixel density feature.</description><identifier>ISSN: 0094-243X</identifier><identifier>EISSN: 1551-7616</identifier><identifier>DOI: 10.1063/5.0175651</identifier><identifier>CODEN: APCPCS</identifier><language>eng</language><publisher>Melville: American Institute of Physics</publisher><subject>Bilingualism ; Classifiers ; Comparative studies ; Density ; Discrete cosine transform ; Feature extraction ; Image classification ; Optical character recognition ; Performance evaluation ; Pixels</subject><ispartof>AIP Conference Proceedings, 2023, Vol.2842 (1)</ispartof><rights>Author(s)</rights><rights>2023 Author(s). Published by AIP Publishing.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://pubs.aip.org/acp/article-lookup/doi/10.1063/5.0175651$$EHTML$$P50$$Gscitation$$H</linktohtml><link.rule.ids>310,311,315,781,785,790,791,795,4513,23932,23933,25142,27926,27927,76386</link.rule.ids></links><search><contributor>Alegavi, Sujata</contributor><contributor>Mishra, B. K.</contributor><contributor>Kasturiwale, Hemant P.</contributor><creatorcontrib>Chaudhari, Shailesh</creatorcontrib><title>Comparative study of feature extraction and classification techniques for printed bilingual Gujarati-English text</title><title>AIP Conference Proceedings</title><description>Accuracy of any Optical Character Recognition system (OCR) depends on extraction of optimal features from the text object. To extract optimal features, an appropriate feature selection method needs to be selected from available methods. The character images can be represented perfectly by employing the appropriate feature selection method. This article presents the comparison of two feature extraction techniques along with widely used classifiers. Here, the performance is evaluated based on feature extraction techniques regarding character image classifications. The purpose of the current research work is to identify and show the most optimal feature extraction technique for printed bilingual documents. Here, two feature extraction namely Discrete Cosine Transform (DCT) feature and zone based pixel density feature are used. The classification accuracy is compared and evaluated with each feature extraction method for different classifiers. From the experiments and analysis, it is observed that energy based Discrete Cosine Transform (DCT) feature is outperformed as compared to zone based pixel density feature.</description><subject>Bilingualism</subject><subject>Classifiers</subject><subject>Comparative studies</subject><subject>Density</subject><subject>Discrete cosine transform</subject><subject>Feature extraction</subject><subject>Image classification</subject><subject>Optical character recognition</subject><subject>Performance evaluation</subject><subject>Pixels</subject><issn>0094-243X</issn><issn>1551-7616</issn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2023</creationdate><recordtype>conference_proceeding</recordtype><recordid>eNotkE9LAzEQxYMoWKsHv0HAm7A1_7M5SqlVKHhR8Bayu0mbss1uk6y0395t62UGhjdv5v0AeMRohpGgL3yGsOSC4yswwZzjQgosrsEEIcUKwujPLbhLaYsQUVKWE7Cfd7veRJP9r4UpD80Rdg46a_IQLbSHHE2dfRegCQ2sW5OSd74251G29Sb4_WATdF2EffQh2wZWvvVhPZgWLoft2bpYhHXr02bcOOR7cONMm-zDf5-C77fF1_y9WH0uP-avq6LHlOaClbVEhknFq0pZZ3EjCRYOG-ZKJQSxJa0sUc4ZhJuGKWnZWIwsEeGVkyWdgqeLbx-7049Zb7shhvGkJuVIhVHFxKh6vqhS7fM5lh5z7Ew8aoz0Canm-h8p_QOXf2qz</recordid><startdate>20231012</startdate><enddate>20231012</enddate><creator>Chaudhari, Shailesh</creator><general>American Institute of Physics</general><scope>8FD</scope><scope>H8D</scope><scope>L7M</scope></search><sort><creationdate>20231012</creationdate><title>Comparative study of feature extraction and classification techniques for printed bilingual Gujarati-English text</title><author>Chaudhari, Shailesh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p133t-48c70a4795bb9efe1d7216f1a4f89662e83be29ffa01dd497e4497a78025bf783</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Bilingualism</topic><topic>Classifiers</topic><topic>Comparative studies</topic><topic>Density</topic><topic>Discrete cosine transform</topic><topic>Feature extraction</topic><topic>Image classification</topic><topic>Optical character recognition</topic><topic>Performance evaluation</topic><topic>Pixels</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Chaudhari, Shailesh</creatorcontrib><collection>Technology Research Database</collection><collection>Aerospace Database</collection><collection>Advanced Technologies Database with Aerospace</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chaudhari, Shailesh</au><au>Alegavi, Sujata</au><au>Mishra, B. K.</au><au>Kasturiwale, Hemant P.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Comparative study of feature extraction and classification techniques for printed bilingual Gujarati-English text</atitle><btitle>AIP Conference Proceedings</btitle><date>2023-10-12</date><risdate>2023</risdate><volume>2842</volume><issue>1</issue><issn>0094-243X</issn><eissn>1551-7616</eissn><coden>APCPCS</coden><abstract>Accuracy of any Optical Character Recognition system (OCR) depends on extraction of optimal features from the text object. To extract optimal features, an appropriate feature selection method needs to be selected from available methods. The character images can be represented perfectly by employing the appropriate feature selection method. This article presents the comparison of two feature extraction techniques along with widely used classifiers. Here, the performance is evaluated based on feature extraction techniques regarding character image classifications. The purpose of the current research work is to identify and show the most optimal feature extraction technique for printed bilingual documents. Here, two feature extraction namely Discrete Cosine Transform (DCT) feature and zone based pixel density feature are used. The classification accuracy is compared and evaluated with each feature extraction method for different classifiers. From the experiments and analysis, it is observed that energy based Discrete Cosine Transform (DCT) feature is outperformed as compared to zone based pixel density feature.</abstract><cop>Melville</cop><pub>American Institute of Physics</pub><doi>10.1063/5.0175651</doi><tpages>8</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0094-243X |
ispartof | AIP Conference Proceedings, 2023, Vol.2842 (1) |
issn | 0094-243X 1551-7616 |
language | eng |
recordid | cdi_proquest_journals_2876143946 |
source | AIP Journals Complete |
subjects | Bilingualism Classifiers Comparative studies Density Discrete cosine transform Feature extraction Image classification Optical character recognition Performance evaluation Pixels |
title | Comparative study of feature extraction and classification techniques for printed bilingual Gujarati-English text |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-18T10%3A05%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_scita&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Comparative%20study%20of%20feature%20extraction%20and%20classification%20techniques%20for%20printed%20bilingual%20Gujarati-English%20text&rft.btitle=AIP%20Conference%20Proceedings&rft.au=Chaudhari,%20Shailesh&rft.date=2023-10-12&rft.volume=2842&rft.issue=1&rft.issn=0094-243X&rft.eissn=1551-7616&rft.coden=APCPCS&rft_id=info:doi/10.1063/5.0175651&rft_dat=%3Cproquest_scita%3E2876143946%3C/proquest_scita%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2876143946&rft_id=info:pmid/&rfr_iscdi=true |