Scene text detection using sparse stroke information and MLP
In this article, we present a novel set of features for detection of text in images of natural scenes using a multi-layer perceptron (MLP) classifier. An estimate of the uniformity in stroke thickness is one of our features and we obtain the same using only a subset of the distance transform values...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 297 |
---|---|
container_issue | |
container_start_page | 294 |
container_title | |
container_volume | |
creator | Chowdhury, A. R. Bhattacharya, U. Parui, S. K. |
description | In this article, we present a novel set of features for detection of text in images of natural scenes using a multi-layer perceptron (MLP) classifier. An estimate of the uniformity in stroke thickness is one of our features and we obtain the same using only a subset of the distance transform values of the concerned region. Estimation of the uniformity in stroke thickness on the basis of sparse sampling of the distance transform values is a novel approach. Another feature is the distance between the foreground and background colors computed in a perceptually uniform and illumination-invariant color space. Remaining features include two ratios of anti-parallel edge gradient orientations, a regularity measure between the skeletal representation and Canny edgemap of the object, average edge gradient magnitude, variation in the foreground gray levels and five others. Here, we present the results of the proposed approach on the ICDAR 2003 database and another database of scene images consisting of text of Indian scripts. |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_6460130</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6460130</ieee_id><sourcerecordid>6460130</sourcerecordid><originalsourceid>FETCH-LOGICAL-i175t-fea4b5ada4a808084a9b0c569593e92db5780f3a854e26c92cc7982c48cfc69a3</originalsourceid><addsrcrecordid>eNotjMtKxDAUQOMLrON8gZv8QCGPmxe4kUEdoaKgrofb9FaiTjo0EfTvlVHO4iwOnAO2DM5DCMICSBEOWaO8lq0DZ472TYJ1Wilp4Zg1UhjZgjXylJ2V8iaEEtr4hl0-RcrEK31VPlClWNOU-WdJ-ZWXHc6FeKnz9E485XGat7jvmAd-3z2es5MRPwot_71gLzfXz6t12z3c3q2uujZJZ2o7EkJvcEBAL34BDL2IxgYTNAU19MZ5MWr0BkjZGFSMLngVwccx2oB6wS7-vomINrs5bXH-3liwQmqhfwDd30dl</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Scene text detection using sparse stroke information and MLP</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Chowdhury, A. R. ; Bhattacharya, U. ; Parui, S. K.</creator><creatorcontrib>Chowdhury, A. R. ; Bhattacharya, U. ; Parui, S. K.</creatorcontrib><description>In this article, we present a novel set of features for detection of text in images of natural scenes using a multi-layer perceptron (MLP) classifier. An estimate of the uniformity in stroke thickness is one of our features and we obtain the same using only a subset of the distance transform values of the concerned region. Estimation of the uniformity in stroke thickness on the basis of sparse sampling of the distance transform values is a novel approach. Another feature is the distance between the foreground and background colors computed in a perceptually uniform and illumination-invariant color space. Remaining features include two ratios of anti-parallel edge gradient orientations, a regularity measure between the skeletal representation and Canny edgemap of the object, average edge gradient magnitude, variation in the foreground gray levels and five others. Here, we present the results of the proposed approach on the ICDAR 2003 database and another database of scene images consisting of text of Indian scripts.</description><identifier>ISSN: 1051-4651</identifier><identifier>ISBN: 9781467322164</identifier><identifier>ISBN: 1467322164</identifier><identifier>EISSN: 2831-7475</identifier><identifier>EISBN: 9784990644109</identifier><identifier>EISBN: 4990644107</identifier><language>eng</language><publisher>IEEE</publisher><subject>Feature extraction ; Image color analysis ; Image edge detection ; Pattern recognition ; Robustness ; Transforms</subject><ispartof>Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012), 2012, p.294-297</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6460130$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>310,311,781,785,790,791,2059,54925</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/6460130$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Chowdhury, A. R.</creatorcontrib><creatorcontrib>Bhattacharya, U.</creatorcontrib><creatorcontrib>Parui, S. K.</creatorcontrib><title>Scene text detection using sparse stroke information and MLP</title><title>Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012)</title><addtitle>ICPR</addtitle><description>In this article, we present a novel set of features for detection of text in images of natural scenes using a multi-layer perceptron (MLP) classifier. An estimate of the uniformity in stroke thickness is one of our features and we obtain the same using only a subset of the distance transform values of the concerned region. Estimation of the uniformity in stroke thickness on the basis of sparse sampling of the distance transform values is a novel approach. Another feature is the distance between the foreground and background colors computed in a perceptually uniform and illumination-invariant color space. Remaining features include two ratios of anti-parallel edge gradient orientations, a regularity measure between the skeletal representation and Canny edgemap of the object, average edge gradient magnitude, variation in the foreground gray levels and five others. Here, we present the results of the proposed approach on the ICDAR 2003 database and another database of scene images consisting of text of Indian scripts.</description><subject>Feature extraction</subject><subject>Image color analysis</subject><subject>Image edge detection</subject><subject>Pattern recognition</subject><subject>Robustness</subject><subject>Transforms</subject><issn>1051-4651</issn><issn>2831-7475</issn><isbn>9781467322164</isbn><isbn>1467322164</isbn><isbn>9784990644109</isbn><isbn>4990644107</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2012</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotjMtKxDAUQOMLrON8gZv8QCGPmxe4kUEdoaKgrofb9FaiTjo0EfTvlVHO4iwOnAO2DM5DCMICSBEOWaO8lq0DZ472TYJ1Wilp4Zg1UhjZgjXylJ2V8iaEEtr4hl0-RcrEK31VPlClWNOU-WdJ-ZWXHc6FeKnz9E485XGat7jvmAd-3z2es5MRPwot_71gLzfXz6t12z3c3q2uujZJZ2o7EkJvcEBAL34BDL2IxgYTNAU19MZ5MWr0BkjZGFSMLngVwccx2oB6wS7-vomINrs5bXH-3liwQmqhfwDd30dl</recordid><startdate>201211</startdate><enddate>201211</enddate><creator>Chowdhury, A. R.</creator><creator>Bhattacharya, U.</creator><creator>Parui, S. K.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>201211</creationdate><title>Scene text detection using sparse stroke information and MLP</title><author>Chowdhury, A. R. ; Bhattacharya, U. ; Parui, S. K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i175t-fea4b5ada4a808084a9b0c569593e92db5780f3a854e26c92cc7982c48cfc69a3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Feature extraction</topic><topic>Image color analysis</topic><topic>Image edge detection</topic><topic>Pattern recognition</topic><topic>Robustness</topic><topic>Transforms</topic><toplevel>online_resources</toplevel><creatorcontrib>Chowdhury, A. R.</creatorcontrib><creatorcontrib>Bhattacharya, U.</creatorcontrib><creatorcontrib>Parui, S. K.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Chowdhury, A. R.</au><au>Bhattacharya, U.</au><au>Parui, S. K.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Scene text detection using sparse stroke information and MLP</atitle><btitle>Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012)</btitle><stitle>ICPR</stitle><date>2012-11</date><risdate>2012</risdate><spage>294</spage><epage>297</epage><pages>294-297</pages><issn>1051-4651</issn><eissn>2831-7475</eissn><isbn>9781467322164</isbn><isbn>1467322164</isbn><eisbn>9784990644109</eisbn><eisbn>4990644107</eisbn><abstract>In this article, we present a novel set of features for detection of text in images of natural scenes using a multi-layer perceptron (MLP) classifier. An estimate of the uniformity in stroke thickness is one of our features and we obtain the same using only a subset of the distance transform values of the concerned region. Estimation of the uniformity in stroke thickness on the basis of sparse sampling of the distance transform values is a novel approach. Another feature is the distance between the foreground and background colors computed in a perceptually uniform and illumination-invariant color space. Remaining features include two ratios of anti-parallel edge gradient orientations, a regularity measure between the skeletal representation and Canny edgemap of the object, average edge gradient magnitude, variation in the foreground gray levels and five others. Here, we present the results of the proposed approach on the ICDAR 2003 database and another database of scene images consisting of text of Indian scripts.</abstract><pub>IEEE</pub><tpages>4</tpages></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1051-4651 |
ispartof | Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012), 2012, p.294-297 |
issn | 1051-4651 2831-7475 |
language | eng |
recordid | cdi_ieee_primary_6460130 |
source | IEEE Electronic Library (IEL) Conference Proceedings |
subjects | Feature extraction Image color analysis Image edge detection Pattern recognition Robustness Transforms |
title | Scene text detection using sparse stroke information and MLP |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-13T05%3A40%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Scene%20text%20detection%20using%20sparse%20stroke%20information%20and%20MLP&rft.btitle=Proceedings%20of%20the%2021st%20International%20Conference%20on%20Pattern%20Recognition%20(ICPR2012)&rft.au=Chowdhury,%20A.%20R.&rft.date=2012-11&rft.spage=294&rft.epage=297&rft.pages=294-297&rft.issn=1051-4651&rft.eissn=2831-7475&rft.isbn=9781467322164&rft.isbn_list=1467322164&rft_id=info:doi/&rft_dat=%3Cieee_6IE%3E6460130%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=9784990644109&rft.eisbn_list=4990644107&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=6460130&rfr_iscdi=true |