Arbitrary-Oriented Scene Text Detection via Rotation Proposals

This paper introduces a novel rotation-based framework for arbitrary-oriented text detection in natural scene images. We present the Rotation Region Proposal Networks (RRPN), which are designed to generate inclined proposals with text orientation angle information. The angle information is then adap...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2018-03
Hauptverfasser: Ma, Jianqi, Shao, Weiyuan, Ye, Hao, Wang, Li, Wang, Hong, Zheng, Yingbin, Xue, Xiangyang
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Ma, Jianqi
Shao, Weiyuan
Ye, Hao
Wang, Li
Wang, Hong
Zheng, Yingbin
Xue, Xiangyang
description This paper introduces a novel rotation-based framework for arbitrary-oriented text detection in natural scene images. We present the Rotation Region Proposal Networks (RRPN), which are designed to generate inclined proposals with text orientation angle information. The angle information is then adapted for bounding box regression to make the proposals more accurately fit into the text region in terms of the orientation. The Rotation Region-of-Interest (RRoI) pooling layer is proposed to project arbitrary-oriented proposals to a feature map for a text region classifier. The whole framework is built upon a region-proposal-based architecture, which ensures the computational efficiency of the arbitrary-oriented text detection compared with previous text detection systems. We conduct experiments using the rotation-based framework on three real-world scene text detection datasets and demonstrate its superiority in terms of effectiveness and efficiency over previous approaches.
doi_str_mv 10.48550/arxiv.1703.01086
format Article
fullrecord <record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_1703_01086</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2071969574</sourcerecordid><originalsourceid>FETCH-LOGICAL-a524-27be231283d6b7d1a3f6bd53ef25781c3b43519718f499773cee3e85165f269d3</originalsourceid><addsrcrecordid>eNotj11LwzAYhYMgOOZ-gFcWvG5N8ubzRhhTpzCYaO9L2r6FDG1qmo35762bVw8HDofzEHLDaCGMlPTexaM_FExTKCijRl2QGQdguRGcX5HFOO4opVxpLiXMyMMy1j5FF3_ybfTYJ2yzjwZ7zEo8puwREzbJhz47eJe9h-RO4S2GIYzuc7wml90EXPxzTsrnp3L1km-269fVcpM7yUXOdY0cGDfQqlq3zEGn6lYCdlxqwxqoBUhmNTOdsFZraBABjWRKdlzZFubk9jx7kquG6L-mx9WfZHWSnBp358YQw_cex1Ttwj7206eKU82sslIL-AXnnFNZ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2071969574</pqid></control><display><type>article</type><title>Arbitrary-Oriented Scene Text Detection via Rotation Proposals</title><source>arXiv.org</source><source>Free E- Journals</source><creator>Ma, Jianqi ; Shao, Weiyuan ; Ye, Hao ; Wang, Li ; Wang, Hong ; Zheng, Yingbin ; Xue, Xiangyang</creator><creatorcontrib>Ma, Jianqi ; Shao, Weiyuan ; Ye, Hao ; Wang, Li ; Wang, Hong ; Zheng, Yingbin ; Xue, Xiangyang</creatorcontrib><description>This paper introduces a novel rotation-based framework for arbitrary-oriented text detection in natural scene images. We present the Rotation Region Proposal Networks (RRPN), which are designed to generate inclined proposals with text orientation angle information. The angle information is then adapted for bounding box regression to make the proposals more accurately fit into the text region in terms of the orientation. The Rotation Region-of-Interest (RRoI) pooling layer is proposed to project arbitrary-oriented proposals to a feature map for a text region classifier. The whole framework is built upon a region-proposal-based architecture, which ensures the computational efficiency of the arbitrary-oriented text detection compared with previous text detection systems. We conduct experiments using the rotation-based framework on three real-world scene text detection datasets and demonstrate its superiority in terms of effectiveness and efficiency over previous approaches.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.1703.01086</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Computer Science - Computer Vision and Pattern Recognition ; Computing time ; Feature maps ; Image detection ; Proposals ; Rotation</subject><ispartof>arXiv.org, 2018-03</ispartof><rights>2018. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,778,782,883,27908</link.rule.ids><backlink>$$Uhttps://doi.org/10.1109/TMM.2018.2818020$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.48550/arXiv.1703.01086$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Ma, Jianqi</creatorcontrib><creatorcontrib>Shao, Weiyuan</creatorcontrib><creatorcontrib>Ye, Hao</creatorcontrib><creatorcontrib>Wang, Li</creatorcontrib><creatorcontrib>Wang, Hong</creatorcontrib><creatorcontrib>Zheng, Yingbin</creatorcontrib><creatorcontrib>Xue, Xiangyang</creatorcontrib><title>Arbitrary-Oriented Scene Text Detection via Rotation Proposals</title><title>arXiv.org</title><description>This paper introduces a novel rotation-based framework for arbitrary-oriented text detection in natural scene images. We present the Rotation Region Proposal Networks (RRPN), which are designed to generate inclined proposals with text orientation angle information. The angle information is then adapted for bounding box regression to make the proposals more accurately fit into the text region in terms of the orientation. The Rotation Region-of-Interest (RRoI) pooling layer is proposed to project arbitrary-oriented proposals to a feature map for a text region classifier. The whole framework is built upon a region-proposal-based architecture, which ensures the computational efficiency of the arbitrary-oriented text detection compared with previous text detection systems. We conduct experiments using the rotation-based framework on three real-world scene text detection datasets and demonstrate its superiority in terms of effectiveness and efficiency over previous approaches.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><subject>Computing time</subject><subject>Feature maps</subject><subject>Image detection</subject><subject>Proposals</subject><subject>Rotation</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GOX</sourceid><recordid>eNotj11LwzAYhYMgOOZ-gFcWvG5N8ubzRhhTpzCYaO9L2r6FDG1qmo35762bVw8HDofzEHLDaCGMlPTexaM_FExTKCijRl2QGQdguRGcX5HFOO4opVxpLiXMyMMy1j5FF3_ybfTYJ2yzjwZ7zEo8puwREzbJhz47eJe9h-RO4S2GIYzuc7wml90EXPxzTsrnp3L1km-269fVcpM7yUXOdY0cGDfQqlq3zEGn6lYCdlxqwxqoBUhmNTOdsFZraBABjWRKdlzZFubk9jx7kquG6L-mx9WfZHWSnBp358YQw_cex1Ttwj7206eKU82sslIL-AXnnFNZ</recordid><startdate>20180315</startdate><enddate>20180315</enddate><creator>Ma, Jianqi</creator><creator>Shao, Weiyuan</creator><creator>Ye, Hao</creator><creator>Wang, Li</creator><creator>Wang, Hong</creator><creator>Zheng, Yingbin</creator><creator>Xue, Xiangyang</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20180315</creationdate><title>Arbitrary-Oriented Scene Text Detection via Rotation Proposals</title><author>Ma, Jianqi ; Shao, Weiyuan ; Ye, Hao ; Wang, Li ; Wang, Hong ; Zheng, Yingbin ; Xue, Xiangyang</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a524-27be231283d6b7d1a3f6bd53ef25781c3b43519718f499773cee3e85165f269d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><topic>Computing time</topic><topic>Feature maps</topic><topic>Image detection</topic><topic>Proposals</topic><topic>Rotation</topic><toplevel>online_resources</toplevel><creatorcontrib>Ma, Jianqi</creatorcontrib><creatorcontrib>Shao, Weiyuan</creatorcontrib><creatorcontrib>Ye, Hao</creatorcontrib><creatorcontrib>Wang, Li</creatorcontrib><creatorcontrib>Wang, Hong</creatorcontrib><creatorcontrib>Zheng, Yingbin</creatorcontrib><creatorcontrib>Xue, Xiangyang</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>arXiv Computer Science</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ma, Jianqi</au><au>Shao, Weiyuan</au><au>Ye, Hao</au><au>Wang, Li</au><au>Wang, Hong</au><au>Zheng, Yingbin</au><au>Xue, Xiangyang</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Arbitrary-Oriented Scene Text Detection via Rotation Proposals</atitle><jtitle>arXiv.org</jtitle><date>2018-03-15</date><risdate>2018</risdate><eissn>2331-8422</eissn><abstract>This paper introduces a novel rotation-based framework for arbitrary-oriented text detection in natural scene images. We present the Rotation Region Proposal Networks (RRPN), which are designed to generate inclined proposals with text orientation angle information. The angle information is then adapted for bounding box regression to make the proposals more accurately fit into the text region in terms of the orientation. The Rotation Region-of-Interest (RRoI) pooling layer is proposed to project arbitrary-oriented proposals to a feature map for a text region classifier. The whole framework is built upon a region-proposal-based architecture, which ensures the computational efficiency of the arbitrary-oriented text detection compared with previous text detection systems. We conduct experiments using the rotation-based framework on three real-world scene text detection datasets and demonstrate its superiority in terms of effectiveness and efficiency over previous approaches.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.1703.01086</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2018-03
issn 2331-8422
language eng
recordid cdi_arxiv_primary_1703_01086
source arXiv.org; Free E- Journals
subjects Computer Science - Computer Vision and Pattern Recognition
Computing time
Feature maps
Image detection
Proposals
Rotation
title Arbitrary-Oriented Scene Text Detection via Rotation Proposals
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-16T20%3A48%3A24IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Arbitrary-Oriented%20Scene%20Text%20Detection%20via%20Rotation%20Proposals&rft.jtitle=arXiv.org&rft.au=Ma,%20Jianqi&rft.date=2018-03-15&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.1703.01086&rft_dat=%3Cproquest_arxiv%3E2071969574%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2071969574&rft_id=info:pmid/&rfr_iscdi=true