A Cascaded R-CNN With Multiscale Attention and Imbalanced Samples for Traffic Sign Detection

In recent years, the deep learning is applied to the field of traffic sign detection methods which achieves excellent performance. However, there are two main challenges in traffic sign detection to be solve urgently. For one thing, some traffic signs of small size are more difficult to detect than...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE access 2020, Vol.8, p.29742-29754
Hauptverfasser: Zhang, Jianming, Xie, Zhipeng, Sun, Juan, Zou, Xin, Wang, Jin
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 29754
container_issue
container_start_page 29742
container_title IEEE access
container_volume 8
creator Zhang, Jianming
Xie, Zhipeng
Sun, Juan
Zou, Xin
Wang, Jin
description In recent years, the deep learning is applied to the field of traffic sign detection methods which achieves excellent performance. However, there are two main challenges in traffic sign detection to be solve urgently. For one thing, some traffic signs of small size are more difficult to detect than those of large size so that the small traffic signs are undetected. For another, some false signs are always detected because of interferences caused by the illumination variation, bad weather and some signs similar to the true traffic signs. Therefore, to solve the undetection and false detection, we first propose a cascaded R-CNN to obtain the multiscale features in pyramids. Each layer of the cascaded network except the first layer fuses the output bounding box of the previous one layer for joint training. This method contributes to the traffic sign detection. Then, we propose a multiscale attention method to obtain the weighted multiscale features by dot-product and softmax, which is summed to fine the features to highlight the traffic sign features and improve the accuracy of the traffic sign detection. Finally, we increase the number of difficult negative samples for dataset balance and data augmentation in the training to relieve the interference by complex environment and similar false traffic signs. The data augment method expands the German traffic sign training dataset by simulation of complex environment changes. We conduct numerous experiments to verify the effectiveness of our proposed algorithm. The accuracy and recall rate of our method are 98.7% and 90.5% in GTSDB, 99.7% and 83.62% in CCTSDB and 98.9% and 85.6% in Lisa dataset respectively.
doi_str_mv 10.1109/ACCESS.2020.2972338
format Article
fullrecord <record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_proquest_journals_2454824331</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>8986614</ieee_id><doaj_id>oai_doaj_org_article_2df7b4af81cc4a83af5d44358eef4187</doaj_id><sourcerecordid>2454824331</sourcerecordid><originalsourceid>FETCH-LOGICAL-c408t-ce924205cf992625dcd3db8eb671e99d2f99cd80605dafda7d720dd22041cd603</originalsourceid><addsrcrecordid>eNpNkU9r3DAQxU1poSHNJ8hF0LO3-mdbOi5u2iykKXRTeimIWc0o1eK1t7L2kG9fbR1C5qLhx3tvBK-qrgVfCcHtp3Xf32y3K8klX0nbSaXMm-pCitbWqlHt21f7--pqnve8jCmo6S6q32vWw-wBCdmPur-_Z79i_sO-nYYcCx6IrXOmMcdpZDAi2xx2MMDoi3wLh-NAMwtTYg8JQoiebePjyD5TJn92fKjeBRhmunp-L6ufX24e-tv67vvXTb--q73mJteerNSSNz5YK1vZoEeFO0O7thNkLcrCPRre8gYhIHTYSY4oJdfCY8vVZbVZcnGCvTumeID05CaI7j-Y0qODlKMfyEkM3U5DMMJ7DUZBaFBr1RiioIXpStbHJeuYpr8nmrPbT6c0lu87qRttpFZKFJVaVD5N85wovFwV3J1bcUsr7tyKe26luK4XVySiF4expm2FVv8AC4-Hhg</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2454824331</pqid></control><display><type>article</type><title>A Cascaded R-CNN With Multiscale Attention and Imbalanced Samples for Traffic Sign Detection</title><source>IEEE Open Access Journals</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><creator>Zhang, Jianming ; Xie, Zhipeng ; Sun, Juan ; Zou, Xin ; Wang, Jin</creator><creatorcontrib>Zhang, Jianming ; Xie, Zhipeng ; Sun, Juan ; Zou, Xin ; Wang, Jin</creatorcontrib><description>In recent years, the deep learning is applied to the field of traffic sign detection methods which achieves excellent performance. However, there are two main challenges in traffic sign detection to be solve urgently. For one thing, some traffic signs of small size are more difficult to detect than those of large size so that the small traffic signs are undetected. For another, some false signs are always detected because of interferences caused by the illumination variation, bad weather and some signs similar to the true traffic signs. Therefore, to solve the undetection and false detection, we first propose a cascaded R-CNN to obtain the multiscale features in pyramids. Each layer of the cascaded network except the first layer fuses the output bounding box of the previous one layer for joint training. This method contributes to the traffic sign detection. Then, we propose a multiscale attention method to obtain the weighted multiscale features by dot-product and softmax, which is summed to fine the features to highlight the traffic sign features and improve the accuracy of the traffic sign detection. Finally, we increase the number of difficult negative samples for dataset balance and data augmentation in the training to relieve the interference by complex environment and similar false traffic signs. The data augment method expands the German traffic sign training dataset by simulation of complex environment changes. We conduct numerous experiments to verify the effectiveness of our proposed algorithm. The accuracy and recall rate of our method are 98.7% and 90.5% in GTSDB, 99.7% and 83.62% in CCTSDB and 98.9% and 85.6% in Lisa dataset respectively.</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2020.2972338</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Algorithms ; attention ; convolutional neural network ; Datasets ; Deep learning ; Detection algorithms ; Feature extraction ; Image color analysis ; Machine learning ; Multiscale ; Object detection ; Pyramids ; Shape ; Signs ; Street signs ; Traffic control ; Traffic sign detection ; Traffic signs ; Training ; Weather</subject><ispartof>IEEE access, 2020, Vol.8, p.29742-29754</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2020</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c408t-ce924205cf992625dcd3db8eb671e99d2f99cd80605dafda7d720dd22041cd603</citedby><cites>FETCH-LOGICAL-c408t-ce924205cf992625dcd3db8eb671e99d2f99cd80605dafda7d720dd22041cd603</cites><orcidid>0000-0002-4278-0805 ; 0000-0001-5473-8738</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/8986614$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,864,2100,4022,27632,27922,27923,27924,54932</link.rule.ids></links><search><creatorcontrib>Zhang, Jianming</creatorcontrib><creatorcontrib>Xie, Zhipeng</creatorcontrib><creatorcontrib>Sun, Juan</creatorcontrib><creatorcontrib>Zou, Xin</creatorcontrib><creatorcontrib>Wang, Jin</creatorcontrib><title>A Cascaded R-CNN With Multiscale Attention and Imbalanced Samples for Traffic Sign Detection</title><title>IEEE access</title><addtitle>Access</addtitle><description>In recent years, the deep learning is applied to the field of traffic sign detection methods which achieves excellent performance. However, there are two main challenges in traffic sign detection to be solve urgently. For one thing, some traffic signs of small size are more difficult to detect than those of large size so that the small traffic signs are undetected. For another, some false signs are always detected because of interferences caused by the illumination variation, bad weather and some signs similar to the true traffic signs. Therefore, to solve the undetection and false detection, we first propose a cascaded R-CNN to obtain the multiscale features in pyramids. Each layer of the cascaded network except the first layer fuses the output bounding box of the previous one layer for joint training. This method contributes to the traffic sign detection. Then, we propose a multiscale attention method to obtain the weighted multiscale features by dot-product and softmax, which is summed to fine the features to highlight the traffic sign features and improve the accuracy of the traffic sign detection. Finally, we increase the number of difficult negative samples for dataset balance and data augmentation in the training to relieve the interference by complex environment and similar false traffic signs. The data augment method expands the German traffic sign training dataset by simulation of complex environment changes. We conduct numerous experiments to verify the effectiveness of our proposed algorithm. The accuracy and recall rate of our method are 98.7% and 90.5% in GTSDB, 99.7% and 83.62% in CCTSDB and 98.9% and 85.6% in Lisa dataset respectively.</description><subject>Algorithms</subject><subject>attention</subject><subject>convolutional neural network</subject><subject>Datasets</subject><subject>Deep learning</subject><subject>Detection algorithms</subject><subject>Feature extraction</subject><subject>Image color analysis</subject><subject>Machine learning</subject><subject>Multiscale</subject><subject>Object detection</subject><subject>Pyramids</subject><subject>Shape</subject><subject>Signs</subject><subject>Street signs</subject><subject>Traffic control</subject><subject>Traffic sign detection</subject><subject>Traffic signs</subject><subject>Training</subject><subject>Weather</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><sourceid>DOA</sourceid><recordid>eNpNkU9r3DAQxU1poSHNJ8hF0LO3-mdbOi5u2iykKXRTeimIWc0o1eK1t7L2kG9fbR1C5qLhx3tvBK-qrgVfCcHtp3Xf32y3K8klX0nbSaXMm-pCitbWqlHt21f7--pqnve8jCmo6S6q32vWw-wBCdmPur-_Z79i_sO-nYYcCx6IrXOmMcdpZDAi2xx2MMDoi3wLh-NAMwtTYg8JQoiebePjyD5TJn92fKjeBRhmunp-L6ufX24e-tv67vvXTb--q73mJteerNSSNz5YK1vZoEeFO0O7thNkLcrCPRre8gYhIHTYSY4oJdfCY8vVZbVZcnGCvTumeID05CaI7j-Y0qODlKMfyEkM3U5DMMJ7DUZBaFBr1RiioIXpStbHJeuYpr8nmrPbT6c0lu87qRttpFZKFJVaVD5N85wovFwV3J1bcUsr7tyKe26luK4XVySiF4expm2FVv8AC4-Hhg</recordid><startdate>2020</startdate><enddate>2020</enddate><creator>Zhang, Jianming</creator><creator>Xie, Zhipeng</creator><creator>Sun, Juan</creator><creator>Zou, Xin</creator><creator>Wang, Jin</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0002-4278-0805</orcidid><orcidid>https://orcid.org/0000-0001-5473-8738</orcidid></search><sort><creationdate>2020</creationdate><title>A Cascaded R-CNN With Multiscale Attention and Imbalanced Samples for Traffic Sign Detection</title><author>Zhang, Jianming ; Xie, Zhipeng ; Sun, Juan ; Zou, Xin ; Wang, Jin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c408t-ce924205cf992625dcd3db8eb671e99d2f99cd80605dafda7d720dd22041cd603</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>attention</topic><topic>convolutional neural network</topic><topic>Datasets</topic><topic>Deep learning</topic><topic>Detection algorithms</topic><topic>Feature extraction</topic><topic>Image color analysis</topic><topic>Machine learning</topic><topic>Multiscale</topic><topic>Object detection</topic><topic>Pyramids</topic><topic>Shape</topic><topic>Signs</topic><topic>Street signs</topic><topic>Traffic control</topic><topic>Traffic sign detection</topic><topic>Traffic signs</topic><topic>Training</topic><topic>Weather</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhang, Jianming</creatorcontrib><creatorcontrib>Xie, Zhipeng</creatorcontrib><creatorcontrib>Sun, Juan</creatorcontrib><creatorcontrib>Zou, Xin</creatorcontrib><creatorcontrib>Wang, Jin</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhang, Jianming</au><au>Xie, Zhipeng</au><au>Sun, Juan</au><au>Zou, Xin</au><au>Wang, Jin</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Cascaded R-CNN With Multiscale Attention and Imbalanced Samples for Traffic Sign Detection</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><date>2020</date><risdate>2020</risdate><volume>8</volume><spage>29742</spage><epage>29754</epage><pages>29742-29754</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>In recent years, the deep learning is applied to the field of traffic sign detection methods which achieves excellent performance. However, there are two main challenges in traffic sign detection to be solve urgently. For one thing, some traffic signs of small size are more difficult to detect than those of large size so that the small traffic signs are undetected. For another, some false signs are always detected because of interferences caused by the illumination variation, bad weather and some signs similar to the true traffic signs. Therefore, to solve the undetection and false detection, we first propose a cascaded R-CNN to obtain the multiscale features in pyramids. Each layer of the cascaded network except the first layer fuses the output bounding box of the previous one layer for joint training. This method contributes to the traffic sign detection. Then, we propose a multiscale attention method to obtain the weighted multiscale features by dot-product and softmax, which is summed to fine the features to highlight the traffic sign features and improve the accuracy of the traffic sign detection. Finally, we increase the number of difficult negative samples for dataset balance and data augmentation in the training to relieve the interference by complex environment and similar false traffic signs. The data augment method expands the German traffic sign training dataset by simulation of complex environment changes. We conduct numerous experiments to verify the effectiveness of our proposed algorithm. The accuracy and recall rate of our method are 98.7% and 90.5% in GTSDB, 99.7% and 83.62% in CCTSDB and 98.9% and 85.6% in Lisa dataset respectively.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2020.2972338</doi><tpages>13</tpages><orcidid>https://orcid.org/0000-0002-4278-0805</orcidid><orcidid>https://orcid.org/0000-0001-5473-8738</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2169-3536
ispartof IEEE access, 2020, Vol.8, p.29742-29754
issn 2169-3536
2169-3536
language eng
recordid cdi_proquest_journals_2454824331
source IEEE Open Access Journals; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals
subjects Algorithms
attention
convolutional neural network
Datasets
Deep learning
Detection algorithms
Feature extraction
Image color analysis
Machine learning
Multiscale
Object detection
Pyramids
Shape
Signs
Street signs
Traffic control
Traffic sign detection
Traffic signs
Training
Weather
title A Cascaded R-CNN With Multiscale Attention and Imbalanced Samples for Traffic Sign Detection
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-12T14%3A12%3A33IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Cascaded%20R-CNN%20With%20Multiscale%20Attention%20and%20Imbalanced%20Samples%20for%20Traffic%20Sign%20Detection&rft.jtitle=IEEE%20access&rft.au=Zhang,%20Jianming&rft.date=2020&rft.volume=8&rft.spage=29742&rft.epage=29754&rft.pages=29742-29754&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2020.2972338&rft_dat=%3Cproquest_doaj_%3E2454824331%3C/proquest_doaj_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2454824331&rft_id=info:pmid/&rft_ieee_id=8986614&rft_doaj_id=oai_doaj_org_article_2df7b4af81cc4a83af5d44358eef4187&rfr_iscdi=true