FUSED-Net: Detecting Traffic Signs with Limited Data

Automatic Traffic Sign Recognition is paramount in modern transportation systems, motivating several research endeavors to focus on performance improvement by utilizing large-scale datasets. As the appearance of traffic signs varies across countries, curating large-scale datasets is often impractica...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Rahman, Md. Atiqur, Asad, Nahian Ibn, Omi, Md. Mushfiqul Haque, Hasan, Md. Bakhtiar, Ahmed, Sabbir, Kabir, Md. Hasanul
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Artificial Intelligence Computer Science - Computer Vision and Pattern Recognition
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Rahman, Md. Atiqur Asad, Nahian Ibn Omi, Md. Mushfiqul Haque Hasan, Md. Bakhtiar Ahmed, Sabbir Kabir, Md. Hasanul
description	Automatic Traffic Sign Recognition is paramount in modern transportation systems, motivating several research endeavors to focus on performance improvement by utilizing large-scale datasets. As the appearance of traffic signs varies across countries, curating large-scale datasets is often impractical; and requires efficient models that can produce satisfactory performance using limited data. In this connection, we present 'FUSED-Net', built-upon Faster RCNN for traffic sign detection, enhanced by Unfrozen Parameters, Pseudo-Support Sets, Embedding Normalization, and Domain Adaptation while reducing data requirement. Unlike traditional approaches, we keep all parameters unfrozen during training, enabling FUSED-Net to learn from limited samples. The generation of a Pseudo-Support Set through data augmentation further enhances performance by compensating for the scarcity of target domain data. Additionally, Embedding Normalization is incorporated to reduce intra-class variance, standardizing feature representation. Domain Adaptation, achieved by pre-training on a diverse traffic sign dataset distinct from the target domain, improves model generalization. Evaluating FUSED-Net on the BDTSD dataset, we achieved 2.4x, 2.2x, 1.5x, and 1.3x improvements of mAP in 1-shot, 3-shot, 5-shot, and 10-shot scenarios, respectively compared to the state-of-the-art Few-Shot Object Detection (FSOD) models. Additionally, we outperform state-of-the-art works on the cross-domain FSOD benchmark under several scenarios.
doi_str_mv	10.48550/arxiv.2409.14852
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2409_14852</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2409_14852</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2409_148523</originalsourceid><addsrcrecordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMjGw1DMEihhxMpi4hQa7uuj6pZZYKbiklqQml2TmpSuEFCWmpWUmKwRnpucVK5RnlmQo-GTmZpakpii4JJYk8jCwpiXmFKfyQmluBnk31xBnD12w-fEFRZm5iUWV8SB74sH2GBNWAQCJgzDu</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>FUSED-Net: Detecting Traffic Signs with Limited Data</title><source>arXiv.org</source><creator>Rahman, Md. Atiqur ; Asad, Nahian Ibn ; Omi, Md. Mushfiqul Haque ; Hasan, Md. Bakhtiar ; Ahmed, Sabbir ; Kabir, Md. Hasanul</creator><creatorcontrib>Rahman, Md. Atiqur ; Asad, Nahian Ibn ; Omi, Md. Mushfiqul Haque ; Hasan, Md. Bakhtiar ; Ahmed, Sabbir ; Kabir, Md. Hasanul</creatorcontrib><description>Automatic Traffic Sign Recognition is paramount in modern transportation systems, motivating several research endeavors to focus on performance improvement by utilizing large-scale datasets. As the appearance of traffic signs varies across countries, curating large-scale datasets is often impractical; and requires efficient models that can produce satisfactory performance using limited data. In this connection, we present 'FUSED-Net', built-upon Faster RCNN for traffic sign detection, enhanced by Unfrozen Parameters, Pseudo-Support Sets, Embedding Normalization, and Domain Adaptation while reducing data requirement. Unlike traditional approaches, we keep all parameters unfrozen during training, enabling FUSED-Net to learn from limited samples. The generation of a Pseudo-Support Set through data augmentation further enhances performance by compensating for the scarcity of target domain data. Additionally, Embedding Normalization is incorporated to reduce intra-class variance, standardizing feature representation. Domain Adaptation, achieved by pre-training on a diverse traffic sign dataset distinct from the target domain, improves model generalization. Evaluating FUSED-Net on the BDTSD dataset, we achieved 2.4x, 2.2x, 1.5x, and 1.3x improvements of mAP in 1-shot, 3-shot, 5-shot, and 10-shot scenarios, respectively compared to the state-of-the-art Few-Shot Object Detection (FSOD) models. Additionally, we outperform state-of-the-art works on the cross-domain FSOD benchmark under several scenarios.</description><identifier>DOI: 10.48550/arxiv.2409.14852</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2024-09</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,777,882</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2409.14852$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2409.14852$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Rahman, Md. Atiqur</creatorcontrib><creatorcontrib>Asad, Nahian Ibn</creatorcontrib><creatorcontrib>Omi, Md. Mushfiqul Haque</creatorcontrib><creatorcontrib>Hasan, Md. Bakhtiar</creatorcontrib><creatorcontrib>Ahmed, Sabbir</creatorcontrib><creatorcontrib>Kabir, Md. Hasanul</creatorcontrib><title>FUSED-Net: Detecting Traffic Signs with Limited Data</title><description>Automatic Traffic Sign Recognition is paramount in modern transportation systems, motivating several research endeavors to focus on performance improvement by utilizing large-scale datasets. As the appearance of traffic signs varies across countries, curating large-scale datasets is often impractical; and requires efficient models that can produce satisfactory performance using limited data. In this connection, we present 'FUSED-Net', built-upon Faster RCNN for traffic sign detection, enhanced by Unfrozen Parameters, Pseudo-Support Sets, Embedding Normalization, and Domain Adaptation while reducing data requirement. Unlike traditional approaches, we keep all parameters unfrozen during training, enabling FUSED-Net to learn from limited samples. The generation of a Pseudo-Support Set through data augmentation further enhances performance by compensating for the scarcity of target domain data. Additionally, Embedding Normalization is incorporated to reduce intra-class variance, standardizing feature representation. Domain Adaptation, achieved by pre-training on a diverse traffic sign dataset distinct from the target domain, improves model generalization. Evaluating FUSED-Net on the BDTSD dataset, we achieved 2.4x, 2.2x, 1.5x, and 1.3x improvements of mAP in 1-shot, 3-shot, 5-shot, and 10-shot scenarios, respectively compared to the state-of-the-art Few-Shot Object Detection (FSOD) models. Additionally, we outperform state-of-the-art works on the cross-domain FSOD benchmark under several scenarios.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMjGw1DMEihhxMpi4hQa7uuj6pZZYKbiklqQml2TmpSuEFCWmpWUmKwRnpucVK5RnlmQo-GTmZpakpii4JJYk8jCwpiXmFKfyQmluBnk31xBnD12w-fEFRZm5iUWV8SB74sH2GBNWAQCJgzDu</recordid><startdate>20240923</startdate><enddate>20240923</enddate><creator>Rahman, Md. Atiqur</creator><creator>Asad, Nahian Ibn</creator><creator>Omi, Md. Mushfiqul Haque</creator><creator>Hasan, Md. Bakhtiar</creator><creator>Ahmed, Sabbir</creator><creator>Kabir, Md. Hasanul</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20240923</creationdate><title>FUSED-Net: Detecting Traffic Signs with Limited Data</title><author>Rahman, Md. Atiqur ; Asad, Nahian Ibn ; Omi, Md. Mushfiqul Haque ; Hasan, Md. Bakhtiar ; Ahmed, Sabbir ; Kabir, Md. Hasanul</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2409_148523</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Rahman, Md. Atiqur</creatorcontrib><creatorcontrib>Asad, Nahian Ibn</creatorcontrib><creatorcontrib>Omi, Md. Mushfiqul Haque</creatorcontrib><creatorcontrib>Hasan, Md. Bakhtiar</creatorcontrib><creatorcontrib>Ahmed, Sabbir</creatorcontrib><creatorcontrib>Kabir, Md. Hasanul</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Rahman, Md. Atiqur</au><au>Asad, Nahian Ibn</au><au>Omi, Md. Mushfiqul Haque</au><au>Hasan, Md. Bakhtiar</au><au>Ahmed, Sabbir</au><au>Kabir, Md. Hasanul</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>FUSED-Net: Detecting Traffic Signs with Limited Data</atitle><date>2024-09-23</date><risdate>2024</risdate><abstract>Automatic Traffic Sign Recognition is paramount in modern transportation systems, motivating several research endeavors to focus on performance improvement by utilizing large-scale datasets. As the appearance of traffic signs varies across countries, curating large-scale datasets is often impractical; and requires efficient models that can produce satisfactory performance using limited data. In this connection, we present 'FUSED-Net', built-upon Faster RCNN for traffic sign detection, enhanced by Unfrozen Parameters, Pseudo-Support Sets, Embedding Normalization, and Domain Adaptation while reducing data requirement. Unlike traditional approaches, we keep all parameters unfrozen during training, enabling FUSED-Net to learn from limited samples. The generation of a Pseudo-Support Set through data augmentation further enhances performance by compensating for the scarcity of target domain data. Additionally, Embedding Normalization is incorporated to reduce intra-class variance, standardizing feature representation. Domain Adaptation, achieved by pre-training on a diverse traffic sign dataset distinct from the target domain, improves model generalization. Evaluating FUSED-Net on the BDTSD dataset, we achieved 2.4x, 2.2x, 1.5x, and 1.3x improvements of mAP in 1-shot, 3-shot, 5-shot, and 10-shot scenarios, respectively compared to the state-of-the-art Few-Shot Object Detection (FSOD) models. Additionally, we outperform state-of-the-art works on the cross-domain FSOD benchmark under several scenarios.</abstract><doi>10.48550/arxiv.2409.14852</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2409.14852
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2409_14852
source	arXiv.org
subjects	Computer Science - Artificial Intelligence Computer Science - Computer Vision and Pattern Recognition
title	FUSED-Net: Detecting Traffic Signs with Limited Data
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T00%3A43%3A14IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=FUSED-Net:%20Detecting%20Traffic%20Signs%20with%20Limited%20Data&rft.au=Rahman,%20Md.%20Atiqur&rft.date=2024-09-23&rft_id=info:doi/10.48550/arxiv.2409.14852&rft_dat=%3Carxiv_GOX%3E2409_14852%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true