FUSED-Net: Detecting Traffic Signs with Limited Data

Automatic Traffic Sign Recognition is paramount in modern transportation systems, motivating several research endeavors to focus on performance improvement by utilizing large-scale datasets. As the appearance of traffic signs varies across countries, curating large-scale datasets is often impractica...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Rahman, Md. Atiqur, Asad, Nahian Ibn, Omi, Md. Mushfiqul Haque, Hasan, Md. Bakhtiar, Ahmed, Sabbir, Kabir, Md. Hasanul
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Rahman, Md. Atiqur
Asad, Nahian Ibn
Omi, Md. Mushfiqul Haque
Hasan, Md. Bakhtiar
Ahmed, Sabbir
Kabir, Md. Hasanul
description Automatic Traffic Sign Recognition is paramount in modern transportation systems, motivating several research endeavors to focus on performance improvement by utilizing large-scale datasets. As the appearance of traffic signs varies across countries, curating large-scale datasets is often impractical; and requires efficient models that can produce satisfactory performance using limited data. In this connection, we present 'FUSED-Net', built-upon Faster RCNN for traffic sign detection, enhanced by Unfrozen Parameters, Pseudo-Support Sets, Embedding Normalization, and Domain Adaptation while reducing data requirement. Unlike traditional approaches, we keep all parameters unfrozen during training, enabling FUSED-Net to learn from limited samples. The generation of a Pseudo-Support Set through data augmentation further enhances performance by compensating for the scarcity of target domain data. Additionally, Embedding Normalization is incorporated to reduce intra-class variance, standardizing feature representation. Domain Adaptation, achieved by pre-training on a diverse traffic sign dataset distinct from the target domain, improves model generalization. Evaluating FUSED-Net on the BDTSD dataset, we achieved 2.4x, 2.2x, 1.5x, and 1.3x improvements of mAP in 1-shot, 3-shot, 5-shot, and 10-shot scenarios, respectively compared to the state-of-the-art Few-Shot Object Detection (FSOD) models. Additionally, we outperform state-of-the-art works on the cross-domain FSOD benchmark under several scenarios.
doi_str_mv 10.48550/arxiv.2409.14852
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2409_14852</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2409_14852</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2409_148523</originalsourceid><addsrcrecordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMjGw1DMEihhxMpi4hQa7uuj6pZZYKbiklqQml2TmpSuEFCWmpWUmKwRnpucVK5RnlmQo-GTmZpakpii4JJYk8jCwpiXmFKfyQmluBnk31xBnD12w-fEFRZm5iUWV8SB74sH2GBNWAQCJgzDu</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>FUSED-Net: Detecting Traffic Signs with Limited Data</title><source>arXiv.org</source><creator>Rahman, Md. Atiqur ; Asad, Nahian Ibn ; Omi, Md. Mushfiqul Haque ; Hasan, Md. Bakhtiar ; Ahmed, Sabbir ; Kabir, Md. Hasanul</creator><creatorcontrib>Rahman, Md. Atiqur ; Asad, Nahian Ibn ; Omi, Md. Mushfiqul Haque ; Hasan, Md. Bakhtiar ; Ahmed, Sabbir ; Kabir, Md. Hasanul</creatorcontrib><description>Automatic Traffic Sign Recognition is paramount in modern transportation systems, motivating several research endeavors to focus on performance improvement by utilizing large-scale datasets. As the appearance of traffic signs varies across countries, curating large-scale datasets is often impractical; and requires efficient models that can produce satisfactory performance using limited data. In this connection, we present 'FUSED-Net', built-upon Faster RCNN for traffic sign detection, enhanced by Unfrozen Parameters, Pseudo-Support Sets, Embedding Normalization, and Domain Adaptation while reducing data requirement. Unlike traditional approaches, we keep all parameters unfrozen during training, enabling FUSED-Net to learn from limited samples. The generation of a Pseudo-Support Set through data augmentation further enhances performance by compensating for the scarcity of target domain data. Additionally, Embedding Normalization is incorporated to reduce intra-class variance, standardizing feature representation. Domain Adaptation, achieved by pre-training on a diverse traffic sign dataset distinct from the target domain, improves model generalization. Evaluating FUSED-Net on the BDTSD dataset, we achieved 2.4x, 2.2x, 1.5x, and 1.3x improvements of mAP in 1-shot, 3-shot, 5-shot, and 10-shot scenarios, respectively compared to the state-of-the-art Few-Shot Object Detection (FSOD) models. Additionally, we outperform state-of-the-art works on the cross-domain FSOD benchmark under several scenarios.</description><identifier>DOI: 10.48550/arxiv.2409.14852</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2024-09</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,777,882</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2409.14852$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2409.14852$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Rahman, Md. Atiqur</creatorcontrib><creatorcontrib>Asad, Nahian Ibn</creatorcontrib><creatorcontrib>Omi, Md. Mushfiqul Haque</creatorcontrib><creatorcontrib>Hasan, Md. Bakhtiar</creatorcontrib><creatorcontrib>Ahmed, Sabbir</creatorcontrib><creatorcontrib>Kabir, Md. Hasanul</creatorcontrib><title>FUSED-Net: Detecting Traffic Signs with Limited Data</title><description>Automatic Traffic Sign Recognition is paramount in modern transportation systems, motivating several research endeavors to focus on performance improvement by utilizing large-scale datasets. As the appearance of traffic signs varies across countries, curating large-scale datasets is often impractical; and requires efficient models that can produce satisfactory performance using limited data. In this connection, we present 'FUSED-Net', built-upon Faster RCNN for traffic sign detection, enhanced by Unfrozen Parameters, Pseudo-Support Sets, Embedding Normalization, and Domain Adaptation while reducing data requirement. Unlike traditional approaches, we keep all parameters unfrozen during training, enabling FUSED-Net to learn from limited samples. The generation of a Pseudo-Support Set through data augmentation further enhances performance by compensating for the scarcity of target domain data. Additionally, Embedding Normalization is incorporated to reduce intra-class variance, standardizing feature representation. Domain Adaptation, achieved by pre-training on a diverse traffic sign dataset distinct from the target domain, improves model generalization. Evaluating FUSED-Net on the BDTSD dataset, we achieved 2.4x, 2.2x, 1.5x, and 1.3x improvements of mAP in 1-shot, 3-shot, 5-shot, and 10-shot scenarios, respectively compared to the state-of-the-art Few-Shot Object Detection (FSOD) models. Additionally, we outperform state-of-the-art works on the cross-domain FSOD benchmark under several scenarios.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMjGw1DMEihhxMpi4hQa7uuj6pZZYKbiklqQml2TmpSuEFCWmpWUmKwRnpucVK5RnlmQo-GTmZpakpii4JJYk8jCwpiXmFKfyQmluBnk31xBnD12w-fEFRZm5iUWV8SB74sH2GBNWAQCJgzDu</recordid><startdate>20240923</startdate><enddate>20240923</enddate><creator>Rahman, Md. Atiqur</creator><creator>Asad, Nahian Ibn</creator><creator>Omi, Md. Mushfiqul Haque</creator><creator>Hasan, Md. Bakhtiar</creator><creator>Ahmed, Sabbir</creator><creator>Kabir, Md. Hasanul</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20240923</creationdate><title>FUSED-Net: Detecting Traffic Signs with Limited Data</title><author>Rahman, Md. Atiqur ; Asad, Nahian Ibn ; Omi, Md. Mushfiqul Haque ; Hasan, Md. Bakhtiar ; Ahmed, Sabbir ; Kabir, Md. Hasanul</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2409_148523</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Rahman, Md. Atiqur</creatorcontrib><creatorcontrib>Asad, Nahian Ibn</creatorcontrib><creatorcontrib>Omi, Md. Mushfiqul Haque</creatorcontrib><creatorcontrib>Hasan, Md. Bakhtiar</creatorcontrib><creatorcontrib>Ahmed, Sabbir</creatorcontrib><creatorcontrib>Kabir, Md. Hasanul</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Rahman, Md. Atiqur</au><au>Asad, Nahian Ibn</au><au>Omi, Md. Mushfiqul Haque</au><au>Hasan, Md. Bakhtiar</au><au>Ahmed, Sabbir</au><au>Kabir, Md. Hasanul</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>FUSED-Net: Detecting Traffic Signs with Limited Data</atitle><date>2024-09-23</date><risdate>2024</risdate><abstract>Automatic Traffic Sign Recognition is paramount in modern transportation systems, motivating several research endeavors to focus on performance improvement by utilizing large-scale datasets. As the appearance of traffic signs varies across countries, curating large-scale datasets is often impractical; and requires efficient models that can produce satisfactory performance using limited data. In this connection, we present 'FUSED-Net', built-upon Faster RCNN for traffic sign detection, enhanced by Unfrozen Parameters, Pseudo-Support Sets, Embedding Normalization, and Domain Adaptation while reducing data requirement. Unlike traditional approaches, we keep all parameters unfrozen during training, enabling FUSED-Net to learn from limited samples. The generation of a Pseudo-Support Set through data augmentation further enhances performance by compensating for the scarcity of target domain data. Additionally, Embedding Normalization is incorporated to reduce intra-class variance, standardizing feature representation. Domain Adaptation, achieved by pre-training on a diverse traffic sign dataset distinct from the target domain, improves model generalization. Evaluating FUSED-Net on the BDTSD dataset, we achieved 2.4x, 2.2x, 1.5x, and 1.3x improvements of mAP in 1-shot, 3-shot, 5-shot, and 10-shot scenarios, respectively compared to the state-of-the-art Few-Shot Object Detection (FSOD) models. Additionally, we outperform state-of-the-art works on the cross-domain FSOD benchmark under several scenarios.</abstract><doi>10.48550/arxiv.2409.14852</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2409.14852
ispartof
issn
language eng
recordid cdi_arxiv_primary_2409_14852
source arXiv.org
subjects Computer Science - Artificial Intelligence
Computer Science - Computer Vision and Pattern Recognition
title FUSED-Net: Detecting Traffic Signs with Limited Data
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T00%3A43%3A14IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=FUSED-Net:%20Detecting%20Traffic%20Signs%20with%20Limited%20Data&rft.au=Rahman,%20Md.%20Atiqur&rft.date=2024-09-23&rft_id=info:doi/10.48550/arxiv.2409.14852&rft_dat=%3Carxiv_GOX%3E2409_14852%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true