Hybrid Attention for Robust RGB-T Pedestrian Detection in Real-World Conditions
Multispectral pedestrian detection has gained significant attention in recent years, particularly in autonomous driving applications. To address the challenges posed by adversarial illumination conditions, the combination of thermal and visible images has demonstrated its advantages. However, existi...
Gespeichert in:
Veröffentlicht in: | arXiv.org 2024-11 |
---|---|
Hauptverfasser: | , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Rathinam, Arunkumar Pauly, Leo Abd El Rahman Shabayek Rharbaoui, Wassim Anis Kacem Gaudillière, Vincent Aouada, Djamila |
description | Multispectral pedestrian detection has gained significant attention in recent years, particularly in autonomous driving applications. To address the challenges posed by adversarial illumination conditions, the combination of thermal and visible images has demonstrated its advantages. However, existing fusion methods rely on the critical assumption that the RGB-Thermal (RGB-T) image pairs are fully overlapping. These assumptions often do not hold in real-world applications, where only partial overlap between images can occur due to sensors configuration. Moreover, sensor failure can cause loss of information in one modality. In this paper, we propose a novel module called the Hybrid Attention (HA) mechanism as our main contribution to mitigate performance degradation caused by partial overlap and sensor failure, i.e. when at least part of the scene is acquired by only one sensor. We propose an improved RGB-T fusion algorithm, robust against partial overlap and sensor failure encountered during inference in real-world applications. We also leverage a mobile-friendly backbone to cope with resource constraints in embedded systems. We conducted experiments by simulating various partial overlap and sensor failure scenarios to evaluate the performance of our proposed method. The results demonstrate that our approach outperforms state-of-the-art methods, showcasing its superiority in handling real-world challenges. |
format | Article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_3125867400</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3125867400</sourcerecordid><originalsourceid>FETCH-proquest_journals_31258674003</originalsourceid><addsrcrecordid>eNqNjMsKwjAUBYMgWLT_cMF1IE362mp9dKeUgsvSmltIKYkm6cK_t4of4OrAzHAWJOBCRDSPOV-R0LmBMcbTjCeJCMilfHVWSdh5j9oro6E3FirTTc5Ddd7TGq4o0XmrWg0H9Hj_VkpDhe1Ib8aOEgqjpfpwtyHLvh0dhr9dk-3pWBclfVjznOafZjCT1bNqRMSTPM1ixsR_1RuyHj3O</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3125867400</pqid></control><display><type>article</type><title>Hybrid Attention for Robust RGB-T Pedestrian Detection in Real-World Conditions</title><source>Free E- Journals</source><creator>Rathinam, Arunkumar ; Pauly, Leo ; Abd El Rahman Shabayek ; Rharbaoui, Wassim ; Anis Kacem ; Gaudillière, Vincent ; Aouada, Djamila</creator><creatorcontrib>Rathinam, Arunkumar ; Pauly, Leo ; Abd El Rahman Shabayek ; Rharbaoui, Wassim ; Anis Kacem ; Gaudillière, Vincent ; Aouada, Djamila</creatorcontrib><description>Multispectral pedestrian detection has gained significant attention in recent years, particularly in autonomous driving applications. To address the challenges posed by adversarial illumination conditions, the combination of thermal and visible images has demonstrated its advantages. However, existing fusion methods rely on the critical assumption that the RGB-Thermal (RGB-T) image pairs are fully overlapping. These assumptions often do not hold in real-world applications, where only partial overlap between images can occur due to sensors configuration. Moreover, sensor failure can cause loss of information in one modality. In this paper, we propose a novel module called the Hybrid Attention (HA) mechanism as our main contribution to mitigate performance degradation caused by partial overlap and sensor failure, i.e. when at least part of the scene is acquired by only one sensor. We propose an improved RGB-T fusion algorithm, robust against partial overlap and sensor failure encountered during inference in real-world applications. We also leverage a mobile-friendly backbone to cope with resource constraints in embedded systems. We conducted experiments by simulating various partial overlap and sensor failure scenarios to evaluate the performance of our proposed method. The results demonstrate that our approach outperforms state-of-the-art methods, showcasing its superiority in handling real-world challenges.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Algorithms ; Attention ; Embedded systems ; Failure ; Image acquisition ; Performance degradation ; Performance evaluation ; Robustness ; Sensors</subject><ispartof>arXiv.org, 2024-11</ispartof><rights>2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>780,784</link.rule.ids></links><search><creatorcontrib>Rathinam, Arunkumar</creatorcontrib><creatorcontrib>Pauly, Leo</creatorcontrib><creatorcontrib>Abd El Rahman Shabayek</creatorcontrib><creatorcontrib>Rharbaoui, Wassim</creatorcontrib><creatorcontrib>Anis Kacem</creatorcontrib><creatorcontrib>Gaudillière, Vincent</creatorcontrib><creatorcontrib>Aouada, Djamila</creatorcontrib><title>Hybrid Attention for Robust RGB-T Pedestrian Detection in Real-World Conditions</title><title>arXiv.org</title><description>Multispectral pedestrian detection has gained significant attention in recent years, particularly in autonomous driving applications. To address the challenges posed by adversarial illumination conditions, the combination of thermal and visible images has demonstrated its advantages. However, existing fusion methods rely on the critical assumption that the RGB-Thermal (RGB-T) image pairs are fully overlapping. These assumptions often do not hold in real-world applications, where only partial overlap between images can occur due to sensors configuration. Moreover, sensor failure can cause loss of information in one modality. In this paper, we propose a novel module called the Hybrid Attention (HA) mechanism as our main contribution to mitigate performance degradation caused by partial overlap and sensor failure, i.e. when at least part of the scene is acquired by only one sensor. We propose an improved RGB-T fusion algorithm, robust against partial overlap and sensor failure encountered during inference in real-world applications. We also leverage a mobile-friendly backbone to cope with resource constraints in embedded systems. We conducted experiments by simulating various partial overlap and sensor failure scenarios to evaluate the performance of our proposed method. The results demonstrate that our approach outperforms state-of-the-art methods, showcasing its superiority in handling real-world challenges.</description><subject>Algorithms</subject><subject>Attention</subject><subject>Embedded systems</subject><subject>Failure</subject><subject>Image acquisition</subject><subject>Performance degradation</subject><subject>Performance evaluation</subject><subject>Robustness</subject><subject>Sensors</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNjMsKwjAUBYMgWLT_cMF1IE362mp9dKeUgsvSmltIKYkm6cK_t4of4OrAzHAWJOBCRDSPOV-R0LmBMcbTjCeJCMilfHVWSdh5j9oro6E3FirTTc5Ddd7TGq4o0XmrWg0H9Hj_VkpDhe1Ib8aOEgqjpfpwtyHLvh0dhr9dk-3pWBclfVjznOafZjCT1bNqRMSTPM1ixsR_1RuyHj3O</recordid><startdate>20241106</startdate><enddate>20241106</enddate><creator>Rathinam, Arunkumar</creator><creator>Pauly, Leo</creator><creator>Abd El Rahman Shabayek</creator><creator>Rharbaoui, Wassim</creator><creator>Anis Kacem</creator><creator>Gaudillière, Vincent</creator><creator>Aouada, Djamila</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20241106</creationdate><title>Hybrid Attention for Robust RGB-T Pedestrian Detection in Real-World Conditions</title><author>Rathinam, Arunkumar ; Pauly, Leo ; Abd El Rahman Shabayek ; Rharbaoui, Wassim ; Anis Kacem ; Gaudillière, Vincent ; Aouada, Djamila</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_31258674003</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Algorithms</topic><topic>Attention</topic><topic>Embedded systems</topic><topic>Failure</topic><topic>Image acquisition</topic><topic>Performance degradation</topic><topic>Performance evaluation</topic><topic>Robustness</topic><topic>Sensors</topic><toplevel>online_resources</toplevel><creatorcontrib>Rathinam, Arunkumar</creatorcontrib><creatorcontrib>Pauly, Leo</creatorcontrib><creatorcontrib>Abd El Rahman Shabayek</creatorcontrib><creatorcontrib>Rharbaoui, Wassim</creatorcontrib><creatorcontrib>Anis Kacem</creatorcontrib><creatorcontrib>Gaudillière, Vincent</creatorcontrib><creatorcontrib>Aouada, Djamila</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Rathinam, Arunkumar</au><au>Pauly, Leo</au><au>Abd El Rahman Shabayek</au><au>Rharbaoui, Wassim</au><au>Anis Kacem</au><au>Gaudillière, Vincent</au><au>Aouada, Djamila</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Hybrid Attention for Robust RGB-T Pedestrian Detection in Real-World Conditions</atitle><jtitle>arXiv.org</jtitle><date>2024-11-06</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>Multispectral pedestrian detection has gained significant attention in recent years, particularly in autonomous driving applications. To address the challenges posed by adversarial illumination conditions, the combination of thermal and visible images has demonstrated its advantages. However, existing fusion methods rely on the critical assumption that the RGB-Thermal (RGB-T) image pairs are fully overlapping. These assumptions often do not hold in real-world applications, where only partial overlap between images can occur due to sensors configuration. Moreover, sensor failure can cause loss of information in one modality. In this paper, we propose a novel module called the Hybrid Attention (HA) mechanism as our main contribution to mitigate performance degradation caused by partial overlap and sensor failure, i.e. when at least part of the scene is acquired by only one sensor. We propose an improved RGB-T fusion algorithm, robust against partial overlap and sensor failure encountered during inference in real-world applications. We also leverage a mobile-friendly backbone to cope with resource constraints in embedded systems. We conducted experiments by simulating various partial overlap and sensor failure scenarios to evaluate the performance of our proposed method. The results demonstrate that our approach outperforms state-of-the-art methods, showcasing its superiority in handling real-world challenges.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2024-11 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_3125867400 |
source | Free E- Journals |
subjects | Algorithms Attention Embedded systems Failure Image acquisition Performance degradation Performance evaluation Robustness Sensors |
title | Hybrid Attention for Robust RGB-T Pedestrian Detection in Real-World Conditions |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T20%3A31%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Hybrid%20Attention%20for%20Robust%20RGB-T%20Pedestrian%20Detection%20in%20Real-World%20Conditions&rft.jtitle=arXiv.org&rft.au=Rathinam,%20Arunkumar&rft.date=2024-11-06&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E3125867400%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3125867400&rft_id=info:pmid/&rfr_iscdi=true |