Hybrid Attention for Robust RGB-T Pedestrian Detection in Real-World Conditions

Multispectral pedestrian detection has gained significant attention in recent years, particularly in autonomous driving applications. To address the challenges posed by adversarial illumination conditions, the combination of thermal and visible images has demonstrated its advantages. However, existi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2024-11
Hauptverfasser: Rathinam, Arunkumar, Pauly, Leo, Abd El Rahman Shabayek, Rharbaoui, Wassim, Anis Kacem, Gaudillière, Vincent, Aouada, Djamila
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Rathinam, Arunkumar
Pauly, Leo
Abd El Rahman Shabayek
Rharbaoui, Wassim
Anis Kacem
Gaudillière, Vincent
Aouada, Djamila
description Multispectral pedestrian detection has gained significant attention in recent years, particularly in autonomous driving applications. To address the challenges posed by adversarial illumination conditions, the combination of thermal and visible images has demonstrated its advantages. However, existing fusion methods rely on the critical assumption that the RGB-Thermal (RGB-T) image pairs are fully overlapping. These assumptions often do not hold in real-world applications, where only partial overlap between images can occur due to sensors configuration. Moreover, sensor failure can cause loss of information in one modality. In this paper, we propose a novel module called the Hybrid Attention (HA) mechanism as our main contribution to mitigate performance degradation caused by partial overlap and sensor failure, i.e. when at least part of the scene is acquired by only one sensor. We propose an improved RGB-T fusion algorithm, robust against partial overlap and sensor failure encountered during inference in real-world applications. We also leverage a mobile-friendly backbone to cope with resource constraints in embedded systems. We conducted experiments by simulating various partial overlap and sensor failure scenarios to evaluate the performance of our proposed method. The results demonstrate that our approach outperforms state-of-the-art methods, showcasing its superiority in handling real-world challenges.
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_3125867400</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3125867400</sourcerecordid><originalsourceid>FETCH-proquest_journals_31258674003</originalsourceid><addsrcrecordid>eNqNjMsKwjAUBYMgWLT_cMF1IE362mp9dKeUgsvSmltIKYkm6cK_t4of4OrAzHAWJOBCRDSPOV-R0LmBMcbTjCeJCMilfHVWSdh5j9oro6E3FirTTc5Ddd7TGq4o0XmrWg0H9Hj_VkpDhe1Ib8aOEgqjpfpwtyHLvh0dhr9dk-3pWBclfVjznOafZjCT1bNqRMSTPM1ixsR_1RuyHj3O</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3125867400</pqid></control><display><type>article</type><title>Hybrid Attention for Robust RGB-T Pedestrian Detection in Real-World Conditions</title><source>Free E- Journals</source><creator>Rathinam, Arunkumar ; Pauly, Leo ; Abd El Rahman Shabayek ; Rharbaoui, Wassim ; Anis Kacem ; Gaudillière, Vincent ; Aouada, Djamila</creator><creatorcontrib>Rathinam, Arunkumar ; Pauly, Leo ; Abd El Rahman Shabayek ; Rharbaoui, Wassim ; Anis Kacem ; Gaudillière, Vincent ; Aouada, Djamila</creatorcontrib><description>Multispectral pedestrian detection has gained significant attention in recent years, particularly in autonomous driving applications. To address the challenges posed by adversarial illumination conditions, the combination of thermal and visible images has demonstrated its advantages. However, existing fusion methods rely on the critical assumption that the RGB-Thermal (RGB-T) image pairs are fully overlapping. These assumptions often do not hold in real-world applications, where only partial overlap between images can occur due to sensors configuration. Moreover, sensor failure can cause loss of information in one modality. In this paper, we propose a novel module called the Hybrid Attention (HA) mechanism as our main contribution to mitigate performance degradation caused by partial overlap and sensor failure, i.e. when at least part of the scene is acquired by only one sensor. We propose an improved RGB-T fusion algorithm, robust against partial overlap and sensor failure encountered during inference in real-world applications. We also leverage a mobile-friendly backbone to cope with resource constraints in embedded systems. We conducted experiments by simulating various partial overlap and sensor failure scenarios to evaluate the performance of our proposed method. The results demonstrate that our approach outperforms state-of-the-art methods, showcasing its superiority in handling real-world challenges.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Algorithms ; Attention ; Embedded systems ; Failure ; Image acquisition ; Performance degradation ; Performance evaluation ; Robustness ; Sensors</subject><ispartof>arXiv.org, 2024-11</ispartof><rights>2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>780,784</link.rule.ids></links><search><creatorcontrib>Rathinam, Arunkumar</creatorcontrib><creatorcontrib>Pauly, Leo</creatorcontrib><creatorcontrib>Abd El Rahman Shabayek</creatorcontrib><creatorcontrib>Rharbaoui, Wassim</creatorcontrib><creatorcontrib>Anis Kacem</creatorcontrib><creatorcontrib>Gaudillière, Vincent</creatorcontrib><creatorcontrib>Aouada, Djamila</creatorcontrib><title>Hybrid Attention for Robust RGB-T Pedestrian Detection in Real-World Conditions</title><title>arXiv.org</title><description>Multispectral pedestrian detection has gained significant attention in recent years, particularly in autonomous driving applications. To address the challenges posed by adversarial illumination conditions, the combination of thermal and visible images has demonstrated its advantages. However, existing fusion methods rely on the critical assumption that the RGB-Thermal (RGB-T) image pairs are fully overlapping. These assumptions often do not hold in real-world applications, where only partial overlap between images can occur due to sensors configuration. Moreover, sensor failure can cause loss of information in one modality. In this paper, we propose a novel module called the Hybrid Attention (HA) mechanism as our main contribution to mitigate performance degradation caused by partial overlap and sensor failure, i.e. when at least part of the scene is acquired by only one sensor. We propose an improved RGB-T fusion algorithm, robust against partial overlap and sensor failure encountered during inference in real-world applications. We also leverage a mobile-friendly backbone to cope with resource constraints in embedded systems. We conducted experiments by simulating various partial overlap and sensor failure scenarios to evaluate the performance of our proposed method. The results demonstrate that our approach outperforms state-of-the-art methods, showcasing its superiority in handling real-world challenges.</description><subject>Algorithms</subject><subject>Attention</subject><subject>Embedded systems</subject><subject>Failure</subject><subject>Image acquisition</subject><subject>Performance degradation</subject><subject>Performance evaluation</subject><subject>Robustness</subject><subject>Sensors</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNjMsKwjAUBYMgWLT_cMF1IE362mp9dKeUgsvSmltIKYkm6cK_t4of4OrAzHAWJOBCRDSPOV-R0LmBMcbTjCeJCMilfHVWSdh5j9oro6E3FirTTc5Ddd7TGq4o0XmrWg0H9Hj_VkpDhe1Ib8aOEgqjpfpwtyHLvh0dhr9dk-3pWBclfVjznOafZjCT1bNqRMSTPM1ixsR_1RuyHj3O</recordid><startdate>20241106</startdate><enddate>20241106</enddate><creator>Rathinam, Arunkumar</creator><creator>Pauly, Leo</creator><creator>Abd El Rahman Shabayek</creator><creator>Rharbaoui, Wassim</creator><creator>Anis Kacem</creator><creator>Gaudillière, Vincent</creator><creator>Aouada, Djamila</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20241106</creationdate><title>Hybrid Attention for Robust RGB-T Pedestrian Detection in Real-World Conditions</title><author>Rathinam, Arunkumar ; Pauly, Leo ; Abd El Rahman Shabayek ; Rharbaoui, Wassim ; Anis Kacem ; Gaudillière, Vincent ; Aouada, Djamila</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_31258674003</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Algorithms</topic><topic>Attention</topic><topic>Embedded systems</topic><topic>Failure</topic><topic>Image acquisition</topic><topic>Performance degradation</topic><topic>Performance evaluation</topic><topic>Robustness</topic><topic>Sensors</topic><toplevel>online_resources</toplevel><creatorcontrib>Rathinam, Arunkumar</creatorcontrib><creatorcontrib>Pauly, Leo</creatorcontrib><creatorcontrib>Abd El Rahman Shabayek</creatorcontrib><creatorcontrib>Rharbaoui, Wassim</creatorcontrib><creatorcontrib>Anis Kacem</creatorcontrib><creatorcontrib>Gaudillière, Vincent</creatorcontrib><creatorcontrib>Aouada, Djamila</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Rathinam, Arunkumar</au><au>Pauly, Leo</au><au>Abd El Rahman Shabayek</au><au>Rharbaoui, Wassim</au><au>Anis Kacem</au><au>Gaudillière, Vincent</au><au>Aouada, Djamila</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Hybrid Attention for Robust RGB-T Pedestrian Detection in Real-World Conditions</atitle><jtitle>arXiv.org</jtitle><date>2024-11-06</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>Multispectral pedestrian detection has gained significant attention in recent years, particularly in autonomous driving applications. To address the challenges posed by adversarial illumination conditions, the combination of thermal and visible images has demonstrated its advantages. However, existing fusion methods rely on the critical assumption that the RGB-Thermal (RGB-T) image pairs are fully overlapping. These assumptions often do not hold in real-world applications, where only partial overlap between images can occur due to sensors configuration. Moreover, sensor failure can cause loss of information in one modality. In this paper, we propose a novel module called the Hybrid Attention (HA) mechanism as our main contribution to mitigate performance degradation caused by partial overlap and sensor failure, i.e. when at least part of the scene is acquired by only one sensor. We propose an improved RGB-T fusion algorithm, robust against partial overlap and sensor failure encountered during inference in real-world applications. We also leverage a mobile-friendly backbone to cope with resource constraints in embedded systems. We conducted experiments by simulating various partial overlap and sensor failure scenarios to evaluate the performance of our proposed method. The results demonstrate that our approach outperforms state-of-the-art methods, showcasing its superiority in handling real-world challenges.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2024-11
issn 2331-8422
language eng
recordid cdi_proquest_journals_3125867400
source Free E- Journals
subjects Algorithms
Attention
Embedded systems
Failure
Image acquisition
Performance degradation
Performance evaluation
Robustness
Sensors
title Hybrid Attention for Robust RGB-T Pedestrian Detection in Real-World Conditions
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T20%3A31%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Hybrid%20Attention%20for%20Robust%20RGB-T%20Pedestrian%20Detection%20in%20Real-World%20Conditions&rft.jtitle=arXiv.org&rft.au=Rathinam,%20Arunkumar&rft.date=2024-11-06&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E3125867400%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3125867400&rft_id=info:pmid/&rfr_iscdi=true