FDTrack: A Dual-Head Focus Tracking Network With Frequency Enhancement

The RGB-T tracking approach combines the advantages of visible and thermal sensors to achieve accurate target tracking in complex scenarios. However, previous RGB-T trackers based on the self-attention (SA) mechanism overlook crucial high-frequency information (such as texture, edges, and colors) th...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE sensors journal 2025-01, Vol.25 (2), p.3879-3897
Hauptverfasser: Gao, Zhao, Zhou, Dongming, Cao, Jinde, Liu, Yisong, Shan, Qingqing
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The RGB-T tracking approach combines the advantages of visible and thermal sensors to achieve accurate target tracking in complex scenarios. However, previous RGB-T trackers based on the self-attention (SA) mechanism overlook crucial high-frequency information (such as texture, edges, and colors) that is essential for object prediction. To address these challenges, we propose a frequency-enhanced dual-head focus tracking network (FDTrack) for RGB-T tracking. FDTrack comprises four main components: high-frequency feature enhancement (HFFE), wavelet multifrequency (WMF) interaction, autonomous modality prediction (AMP), and search focus preprocessing (SFP). HFFE refines the features from the ViT backbones within specific modalities by adaptively amplifying high-frequency features. In contrast, WMF facilitates communication between different frequency bands to enhance the interaction of RGB-T features from a frequency perspective. To improve tracking robustness under extreme scenes, AMP incorporates dual prediction heads and determines the final outcome through feature matching. SFP adjusts the convolution kernel size based on pixel-to-target distance and preprocesses the search region with Gaussian blur to reduce background clutter interference and emphasize the target. Extensive experimental results demonstrate that FDTrack achieves competitive performance compared to state-of-the-art algorithms across various datasets, including RGBT210, RGBT234, and LasHeR, showcasing its cutting-edge capabilities in this field.
ISSN:1530-437X
1558-1748
DOI:10.1109/JSEN.2024.3506929