Global-to-Local Spatial-Spectral Awareness Transformer Network for Hyperspectral Anomaly Detection

Hyperspectral anomaly detection (HAD) is one of the momentous technologies in the field of Earth observation and remote sensing monitoring. Profiting from puissant deep feature extraction abilities, deep convolutional networks (DCN) perform excellently in the HAD domain. Nevertheless, limited by the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on geoscience and remote sensing 2024, Vol.62, p.1-22
Hauptverfasser: He, Xu, Zhou, Shilin, Ling, Qiang, Li, Miao, Li, Zhaoxu, Zhang, Yuyuan, Lin, Zaiping
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Hyperspectral anomaly detection (HAD) is one of the momentous technologies in the field of Earth observation and remote sensing monitoring. Profiting from puissant deep feature extraction abilities, deep convolutional networks (DCN) perform excellently in the HAD domain. Nevertheless, limited by the restriction of unique local receptive fields, DCN-based detection methods struggle to catch the long-range dependence from a global perspective. In contrast, vision transformers (ViTs) perform better in global feature extraction but still disregard the local dependence properties. To this end, we proposed a novel method entitled the global-to-local spatial-spectral awareness transformer (G2LSSAT) network, in which the global transformer block (GTB) and local transformer block (LTB) are deployed in sequence to capture deep reconstruction characteristics from the global view to the local view in a spatial-spectral domain. In particular, the GTB is designed to explore the global spatial-spectral characteristics that are dependent on a crossbar-based global sparse attention module. Furthermore, the global glanced image is divided into multiple local patches and the LTB is devised to learn the local spatial-spectral features supported by a patch-based local self-invisible attention module. In addition, considering that the abnormal pixels always be unexpectedly reconstructed with the conventional self-attention module in ViTs, we introduce a invisible diagonal mask (IDM), which is embedded into the LTB module, to overshadow each pixel itself in the receptive field and reconstruct itself based on global and local dependent spatial-spectral features. Extensive experimental results on six datasets illustrate the superiority of the proposed G2LSSAT compared with other state-of-the-art detectors.
ISSN:0196-2892
1558-0644
DOI:10.1109/TGRS.2024.3456799