GraphMamba: An Efficient Graph Structure Learning Vision Mamba for Hyperspectral Image Classification

Efficient extraction of spectral sequences and geospatial information is crucial in hyperspectral image (HSI) classification. Recurrent neural networks (RNNs) and Transformers excel in capturing long-range spectral features, while convolutional neural networks (CNNs) excel in aggregating spatial inf...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on geoscience and remote sensing 2024, Vol.62, p.1-14
Hauptverfasser: Yang, Aitao, Li, Min, Ding, Yao, Fang, Leyuan, Cai, Yaoming, He, Yujie
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Efficient extraction of spectral sequences and geospatial information is crucial in hyperspectral image (HSI) classification. Recurrent neural networks (RNNs) and Transformers excel in capturing long-range spectral features, while convolutional neural networks (CNNs) excel in aggregating spatial information through convolutional kernels. However, RNNs and Transformers suffer from low-computational efficiency, and CNNs have limitations in perceiving global contextual information. To address these issues, this article proposes GraphMamba-an efficient graph structure learning vision Mamba for HSI classification. Specifically, GraphMamba is a novel hyperspectral information processing paradigm that preserves spatial-spectral features by constructing spatial-spectral cubes and employs a linear spectral encoder to enhance the operability of subsequent tasks. The core components of GraphMamba include the HyperMamba module, which enhances computational efficiency, and the SpatialGCN module, designed for adaptive spatial context awareness. The HyperMamba mitigates clutter interference by employing a global mask (GM) and introduces a parallel training and inference architecture to alleviate computational bottlenecks. Meanwhile, the SpatialGCN utilizes weighted multihop aggregation (WMA) for spatial encoding, emphasizing highly correlated spatial structural features. This approach enables flexible aggregation of contextual information while minimizing spatial noise interference. Notably, the encoding modules of the proposed GraphMamba architecture are both flexible and scalable, providing a novel approach for the joint mining of spatial-spectral information in hyperspectral images. Extensive experiments were conducted on three different scales of real HSI datasets. When compared with state-of-the-art classification methods, GraphMamba demonstrated superior performance. The core code will be released at https://github.com/ahappyyang/GraphMamba .
ISSN:0196-2892
1558-0644
DOI:10.1109/TGRS.2024.3493101