Unsupervised multi-level spatio-spectral fusion transformer for hyperspectral image super-resolution

•Unsupervised transformer-based network proposed for fusion-based super-resolution, achieving the state-of-the-art performance in terms of both quantitative results and visual quality.•Multi-level features fusion strategy is constructed for HSI-MSI fusion, leveraging feature interactions between spa...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Optics and laser technology 2024-09, Vol.176, p.111032, Article 111032
Hauptverfasser: Cao, Xuheng, Lian, Yusheng, Li, Jin, Wang, Kaixuan, Ma, Chao
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•Unsupervised transformer-based network proposed for fusion-based super-resolution, achieving the state-of-the-art performance in terms of both quantitative results and visual quality.•Multi-level features fusion strategy is constructed for HSI-MSI fusion, leveraging feature interactions between spatio-spectral domain.•A cross-attention block S2F-FAB is investigated to incorporate the spatial details reconstruction and spectral information integration into attention mechanism, modeling the cross spatio-spectral similarity of HSI. Fusing a low spatial resolution hyperspectral image (LR-HSI) with a high spatial resolution multispectral image (HR-MSI) is widely used for HSI super-resolution. Recent works still face problems in exploring global spatio-spectral correlation and lack effective utilization of multi-level features of the inputs (i.e., HR-MSI and LR-HSI), which results in a lack of similarity between the reconstruction and the inputs, ultimately causing significant spatio-spectral distortion. To solve these problems, we design an Unsupervised Multi-level Spatio-spectral Fusion Transformer (UMSFT). In UMSFT, a novel multi-level features fusion strategy is constructed, which fuses the hierarchy features of the inputs via proposed Spatio-spectral Feature Fusion Attention Blocks (S2F-FAB) in a level-by-level manner, thereby fully exploring the interaction between the hierarchical features. The S2F-FAB is specially designed for HSI-MSI fusion consisting of two components: (1) a spatial fusion module (Spa-FM) is designed for spatial domain fusion, and its output is set as Values (V) of a transformer; (2) a novel spectral feature cross attention (Spe- FCA) formulates the features of LR-HSI and HR-MSI as Queries (Q) and Keys (K), respectively, and achieves spectral domain fusion by applying attention mechanism along the spectral dimension. Incorporating spatial detail reconstruction and spectral feature integration into the attention mechanism, the S2F-FAB efficiently exploits the spatio-spectral correlation between target HR-HSI and inputs. Experimental results on three public datasets and our real-world images show the superiority of our method as compared with eleven state-of-the-art methods. Codes will be available at https://github.com/Caoxuheng/HIFtool.
ISSN:0030-3992
DOI:10.1016/j.optlastec.2024.111032