Supervised-unsupervised combined transformer for spectral compressive imaging reconstruction

•The supervised-unsupervised transformer simultaneously learns generalized and specific priors.•a supervised Spatio-Spectral Transformer network obtains a preliminary hyperspectral imaging.•Multi-level feature refinement mechanism improves reconstruction accuracy. To solve the low spatial and/or tem...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Optics and lasers in engineering 2024-04, Vol.175, p.108030, Article 108030
Hauptverfasser: Zhou, Han, Lian, Yusheng, Li, Jin, Liu, Zilong, Cao, Xuheng, Ma, Chao
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•The supervised-unsupervised transformer simultaneously learns generalized and specific priors.•a supervised Spatio-Spectral Transformer network obtains a preliminary hyperspectral imaging.•Multi-level feature refinement mechanism improves reconstruction accuracy. To solve the low spatial and/or temporal resolution problem which the conventional hyperspectral cameras often suffer from, spectral compressive imaging systems (SCI) have attracted more attention recently. Recovering a hyperspectral image (HSI) from its corresponding 2D coded image is an ill-posed inverse problem, and learning accurate prior from HSI and 2D coded image is essential to solve this inverse problem. Existing methods only use supervised networks that focus on learning generalized prior from training datasets, or only use unsupervised networks that focus on learning specific prior from 2D coded image, resulting in the inability to learn both generalized and specific priors. Also, when learning the priors, existing methods cannot simultaneously give consideration to both global and local scales, as well as both spatial and spectral dimensions. To cope with this problem, in this paper, we propose a Supervised-Unsupervised Combined Transformer Network (SUCTNet) composed by a supervised Spatio-spectral Transformer network (SSTNet) and an Unsupervised Multi-level Feature Refinement network (UMFRNet). Specifically, we first develop the SSTNet to learn generalized prior and obtain a preliminary HSI. In SSTNet, the proposed spatial encoding and spectral decoding network architecture enables it to simultaneously consider both spatial and spectral dimensions, and a proposed Global and Local Multi head Self Attention block (GL-MSA) enables it simultaneously to consider both global and local scales. Then, the preliminary HSI is fed into the proposed UMFRNet to learn specific prior and obtain the target HSI. In UMFRNet, a proposed multi-level feature refinement mechanism and the physical imaging model of SCI are used to improve reconstruction accuracy and generalization performance. Extensive experiments show that our method significantly outperforms state-of-the-art (SOTA) methods on simulated and real datasets. Codes will be available at https://github.com/Vzhouhan/SUCTNet.
ISSN:0143-8166
1873-0302
DOI:10.1016/j.optlaseng.2024.108030