ConvTransNet: A CNN-Transformer Network for Change Detection With Multiscale Global-Local Representations

Change detection (CD) in optical remote sensing images has significantly benefited from the development of deep convolutional neural networks (CNNs) due to their strong capability of local modeling in bitemporal images. In addition, the recent rise of transformer modules has led to the improvement o...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on geoscience and remote sensing 2023, Vol.61, p.1-15
Hauptverfasser: Li, Weiming, Xue, Lihui, Wang, Xueqian, Li, Gang
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Change detection (CD) in optical remote sensing images has significantly benefited from the development of deep convolutional neural networks (CNNs) due to their strong capability of local modeling in bitemporal images. In addition, the recent rise of transformer modules has led to the improvement of global feature extraction of bitemporal remote sensing images. Note that the existing simple cascade of deep CNNs and transformer modules shows limited CD performance on small changed areas due to deficiencies of multiscale information therein. To address the aforementioned issue, we propose a new CNN-transformer network (ConvTransNet) with multiscale framework to better exploit global-local information in optical remote sensing images. In our ConvTransNet, we propose the parallel-branch ConvTrans block as the basic component to generate global-local features, i.e., adaptively integrates the global features summarized by a transformer-based branch and the local features extracted by a convolution-based branch, providing better identifiability between changed areas and unchanged areas. By fusing multiple global-local features with different scales, our ConvTransNet improves the robustness of the CD performance on changed areas with different sizes, especially small changed areas. Experiments on two public CD datasets of optical remote sensing images, i.e., LEVIR-CD and CDD, demonstrate that our ConvTransNet achieves enhanced CD performance than the other commonly used methods.
ISSN:0196-2892
1558-0644
DOI:10.1109/TGRS.2023.3272694