ConvTransNet: A CNN-Transformer Network for Change Detection With Multiscale Global-Local Representations
Change detection (CD) in optical remote sensing images has significantly benefited from the development of deep convolutional neural networks (CNNs) due to their strong capability of local modeling in bitemporal images. In addition, the recent rise of transformer modules has led to the improvement o...
Gespeichert in:
Veröffentlicht in: | IEEE transactions on geoscience and remote sensing 2023, Vol.61, p.1-15 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Change detection (CD) in optical remote sensing images has significantly benefited from the development of deep convolutional neural networks (CNNs) due to their strong capability of local modeling in bitemporal images. In addition, the recent rise of transformer modules has led to the improvement of global feature extraction of bitemporal remote sensing images. Note that the existing simple cascade of deep CNNs and transformer modules shows limited CD performance on small changed areas due to deficiencies of multiscale information therein. To address the aforementioned issue, we propose a new CNN-transformer network (ConvTransNet) with multiscale framework to better exploit global-local information in optical remote sensing images. In our ConvTransNet, we propose the parallel-branch ConvTrans block as the basic component to generate global-local features, i.e., adaptively integrates the global features summarized by a transformer-based branch and the local features extracted by a convolution-based branch, providing better identifiability between changed areas and unchanged areas. By fusing multiple global-local features with different scales, our ConvTransNet improves the robustness of the CD performance on changed areas with different sizes, especially small changed areas. Experiments on two public CD datasets of optical remote sensing images, i.e., LEVIR-CD and CDD, demonstrate that our ConvTransNet achieves enhanced CD performance than the other commonly used methods. |
---|---|
ISSN: | 0196-2892 1558-0644 |
DOI: | 10.1109/TGRS.2023.3272694 |