Query Adaptive Transformer and Multi-Prototype Rectification for Few-Shot Remote Sensing Image Segmentation
Deep learning has emerged as a powerful tool for semantic segmentation tasks. However, in some data-deficient and resource-limited scenarios, networks are constrained to learn novel concepts. To tackle this problem, few-shot segmentation (FSS) has been proposed by the machine learning community, aim...
Gespeichert in:
Veröffentlicht in: | IEEE transactions on geoscience and remote sensing 2024-12, p.1-1 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Deep learning has emerged as a powerful tool for semantic segmentation tasks. However, in some data-deficient and resource-limited scenarios, networks are constrained to learn novel concepts. To tackle this problem, few-shot segmentation (FSS) has been proposed by the machine learning community, aiming at segmenting novel objects by leveraging a handful of annotated samples. However, most advanced FSS algorithms suffer from severe performance degradation when directly applied to remote sensing image (RSI) domains. Challenges arise due to the unique characteristics of RSIs. To address the large intra-class variation brought by various imaging conditions and large-scale variations, a Query Adaptive Transformer (QAT) is proposed. QAT incorporates query priors into the feature extraction process, adapting RSI features to novel objects from query RSIs. It is noteworthy that query priors are extracted via a Query Prior Generation (QPG) module devoid of any query label information. To alleviate interference brought by complex object distribution, a multi-prototype strategy is adopted instead of representing objects with a single prototype ambiguously. Moreover, we rectify the prototypes using a Prior Injection Module (PIM), thereby fully leveraging the advantages offered by query priors. The superiority of our method is validated through comprehensive experiments on the public iSAID-5 i dataset and comparisons with state-of-the-art methods. Finally, we propose a novel cross-domain setting to investigate the potential and generalizability of few-shot RSI segmentation for several Earth observation applications. |
---|---|
ISSN: | 0196-2892 |
DOI: | 10.1109/TGRS.2024.3511622 |