A Local-Sparse-Information-Aggregation Transformer with Explicit Contour Guidance for SAR Ship Detection

Ship detection in synthetic aperture radar (SAR) images has witnessed rapid development in recent years, especially after the adoption of convolutional neural network (CNN)-based methods. Recently, a transformer using self-attention and a feed forward neural network with a encoder-decoder structure...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Remote sensing (Basel, Switzerland) Switzerland), 2022-10, Vol.14 (20), p.5247
Hauptverfasser:	Shi, Hao, Chai, Bingqian, Wang, Yupei, Chen, Liang
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Agglomeration Algorithms Artificial neural networks Clutter Coders contour enhancement Contours Deep learning deformable attention Deformation effects Efficiency Encoders-Decoders Formability Machine learning Neural networks Probability distribution Radar detection Radar imaging Receptive field SAR ship detection Sensors Shape Ships Swin Transformer Synthetic aperture radar
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Ship detection in synthetic aperture radar (SAR) images has witnessed rapid development in recent years, especially after the adoption of convolutional neural network (CNN)-based methods. Recently, a transformer using self-attention and a feed forward neural network with a encoder-decoder structure has received much attention from researchers, due to its intrinsic characteristics of global-relation modeling between pixels and an enlarged global receptive field. However, when adapting transformers to SAR ship detection, one challenging issue cannot be ignored. Background clutter, such as a coast, an island, or a sea wave, made previous object detectors easily miss ships with a blurred contour. Therefore, in this paper, we propose a local-sparse-information-aggregation transformer with explicit contour guidance for ship detection in SAR images. Based on the Swin Transformer architecture, in order to effectively aggregate sparse meaningful cues of small-scale ships, a deformable attention mechanism is incorporated to change the original self-attention mechanism. Moreover, a novel contour-guided shape-enhancement module is proposed to explicitly enforce the contour constraints on the one-dimensional transformer architecture. Experimental results show that our proposed method achieves superior performance on the challenging HRSID and SSDD datasets.
ISSN:	2072-4292 2072-4292
DOI:	10.3390/rs14205247