Learning Higher Quality Rotation Invariance Features for Multioriented Object Detection in Remote Sensing Images

Multioriented object detection, an important yet challenging task because of the bird's-eye-view perspective, complex background, and densely packed objects, is in the spotlight of detection in remote sensing images. Although existing methods have recently experienced substantial progress based...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE journal of selected topics in applied earth observations and remote sensing 2021, Vol.14, p.5842-5853
Hauptverfasser: Zhang, Caiguang, Xiong, Boli, Li, Xiao, Zhang, Jinqian, Kuang, Gangyao
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Multioriented object detection, an important yet challenging task because of the bird's-eye-view perspective, complex background, and densely packed objects, is in the spotlight of detection in remote sensing images. Although existing methods have recently experienced substantial progress based on oriented head, they learn little about essential rotation invariance of the objects. In this article, a novel framework is proposed that can learn high-quality rotation invariance features of the multioriented objects by three measures. Given a remote sensing image, the multiscale semantic segmentation feature fusion module first merges the global semantic segmentation features predicted by the semantic segmentation branch and the multiscale features extracted by the backbone with FPN in order to distinguish complex background. Then, the discriminative features are used by rotation mainstream, whose structure is similar to cascade R-CNN and can extract higher quality rotation invariance features and predict more accurate location information by adaptively adjusting the distribution of the samples through progressive intersection over union thresholds. And in order to improve the performance of mainstream to predict more accurate oriented bounding box, the horizontal tributaries that can fully leverage the reciprocal relationship between the oriented detection and horizontal detection were added to the latter two stages. Extensive experiments on three public datasets for remote sensing images, i.e., Gaofen Airplane, HRSC2016, and DOTA demonstrate that without bells and whistles, the proposed method achieves superior performances compared with the existing state-of-the-art methods for multioriented detection. Moreover, our overall system achieves 59.264% mAP of airplane Detection in 2020 Gaofen challenge, ranking third in the final.
ISSN:1939-1404
2151-1535
DOI:10.1109/JSTARS.2021.3085665