A Double-Head Global Reasoning Network for Object Detection of Remote Sensing Images

Object detection of remote sensing images is a fundamental and important task for a wide range of applications, such as agriculture, military, and geological exploration. However, it is still a challenging task due to complex background, heavy occlusion, and object ambiguities. Complex remote sensin...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on geoscience and remote sensing 2024, Vol.62, p.1-16
Hauptverfasser:	Shi, Jingye, Zhi, Ruicong, Zhao, Jingru, Hou, Yuewu, Zhao, Zeyong, Wu, Jingyi, Liu, Jiqiang
Format:	Artikel
Sprache:	eng
Schlagworte:	Artificial neural networks Classification Cognition Detection Detectors Double-head global reasoning network (DGRN) Embedding Feature extraction Geological surveys Graph neural networks graph reasoning Graphs Localization Marine vehicles Neural networks object detection Object recognition Occlusion Performance enhancement Proposals Reasoning Remote sensing remote sensing images YOLO
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Object detection of remote sensing images is a fundamental and important task for a wide range of applications, such as agriculture, military, and geological exploration. However, it is still a challenging task due to complex background, heavy occlusion, and object ambiguities. Complex remote sensing scenes contain rich visual and spatial relationships between proposals, while most existing methods ignore these relationships. In this article, we introduce a novel double-head global reasoning network (DGRN), which endows an object detection model with the ability to adapt global reasoning by propagating visual and spatial embeddings between positive proposals (foreground) and negative proposals (background). Instead of propagating the features of positive proposals, we evolve high-level embeddings globally to explore potential information in all proposals. Specifically, our method first classifies and locates positive proposals and negative proposals simultaneously, and then adaptively learns two sparse region-to-region undirected graphs for classification and location, respectively. Finally, the graph reasoning module (GRM) conducts the propagation of proposals' embedding to improve the performance of object classification and localization. Without any prior knowledge, our DGRN method explores reasoning graphs between proposals for object classification and location, respectively, and then uses graph convolutional neural network (GCN) to propagate information and achieve relational reasoning. Our method is light-weighted and flexible enough to enhance many object detection models, making object detection models the capability of relational reasoning. The experimental results on DOTA and DIOR showed that the proposed method had better detection performance than object detection networks without considering relationships.
ISSN:	0196-2892 1558-0644
DOI:	10.1109/TGRS.2023.3347798