TIR-Net: Task Integration Based on Rotated Convolution Kernel for Oriented Object Detection in Aerial Images
The application of oriented object detection in the field of aerial images has gained substantial attention and made significant progress. However, most one-stage object detectors struggle to extract rotation-invariant features of oriented objects using ordinary convolutions. And the structure of tw...
Gespeichert in:
Veröffentlicht in: | IEEE transactions on geoscience and remote sensing 2024, Vol.62, p.1-13 |
---|---|
Hauptverfasser: | , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The application of oriented object detection in the field of aerial images has gained substantial attention and made significant progress. However, most one-stage object detectors struggle to extract rotation-invariant features of oriented objects using ordinary convolutions. And the structure of two parallel vision subtasks can result in the inconsistency between the classification and regression. In this article, we propose a Task Integration based on a Rotated convolution kernel Network (TIR-Net) consisting of three modules: selective rotation of the kernel (SRK), regression feature refinement (RFR), and task integration (TI). Specifically, an SRK module enhances classification features by applying rotated convolution kernels selectively, introducing rotational invariance to the features. An RFR module places more emphasis on feature extraction of large aspect ratio objects to improve their perceptibility. A TI module integrates classification and regression features to alleviate the inconsistency between classification and regression. Experimental evaluations on the benchmarks for oriented object detection indicate that our method achieves excellent detection performance. |
---|---|
ISSN: | 0196-2892 1558-0644 |
DOI: | 10.1109/TGRS.2024.3412167 |