Sinextnet: A New Small Object Detection Model for Aerial Images Based on PP-Yoloe

Although object detection has achieved great success in the field of computer vision in the past few years, the performance of detecting small objects has not yet achieved ideal results. For instance, UAV aerial photography object detection plays an important role in traffic monitoring and other fie...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of Artificial Intelligence and Soft Computing Research 2024-06, Vol.14 (3), p.251-265
Hauptverfasser: Zhang, Wenkang, Hong, Zhiyong, Xiong, Liping, Zeng, Zhiqiang, Cai, Zhishun, Tan, Kunyu
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Although object detection has achieved great success in the field of computer vision in the past few years, the performance of detecting small objects has not yet achieved ideal results. For instance, UAV aerial photography object detection plays an important role in traffic monitoring and other fields, but it faces some great challenges. The objects in aerial images are mainly small objects, the resolution of whom is low and the feature expression ability of whom is very weak. Information will be lost in high-dimensional feature maps, and this information is very important for the classification and positioning of small objects. The most common way to improve small object detection accuracy is to use high-resolution images, but this incurs additional computational costs. To address the above-mentioned problems, this article proposes a new model SINextNet, which uses a new dilated convolution module SINext block. This module is based on depth-separable convolution, and can improve the receptive field of the model. While extracting small object features, it can combine small object features with background information, greatly improving the feature expression ability of small objects. The experimental results indicate that the method proposed in this paper can achieve advanced performance across multiple aerial datasets.
ISSN:2449-6499
2449-6499
DOI:10.2478/jaiscr-2024-0014