AD-RCNN: Adaptive Dynamic Neural Network For Small Object Detection

With the large-scale commercialization of 5G networks, Internet of Things (IoT) applications keep on emerging in recent years. Real-time environmental awareness is an essential part of various IoT applications, e.g., self-driving vehicles. Object detection plays a fundamental role in real-time envir...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE internet of things journal 2023-03, Vol.10 (5), p.1-1
Hauptverfasser:	Ou, Zhonghong, Wang, Zhaofengnian, Xiao, Fenrui, Xiong, Baiqiao, Zhang, Hongxing, Song, Meina, Zheng, Yan, Hui, Pan
Format:	Artikel
Sprache:	eng
Schlagworte:	Adaptive systems Autonomous cars Commercialization dynamic network Feature extraction Frames per second Generative adversarial networks High resolution Image resolution Internet of Things Neural networks Object detection Object recognition Proposals Real time small object detection Training
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	With the large-scale commercialization of 5G networks, Internet of Things (IoT) applications keep on emerging in recent years. Real-time environmental awareness is an essential part of various IoT applications, e.g., self-driving vehicles. Object detection plays a fundamental role in real-time environmental awareness, which is responsible for acquiring valuable object information from the environment automatically. Despite of the fast progress for object detection in general, small object detection still faces challenges. Because of the restricted scales, small objects are only capable of generating relatively week features after multiple convolutional layers, thus causing low detection accuracy. Existing schemes mostly focus on extracting rich multi-scale features, e.g., generating high resolution features through Generative Adversarial Networks (GAN), or generating multi-scale features through feature combination. Nevertheless, these schemes require complex network implementation, and usually suffer from high processing delay because of high resolution images. To resolve the problems mentioned above, we propose an Adaptive Dynamic neural network (AD-RCNN) that consists of three fundamental improvements. We first propose a dynamic region proposal network to improve the quality of region proposals. We then introduce a visual attention scheme to generate features of regions. At last, we put forward an adaptive dynamic training module to optimize final detection results. Experimental results demonstrate that AD-RCNN outperforms the state-of-the-art from the perspectives of mAP and Frames per Second (FPS). Specifically, at the resolution of 1024 of TT100K dataset, AD-RCNN achieves 68.8% mAP, which outperforms the baseline Faster RCNN by 8.52%.
ISSN:	2327-4662 2327-4662
DOI:	10.1109/JIOT.2022.3215469