AD-RCNN: Adaptive Dynamic Neural Network For Small Object Detection
With the large-scale commercialization of 5G networks, Internet of Things (IoT) applications keep on emerging in recent years. Real-time environmental awareness is an essential part of various IoT applications, e.g., self-driving vehicles. Object detection plays a fundamental role in real-time envir...
Gespeichert in:
Veröffentlicht in: | IEEE internet of things journal 2023-03, Vol.10 (5), p.1-1 |
---|---|
Hauptverfasser: | , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | With the large-scale commercialization of 5G networks, Internet of Things (IoT) applications keep on emerging in recent years. Real-time environmental awareness is an essential part of various IoT applications, e.g., self-driving vehicles. Object detection plays a fundamental role in real-time environmental awareness, which is responsible for acquiring valuable object information from the environment automatically. Despite of the fast progress for object detection in general, small object detection still faces challenges. Because of the restricted scales, small objects are only capable of generating relatively week features after multiple convolutional layers, thus causing low detection accuracy. Existing schemes mostly focus on extracting rich multi-scale features, e.g., generating high resolution features through Generative Adversarial Networks (GAN), or generating multi-scale features through feature combination. Nevertheless, these schemes require complex network implementation, and usually suffer from high processing delay because of high resolution images. To resolve the problems mentioned above, we propose an Adaptive Dynamic neural network (AD-RCNN) that consists of three fundamental improvements. We first propose a dynamic region proposal network to improve the quality of region proposals. We then introduce a visual attention scheme to generate features of regions. At last, we put forward an adaptive dynamic training module to optimize final detection results. Experimental results demonstrate that AD-RCNN outperforms the state-of-the-art from the perspectives of mAP and Frames per Second (FPS). Specifically, at the resolution of 1024 of TT100K dataset, AD-RCNN achieves 68.8% mAP, which outperforms the baseline Faster RCNN by 8.52%. |
---|---|
ISSN: | 2327-4662 2327-4662 |
DOI: | 10.1109/JIOT.2022.3215469 |