Multiscale anchor box and optimized classification with faster R‐CNN for object detection

For the two‐stage object detector as a faster region‐convolutional neural network (Faster R‐CNN), upgrading the accuracy of object recognition depends on the proposal box, which is generated by the region proposal algorithms. Due to the limitations of the anchor setting of Faster RCNN, the size of t...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IET image processing 2023-04, Vol.17 (5), p.1322-1333
Hauptverfasser: Wang, Sheng‐Ye, Qu, Zhong
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:For the two‐stage object detector as a faster region‐convolutional neural network (Faster R‐CNN), upgrading the accuracy of object recognition depends on the proposal box, which is generated by the region proposal algorithms. Due to the limitations of the anchor setting of Faster RCNN, the size of the proposal box generated by the region proposal network (RPN) used is large, which would easily cause a great number of overflows in the sliding search. To improve the accuracy of object detection and remit the overflow problem of the anchor box, multi‐scale anchor box and moving overflow anchor box strategies are introduced here. Then, to increase the positive sample range of the foreground, the hierarchical weight cross entropy classification function is set for binary classification in the RPN network. These strategies could improve the accuracy of object detection. The experimental result achieves 76.2% AP on the Pascal VOC 2007(VOC 07) dataset, which is 2.7% higher than the Faster R‐CNN. The result of the Pascal VOC 2012(VOC 12) test, we achieve 75.6% AP, is improved by 2.5% compared with the Faster R‐CNN. To improve the accuracy of object detection and remit the overflow problem of anchor box, multi‐scale anchor box and moving overflow anchor box strategies are introduced in the paper. Then, to increase the positive sample range of the foreground, the weight cross entropy classification function is set for binary classification in the RPN network. Those strategies could improve the accuracy of object detection.
ISSN:1751-9659
1751-9667
DOI:10.1049/ipr2.12714