Improved YOLOv5 for aerial images based on attention mechanism
Object detection based on unmanned aerial vehicle(UAV) platforms is essential for both engineering and research. Complex scale problems in UAV application scenarios require strong regression localization capabilities from target detection algorithms. Nonetheless, due to the constraints of UAV platfo...
Gespeichert in:
Veröffentlicht in: | IEEE access 2023-01, Vol.11, p.1-1 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Object detection based on unmanned aerial vehicle(UAV) platforms is essential for both engineering and research. Complex scale problems in UAV application scenarios require strong regression localization capabilities from target detection algorithms. Nonetheless, due to the constraints of UAV platform, it is difficult to increase accuracy by deepening the network. Therefore, this paper presents an improved YOLOv5 with an attention mechanism, consisting a Convolution-Swin Transformer Block(CSTB) utilizing Swin Transformer as well as a Convolution-block Attention Module(CBAM) to improve network positioning accuracy. In addition, this paper incorporates Bidirectional Feature Pyramid Network(BiFPN) [1], Spatial Pyramid Pooling-Fast(SPPF) and some network components to increase the average precision while maintaining the limited size of the model. Experiments on Visdrone2019 dataset show that the proposed approach can raise the mean Average Precision(mAP) by 5.4% compared to YOLOv5, with only 18% increase in model size. |
---|---|
ISSN: | 2169-3536 2169-3536 |
DOI: | 10.1109/ACCESS.2023.3277931 |