Detection of objects in satellite and aerial imagery using channel and spatially attentive YOLO-CSL for surveillance

A novel lightweight Rotational Object Detection algorithm is proposed to overcome the shortcomings of conventional computer-vision-aided object detection methods used in Remote Sensing and Surveillance that overlook the variability in size and orientation of objects in satellite and aerial images. T...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Image and vision computing 2024-07, Vol.147, p.105070, Article 105070
Hauptverfasser:	Chaurasia, Divyansh, Patro, B.D.K.
Format:	Artikel
Sprache:	eng
Schlagworte:	Channel attention Circular smooth label Computer vision Remote sensing Rotational object detection Spatial attention
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	A novel lightweight Rotational Object Detection algorithm is proposed to overcome the shortcomings of conventional computer-vision-aided object detection methods used in Remote Sensing and Surveillance that overlook the variability in size and orientation of objects in satellite and aerial images. This advanced algorithm integrates a branch dedicated to angle prediction and employs the circular smooth label (CSL) method for angle classification. This approach is suitable for scenarios that require detection in rotational boxes. Our work is further distinguished by the introduction of a novel Channel and Spatial Attention (CSA) module, which is seamlessly integrated into the YOLOv5-CSL framework via the C3CS module. This module accentuates pertinent features through both the channel and spatial attention mechanisms. In addition, bicubic interpolation and the GELU activation function were incorporated into the YOLOv5-CSLA model. Our model achieved 57.86 mAP on the challenging DOTA v2 dataset surpassing the second-best method by 0.20 points and simultaneously consuming 11 million fewer parameters and 103 fewer GFLOPs (our model consumes 25 M Params and 54 GFLOPs), justifying its suitability for deployment on a large majority of platforms, as the compute required is a challenge in real-time deployment scenarios. •Integration of channel and spatial attention in YOLOv5-CSL.•Integration of circular smooth labels and BCE with logit loss using YOLOv5.•The model achieved an mAP of 57.86 on the DOTA-v2 satellite images dataset.•The model consumes just 54 GFLOPS.
ISSN:	0262-8856 1872-8138
DOI:	10.1016/j.imavis.2024.105070