CTM-YOLOv8n: A Lightweight Pedestrian Traffic-Sign Detection and Recognition Model with Advanced Optimization

Traffic-sign detection and recognition (TSDR) is crucial to avoiding harm to pedestrians, especially children, from intelligent connected vehicles and has become a research hotspot. However, due to motion blurring, partial occlusion, and smaller sign sizes, pedestrian TSDR faces increasingly signifi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:World Electric Vehicle Journal 2024-07, Vol.15 (7), p.285
Hauptverfasser: Chen, Qiang, Dai, Zhongmou, Xu, Yi, Gao, Yuezhen
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Traffic-sign detection and recognition (TSDR) is crucial to avoiding harm to pedestrians, especially children, from intelligent connected vehicles and has become a research hotspot. However, due to motion blurring, partial occlusion, and smaller sign sizes, pedestrian TSDR faces increasingly significant challenges. To overcome these difficulties, a CTM-YOLOv8n model is proposed based on the YOLOv8n model. With the aim of extracting spatial features more efficiently and making the network faster, the C2f Faster module is constructed to replace the C2f module in the head, which applies filters to only a few input channels while leaving the remaining ones untouched. To enhance small-sign detection, a tiny-object-detection (TOD) layer is designed and added to the first C2f layer in the backbone. Meanwhile, the seventh Conv layer, eighth C2f layer, and connected detection head are deleted to reduce the quantity of model parameters. Eventually, the original CIoU is replaced by the MPDIoU, which is better for training deep models. During experiments, the dataset is augmented, which contains the choice of categories ‘w55’ and ‘w57’ in the TT100K dataset and a collection of two types of traffic signs around the schools in Tianjin. Empirical results demonstrate the efficacy of our model, showing enhancements of 5.2% in precision, 10.8% in recall, 7.0% in F1 score, and 4.8% in mAP@0.50. However, the number of parameters is reduced to 0.89M, which is only 30% of the YOLOv8n model. Furthermore, the proposed CTM-YOLOv8n model shows superior performance when tested against other advanced TSDR models.
ISSN:2032-6653
2032-6653
DOI:10.3390/wevj15070285