YOLO sparse training and model pruning for street view house numbers recognition

This paper proposes a YOLO (You Only Look Once) sparse training and model pruning technique for recognizing house numbers in street view images. YOLO is a popular object detection algorithm that has achieved state-of-the-art performance in various computer vision tasks. However, its large model size...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of physics. Conference series 2023-12, Vol.2646 (1), p.12025
Hauptverfasser: Zhang, Ruohao, Lu, Yijie, Song, Zhengfei
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:This paper proposes a YOLO (You Only Look Once) sparse training and model pruning technique for recognizing house numbers in street view images. YOLO is a popular object detection algorithm that has achieved state-of-the-art performance in various computer vision tasks. However, its large model size and computational complexity limit its deployment on resource-constrained devices such as smartphones and embedded systems. To address this issue, we use a sparse training technique that trains YOLO with L1 norm regularization to encourage the network to learn sparse representations. This results in a significant reduction in the number of parameters and computation without sacrificing accuracy. Furthermore, we apply a model pruning technique to the sparse-trained model to reduce the model size and computation. We evaluate our proposed method on the SVHN (Street View House Numbers) dataset. We show that it performs comparably to the original YOLO model while reducing the model size by 5% and the computation time by 7%. Overall, our proposed YOLO sparse training and model pruning technique provides an effective solution for deploying YOLO-based object detection models on resource-constrained devices.
ISSN:1742-6588
1742-6596
DOI:10.1088/1742-6596/2646/1/012025