Object Detection and Tracking using YOLO v3 Framework for Increased Resolution Video

The proposed system is used for vehicle detection and tracking from the high-resolution video. It detects the object (vehicles) and recognizes the object comparing its features with the features of the objects stored in the database. If the features match, then object is tracked. There are two steps...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	International journal of innovative technology and exploring engineering 2020-04, Vol.9 (6), p.118-125
Hauptverfasser:	Shakil A., Shaikh, Kureshi, Dr. A. K.
Format:	Artikel
Sprache:	eng
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The proposed system is used for vehicle detection and tracking from the high-resolution video. It detects the object (vehicles) and recognizes the object comparing its features with the features of the objects stored in the database. If the features match, then object is tracked. There are two steps of implementation, online and offline process. In offline process the data in the form of images are given to feature extractor and then after to the trained YOLO v3 model and weight files is generated form the pre-trained YOLO v3 model. In online phase, real-time video is applied to feature extractor to extract the features and then applied to the pre-trained YOLO v3 model. The other reference to YOLO v3 model pre-trained is the output of weight file. The YOLO v3 model process on the video frame and weight file extracted features, the model output is classified image. In YOLO v3 Darknet-53 is used along with Keras, some libraries with OpenCV, Tensor Flow, and Numpy. The proposed system is implemented on PC Intel Pentium G500, 8GB and operating system Windows 7 is used for processing our system. The system is tested on PASCAL VOC dataset and the results obtained are accuracy 80%, precision 80%, recall 100%, F1-Score 88%, mAP 76.7%, and 0.018%. The system is implemented using python 3.6.0 software and also tested using real-time video having 1280x720 and 1920x1080 resolutions. The execution time for one frame of video having resolution of 1280x720 (HD) and 1920x1080 (FHD) and 1280x720 (HD) are 1.840 second and 4.414808 seconds respectively with accuracy is 80%.
ISSN:	2278-3075 2278-3075
DOI:	10.35940/ijitee.E3038.049620