Speeding up Viola-Jones algorithm using multi-Core GPU implementation

Graphic Processing Units (GPUs) offer cheap and high-performance computation capabilities by offloading compute-intensive portions of the application to the GPU, while the remainder of the code still runs on a CPU. This paper introduces an multi-GPU CUDA implementation of training of object detectio...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Masek, Jan, Burget, Radim, Uher, Vaclav, Guney, Selda
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Acceleration CUDA Detectors Face face detection Graphics processing units high performance computing Instruction sets multi-GPU Testing Training Viola-Jones detector
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Graphic Processing Units (GPUs) offer cheap and high-performance computation capabilities by offloading compute-intensive portions of the application to the GPU, while the remainder of the code still runs on a CPU. This paper introduces an multi-GPU CUDA implementation of training of object detection using Viola-Jones algorithm that has accelerated of two the most time consuming operations in training process by using two dual-core NVIDIA GeForce GTX 690. When compared to single thread implementation on Intel Core i7 3770 with 3.7 GHz frequency, the first accelerated part of training process was speeded up 151 times and the second accelerated part was speeded up 124 times using two dual-core GPUs. This paper examines overall computational time of the Viola-Jones training process with the use of: one core CPU, one GPU, two GPUs, 3 GPUs and 4GPUs. Trained detector was applied on testing set containing real world images.
DOI:	10.1109/TSP.2013.6614050