Human Action Recognition Using Improved Salient Dense Trajectories

Human action recognition in videos is a topic of active research in computer vision. Dense trajectory (DT) features were shown to be efficient for representing videos in state-of-the-art approaches. In this paper, we present a more effective approach of video representation using improved salient de...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Computational Intelligence and Neuroscience 2016-01, Vol.2016 (2016), p.376-386
Hauptverfasser:	Huo, Guanying, Zhou, Yan, Cheng, Haisu, Li, Qingwu
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Algorithms Analysis Datasets as Topic Human Activities Human acts Human behavior Human motion Humans Image Processing, Computer-Assisted Intelligence Machine vision Movement Moving object recognition Object recognition (Computers) Pattern recognition Pattern Recognition, Automated Representations Sensors Sparsity State of the art Tracking Trajectories
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Human action recognition in videos is a topic of active research in computer vision. Dense trajectory (DT) features were shown to be efficient for representing videos in state-of-the-art approaches. In this paper, we present a more effective approach of video representation using improved salient dense trajectories: first, detecting the motion salient region and extracting the dense trajectories by tracking interest points in each spatial scale separately and then refining the dense trajectories via the analysis of the motion saliency. Then, we compute several descriptors (i.e., trajectory displacement, HOG, HOF, and MBH) in the spatiotemporal volume aligned with the trajectories. Finally, in order to represent the videos better, we optimize the framework of bag-of-words according to the motion salient intensity distribution and the idea of sparse coefficient reconstruction. Our architecture is trained and evaluated on the four standard video actions datasets of KTH, UCF sports, HMDB51, and UCF50, and the experimental results show that our approach performs competitively comparing with the state-of-the-art results.
ISSN:	1687-5265 1687-5273
DOI:	10.1155/2016/6750459