Recognizing Human Action at a Distance in Video by Key Poses

In this paper, we propose a graph theoretic technique for recognizing human actions at a distance in a video by modeling the visual senses associated with poses. The proposed methodology follows a bag-of-word approach that starts with a large vocabulary of poses (visual words) and derives a refined...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on circuits and systems for video technology 2011-09, Vol.21 (9), p.1228-1241
Hauptverfasser:	Mukherjee, S., Biswas, S. K., Mukherjee, D. P.
Format:	Artikel
Sprache:	eng
Schlagworte:	Action recognition Applied sciences centrality measure Circuits Detection, estimation, filtering, equalization, prediction Exact sciences and technology Graphs Histograms Human Humans Image processing Information, signal and communications theory key pose Legged locomotion Mathematical analysis Methodology Optical attenuators Optical imaging Pattern recognition Recognition Signal and communications theory Signal processing Signal, noise Telecommunications and information theory Vectors (mathematics) Visual Visualization Vocabulary
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In this paper, we propose a graph theoretic technique for recognizing human actions at a distance in a video by modeling the visual senses associated with poses. The proposed methodology follows a bag-of-word approach that starts with a large vocabulary of poses (visual words) and derives a refined and compact codebook of key poses using centrality measure of graph connectivity. We introduce a "meaningful" threshold on centrality measure that selects key poses for each action type. Our contribution includes a novel pose descriptor based on histogram of oriented optical flow evaluated in a hierarchical fashion on a video frame. This pose descriptor combines both pose information and motion pattern of the human performer into a multidimensional feature vector. We evaluate our methodology on four standard activity-recognition datasets demonstrating the superiority of our method over the state-of-the-art.
ISSN:	1051-8215 1558-2205
DOI:	10.1109/TCSVT.2011.2135290