Statistical Motion Information Extraction and Representation for Semantic Video Analysis

In this paper, an approach to semantic video analysis that is based on the statistical processing and representation of the motion signal is presented. Overall, the examined video is temporally segmented into shots and for every resulting shot appropriate motion features are extracted; using these,...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on circuits and systems for video technology 2009-10, Vol.19 (10), p.1513-1528
Hauptverfasser:	Papadopoulos, G.T., Briassouli, A., Mezaris, V., Kompatsiaris, I., Strintzis, M.G.
Format:	Artikel
Sprache:	eng
Schlagworte:	Applied sciences Circuits Data mining Exact sciences and technology Feature extraction Hidden Markov models Hidden Markov models (HMMs) Image motion analysis Image processing Information analysis Information processing Information theory Information, signal and communications theory Kurtosis Mathematical models Miscellaneous Motion analysis motion representation News Optical noise Pixels Representations semantic video analysis Semantics Shot Signal analysis Signal and communications theory Signal processing Signal representation. Spectral analysis Signal, noise Telecommunications and information theory
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In this paper, an approach to semantic video analysis that is based on the statistical processing and representation of the motion signal is presented. Overall, the examined video is temporally segmented into shots and for every resulting shot appropriate motion features are extracted; using these, hidden Markov models (HMMs) are employed for performing the association of each shot with one of the semantic classes that are of interest. The novel contributions of this paper lie in the areas of motion information processing and representation. Regarding the motion information processing, the kurtosis of the optical flow motion estimates is calculated for identifying which motion values originate from true motion rather than measurement noise. Additionally, unlike the majority of the approaches of the relevant literature that are mainly limited to global- or camera-level motion representations, a new representation for providing local-level motion information to HMMs is also presented. It focuses only on the pixels where true motion is observed. For the selected pixels, energy distribution-related information, as well as a complementary set of features that highlight particular spatial attributes of the motion signal, are extracted. Experimental results, as well as comparative evaluation, from the application of the proposed approach in the domains of Tennis , News and Volleyball broadcast video, and Human Action video demonstrate the efficiency of the proposed method.
ISSN:	1051-8215 1558-2205
DOI:	10.1109/TCSVT.2009.2026932