Sparse Coding of Shape Trajectories for Facial Expression and Action Recognition

The detection and tracking of human landmarks in video streams has gained in reliability partly due to the availability of affordable RGB-D sensors. The analysis of such time-varying geometric data is playing an important role in the automatic human behavior understanding. However, suitable shape re...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on pattern analysis and machine intelligence 2020-10, Vol.42 (10), p.2594-2607
Hauptverfasser:	Tanfous, Amor Ben, Drira, Hassen, Amor, Boulbaba Ben
Format:	Artikel
Sprache:	eng
Schlagworte:	action recognition Artificial Intelligence Coding Computer Science Computer Vision and Pattern Recognition Encoding Face recognition facial expression recognition Kendall's shape space Landmarks Machine learning Manifolds Nonlinearity Reliability analysis Sequences Shape shape trajectories sparse coding and dictionary learning Three-dimensional displays Trajectory Two dimensional displays Video data
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The detection and tracking of human landmarks in video streams has gained in reliability partly due to the availability of affordable RGB-D sensors. The analysis of such time-varying geometric data is playing an important role in the automatic human behavior understanding. However, suitable shape representations as well as their temporal evolution, termed trajectories, often lie to nonlinear manifolds. This puts an additional constraint (i.e., nonlinearity) in using conventional Machine Learning techniques. As a solution, this paper accommodates the well-known Sparse Coding and Dictionary Learning approach to study time-varying shapes on the Kendall shape spaces of 2D and 3D landmarks. We illustrate effective coding of 3D skeletal sequences for action recognition and 2D facial landmark sequences for macro- and micro-expression recognition. To overcome the inherent nonlinearity of the shape spaces, intrinsic and extrinsic solutions were explored. As main results, shape trajectories give rise to more discriminative time-series with suitable computational properties, including sparsity and vector space structure. Extensive experiments conducted on commonly-used datasets demonstrate the competitiveness of the proposed approaches with respect to state-of-the-art.
ISSN:	0162-8828 1939-3539 2160-9292
DOI:	10.1109/TPAMI.2019.2932979