Continuous Audiovisual Emotion Recognition Using Feature Selection and LSTM

Speech and visual information are the most dominant modalities for a human to perceive emotion. A method of recognizing human emotion from these modalities is proposed by utilizing feature selection and long short-term memory (LSTM) neural networks. A feature selection method based on support vector...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of Signal Processing 2020/11/01, Vol.24(6), pp.229-235
Hauptverfasser:	Elbarougy, Reda, Atmaja, Bagus Tris, Akagi, Masato
Format:	Artikel
Sprache:	eng
Schlagworte:	Arousal audiovisual continuous emotion recognition Correlation coefficients Emotion recognition Emotions Feature recognition feature selection LSTM Neural networks Speech recognition Support vector machines
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Speech and visual information are the most dominant modalities for a human to perceive emotion. A method of recognizing human emotion from these modalities is proposed by utilizing feature selection and long short-term memory (LSTM) neural networks. A feature selection method based on support vector regression is used to select the relevant features among thousands of features extended from speech and video features via bag-of-X-words. The LSTM neural networks then are trained using a number of selected features and also separately optimized for every emotion dimension. Instead of utterance-level emotion recognition, time-frame-based processing is performed to enable continuous emotion recognition using a database labeled for each time frame. Experimental results reveal that a system with feature selection is more effective for predicting emotion dimensions for a single language than the baseline system without feature selection. The performance is measured in terms of the concordance correlation coefficient obtained by averaging the valence, arousal, and liking dimensions.
ISSN:	1342-6230 1880-1013
DOI:	10.2299/jsp.24.229