Gated spatio and temporal convolutional neural network for activity recognition: towards gated multimodal deep learning

Human activity recognition requires both visual and temporal cues, making it challenging to integrate these important modalities. The usual schemes for integration are averaging and fixing the weights of both features for all samples. However, how much weight is needed for each sample and modality,...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	EURASIP journal on image and video processing 2017-12, Vol.2017 (1), p.1-12, Article 85
Hauptverfasser:	Yudistira, Novanto, Kurita, Takio
Format:	Artikel
Sprache:	eng
Schlagworte:	Action recognition Applications of Visual Analysis of Human Behaviour Artificial neural networks Biometrics CNN Cues Deep learning Engineering Feature recognition Gated network Human activity recognition Image Processing and Computer Vision Neural networks Pattern Recognition Signal,Image and Speech Processing
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Human activity recognition requires both visual and temporal cues, making it challenging to integrate these important modalities. The usual schemes for integration are averaging and fixing the weights of both features for all samples. However, how much weight is needed for each sample and modality, is still an open question. A mixture of experts via a gating Convolutional Neural Network (CNN) is one promising architecture for adaptively weighting every sample within a dataset. In this paper, rather than just averaging or using fixed weights, we investigate how a natural associative cortex such as a network integrates expert networks to form a gating CNN scheme. Starting from Red Green Blue color model (RGB) values and optical flows, we show that with proper treatment, the gating CNN scheme works well, indicating future approaches to information integration in future activity recognition.
ISSN:	1687-5281 1687-5176 1687-5281
DOI:	10.1186/s13640-017-0235-9