A deep neural network model for multi-view human activity recognition

Multiple cameras are used to resolve occlusion problem that often occur in single-view human activity recognition. Based on the success of learning representation with deep neural networks (DNNs), recent works have proposed DNNs models to estimate human activity from multi-view inputs. However, curr...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	PloS one 2022-01, Vol.17 (1), p.e0262181-e0262181
Hauptverfasser:	Putra, Prasetia Utama, Shima, Keisuke, Shimatani, Koji
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Analysis Artificial neural networks Attention - physiology Biology and Life Sciences Cameras Classification Computer and Information Sciences Datasets Engineering Engineering and Technology Human Activities - statistics & numerical data Human activity recognition Human mechanics Humans Hypotheses Learning - physiology Long short-term memory Machine learning Memory - physiology Modelling Moving object recognition Neural networks Neural Networks, Computer Occlusion Physical Sciences Recognition, Psychology Social Sciences Transfer learning
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Multiple cameras are used to resolve occlusion problem that often occur in single-view human activity recognition. Based on the success of learning representation with deep neural networks (DNNs), recent works have proposed DNNs models to estimate human activity from multi-view inputs. However, currently available datasets are inadequate in training DNNs model to obtain high accuracy rate. Against such an issue, this study presents a DNNs model, trained by employing transfer learning and shared-weight techniques, to classify human activity from multiple cameras. The model comprised pre-trained convolutional neural networks (CNNs), attention layers, long short-term memory networks with residual learning (LSTMRes), and Softmax layers. The experimental results suggested that the proposed model could achieve a promising performance on challenging MVHAR datasets: IXMAS (97.27%) and i3DPost (96.87%). A competitive recognition rate was also observed in online classification.
ISSN:	1932-6203 1932-6203
DOI:	10.1371/journal.pone.0262181