Human activity classification based on sound recognition and residual convolutional neural network

Human activity recognition is crucial for a better understanding of workers in construction sites and people in the built environment. Previous studies have been proposed various ways in which sensing and machine learning techniques can be utilized to collect human activity data automatically. Sound...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Automation in construction 2020-06, Vol.114, p.103177, Article 103177
Hauptverfasser: Jung, Minhyuk, Chi, Seokho
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Human activity recognition is crucial for a better understanding of workers in construction sites and people in the built environment. Previous studies have been proposed various ways in which sensing and machine learning techniques can be utilized to collect human activity data automatically. Sound recognition has the potential to be utilized in ways that complement the limitations of the previous methods because sound signals are easy to propagate in indoor environments where many physical obstacles exist, and this method can simultaneously recognize not only sounds from human activities but also sounds from related objects. Therefore, this study develops a sound recognition-based human activity classification model using a residual neural network. A sound data is collected based on ten classes representing people's daily activities in the indoor environment. Then, the features of the sound data were extracted using the Log Mel-filter bank energies method, and a residual neural network model with 34 convolutional layers was trained using the data. The results showed the following: the accuracy of the model was 87.6%, and the Precision score for each class ranged from 76.8% to 92.6%, the Recall scores ranged from 75.8% to 98.6%, and the F1-score ranged from 78.6% to 93.7%. The contribution of this study is to demonstrate that sound recognition can classify people's indoor activities successfully, but this study leaves the limitation that it is based on a monophonic method that only one activity can be classified at a time. •The human activity classification model was proposed based on a sound recognition method.•The sound dataset for ten human activity classes was developed using open-source video and audio platforms.•A deep residual neural network with 34 convolutional layers was designed to classify sound data converted into 2D spectrograms.•The performance of the classification model by human activity class was evaluated and discussed.
ISSN:0926-5805
1872-7891
DOI:10.1016/j.autcon.2020.103177