Deep learning-based human activity recognition using CNN, ConvLSTM, and LRCN

Human activity recognition (HAR) plays a crucial role in assisting the elderly and individuals with vascular dementia by providing support and monitoring for their daily activities. This paper presents a deep learning (DL)-based approach to HAR, leveraging convolutional neural network (CNN), convolu...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of cognitive computing in engineering 2024, Vol.5, p.259-268
Hauptverfasser: Uddin, Md. Ashraf, Talukder, Md. Alamin, Uzzaman, Muhammad Sajib, Debnath, Chandan, Chanda, Moumita, Paul, Souvik, Islam, Md. Manowarul, Khraisat, Ansam, Alazab, Ammar, Aryal, Sunil
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Human activity recognition (HAR) plays a crucial role in assisting the elderly and individuals with vascular dementia by providing support and monitoring for their daily activities. This paper presents a deep learning (DL)-based approach to HAR, leveraging convolutional neural network (CNN), convolutional long short-term memory (ConvLSTM), and long-term recurrent convolutional network (LRCN) architectures. These models are designed to extract spatial features and capture temporal dependencies in video data, enhancing the accuracy of activity classification. We conducted experiments on the UCF50 and HMDB51 video datasets, encompassing diverse human activities. Our evaluation demonstrates that the ConvLSTM model achieves an accuracy of 82% on UCF50 and 68% on HMDB51, while the LRCN model gives accuracies of 93.44% and 71.55%, respectively. Finally, the CNN model outperforms with an accuracy rate of 99.58% for the UCF50 and 92.70% for the HMDB51 datasets. These significant improvements showcase the effectiveness of integrating convolutional and recurrent neural networks for HAR tasks. Our research contributes to advancing HAR systems with potential applications in healthcare, assisted living, and surveillance. By accurately recognizing human activities, our models can assist in remote patient monitoring, fall detection, and public safety initiatives. These findings underscore the importance of DL in enhancing the quality of life and safety for individuals in various contexts. •Deep Learning approach enhances human activity accuracy in video data.•CNN, ConvLSTM, and LRCN capture spatial and temporal features in videos.•CNN model surpasses 99.58% accuracy on UCF50 and 92.70% on HMDB51 datasets.•Proposal aids in remote monitoring, fall detection, and public safety initiatives.
ISSN:2666-3074
2666-3074
DOI:10.1016/j.ijcce.2024.06.004