A review of Convolutional-Neural-Network-based action recognition
•We provide a review of Convolutional Neural Network based action recognition.•The review follows a clue of temporal information exploitation.•We discuss the performance of action recognition on recent large-scale benchmarks.•We indicate the limitations and future research of Convolutional Neural Ne...
Gespeichert in:
Veröffentlicht in: | Pattern recognition letters 2019-02, Vol.118, p.14-22 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | •We provide a review of Convolutional Neural Network based action recognition.•The review follows a clue of temporal information exploitation.•We discuss the performance of action recognition on recent large-scale benchmarks.•We indicate the limitations and future research of Convolutional Neural Network based action recognition.
Video action recognition is widely applied in video indexing, intelligent surveillance, multimedia understanding, and other fields. Recently, it was greatly improved by incorporating the learning of deep information using Convolutional Neural Network (CNN). This motivated us to review the notable CNN-based action recognition works. Because CNN is primarily designed to extract 2D spatial features from still image and videos are naturally viewed as 3D spatiotemporal signals, the core issue of extending the CNN from image to video is temporal information exploitation. We divide the solutions for exploiting temporal information exploration into three strategies: 1) 3D CNN; 2) taking the motion-related information as the CNN input; and 3) fusion. In this paper, we present a comprehensive review of the CNN-based action recognition methods according to these strategies. We also discuss the action recognition performance on recent large-scale benchmarks and the limitations and future research directions of CNN-based action recognition. This paper offers an objective and clear review of CNN-based action recognition and provides a guide for future research. |
---|---|
ISSN: | 0167-8655 1872-7344 |
DOI: | 10.1016/j.patrec.2018.05.018 |