Weighted voting of multi-stream convolutional neural networks for video-based action recognition using optical flow rhythms

•We propose a multi-stream approach to solve the human action recognition problem.•The approach is based on the weighted voting of convolutional neural networks.•The voting weights are defined using a simulated annealing metaheuristic.•Multiple visual rhythms are employed to improve the spatio-tempo...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of visual communication and image representation 2021-05, Vol.77, p.103112, Article 103112
Hauptverfasser:	de Souza Brito, André, Bernardes Vieira, Marcelo, Moraes Villela, Saulo, Tacon, Hemerson, de Lima Chaves, Hugo, de Almeida Maia, Helena, Ttito Concha, Darwin, Pedrini, Helio
Format:	Artikel
Sprache:	eng
Schlagworte:	Action recognition Convolutional neural networks Optical flow rhythm
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	•We propose a multi-stream approach to solve the human action recognition problem.•The approach is based on the weighted voting of convolutional neural networks.•The voting weights are defined using a simulated annealing metaheuristic.•Multiple visual rhythms are employed to improve the spatio-temporal stream.•A novel temporal video representation is proposed to model long-term motion dynamics. Two of the most important premises of an ensemble are the diversity of its components and how to combine their votes. In this paper, we propose a multi-stream architecture based on the weighted voting of convolutional neural networks to deal with the problem of recognizing human actions in videos. A major challenge is how to include temporal aspects into this kind of approach. A key step in this direction is the selection of features that characterize the complexity of human actions in time. In this context, we propose a new stream, Optical Flow Rhythm, besides using other streams for diversity. To combine the streams, a voting system based on a new weighted average fusion method is introduced. In this scheme, the weights of classifiers are defined by an optimization process led by a metaheuristic. Experiments conducted on the UCF101 and HMDB51 datasets demonstrate that our method is comparable to state-of-the-art approaches.
ISSN:	1047-3203 1095-9076
DOI:	10.1016/j.jvcir.2021.103112