Combined action recognition method and system

The invention provides a combined action recognition method and system, and belongs to the technical field of computer vision, and the method comprises the steps: obtaining the geometric position information of an object contained in a to-be-recognized video; obtaining multi-modal features of each o...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	MI HUADONG, LU YANSHA, CHANG FALIANG, LIU CHUNSHENG, LI NANJUN
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	CALCULATING COMPUTING COUNTING HANDLING RECORD CARRIERS PHYSICS PRESENTATION OF DATA RECOGNITION OF DATA RECORD CARRIERS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The invention provides a combined action recognition method and system, and belongs to the technical field of computer vision, and the method comprises the steps: obtaining the geometric position information of an object contained in a to-be-recognized video; obtaining multi-modal features of each object based on the geometric position coordinates; wherein the multi-modal features comprise geometric characterization features, visual characterization features and motion flow features; according to the multi-modal features, obtaining feature vectors representing interaction relationships among the objects; and based on the feature vectors representing the interaction relationship between the objects, performing action classification to obtain a final combined action recognition result. According to the method, through a Transform module based on a self-attention mechanism, the interaction relation of different objects in a video sequence in a time domain and a space domain is deduced, RGB visual representation