Multi-modal skeleton action recognition method and device

The invention relates to the field of artificial intelligence visual language multi-modality, in particular to a multi-modality skeleton action recognition method and device. The method comprises the following steps: acquiring and preprocessing skeleton data, and extracting corresponding skeleton or...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	CHEN QIJUN, LIU CHENGJU, ZENG QINYANG
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The invention relates to the field of artificial intelligence visual language multi-modality, in particular to a multi-modality skeleton action recognition method and device. The method comprises the following steps: acquiring and preprocessing skeleton data, and extracting corresponding skeleton original features; a visual encoder and a multi-layer perceptron are used for extracting corresponding visual features from the preprocessed skeleton data, meanwhile, a text prompt mapper is used for mapping action labels into texts, and a text encoder and the multi-layer perceptron are used for extracting corresponding language features from the texts; a loss value is calculated through a loss function, a visual encoder, a text decoder and a multi-layer perceptron are trained, and the loss function is composed of a visual loss function, a visual language loss function and a language decoding loss function; and testing by using the trained visual encoder to obtain a skeleton action recognition and classification resu