Multi-modal middle school experiment step detection method and system based on moving target semantic enhancement

The invention discloses a multi-modal middle school experiment step detection method and system based on motion target semantic enhancement, and the method comprises the steps: firstly carrying out the preprocessing of a video frame, obtaining a motion region through frame difference, obtaining a mo...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	CHEN YI, XING WULUE, GU YANHUI, YUAN HAOMIAO, ZHOU JUNSHENG
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING IMAGE DATA PROCESSING OR GENERATION, IN GENERAL PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The invention discloses a multi-modal middle school experiment step detection method and system based on motion target semantic enhancement, and the method comprises the steps: firstly carrying out the preprocessing of a video frame, obtaining a motion region through frame difference, obtaining a motion target through a target detection technology, and extracting a semantic time sequence feature through a BERT model; and performing time sequence dependence modeling on the video features in an encoder to obtain step-level visual time sequence features, fusing the step-level visual time sequence features with moving target semantic features in a decoder, constructing a relation between experiment steps and corresponding targets, and realizing accurate detection of the experiment steps in a middle school experiment video. The method can more effectively capture unique motion characteristics of the experimental steps, effectively distinguish different steps, realize accurate judgment of the experimental steps, an