Cross-modal visual tracking method and device based on adaptive convolution

The invention discloses a cross-modal visual tracking method and device based on self-adaptive convolution, and belongs to the technical field of computer vision, and the method comprises the steps: inputting a pair of registered multi-modal images, generating a weight tensor corresponding to the si...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	JIA YAQING, CAI XIANCHEN, LI CHENGLONG, ZHU QIWEN, TANG JIN
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING IMAGE DATA PROCESSING OR GENERATION, IN GENERAL PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The invention discloses a cross-modal visual tracking method and device based on self-adaptive convolution, and belongs to the technical field of computer vision, and the method comprises the steps: inputting a pair of registered multi-modal images, generating a weight tensor corresponding to the size of a feature graph after each layer of convolution through a self-adaptive convolution module, and generating a weight tensor corresponding to the size of a feature graph; self-adaptive fusion is carried out on input among different modals pixel by pixel, self-adaptive fusion of two modal features is carried out again on a fusion result and a single input modal feature, and cross-modal information interaction and single modal information enhancement are achieved; fine-tuning the fully connected layer according to a first frame collection sample of each video to cope with an instance-specific challenge; and finally, sending to the last layer of the full connection layer for binary classification operation to obta