Conditional object-centric slot attention learning for video and other sequential data

The present application relates to methods, systems, and non-transitory computer readable media associated with conditional object-centric slot attention learning for video and other sequential data. A method includes obtaining a first feature vector and a second feature vector representing content...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: GRAEF, KLAUS, JONSCHKOWSKI, ROBERT, KIEPE TOBIAS, DOSOVITZKY, ALEXANDER, ESSEYED, GIL, HAIGOLD GEORGE, MAHENDRAN ARUNGUNDRAM, STONE ANTHONY C, AGHDAM SARA SABOUR ROUH
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The present application relates to methods, systems, and non-transitory computer readable media associated with conditional object-centric slot attention learning for video and other sequential data. A method includes obtaining a first feature vector and a second feature vector representing content of first and second image frames of an input video, respectively. The method may also include generating first slot vectors based on the first feature vectors, wherein each slot vector represents an attribute of a corresponding entity as represented in the first image frame; and generating prediction slot vectors based on the first slot vectors, the prediction slot vectors including corresponding prediction slot vectors representing transitions of attributes of corresponding entities from the first image frame to the second image frame. The method may also include generating second slot vectors based on the predicted slot vector and the second feature vector, the second slot vectors including corresponding slot vec