Using machine learning to train and use a model to perform automatic interface actions based on video and input datasets

Disclosed herein are methods, systems, and computer-readable media for training a machine learning model to label unlabeled data and/or perform automated actions. In an embodiment, a method comprises receiving unlabeled digital video data, generating pseudo-labels for the unlabeled digital video dat...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Ecoffet, Adrien, Baker, Bowen, Clune, Jeffrey, Huizanga, Joost, Tang, Jie, Houghton, Brandon, Akkaya, Ilge, Zhokhov, Peter, Gonzalez, Raul Sampedro
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Disclosed herein are methods, systems, and computer-readable media for training a machine learning model to label unlabeled data and/or perform automated actions. In an embodiment, a method comprises receiving unlabeled digital video data, generating pseudo-labels for the unlabeled digital video data, the generating comprising receiving labeled digital video data, training an inverse dynamics model (IDM) using the labeled digital video data, and generating at least one pseudo-label for the unlabeled digital video data, wherein the at least one pseudo-label is based on a prediction, generated by the IDM, of one or more actions that mimic at least one timestep of the unlabeled digital video data. In some embodiments, the method further comprises adding the at least one pseudo-label to the unlabeled digital video data and further training the IDM or a machine learning model using the pseudo-labeled digital video data.