Systems and Methods for Active Curriculum Learning

Computer systems and computer implemented methods for training a machine learning model are provided that includes: selecting seed data from an unlabeled dataset; labeling the seed data and storing the labeled seed data in a data store; training the machine learning model in an initial iteration usi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Pogrebnyakov, Nicolai, Lee, Seung Min, Makrehchi, Masoud, Jafarpour, Borna, Sepehr, Firoozeh, Madyalkar, Vinod Vijaykumar
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Computer systems and computer implemented methods for training a machine learning model are provided that includes: selecting seed data from an unlabeled dataset; labeling the seed data and storing the labeled seed data in a data store; training the machine learning model in an initial iteration using the labeled seed data, where the machine learning model is trained to select a next subset of the unlabeled dataset; selecting a next subset of the unlabeled dataset; computing difficulty scores for at least the next subset of the unlabeled dataset; labeling the next subset of the unlabeled data; and training the machine learning model in a second iteration using the labeled next subset of the unlabeled dataset. The machine learning model is generally trained to select the next subset of the unlabeled dataset for a subsequent training iteration by presenting the labeled next subset of the unlabeled dataset in an order sorted based on the difficulty scores.