Oversampling for imbalanced test data

One or more computing devices, systems, and/or methods for oversampling for imbalanced test data are provided. A classifier is configured to classify data points as either belonging to a first class or a second class. A determination may be made that the first class and the second class are imbalanc...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Tsioutsiouliklis, Kostas, Kang, Changsung, Shang, Hongwei, Langlois, Jean-Marc
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:One or more computing devices, systems, and/or methods for oversampling for imbalanced test data are provided. A classifier is configured to classify data points as either belonging to a first class or a second class. A determination may be made that the first class and the second class are imbalanced where a first number of data points estimated to be part of the first class is a threshold amount less than a second number of data points estimated to be part of the second class. An oversampling ratio is determined for the first class. The oversampling ratio is used to select a sample set of data points for editorial labeling, where the sampling set of data points comprises a total number of data points below a threshold amount.