Oversampling for imbalanced test data
One or more computing devices, systems, and/or methods for oversampling for imbalanced test data are provided. A classifier is configured to classify data points as either belonging to a first class or a second class. A determination may be made that the first class and the second class are imbalanc...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Patent |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | One or more computing devices, systems, and/or methods for oversampling for imbalanced test data are provided. A classifier is configured to classify data points as either belonging to a first class or a second class. A determination may be made that the first class and the second class are imbalanced where a first number of data points estimated to be part of the first class is a threshold amount less than a second number of data points estimated to be part of the second class. An oversampling ratio is determined for the first class. The oversampling ratio is used to select a sample set of data points for editorial labeling, where the sampling set of data points comprises a total number of data points below a threshold amount. |
---|