Binary classification of imbalanced datasets using conformal prediction

[Display omitted] •Conformal Prediction finds a majority of active compounds in highly imbalanced data.•Separate distributions are used for the two classes during classification.•No balancing measures is need for imbalanced data using Conformal Prediction. Aggregated Conformal Prediction is used as...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of molecular graphics & modelling 2017-03, Vol.72, p.256-265
Hauptverfasser: Norinder, Ulf, Boyer, Scott
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:[Display omitted] •Conformal Prediction finds a majority of active compounds in highly imbalanced data.•Separate distributions are used for the two classes during classification.•No balancing measures is need for imbalanced data using Conformal Prediction. Aggregated Conformal Prediction is used as an effective alternative to other, more complicated and/or ambiguous methods involving various balancing measures when modelling severely imbalanced datasets. Additional explicit balancing measures other than those already apart of the Conformal Prediction framework are shown not to be required. The Aggregated Conformal Prediction procedure appears to be a promising approach for severely imbalanced datasets in order to retrieve a large majority of active minority class compounds while avoiding information loss or distortion.
ISSN:1093-3263
1873-4243
1873-4243
DOI:10.1016/j.jmgm.2017.01.008