Multi-label classification and label dependence in in silico toxicity prediction

Most computational predictive models are specifically trained for a single toxicity endpoint and lack the ability to learn dependencies between endpoints, such as those targeting similar biological pathways. In this study, we compare the performance of 3 multi-label classification (MLC) models, name...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Toxicology in vitro 2021-08, Vol.74, p.105157-105157, Article 105157
Hauptverfasser:	Yap, Xiu Huan, Raymer, Michael
Format:	Artikel
Sprache:	eng
Schlagworte:	Classification Classifiers Computer applications Label dependence Learning Model accuracy Multi-label classification Partitioning Prediction models Predictions Regression analysis Stacking Statistical analysis Tox21 Toxicity Toxicity prediction
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Most computational predictive models are specifically trained for a single toxicity endpoint and lack the ability to learn dependencies between endpoints, such as those targeting similar biological pathways. In this study, we compare the performance of 3 multi-label classification (MLC) models, namely Classifier Chains (CC), Label Powersets (LP) and Stacking (SBR), against independent classifiers (Binary Relevance) on Tox21 challenge data. Also, we develop a novel label dependence measure that shows full range of values, even at low prior probabilities, for the purpose of data-driven label partitioning. Using Logistic Regression as the base classifier and random label partitioning (k = 3), CC show statistically significant improvements in model performance using Hamming and multi-label accuracy scores (p
ISSN:	0887-2333 1879-3177
DOI:	10.1016/j.tiv.2021.105157