On the Misleading Use of QF32 for QSAR Model Comparison

Quantitative Structure – Activity Relationship (QSAR) models play a central role in medicinal chemistry, toxicology and computer‐assisted molecular design, as well as a support for regulatory decisions and animal testing reduction. Thus, assessing their predictive ability becomes an essential step f...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Molecular informatics 2019-01, Vol.38 (1-2), p.n/a
Hauptverfasser: Consonni, Viviana, Todeschini, Roberto, Ballabio, Davide, Grisoni, Francesca
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Quantitative Structure – Activity Relationship (QSAR) models play a central role in medicinal chemistry, toxicology and computer‐assisted molecular design, as well as a support for regulatory decisions and animal testing reduction. Thus, assessing their predictive ability becomes an essential step for any prospective application. Many metrics have been proposed to estimate the model predictive ability of QSARs, which have created confusion on how models should be evaluated and properly compared. Recently, we showed that the metric QF32 is particularly well‐suited for comparing the external predictivity of different models developed on the same training dataset. However, when comparing models developed on different training data, this function becomes inadequate and only dispersion measures like the root‐mean‐square error (RMSE) should be used. The intent of this work is to provide clarity on the correct and incorrect uses of QF32 , discussing its behavior towards the training data distribution and illustrating some cases in which QF32 estimates may be misleading. Hereby, we encourage the usage of measures of dispersions when models trained on different datasets have to be compared and evaluated.
ISSN:1868-1743
1868-1751
DOI:10.1002/minf.201800029