In Silico Predictions of Human Skin Permeability using Nonlinear Quantitative Structure–Property Relationship Models

Purpose Predicting human skin permeability of chemical compounds accurately and efficiently is useful for developing dermatological medicines and cosmetics. However, previous work have two problems; 1) quality of databases used, and 2) methods for prediction models. In this paper, we attempt to solv...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Pharmaceutical research 2015-07, Vol.32 (7), p.2360-2371
Hauptverfasser: Baba, Hiromi, Takahara, Jun-ichi, Mamitsuka, Hiroshi
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Purpose Predicting human skin permeability of chemical compounds accurately and efficiently is useful for developing dermatological medicines and cosmetics. However, previous work have two problems; 1) quality of databases used, and 2) methods for prediction models. In this paper, we attempt to solve these two problems. Methods We first compile, by carefully screening from the literature, a novel dataset of chemical compounds with permeability coefficients, measured under consistent experimental conditions. We then apply machine learning techniques such as support vector regression (SVR) and random forest (RF) to our database to develop prediction models. Molecular descriptors are fully computationally obtained, and greedy stepwise selection is employed for descriptor selection. Prediction models are internally and externally validated. Results We generated an original, new database on human skin permeability of 211 different compounds from aqueous donors. Nonlinear SVR achieved the best performance among linear SVR, nonlinear SVR, and RF. The determination coefficient, root mean square error, and mean absolute error of nonlinear SVR in external validation were 0.910, 0.342, and 0.282, respectively. Conclusions We provided one of the largest datasets with purely experimental log k p and developed reliable and accurate prediction models for screening active ingredients and seeking unsynthesized compounds of dermatological medicines and cosmetics.
ISSN:0724-8741
1573-904X
DOI:10.1007/s11095-015-1629-y