Quantitative Structure−Property Relationship (QSPR) Prediction of Liquid Viscosities of Pure Organic Compounds Employing Random Forest Regression

A quantitative structure−property relationship (QSPR) approach was used to develop a predictive model for viscosities of pure organic liquids using a set of 403 compounds that belong to diverse classes of organic chemicals. A pool of 116 descriptors that encode topostructural, topochemical, electrot...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Industrial & engineering chemistry research 2009-11, Vol.48 (21), p.9708-9712
Hauptverfasser: Rajappan, Remya, Shingade, Prashant D, Natarajan, Ramanathan, Jayaraman, Valadi K
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A quantitative structure−property relationship (QSPR) approach was used to develop a predictive model for viscosities of pure organic liquids using a set of 403 compounds that belong to diverse classes of organic chemicals. A pool of 116 descriptors that encode topostructural, topochemical, electrotopological, geometrical, and quantum chemical properties of the organic compounds was used to develop QSPR models, based on the robust Random Forest (RF) regression algorithm. The performance of the algorithm, in terms of correlation coefficients and mean square errors, was determined to be good. The capability of the algorithm to build models and select the most-informative features simultaneously is very useful for several quantitative structure−activity/property relationship tasks. The eight most-dominant features selected by the RF regression algorithm primarily contained predictors that encode characteristics of atoms and groups that form hydrogen bonds, as well as factors involving molecular shape and size.
ISSN:0888-5885
1520-5045
DOI:10.1021/ie8018406