Dimensionality of big data sets explored by Cluj descriptors
Dimensionality of a relatively big data set (95 compounds) observed for toxicity (mutagenicity) was explored in order to compute QSAR models. Distinct molecular descriptors were used. Dimensionality of data, using PCA, correlation plots and clustering, was evaluated. Analyzing data dimensionality al...
Gespeichert in:
Veröffentlicht in: | Studia Universitatis Babeș-Bolyai. Chemia 2017-01, Vol.62 (3), p.197-204 |
---|---|
Hauptverfasser: | , , , , , |
Format: | Artikel |
Sprache: | eng |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Dimensionality of a relatively big data set (95 compounds) observed for toxicity (mutagenicity) was explored in order to compute QSAR models. Distinct molecular descriptors were used. Dimensionality of data, using PCA, correlation plots and clustering, was evaluated. Analyzing data dimensionality allowed model optimization. Docking studies and PCA were used in order to expand data dimensionality. Pearson correlation coefficient ([r.sup.2]) values, obtained for both perceptive and predictive models, were satisfactory. Keywords: topological descriptor, QSAR, data dimensionality, mutagenity, principal component analysis (PCA), Ames test. |
---|---|
ISSN: | 1224-7154 2065-9520 |
DOI: | 10.24193/subbchem.2017.3.16 |