Performance Validation of Neural Network Based 13C NMR Prediction Using a Publicly Available Data Source

The validation of the performance of a neural network based 13C NMR prediction algorithm using a test set available from an open source publicly available database, NMRShiftDB, is described. The validation was performed using a version of the database containing ca. 214 000 chemical shifts as well a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of chemical information and modeling 2008-03, Vol.48 (3), p.550-555
Hauptverfasser: Blinov, K. A, Smurnyy, Y. D, Elyashberg, M. E, Churanova, T. S, Kvasha, M, Steinbeck, C, Lefebvre, B. A, Williams, A. J
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The validation of the performance of a neural network based 13C NMR prediction algorithm using a test set available from an open source publicly available database, NMRShiftDB, is described. The validation was performed using a version of the database containing ca. 214 000 chemical shifts as well as for two subsets of the database to compare performance when overlap with the training set is taken into account. The first subset contained ca. 93 000 chemical shifts that were absent from the ACD\CNMR DB, the “excluded shift set” used for training of the neural network and the ACD\CNMR prediction algorithm, while the second contained ca. 121 000 shifts that were present in the ACD\CNMR DB training set, the “included shift set”. This work has shown that the mean error between experimental and predicted shifts for the entire database is 1.59 ppm, while the mean deviation for the subset with included shifts is 1.47 and 1.74 ppm for excluded shifts. Since similar work has been reported online for another algorithm we compared the results with the errors determined using Robien's CNMR Neural Network Predictor using the entire NMRShiftDB for program validation.
ISSN:1549-9596
1549-960X
DOI:10.1021/ci700363r