A deep neural network combined with molecular fingerprints (DNN-MF) to develop predictive models for hydroxyl radical rate constants of water contaminants
[Display omitted] •Molecular fingerprints (MF) were inputs for a deep neural network (DNN) for QSAR.•MF can be generated and understood more easily than molecular descriptors.•DNN-MF is a fast, robust, and easy modeling approach for predictive models.•DNN-MF is promising in other environmental field...
Gespeichert in:
Veröffentlicht in: | Journal of hazardous materials 2020-02, Vol.383, p.121141-121141, Article 121141 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | [Display omitted]
•Molecular fingerprints (MF) were inputs for a deep neural network (DNN) for QSAR.•MF can be generated and understood more easily than molecular descriptors.•DNN-MF is a fast, robust, and easy modeling approach for predictive models.•DNN-MF is promising in other environmental fields, e.g., adsorption and toxicity.
This work combined a Deep Neural Network (DNN) with molecular fingerprints (MF) to develop models to predict the OH radical rate constants of 593 organic contaminants. Molecular descriptors, most often used in establishing quantitative structural-activity relationships (QSARs), were not used here because of their complicated generation processes that rely on advanced physicochemical and computational knowledge. Instead, we only fed the most basic information of the contaminant structures, i.e., MF encoding the types of atoms and how they are connected, to DNN and DNN then developed predictive models automatically. Here, a dataset containing 457 contaminants and their OH rate constants was first used to develop predictive models by DNN-MF. The hence developed models showed comparable accuracy to the traditional QSARs. The root mean square error (RMSE) values of the test sets were 0.358-0.384. The length of 2048 bits for the MF and 3 hidden layers (each with 1024 neurons) were found to be the optimal parameters for DNN. The model containing additional 89 micorpollutants in the training set was then successfully applied to predict the OH rate constants of 17 organophosphorus flame retardants and 29 additional micropollutants, with comparable accuracy to the reported molecular descriptors-based QSARs. |
---|---|
ISSN: | 0304-3894 1873-3336 |
DOI: | 10.1016/j.jhazmat.2019.121141 |