Predicting chemical structure using reinforcement learning with a stack-augmented conditional variational autoencoder

In this paper, a reinforcement learning model is proposed that can maximize the predicted binding affinity between a generated molecule and target proteins. The model used to generate molecules in the proposed model was the Stacked Conditional Variation AutoEncoder (Stack-CVAE), which acts as an age...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of cheminformatics 2022-12, Vol.14 (1), p.83-12, Article 83
Hauptverfasser:	Kim, Hwanhee, Ko, Soohyun, Kim, Byung Ju, Ryu, Sung Jin, Ahn, Jaegyoon
Format:	Artikel
Sprache:	eng
Schlagworte:	Affinity Analysis Binding Chemical compounds Chemical properties Chemistry Chemistry and Materials Science Computational Biology/Bioinformatics Computational linguistics Computer Applications in Chemistry Conditional Variational AutoEencoder De novo drug design Documentation and Information in Chemistry Kinases Language processing Learning Molecular modelling Natural language interfaces Protein binding Proteins Raf kinases Reinforcement Reinforcement learning Sorafenib Theoretical and Computational Chemistry
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In this paper, a reinforcement learning model is proposed that can maximize the predicted binding affinity between a generated molecule and target proteins. The model used to generate molecules in the proposed model was the Stacked Conditional Variation AutoEncoder (Stack-CVAE), which acts as an agent in reinforcement learning so that the resulting chemical formulas have the desired chemical properties and show high binding affinity with specific target proteins. We generated 1000 chemical formulas using the chemical properties of sorafenib and the three target kinases of sorafenib. Then, we confirmed that Stack-CVAE generates more of the valid and unique chemical compounds that have the desired chemical properties and predicted binding affinity better than other generative models. More detailed analysis for 100 of the top scoring molecules show that they are novel ones not found in existing chemical databases. Moreover, they reveal significantly higher predicted binding affinity score for Raf kinases than for other kinases. Furthermore, they are highly druggable and synthesizable.
ISSN:	1758-2946 1758-2946
DOI:	10.1186/s13321-022-00666-9