Predicting chemical structure using reinforcement learning with a stack-augmented conditional variational autoencoder-Reference-Cited by-同舟云学术

Predicting chemical structure using reinforcement learning with a stack-augmented conditional variational autoencoder

Published:2022-12-09 Issue:1 Volume:14 Page:
ISSN:1758-2946
Container-title:Journal of Cheminformatics
language:en
Short-container-title:J Cheminform

Author:

Kim Hwanhee,Ko Soohyun,Kim Byung Ju,Ryu Sung Jin,Ahn Jaegyoon

Abstract

AbstractIn this paper, a reinforcement learning model is proposed that can maximize the predicted binding affinity between a generated molecule and target proteins. The model used to generate molecules in the proposed model was the Stacked Conditional Variation AutoEncoder (Stack-CVAE), which acts as an agent in reinforcement learning so that the resulting chemical formulas have the desired chemical properties and show high binding affinity with specific target proteins. We generated 1000 chemical formulas using the chemical properties of sorafenib and the three target kinases of sorafenib. Then, we confirmed that Stack-CVAE generates more of the valid and unique chemical compounds that have the desired chemical properties and predicted binding affinity better than other generative models. More detailed analysis for 100 of the top scoring molecules show that they are novel ones not found in existing chemical databases. Moreover, they reveal significantly higher predicted binding affinity score for Raf kinases than for other kinases. Furthermore, they are highly druggable and synthesizable.

Funder

The Ministry of Science and ICT, Korea

Publisher

Springer Science and Business Media LLC

Subject

Library and Information Sciences,Computer Graphics and Computer-Aided Design,Physical and Theoretical Chemistry,Computer Science Applications

Link

https://link.springer.com/content/pdf/10.1186/s13321-022-00666-9.pdf

Reference47 articles.

1. Kim S, Chen J, Cheng T et al (2021) PubChem in 2021: new data content and improved web interfaces. Nucleic Acids Res 49:D1388–D1395. https://doi.org/10.1093/nar/gkaa971

2. Lin XX, Li X, Lin XX (2020) A review on applications of computational methods in drug screening and design. Molecules 25:1–17. https://doi.org/10.3390/molecules25061375

3. Shoichet BK (2005) Virtual screening of chemical libraries. Nature 432:862–865. https://doi.org/10.1038/nature03197

4. Scior T, Bender A, Tresadern G et al (2012) Recognizing pitfalls in virtual screening: a critical review. J Chem Inf Model 52:867–881. https://doi.org/10.1021/ci200528d

5. Cheng T, Li Q, Zhou Z et al (2012) Structure-based virtual screening for drug discovery: a problem-centric review. AAPS J 14:133–141. https://doi.org/10.1208/s12248-012-9322-0

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Drug Molecule Generation Method Based on Fusion of Protein Sequence Features;Lecture Notes in Computer Science;2024

2. moBRCA-net: a breast cancer subtype classification framework based on multi-omics attention neural networks;BMC Bioinformatics;2023-04-26