Optimization of binding affinities in chemical space with generative pre-trained transformer and deep reinforcement learning-Reference-Cited by-同舟云学术

Optimization of binding affinities in chemical space with generative pre-trained transformer and deep reinforcement learning

Published:2024-02-20 Issue: Volume:12 Page:757
ISSN:2046-1402
Container-title:F1000Research
language:en
Short-container-title:F1000Res

Author:

Xu Xiaopeng^ORCID,Zhou Juexiao,Zhu Chen,Zhan Qing,Li Zhongxiao^ORCID,Zhang Ruochi^ORCID,Wang Yu,Liao Xingyu,Gao Xin

Abstract

Background The key challenge in drug discovery is to discover novel compounds with desirable properties. Among the properties, binding affinity to a target is one of the prerequisites and usually evaluated by molecular docking or quantitative structure activity relationship (QSAR) models. Methods In this study, we developed SGPT-RL, which uses a generative pre-trained transformer (GPT) as the policy network of the reinforcement learning (RL) agent to optimize the binding affinity to a target. SGPT-RL was evaluated on the Moses distribution learning benchmark and two goal-directed generation tasks, with Dopamine Receptor D2 (DRD2) and Angiotensin-Converting Enzyme 2 (ACE2) as the targets. Both QSAR model and molecular docking were implemented as the optimization goals in the tasks. The popular Reinvent method was used as the baseline for comparison. Results The results on the Moses benchmark showed that SGPT-RL learned good property distributions and generated molecules with high validity and novelty. On the two goal-directed generation tasks, both SGPT-RL and Reinvent were able to generate valid molecules with improved target scores. The SGPT-RL method achieved better results than Reinvent on the ACE2 task, where molecular docking was used as the optimization goal. Further analysis shows that SGPT-RL learned conserved scaffold patterns during exploration. Conclusions The superior performance of SGPT-RL in the ACE2 task indicates that it can be applied to the virtual screening process where molecular docking is widely used as the criteria. Besides, the scaffold patterns learned by SGPT-RL during the exploration process can assist chemists to better design and discover novel lead candidates.

Funder

King Abdullah University of Science and Technology (KAUST) Office of Research Administration

Publisher

F1000 Research Ltd

Link

https://f1000research.com/articles/12-757/v2/pdf

Reference43 articles.

1. Multi-objective optimization methods in drug design.;C Nicolaou;Drug Discov. Today Technol.,2013

2. Principles of early drug discovery.;J Hughes;Br. J. Pharmacol.,2011

3. Deep learning for molecular design—a review of the state of the art.;D Elton;Molecular Systems Design & Engineering.,2019

4. Multi-constraint molecular generation based on conditional transformer, knowledge distillation and reinforcement learning.;J Wang;Nat. Mach. Intell.,2021

5. Machine learning for molecular and materials science.;K Butler;Nature.,2018

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. HELM-GPT: de novo macrocyclic peptide design using generative pre-trained transformer;Bioinformatics;2024-06