Exploration on learning molecular docking with deep learning models

Author:

Xie Qin1,Ma Wei12,Zhang Jianhang1,Li Shiliang2,Deng Xiaobing3,Xu Youjun1,Zhang Weilin1

Affiliation:

1. Infinite Intelligence Pharma Beijing 100083 China

2. Shanghai Key Laboratory of New Drug Design State Key Laboratory of Bioreactor Engineering School of Pharmacy East China University of Science and Technology Shanghai 200237 China

3. College of Chemistry and Molecular Engineering Peking University Beijing 100871 China

Abstract

A deep learning‐powered VS approach combined with two free docking programs are proposed and evaluated for screening an ultra‐large compound library to obtain diverse potential active compounds rapidly and efficiently. We found that it is a practical and transferable strategy to significantly reduce computational cost.BackgroundMolecular docking‐based virtual screening (VS) aims to choose ligands with potential pharmacological activities from millions or even billions of molecules. This process could significantly cut down the number of compounds that need to be experimentally tested. However, during the docking calculation, many molecules have low affinity for a particular protein target, which waste a lot of computational resources.MethodsWe implemented a fast and practical molecular screening approach called DL‐DockVS (deep learning dock virtual screening) by using deep learning models (regression and classification models) to learn the outcomes of pipelined docking programs step‐by‐step.ResultsIn this study, we showed that this approach could successfully weed out compounds with poor docking scores while keeping compounds with potentially high docking scores against 10 DUD‐E protein targets. A self‐built dataset of about 1.9 million molecules was used to further verify DL‐DockVS, yielding good results in terms of recall rate, active compounds enrichment factor and runtime speed.ConclusionsWe comprehensively evaluate the practicality and effectiveness of DL‐DockVS against 10 protein targets. Due to the improvements of runtime and maintained success rate, it would be a useful and promising approach to screen ultra‐large compound libraries in the age of big data. It is also very convenient for researchers to make a well‐trained model of one specific target for predicting other chemical libraries and high docking‐score molecules without docking computation again.

Publisher

Wiley

Subject

Applied Mathematics,Computer Science Applications,Biochemistry, Genetics and Molecular Biology (miscellaneous),Modeling and Simulation

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3