Enhanced compound-protein binding affinity prediction by representing protein multimodal information via a coevolutionary strategy

Author:

Guo Binjie12345,Zheng Hanyu12345,Jiang Haohan12345,Li Xiaodan12345,Guan Naiyu12345,Zuo Yanming345,Zhang Yicheng12345,Yang Hengfu6,Wang Xuhua123457ORCID

Affiliation:

1. Department of Neurobiology and Department of Rehabilitation Medicine , First Affiliated Hospital, , Hangzhou, Zhejiang Province 310058 , China

2. Zhejiang University School of Medicine , First Affiliated Hospital, , Hangzhou, Zhejiang Province 310058 , China

3. Liangzhu Laboratory , MOE Frontier Science Center for Brain Science and Brain-machine Integration, State Key Laboratory of Brain-machine Intelligence, , 1369 West Wenyi Road, Hangzhou 311121 , China

4. Zhejiang University , MOE Frontier Science Center for Brain Science and Brain-machine Integration, State Key Laboratory of Brain-machine Intelligence, , 1369 West Wenyi Road, Hangzhou 311121 , China

5. NHC and CAMS Key Laboratory of Medical Neurobiology, Zhejiang University , Hangzhou 310058 , China

6. School of Computer Science, Hunan First Normal University , Changsha, 410205 Hunan , China

7. Co-innovation Center of Neuroregeneration, Nantong University , Nantong, 226001 Jiangsu , China

Abstract

AbstractDue to the lack of a method to efficiently represent the multimodal information of a protein, including its structure and sequence information, predicting compound-protein binding affinity (CPA) still suffers from low accuracy when applying machine-learning methods. To overcome this limitation, in a novel end-to-end architecture (named FeatNN), we develop a coevolutionary strategy to jointly represent the structure and sequence features of proteins and ultimately optimize the mathematical models for predicting CPA. Furthermore, from the perspective of data-driven approach, we proposed a rational method that can utilize both high- and low-quality databases to optimize the accuracy and generalization ability of FeatNN in CPA prediction tasks. Notably, we visually interpret the feature interaction process between sequence and structure in the rationally designed architecture. As a result, FeatNN considerably outperforms the state-of-the-art (SOTA) baseline in virtual drug evaluation tasks, indicating the feasibility of this approach for practical use. FeatNN provides an outstanding method for higher CPA prediction accuracy and better generalization ability by efficiently representing multimodal information of proteins via a coevolutionary strategy.

Funder

Fundamental Research Funds for the Central Universities

Leading Innovative and Entrepreneur Team Introduction Program of Zhejiang

Natural Science Foundation of Zhejiang Province

National Natural Science Foundation of China

The Scientific and Technological Innovation 2030 Program of China

Publisher

Oxford University Press (OUP)

Subject

Molecular Biology,Information Systems

Reference75 articles.

1. Drug-target interaction prediction: databases, web servers and computational models;Chen;Brief Bioinform,2016

2. From virtuality to reality—virtual screening in lead discovery and lead optimization: a medicinal chemistry perspective;Rester;Curr Opin Drug Discov Devel,2008

3. BindingDB in 2015: a public database for medicinal chemistry, computational chemistry and systems pharmacology;Gilson;Nucleic Acids Res,2016

4. RCSB protein data Bank: architectural advances towards integrated searching and efficient access to macromolecular structure data from the PDB archive;Rose;J Mol Biol,2021

5. DeepDTA: deep drug-target binding affinity prediction;Ozturk;Bioinformatics,2018

Cited by 4 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3