Precise modelling and interpretation of bioactivities of ligands targeting G protein-coupled receptors

Author:

Wu Jiansheng12,Liu Ben3,Chan Wallace K B4,Wu Weijian5,Pang Tao6,Hu Haifeng3,Yan Shancheng12,Ke Xiaoyan7,Zhang Yang89

Affiliation:

1. School of Geographic and Biological Information, Nanjing University of Posts and Telecommunications, Nanjing, China

2. Smart Health Big Data Analysis and Location Services Engineering Lab of Jiangsu Province, Nanjing University of Posts and Telecommunications, Nanjing, China

3. School of Telecommunication and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing, China

4. Department of Pharmacology, University of Michigan, Ann Arbor, MI, USA

5. College of Computer and Information, Hohai University, Nanjing, China

6. Jiangsu Key Laboratory of Drug Screening, China Pharmaceutical University, Nanjing, China

7. Child Mental Health Research Center, Nanjing Brain Hospital, Nanjing Medical University, Nanjing, China

8. Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA

9. Department of Biological Chemistry, University of Michigan, Ann Arbor, MI, USA

Abstract

Abstract Motivation Accurate prediction and interpretation of ligand bioactivities are essential for virtual screening and drug discovery. Unfortunately, many important drug targets lack experimental data about the ligand bioactivities; this is particularly true for G protein-coupled receptors (GPCRs), which account for the targets of about a third of drugs currently on the market. Computational approaches with the potential of precise assessment of ligand bioactivities and determination of key substructural features which determine ligand bioactivities are needed to address this issue. Results A new method, SED, was proposed to predict ligand bioactivities and to recognize key substructures associated with GPCRs through the coupling of screening for Lasso of long extended-connectivity fingerprints (ECFPs) with deep neural network training. The SED pipeline contains three successive steps: (i) representation of long ECFPs for ligand molecules, (ii) feature selection by screening for Lasso of ECFPs and (iii) bioactivity prediction through a deep neural network regression model. The method was examined on a set of 16 representative GPCRs that cover most subfamilies of human GPCRs, where each has 300–5000 ligand associations. The results show that SED achieves excellent performance in modelling ligand bioactivities, especially for those in the GPCR datasets without sufficient ligand associations, where SED improved the baseline predictors by 12% in correlation coefficient (r2) and 19% in root mean square error. Detail data analyses suggest that the major advantage of SED lies on its ability to detect substructures from long ECFPs which significantly improves the predictive performance. Availability and implementation The source code and datasets of SED are freely available at https://zhanglab.ccmb.med.umich.edu/SED/. Supplementary information Supplementary data are available at Bioinformatics online.

Funder

National Science Foundation of China

Natural Science Foundation of the Higher Education Institutions of Jiangsu Province

key University Science Research Project of Jiangsu Province

Natural Science Foundation of Nanjing University of Posts and Telecommunications

National Science Foundation

Publisher

Oxford University Press (OUP)

Subject

Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3