MDeePred: novel multi-channel protein featurization for deep learning-based binding affinity prediction in drug discovery

Author:

Rifaioglu A S12,Cetin Atalay R34ORCID,Cansen Kahraman D3,Doğan T56ORCID,Martin M7ORCID,Atalay V1

Affiliation:

1. Department of Computer Engineering, Middle East Technical University, Ankara, Turkey

2. Department of Computer Engineering, İskenderun Technical University, Hatay, Turkey

3. Graduate School of Informatics, Middle East Technical University, Ankara, Turkey

4. Section of Pulmonary and Critical Care Medicine, The University of Chicago, Chicago, IL, USA

5. Department of Computer Engineering, Hacettepe University, Ankara, Turkey

6. Institute of Informatics, Hacettepe University, Ankara, Turkey

7. European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL–EBI), Cambridge, Hinxton, UK

Abstract

Abstract Motivation Identification of interactions between bioactive small molecules and target proteins is crucial for novel drug discovery, drug repurposing and uncovering off-target effects. Due to the tremendous size of the chemical space, experimental bioactivity screening efforts require the aid of computational approaches. Although deep learning models have been successful in predicting bioactive compounds, effective and comprehensive featurization of proteins, to be given as input to deep neural networks, remains a challenge. Results Here, we present a novel protein featurization approach to be used in deep learning-based compound–target protein binding affinity prediction. In the proposed method, multiple types of protein features such as sequence, structural, evolutionary and physicochemical properties are incorporated within multiple 2D vectors, which is then fed to state-of-the-art pairwise input hybrid deep neural networks to predict the real-valued compound–target protein interactions. The method adopts the proteochemometric approach, where both the compound and target protein features are used at the input level to model their interaction. The whole system is called MDeePred and it is a new method to be used for the purposes of computational drug discovery and repositioning. We evaluated MDeePred on well-known benchmark datasets and compared its performance with the state-of-the-art methods. We also performed in vitro comparative analysis of MDeePred predictions with selected kinase inhibitors’ action on cancer cells. MDeePred is a scalable method with sufficiently high predictive performance. The featurization approach proposed here can also be utilized for other protein-related predictive tasks. Availability and implementation The source code, datasets, additional information and user instructions of MDeePred are available at https://github.com/cansyl/MDeePred. Supplementary information Supplementary data are available at Bioinformatics online.

Funder

Turkish Ministry of Development, KanSiL project

Newton/Katip Celebi Institutional Links program by TUBITAK

British Council

Publisher

Oxford University Press (OUP)

Subject

Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3