DeepMHCII: a novel binding core-aware deep interaction model for accurate MHC-II peptide binding affinity prediction

Author:

You Ronghui1,Qu Wei12,Mamitsuka Hiroshi34,Zhu Shanfeng125678ORCID

Affiliation:

1. Institute of Science and Technology for Brain-Inspired Intelligence and MOE Frontiers Center for Brain Sciences, Fudan University , Shanghai 200433, China

2. Shanghai Qi Zhi Institute , Shanghai 200030, China

3. Bioinformatics Center, Institute for Chemical Research, Kyoto University , Uji, Kyoto Prefecture, Japan

4. Department of Computer Science, Aalto University , Espoo, Finland

5. Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence (Fudan University), Ministry of Education , Shanghai 200433, China

6. Shanghai Key Lab of Intelligent Information Processing and Shanghai Institute of Artificial Intelligence Algorithm, Fudan University , Shanghai 200433, China

7. Zhangjiang Fudan International Innovation Center , Shanghai 200433, China

8. National Genomics Data Center, CAS Key Laboratory of Computational Biology, Bio-Med Big Data Center, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China

Abstract

Abstract Motivation Computationally predicting major histocompatibility complex (MHC)-peptide binding affinity is an important problem in immunological bioinformatics. Recent cutting-edge deep learning-based methods for this problem are unable to achieve satisfactory performance for MHC class II molecules. This is because such methods generate the input by simply concatenating the two given sequences: (the estimated binding core of) a peptide and (the pseudo sequence of) an MHC class II molecule, ignoring biological knowledge behind the interactions of the two molecules. We thus propose a binding core-aware deep learning-based model, DeepMHCII, with a binding interaction convolution layer, which allows to integrate all potential binding cores (in a given peptide) with the MHC pseudo (binding) sequence, through modeling the interaction with multiple convolutional kernels. Results Extensive empirical experiments with four large-scale datasets demonstrate that DeepMHCII significantly outperformed four state-of-the-art methods under numerous settings, such as 5-fold cross-validation, leave one molecule out, validation with independent testing sets and binding core prediction. All these results and visualization of the predicted binding cores indicate the effectiveness of our model, DeepMHCII, and the importance of properly modeling biological facts in deep learning for high predictive performance and efficient knowledge discovery. Availability and implementation DeepMHCII is publicly available at https://github.com/yourh/DeepMHCII. Supplementary information Supplementary data are available at Bioinformatics online.

Funder

National Natural Science Foundation of China

Shanghai Municipal Science and Technology Major Project

Shanghai Center for Brain Science and Brain-Inspired Technology

Shanghai Municipal Science and Technology Major

Information Technology Facility, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute for Biological Sciences, Chinese Academy of Sciences

MEXT KAKENHI

Publisher

Oxford University Press (OUP)

Subject

Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability

Reference32 articles.

1. Accurate pan-specific prediction of peptide-MHC class II binding affinity with improved binding core identification;Andreatta;Immunogenetics,2015

2. An automated benchmarking platform for MHC class II binding prediction methods;Andreatta;Bioinformatics,2018

3. The majority of immunogenic epitopes generate CD4+ T cells that are dependent on MHC class II-bound peptide-flanking residues;Arnold;J. Immunol,2002

4. Advances in the development of personalized neoantigen-based therapeutic cancer vaccines;Blass;Nat. Rev. Clin. Oncol,2021

5. RCSB protein data bank: powerful new tools for exploring 3D structures of biological macromolecules for basic and applied research and education in fundamental biology, biomedicine, biotechnology, bioengineering and energy sciences;Burley;Nucleic Acids Res,2021

Cited by 19 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3