Improved the Protein Complex Prediction with Protein Language Models-Reference-Cited by-同舟云学术

Improved the Protein Complex Prediction with Protein Language Models

Published:2022-09-17 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Chen Bo^ORCID,Xie Ziwei,Qiu Jiezhong,Ye Zhaofeng,Xu Jinbo^ORCID,Tang Jie

Abstract

AbstractAlphaFold-Multimer has greatly improved protein complex structure prediction, but its accuracy also depends on the quality of the multiple sequence alignment (MSA) formed by the interacting homologs (i.e., interologs) of the complex under prediction. Here we propose a novel method, denoted as ESMPair, that can identify interologs of a complex by making use of protein language models (PLMs). We show that ESMPair can generate better interologs than the default MSA generation method in AlphaFold-Multimer. Our method results in better complex structure prediction than AlphaFold-Multimer by a large margin (+10.7% in terms of the Top-5 best DockQ), especially when the predicted complex structures have low confidence. We further show that by combining several MSA generation methods, we may yield even better complex structure prediction accuracy than Alphafold-Multimer (+22% in terms of the Top-5 best DockQ). We systematically analyze the impact factors of our algorithm and find out the diversity of MSA of interologs significantly affects the prediction accuracy. Moreover, we show that ESMPair performs particularly well on complexes in eucaryotes.

Publisher

Cold Spring Harbor Laboratory

Reference60 articles.

1. Principles of protein-protein interactions.

2. Liddington, R.C. : Structural basis of protein-protein interactions. Protein-Protein Interactions, 3–14 (2004)

3. Conserved patterns of protein interaction in multiple species

4. Common and specific signatures of gene expression and protein–protein interactions in autoimmune diseases;Genes & Immunity,2013

5. Network analytics in the age of big data

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Transformer models in biomedicine;BMC Medical Informatics and Decision Making;2024-07-29

2. Evaluation of AlphaFold antibody–antigen modeling with implications for improving predictive accuracy;Protein Science;2023-12-27

3. Recent Advances and Challenges in Protein Structure Prediction;Journal of Chemical Information and Modeling;2023-12-18

4. Intelligent Protein Design and Molecular Characterization Techniques: A Comprehensive Review;Molecules;2023-11-30

5. Protein–DNA binding sites prediction based on pre-trained protein language model and contrastive learning;Briefings in Bioinformatics;2023-11-22