Refining Embedding-Based Binding Predictions by Leveraging AlphaFold2 Structures-Reference-Cited by-同舟云学术

Refining Embedding-Based Binding Predictions by Leveraging AlphaFold2 Structures

Published:2022-09-03 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Endres Leopold,Olenyi Tobias^ORCID,Erckert Kyra^ORCID,Weißenow Konstantin,Rost Burkhard^ORCID,Littmann Maria^ORCID

Abstract

AbstractBackgroundIdentifying residues in a protein involved in ligand binding is important for understanding its function. bindEmbed21DL is a Machine Learning method which predicts protein-ligand binding on a per-residue level using embeddings derived from the protein Language Model (pLM) ProtT5. This method relies solely on sequences, making it easily applicable to all proteins. However, highly reliable protein structures are now accessible through the AlphaFold Protein Structure Database or can be predicted using AlphaFold2 and ColabFold, allowing the incorporation of structural information into such sequence-based predictors.ResultsHere, we propose bindAdjust which leverages predicted distance maps to adjust the binding probabilities of bindEmbed21DL to subsequently boost performance. bindAdjust raises the recall of bindEmbed21DL from 47±2% to 53±2% at a precision of 50% for small molecule binding. For binding to metal ions and nucleic acids, bindAdjust serves as a filter to identify good predictions focusing on the binding site rather than isolated residues. Further investigation of two examples shows that bindAdjust is in fact able to add binding predictions which are not close in sequence but close in structure, extending the binding residue predictions of bindEmbed21DL to larger binding stretches or binding sites.ConclusionDue to its simplicity and speed, the algorithm of bindAdjust can easily refine binding predictions also from other tools than bindEmbed21DL and, in fact, could be applied to any protein prediction task.

Publisher

Cold Spring Harbor Laboratory

Reference28 articles.

1. Statistical and machine learning approaches to predicting protein–ligand interactions

2. Beyond annotation transfer by homology: novel protein-function prediction methods to assist drug discovery

3. Automatic prediction of protein function

4. Evolutionary couplings and sequence variation effect predict protein binding sites

5. MSA-Regularized Protein Sequence Transformer toward Predicting Genome-Wide Chemical-Protein Interactions: Application to GPCRome Deorphanization

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The opportunities and challenges posed by the new generation of deep learning-based protein structure predictors;Current Opinion in Structural Biology;2023-04