Does Inter-Protein Contact Prediction Benefit from Multi-Modal Data and Auxiliary Tasks?-Reference-Cited by-同舟云学术

Does Inter-Protein Contact Prediction Benefit from Multi-Modal Data and Auxiliary Tasks?

Published:2022-12-02 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Talukder Arghamitra,Yin Rujie,Sun Yuanfei,Shen Yang^ORCID,You Yuning

Abstract

AbstractApproaches toin silicoprediction of protein structures have been revolutionized by AlphaFold2, while those topredict interfaces between proteinsare relatively underdeveloped, owing to the overly complicated yet relatively limited data of protein–protein complexes. In short, proteins are 1D sequences of amino acids folding into 3D structures, and interact to form assemblies to function. We believe that such intricate scenarios are better modeled with additional indicative information that reflects their multi-modality nature and multi-scale functionality. To improve binary prediction of inter-protein residue-residue contacts, we propose to augment input features with multi-modal representations and to synergize the objective with auxiliary predictive tasks. (i) We first progressively add three protein modalities into models: protein sequences, sequences with evolutionary information, and structure-aware intra-protein residue contact maps. We observe thatutilizing all data modalities delivers the best prediction precision. Analysis reveals that evolutionary and structural information benefit predictions on the difficult and rigid protein complexes, respectively, assessed by the resemblance to native residue contacts in bound complex structures. (ii) We next introduce three auxiliary tasks via self-supervised pre-training (binary prediction of protein-protein interaction (PPI)) and multi-task learning (prediction of inter-protein residue–residue distances and angles). Although PPI prediction is reported to benefit from predicting inter-contacts (as causal interpretations), it is not found vice versa in our study. Similarly, the finer-grained distance and angle predictions did not appear to uniformly improve contact prediction either. This again reflects the high complexity of protein–protein complex data, for whichdesigning and incorporating synergistic auxiliary tasks remains challenging.

Publisher

Cold Spring Harbor Laboratory

Reference26 articles.

1. Accurate de novo prediction of protein contact map by ultra-deep learning model;PLoS computational biology,2017

2. Improved protein structure prediction using potentials from deep learning;Nature,2020

3. Deeplearning contact-map guided protein structure prediction in casp13;Proteins: Structure, Function, and Bioinformatics,2019

4. Highly accurate protein structure prediction with AlphaFold

5. Accurate prediction of protein structures and interactions using a three-track neural network