Abstract
Predicting protein-protein interactions from sequences is an important goal of computational biology. Various sources of information can be used to this end. Starting from the sequences of two interacting protein families, one can use phylogeny or residue coevolution to infer which paralogs are specific interaction partners within each species. We show that these two signals can be combined to improve the performance of the inference of interaction partners among paralogs. For this, we first align the sequence-similarity graphs of the two families through simulated annealing, yielding a robust partial pairing. We next use this partial pairing to seed a coevolution-based iterative pairing algorithm. This combined method improves performance over either separate method. The improvement obtained is striking in the difficult cases where the average number of paralogs per species is large or where the total number of sequences is modest.
Funder
H2020 Marie Skłodowska-Curie Actions
European Research Council
Institut de Biologie Paris Seine
National Science Foundation
Publisher
Public Library of Science (PLoS)
Subject
Computational Theory and Mathematics,Cellular and Molecular Neuroscience,Genetics,Molecular Biology,Ecology,Modeling and Simulation,Ecology, Evolution, Behavior and Systematics
Reference50 articles.
1. Highly accurate protein structure prediction with AlphaFold;J Jumper;Nature,2021
2. Computed structures of core eukaryotic protein complexes;IR Humphreys;Science,2021
3. Improved prediction of protein-protein interactions using AlphaFold2;P Bryant;Nat Commun,2022
4. Evans R, O’Neill M, Pritzel A, Antropova N, Senior A, Green T, et al. Protein complex prediction with AlphaFold-Multimer. BioRxiv Preprint; p.
5. Inter-residue, inter-protein and inter-family coevolution: bridging the scales;H Szurmant;Current Opinion in Structural Biology,2018
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献