Abstract
AbstractComplex biological processes in cells are embedded in the interactome, representing the complete set of protein-protein interactions. Mapping and analyzing the protein structures are essential to fully comprehending these processes’ molecular details. Therefore, knowing the structural coverage of the interactome is essential to show the current limitations. Structural modeling of protein-protein interactions requires accurate protein structures. In this study, we mapped all experimental structures to the reference human proteome. Later, we found the enrichment in structural coverage when complementary methods such as homology modeling and deep learning (AlphaFold) are included. We then collected the interactions from the literature and databases to form the reference human interactome resulting in 117,897 non-redundant interactions. When we analyzed the structural coverage of the interactome, we found that the number of experimentally determined protein complex structures is scarce, corresponding to 3.95% of all binary interactions. We also analyzed known and modeled structures to potentially construct the structural interactome with a docking method. Our analysis showed that 12.97% of the interactions from HuRI, 73.62%, and 32.94% from the filtered versions of STRING and HIPPIE could potentially be modeled with a high structural coverage or accuracy, respectively. Overall, this paper provides an overview of the current state of structural coverage of the human proteome and interactome.Significance StatementWe gathered binary protein-protein interactions from three prominent interactome databases to create a comprehensive human reference interactome. We quantified the structural coverage of the human interactome using already available structural data from four different sources. We further evaluate the percentage of interactions that can be accurately predicted using docking methods.
Publisher
Cold Spring Harbor Laboratory