Abstract
Protein domains are conserved structural and functional units and are the functional building blocks of proteins. Evolutionary expansion means that domain families are often represented by many members in a species, which are found in various configurations with other domains, which have evolved new specificity for interacting partners. Here, we develop a structure-based interface analysis to comprehensively map domain interfaces from available experimental and predicted structures, including interfaces with other macromolecules and intraprotein interfaces (such as might exist between domains in a protein). We hypothesized that a comprehensive approach to contact mapping of domains could yield new insights. Specifically, we use it to gain information about how domains selectivity interact with ligands, whether domain-domain interfaces of repeated domain partnerships are conserved across diverse proteins, and identify regions of conserved post-translational modifications, using relationship to interaction interfaces as a method to hypothesize the effect of post-translational modifications (and mutations). We applied this approach to the human SH2 domain family, an extensive modular unit that is the foundation of phosphotyrosine-mediated signaling, where we identified a novel approach to understanding the binding selectivity of SH2 domains and evidence that there is coordinated and conserved regulation of multiple SH2 domain binding interfaces by tyrosine and serine/threonine phosphorylation and acetylation, suggesting that multiple signaling systems can regulate protein activity and SH2 domain interactions in a regulated manner. We provide the extensive features of the human SH2 domain family and this modular approach, as an open source Python package for COmprehensive Domain Interface Analysis of Contacts (Co-DIAC).
Publisher
Cold Spring Harbor Laboratory