Author:
Beer Hodaya,Sherill-Rofe Dana,Unterman Irene,Bloch Idit,Isseroff Mendel,Stupp Doron,Sharon Elad,Zisman Elad,Tabach Yuval
Abstract
Cross-species protein conservation patterns, as directed by natural selection, are indicative of the interplay between protein function, protein-protein interaction and evolution. Since the beginning of the genomic era, proteins were characterized as either conserved or not conserved. This simple classification became archaic and cursory once data on protein orthologs became available for thousands of species. To enrich the language used to describe protein conservation patterns, and to understand their biological significance, we classified 20,294 human proteins against 1096 species. Analyses of the conservation patterns of human proteins in different eukaryotic clades yielded extremely variable and rich patterns that had never been characterized or studied before. Using mathematical classifications, we defined seven conservation motifs: Steps, Critical, Lately Developed, Plateau, Clade Loss, Trait Loss and Gain, which describe the evolution of human proteins. Overall, our work offers novel terms for conservation patterns and defines a new language intended to comprehensively describe protein evolution. This novel terminology enables the classification of proteins based on evolution, reveals aspects of protein evolution, and improves the understanding of protein functions.
Publisher
Cold Spring Harbor Laboratory