Abstract
AbstractGlycosyltransferases (GTs) are prevalent across the tree of life and regulate nearly all aspects of cellular functions by catalyzing synthesis of glycosidic linkages between diverse donor and acceptor substrates. Despite the availability of GT sequences from diverse organisms, the evolutionary basis for their complex and diverse modes of catalytic and regulatory functions remain enigmatic. Here, based on deep mining of over half a million GT-A fold sequences from diverse organisms, we define a minimal core component shared among functionally diverse enzymes. We find that variations in the common core and the emergence of hypervariable loops extending from the core contributed to the evolution of catalytic and functional diversity. We provide a phylogenetic framework relating diverse GT-A fold families for the first time and show that inverting and retaining mechanisms emerged multiple times independently during the course of evolution. We identify conserved modes of donor and acceptor recognition in evolutionarily divergent families and pinpoint the sequence and structural features for functional specialization. Using the evolutionary information encoded in primary sequences, we trained a machine learning classifier to predict donor specificity with nearly 88% accuracy and deployed it for the annotation of understudied GTs in five model organisms. Our studies provide an evolutionary framework for investigating the complex relationships connecting GT-A fold sequence, structure, function and regulation.
Publisher
Cold Spring Harbor Laboratory
Reference46 articles.
1. A. Varki , P. Gagneux , “Biological Functions of Glycans” in Essentials of Glycobiology, 3rd Ed., A. Varki , et al., Eds. (Cold Spring Harbor Laboratory Press, 2015) (September 29, 2019).
2. O-GlcNAc Modification Protects against Protein Misfolding and Aggregation in Neurodegenerative Disease;ACS Chem. Neurosci.,2019
3. C. J. Day , E. A. Semchenko , V. Korolik , Glycoconjugates Play a Key Role in Campylobacter jejuni Infection: Interactions between Host and Pathogen. Front. Cell. Infect. Microbiol. 2 (2012).
4. Recent structures, evolution and mechanisms of glycosyltransferases
5. The carbohydrate-active enzymes database (CAZy) in 2013