Affiliation:
1. University of Houston-Downtown
2. Wichita State University
3. University of Arizona
Abstract
Abstract
O-linked glycosylation of proteins is an essential post-translational modification process in Homo sapiens, where the attachment of a sugar moiety occurs at the oxygen atom of serine and/or threonine residues. This modification plays a pivotal role in various biological and cellular functions. While threonine or serine residues in a protein sequence are potential sites for O-linked glycosylation, not all threonine or serine residues are O-linked glycosylated. Furthermore, the modification is reversible. Hence, it is of vital importance to characterize if and when O-linked glycosylation occurs. We propose a multi-layer perceptron-based approach termed OglyPred-PLM which leverages the contextualized embeddings produced from the ProtT5-XL-UniRef50 protein language model that significantly improves the prediction performance of human O-linked glycosylation sites. OglyPred-PLM surpassed the performance of other indispensable O-linked glycosylation predictors on the independent benchmark dataset. This demonstrates that OglyPred-PLM is a powerful and unique computational tool to predict O-linked glycosylation sites in proteins and thus will accelerate the discovery of unknown O-linked glycosylation sites in proteins.
Publisher
Research Square Platform LLC
Reference66 articles.
1. Yang, X.-m. in Advanced Research on Computer Education, Simulation and Modeling. (eds Song Lin & Xiong Huang) 445–450 (Springer Berlin Heidelberg).
2. Colley, K. J., Varki, A. & Kinoshita, T. in Essentials of Glycobiology (eds A. Varki et al.) 41–49 (2015).
3. Adaptive immune activation: glycosylation does matter;Wolfert MA;Nat Chem Biol,2013
4. The heterotaxy gene GALNT11 glycosylates Notch to orchestrate cilia type and laterality;Boskovski MT;Nature,2013
5. Prediction of O-glycosylation sites based on multi-scale composition of amino acids and feature selection;Chen Y;Med Biol Eng Comput,2015