UniPTM: Multiple PTM site prediction on full-length protein sequence-Reference-Cited by-同舟云学术

UniPTM: Multiple PTM site prediction on full-length protein sequence

Published:2024-08-06 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Meng Lingkuan^ORCID,Lin Jiecong^ORCID,Cheng Ke^ORCID,Xu Kui,Sun Hongyan^ORCID,Wong Ka-Chun^ORCID

Abstract

AbstractPost-translational modifications (PTMs) enrich the functional diversity of proteins by attaching chemical groups to the side chains of amino acids. In recent years, a myr-iad of AI models have been proposed to predict many specific types of PTMs. However, those models typically adopt the sliding window approach to extract short and equal-length protein fragments from full-length proteins for model training. Unfortunately, such a subtle step results in the loss of long-range information from distal amino acids, which may impact the PTM formation process. In this study, we introduce UniPTM, a window-free model designed to train and test on natural and full-length protein sequences, enabling the prediction of multiple types of PTMs in a holistic manner. Moreover, we established PTMseq, the first comprehensive dataset of full-length pro-tein sequences with annotated PTMs, to train and validate our model. UniPTM has undergone extensive validations and significantly outperforms existing models, eluci-dating the influence of protein sequence completeness on PTM. Consequently, UniPTM offers interpretable and biologically meaningful predictions, enhancing our understand-ing of protein functionally and regulation. The source code and PTMseq dataset for UniPTM are available athttps://www.github.com/TransPTM/UniPTM.

Publisher

Cold Spring Harbor Laboratory

Reference40 articles.

1. UniProt: a worldwide hub of protein knowledge

2. Meng, L. ; Chan, W.-S. ; Huang, L. ; Liu, L. ; Chen, X. ; Zhang, W. ; Wang, F. ; Cheng, K. ; Sun, H. ; Wong, K.-C . Mini-review: Recent advances in post-translational modification site prediction based on deep learning. Computational and Structural Biotechnology Journal 2022,

3. Essential Role for Protein Kinase B (PKB) in Insulin-induced Glycogen Synthase Kinase 3 Inactivation

4. 50 years of protein acetylation: from gene regulation to epigenetics, metabolism and beyond

5. Targeting protein methylation: from chemical tools to precision medicines;Cellular and molecular life sciences,2019