RMTLysPTM: recognizing multiple types of lysine PTM sites by deep analysis on sequences-Reference-Cited by-同舟云学术

RMTLysPTM: recognizing multiple types of lysine PTM sites by deep analysis on sequences

Published:2023-11-22 Issue:1 Volume:25 Page:
ISSN:1467-5463
Container-title:Briefings in Bioinformatics
language:en
Short-container-title:

Author:

Chen Lei¹^ORCID,Chen Yuwei¹

Affiliation:

1. College of Information Engineering, Shanghai Maritime University , Shanghai 201306 , People’s Republic of China

Abstract

Abstract Post-translational modification (PTM) occurs after a protein is translated from ribonucleic acid. It is an important living creature life phenomenon because it is implicated in almost all cellular processes. Identification of PTM sites from a given protein sequence is a hot topic in bioinformatics. Lots of computational methods have been proposed, and they provide good performance. However, most previous methods can only tackle one PTM type. Few methods consider multiple PTM types. In this study, a multi-label classification model, named RMTLysPTM, was developed to recognize four types of lysine (K) PTM sites, including acetylation, crotonylation, methylation and succinylation. The surrounding sites of a lysine site were selected to constitute a peptide segment, representing the lysine at the center. Deep analysis was conducted to count the distribution of 2-residues with fixed location across the four types of lysine PTM sites. By aggregating the distribution information of 2-residues in one peptide segment, the peptide segment was encoded by informative features. Furthermore, a prediction engine that can precisely capture the traits of the above representations was designed to recognize the types of lysine PTM sites. The cross-validation results on two datasets (Qiu and CPLM training datasets) suggested that the model had extremely high performance and RMTLysPTM had strong generalization ability by testing it on protein Q16778 and CPLM testing datasets. The model was found to be generally superior to all previous models and those using popular methods and features. A web server was set up for RMTLysPTM, and it can be accessed at http://119.3.127.138/.

Publisher

Oxford University Press (OUP)

Subject

Molecular Biology,Information Systems

Link

https://academic.oup.com/bib/article-pdf/25/1/bbad450/55465220/bbad450.pdf

Reference44 articles.

1. Posttranslational Modification

2. Phosphoproteomics

3. iPTM-mLys: identifying multiple lysine PTM sites and their different types;Qiu;Bioinformatics,2016

4. Improved prediction of lysine acetylation by support vector machines;Li;Protein Pept Lett,2009

5. LAceP: lysine acetylation site prediction using logistic regression classifiers;Hou;PloS One,2014

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Identification of gene and protein signatures associated with long-term effects of COVID-19 on the immune system after patient recovery by analyzing single-cell multi-omics data using a machine learning approach;Vaccine;2024-10

2. Prediction of Solubility of Proteins in Escherichia coli Based on Functional and Structural Features Using Machine Learning Methods;The Protein Journal;2024-09-07

3. Machine Learning in Identifying Marker Genes for Congenital Heart Diseases of Different Cardiac Cell Types;Life;2024-08-19

4. Identifying Key Clinical Indicators Associated with the Risk of Death in Hospitalized COVID-19 Patients;Current Bioinformatics;2024-08-01

5. PMiSLocMF: predicting miRNA subcellular localizations by incorporating multi-source features of miRNAs;Briefings in Bioinformatics;2024-07-25