A hybrid feature extraction scheme for efficient malonylation site prediction-Reference-Cited by-同舟云学术

A hybrid feature extraction scheme for efficient malonylation site prediction

Published:2022-04-06 Issue:1 Volume:12 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Sorkhi Ali Ghanbari,Pirgazi Jamshid,Ghasemi Vahid

Abstract

AbstractLysine malonylation is one of the most important post-translational modifications (PTMs). It affects the functionality of cells. Malonylation site prediction in proteins can unfold the mechanisms of cellular functionalities. Experimental methods are one of the due prediction approaches. But they are typically costly and time-consuming to implement. Recently, methods based on machine-learning solutions have been proposed to tackle this problem. Such practices have been shown to reduce costs and time complexities and increase accuracy. However, these approaches also have specific shortcomings, including inappropriate feature extraction out of protein sequences, high-dimensional features, and inefficient underlying classifiers. A machine learning-based method is proposed in this paper to cope with these problems. In the proposed approach, seven different features are extracted. Then, the extracted features are combined, ranked based on the Fisher’s score (F-score), and the most efficient ones are selected. Afterward, malonylation sites are predicted using various classifiers. Simulation results show that the proposed method has acceptable performance compared with some state-of-the-art approaches. In addition, the XGBOOST classifier, founded on extracted features such as TFCRF, has a higher prediction rate than the other methods. The codes are publicly available at: https://github.com/jimy2020/Malonylation-site-prediction

Publisher

Springer Science and Business Media LLC

Subject

Multidisciplinary

Link

https://www.nature.com/articles/s41598-022-08555-9.pdf

Reference42 articles.

1. Peng, C. et al. The first identification of lysine malonylation substrates and its regulatory enzyme. Mol. Cell Proteomics. 10(12), 012658. https://doi.org/10.1074/mcp.M111.012658 (2011).

2. Bao, X., Zhao, Q., Yang, T., Fung, Y. M. E. & Li, X. D. A chemical probe for lysine malonylation. Angew. Chem. Int. Ed. 52(18), 4883–4886. https://doi.org/10.1002/anie.201300252 (2013).

3. Du, Y. et al. Lysine malonylation is elevated in type 2 diabetic mouse models and enriched in metabolic associated proteins. Mol Cell Proteomics 14(1), 227–236 (2015).

4. Gallego, M. & Virshup, D. M. Post-translationalmodifications regulate the ticking of the circadian clock. Nat. Rev. Mol. Cell Biol. 8, 139–148 (2007).

5. Luna, L. et al. Dynamic relocalization of hOGG1 during the cell cycle is disrupted in cells harbouring the hOGG1-Cys326 polymorphic variant. Nucleic Acids Res. 33, 1813 (2005).

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Computational Predictor for Accurate Identification of Tumor Homing Peptides by Integrating Sequential and Deep BiLSTM Features;Interdisciplinary Sciences: Computational Life Sciences;2024-05-11

2. An ensemble computational model for prediction of clathrin protein by coupling machine learning with discrete cosine transform;Journal of Biomolecular Structure and Dynamics;2024-03-18

3. EACVP: An ESM-2 LM Framework Combined CNN and CBAM Attention to Predict Anti-coronavirus Peptides;Current Medicinal Chemistry;2024-03-15

4. Analysis and review of techniques and tools based on machine learning and deep learning for prediction of lysine malonylation sites in protein sequences;Database;2024-01-01

5. Prediction of Amyloid Proteins Using Embedded Evolutionary & Ensemble Feature Selection Based Descriptors With eXtreme Gradient Boosting Model;IEEE Access;2023