A method for identifying moonlighting proteins based on linear discriminant analysis and bagging-SVM-Reference-Cited by-同舟云学术

A method for identifying moonlighting proteins based on linear discriminant analysis and bagging-SVM

Published:2022-08-15 Issue: Volume:13 Page:
ISSN:1664-8021
Container-title:Frontiers in Genetics
language:
Short-container-title:Front. Genet.

Author:

Chen Yu,Li Sai,Guo Jifeng

Abstract

Moonlighting proteins have at least two independent functions and are widely found in animals, plants and microorganisms. Moonlighting proteins play important roles in signal transduction, cell growth and movement, tumor inhibition, DNA synthesis and repair, and metabolism of biological macromolecules. Moonlighting proteins are difficult to find through biological experiments, so many researchers identify moonlighting proteins through bioinformatics methods, but their accuracies are relatively low. Therefore, we propose a new method. In this study, we select SVMProt-188D as the feature input, and apply a model combining linear discriminant analysis and basic classifiers in machine learning to study moonlighting proteins, and perform bagging ensemble on the best-performing support vector machine. They are identified accurately and efficiently. The model achieves an accuracy of 93.26% and an F-sorce of 0.946 on the MPFit dataset, which is better than the existing MEL-MP model. Meanwhile, it also achieves good results on the other two moonlighting protein datasets.

Publisher

Frontiers Media SA

Subject

Genetics (clinical),Genetics,Molecular Medicine

Reference53 articles.

1. Moonlighting proteins are important players in cancer immunology;Adamo;Front. Immunol.,2021

2. An optimum algorithm in pathological voice quality assessment using wavelet-packet-based features, linear discriminant analysis and support vector machine;Arjmandi;Biomed. Signal Process. Control,2012

3. iPhosH-PseAAC: identify phosphohistidine sites in proteins by blending statistical moments and position relative features according to the chou's 5-step rule and general pseudo amino acid composition;Awais;IEEE/ACM Trans. Comput. Biol. Bioinform.,2021

4. Machine intelligence in peptide therapeutics: a next-generation tool for rapid disease screening;Basith;Med. Res. Rev.,2020

5. Bagging predictors;Breiman;Mach. Learn.,1996

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Assessment of subvisible particles in biopharmaceuticals with image feature extraction and machine learning;Chemometrics and Intelligent Laboratory Systems;2024-02

2. Molecular functions of moonlighting proteins in cell metabolic processes;Biochimica et Biophysica Acta (BBA) - Molecular Cell Research;2024-01

3. Leveraging Graph Machine Learning for Moonlighting Protein Prediction: A PPI Network and Physiochemical Feature Approach;2023-11-16