Affiliation:
1. School of Science, Jiangnan University, Wuxi 214122, China
Abstract
The prediction of protein subcellular localization not only is important for the study of protein structure and function but also can facilitate the design and development of new drugs. In recent years, feature extraction methods based on protein evolution information have attracted much attention and made good progress. Based on the protein position-specific score matrix (PSSM) obtained by PSI-BLAST, PSSM-GSD method is proposed according to the data distribution characteristics. In order to reflect the protein sequence information as much as possible, AAO method, PSSM-AAO method, and PSSM-GSD method are fused together. Then, conditional entropy-based classifier chain algorithm and support vector machine are used to locate multilabel proteins. Finally, we test Gpos-mPLoc and Gneg-mPLoc datasets, considering the severe imbalance of data, and select SMOTE algorithm to expand a few sample; the experiment shows that the AAO + PSSM
method in the paper achieved 83.1% and 86.8% overall accuracy, respectively. After experimental comparison of different methods, AAO + PSSM
has good performance and can effectively predict protein subcellular location.
Funder
National Natural Science Foundation of China
Subject
General Engineering,General Mathematics
Reference28 articles.
1. Review of Protein Subcellular Localization Prediction
2. Discrimination of Intracellular and Extracellular Proteins Using Amino Acid Composition and Residue-pair Frequencies
3. Protein subcellular localization prediction based on reduced representation of amino acid and statistical characteristic;H. Yang;Chinese Journal of Bioinformatics,2015
4. Prediction of protein subcellular localization based on multilayer sparse coding;X. J. Chen;Sheng wu gong cheng xue bao = Chinese journal of biotechnology,2019
5. Prediction of protein cellular attributes using pseudo-amino acid composition
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献