Author:
Yao Yu-hua,Lv Ya-ping,Li Ling,Xu Hui-min,Ji Bin-bin,Chen Jing,Li Chun,Liao Bo,Nan Xu-ying
Abstract
Abstract
Background
Subcellular localization prediction of protein is an important component of bioinformatics, which has great importance for drug design and other applications. A multitude of computational tools for proteins subcellular location have been developed in the recent decades, however, existing methods differ in the protein sequence representation techniques and classification algorithms adopted.
Results
In this paper, we firstly introduce two kinds of protein sequences encoding schemes: dipeptide information with space and Gapped k-mer information. Then, the Gapped k-mer calculation method which is based on quad-tree is also introduced.
Conclusions
>From the prediction results, this method not only reduces the dimension, but also improves the prediction precision of protein subcellular localization.
Publisher
Springer Science and Business Media LLC
Subject
Applied Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Structural Biology
Cited by
10 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献