A Novel Gene Selection Algorithm based on Sparse Representation and Minimum-redundancy Maximum-relevancy of Maximum Compatibility Center
Author:
Chen Min1, Zhang Yi2, Li Zejun1, Li Ang1, Liu Wenhua1, Liu Liubin3, Chen Zheng1
Affiliation:
1. School of Computer Science and Technology, Hunan Institute of Technology, 421002 Hengyang, China 2. School of Information Science and Engineering, Guilin University of Technology, 541004 Guilin, China 3. Cloud Collaboration Technology Group, Cisco System Inc., 95035 Milpitas, CA, United States
Abstract
Background:
Tumor classification is important for accurate diagnosis and personalized
treatment and has recently received great attention. Analysis of gene expression profile has shown relevant
biological significance and thus has become a research hotspot and a new challenge for bio-data
mining. In the research methods, some algorithms can identify few genes but with great time
complexity, some algorithms can get small time complex methods but with unsatisfactory classification
accuracy, this article proposed a new extraction method for gene expression profile.
Methods:
In this paper, we propose a classification method for tumor subtypes based on the Minimum-
Redundancy Maximum-Relevancy (MRMR) of maximum compatibility center. First, we performed a
fuzzy clustering of gene expression profiles based on the compatibility relation. Next, we used the
sparse representation coefficient to assess the importance of the gene for the category, extracted the
top-ranked genes, and removed the uncorrelated genes. Finally, the MRMR search strategy was used to
select the characteristic gene, reject the redundant gene, and obtain the final subset of characteristic
genes.
Results:
Our method and four others were tested on four different datasets to verify its effectiveness.
Results show that the classification accuracy and standard deviation of our method are better than
those of other methods.
Conclusion:
Our proposed method is robust, adaptable, and superior in classification. This method can
help us discover the susceptibility genes associated with complex diseases and understand the interaction
between these genes. Our technique provides a new way of thinking and is important to understand
the pathogenesis of complex diseases and prevent diseases, diagnosis and treatment.
Funder
Science-Technology of Hunan Province, China Nature Science Foundation of Hunan Province, China National Nature Science Foundation of China
Publisher
Bentham Science Publishers Ltd.
Subject
Molecular Biology,Biochemistry
Reference56 articles.
1. Lamb J, Crawford ED, Peck D, Modell JW, Blat IC, Wrobel MJ, Lerner J, Brunet J-P, Subramanian A, Ross KN. The connectivity map: using gene-expression signatures to connect small molecules, genes, and disease. 2. Huang G, Zhou H, Li Y, Xu L. Alignment-free comparison of genome sequences by a new numerical characterization. 3. Li X. A fast and exhaustive method for heterogeneity and epistasis analysis based on multi-objective optimization. 4. Emilsson V, Thorleifsson G, Zhang B, Leonardson AS, Zink F, Zhu J, Carlson S, Helgason A, Walters GB, Gunnarsdottir S. Genetics of gene expression and its effect on disease. 5. Huang G, Chu C, Huang T, Kong X, Zhang Y, Zhang N, Cai Y-D. Exploring mouse protein function multiple approaches.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|