Author:
Scheubert Lena,Luštrek Mitja,Schmidt Rainer,Repsilber Dirk,Fuellen Georg
Abstract
Abstract
Background
Alzheimer’s disease has been known for more than 100 years and the underlying molecular mechanisms are not yet completely understood. The identification of genes involved in the processes in Alzheimer affected brain is an important step towards such an understanding. Genes differentially expressed in diseased and healthy brains are promising candidates.
Results
Based on microarray data we identify potential biomarkers as well as biomarker combinations using three feature selection methods: information gain, mean decrease accuracy of random forest and a wrapper of genetic algorithm and support vector machine (GA/SVM). Information gain and random forest are two commonly used methods. We compare their output to the results obtained from GA/SVM. GA/SVM is rarely used for the analysis of microarray data, but it is able to identify genes capable of classifying tissues into different classes at least as well as the two reference methods.
Conclusion
Compared to the other methods, GA/SVM has the advantage of finding small, less redundant sets of genes that, in combination, show superior classification characteristics. The biological significance of the genes and gene pairs is discussed.
Publisher
Springer Science and Business Media LLC
Subject
Applied Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Structural Biology
Reference61 articles.
1. Alzheimer’s Association: 2010 Alzheimer’s disease facts and figures. Alzheimer’s & amp; dementia : The J of the Alzheimer’s Assoc 2010, 6(2):158–194. [http://dx.doi.org/10.1016/j.jalz.2010.01.009]
2. Liang WS, Reiman EM, Valla J, Dunckley T, Beach TG, Grover A, Niedzielko TL, Schneider LE, Mastroeni D, Caselli R, Kukull W, Morris JC, Hulette CM, Schmechel D, Rogers J, Stephan DA: Alzheimer’s disease is associated with reduced expression of energy metabolism genes in posterior cingulate neurons. Proc Nat Acad Sci USA 2008, 105: 4441–4446. 10.1073/pnas.0709259105
3. Huerta EB, Duval B, kao Hao J: A hybrid GA/SVM approach for gene selection and classification of microarray data. In EvoWorkshops 2006, LNCS 3907. Berlin, Heidelberg, Germany: Springer; 2006:34–44.
4. Scheubert L, Schmidt R, Repsilber D, Lustrek M, Fuellen G: Learning biomarkers of pluripotent stem cells in mouse. DNA Res 2011, 18: 233–251. 10.1093/dnares/dsr016
5. Hallock P, Thomas MA: Integrating the Alzheimer’s disease proteome and transcriptome: a comprehensive network model of a complex disease. OMICS 2012, 16(1–2):37–49. 10.1089/omi.2011.0054
Cited by
24 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献