Affiliation:
1. Oakland University, MI
2. William Beaumont Hospital, Royal Oak, MI
Abstract
A high level of expertise, rigorous algorithms and methods are needed to adequately mine and harness Mass Spectrometer generated data due to its unique nature and structure. Hitherto, peptide ions are matched with theoretical results and/or public databases in order to identify expressed proteins in analyzed protein source samples, but this is done on a spectrum by spectrum basis. In this study, we present a mechanism that extends the principle of K-nearest neighbor algorithm for mining pools of mass spectrometer saliva data towards discovering and characterizing patterns for diagnosing Alzheimer's disease. The methodology discusses feature selection by correlation matrix, matrix to vector decomposition, an extension of Euclidean distance formula, and successfully classifies donor samples into the three stages of Alzheimer's disease with over 85% accuracy without collaborating clinical records.
Publisher
Association for Computing Machinery (ACM)
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献