Abstract
AbstractParkinson’s disease (PD) is a progressive neurodegenerative disorder affecting the central nervous system, often diagnosed in its advanced stages due to the absence of sensitive biomarkers. With this objective in mind, our study conducted a comprehensive analysis of differentially expressed genes (DEGs) sourced from blood-based microarray datasets to uncover potential biomarkers and developed a machine learning based classifier to conduct two step validations. By analyzing gene expression of three projects, we identified 678 DEGs, consisting of 337 genes showing upregulation and 341 genes presenting downregulation. Additionally, insights from functional enrichment and the protein-protein network analysis indicate thatHLA-F,IRF-1, andRPS28have the potential to serve as biomarkers for diagnosing PD. Simultaneously, we employed feature selection techniques such as Least Absolute Shrinkage and Selection Operator with Cross Validation (LassoCV) followed by Recursive Feature Elimination with Cross Validation (REFCV) to filter our initial dataset of 13,249 genes down to 43 genes, which were subsequently used to train the machine learning-based classifier models. These 43 genes formed the basis for training and testing various machine learning models, including logistic regression, random forest, naive Bayes, k-nearest neighbors, support vector machine, and deep learning based artificial neural networks. Our models demonstrated robust performance, with Support Vector Machine outperforming others by 0.65 accuracy (95%CI: 0.58-0.66), 0.70 AUC-ROC (95%CI: 0.70-0.71) and 0.35 MCC (95%CI: 0.34-0.39). The model was implemented to develop the PitArray tool for non-invasive detection of PD from blood. PitArray is available at:https://github.com/Arittra95/PitArray.Key PointsHLA-F, IRF-1,andRPS28were identified as potential biomarkers for Parkinson’s disease diagnosis.Several sophisticated feature selection methods recognized 43 genes which were then used to build a machine learning model.A Support Vector Machine based tool named PitArray was developed which could distinguish Parkinson’s disease patients from healthy people based on blood transcriptome data.
Publisher
Cold Spring Harbor Laboratory
Reference71 articles.
1. Parkinson disease
2. “Parkinson disease.” https://www.who.int/news-room/fact-sheets/detail/parkinson-disease (accessed Sep. 19, 2023).
3. “Parkinson’s Disease: Challenges, Progress, and Promise,” National Institute of Neurological Disorders and Stroke. https://www.ninds.nih.gov/current-research/focus-disorders/focus-parkinsons-disease-research/parkinsons-disease-challenges-progress-and-promise (accessed Sep. 19, 2023).
4. A. Kouli , K. M. Torsney , and W.-L. Kuan , “Parkinson’s Disease: Etiology, Neuropathology, and Pathogenesis,” in Parkinson’s Disease: Pathogenesis and Clinical Aspects, T. B. Stoker and J. C. Greenland , Eds., Brisbane (AU): Codon Publications, 2018. Accessed: Sep. 19, 2023. [Online]. Available: http://www.ncbi.nlm.nih.gov/books/NBK536722/
5. Neuropathological assessment of Parkinson's disease: refining the diagnostic criteria