Advancing Alzheimer's Disease Risk Prediction: Development and Validation of a Machine Learning-Based Preclinical Screening Model (Preprint)

Author:

Cao ShihuaORCID,wang bingshengORCID,shi yankaiORCID,Yao Jiani,Lou XiajingORCID,He DanniORCID,Chen YanfeiORCID,Qi WenhaoORCID,Wang Bing,Dong Chaoqun,Dong Chaoqun,Zhu Xiaohong,Shi Aili,Cheng Lingling

Abstract

BACKGROUND

Alzheimer's disease (AD) poses a significant challenge for individuals aged 65 and older, being the most prevalent form of dementia. Most existing Alzheimer's disease risk prediction tools have high accuracy, but the complexity and limited accessibility of current AD risk prediction tools hinder their practical use.

OBJECTIVE

Our goal was to leverage machine learning techniques to develop a prediction model that is not only highly efficient but also cost-effective.

METHODS

Utilizing data from 2,968 individuals sourced from the National Alzheimer's Coordinating Center, and we constructed models, including gradient-enhanced machines and random forests, as well as commonly used logistic regression models. For modeling purposes, we employed two popular machine learning algorithms, Random Forest and XGBoost, along with traditional logistic regression methods. The models' performance was evaluated based on five key criteria: the Brier score, accuracy (ACC), specificity (SPE), sensitivity (SEN), and area under the receiver operating characteristic curve (AUC).

RESULTS

The average age of the 2968 participants was 71.1 years, with a standard deviation of 6.8 years, and 60.3% were female. The prevalence of AD was 23.15% (n= 687). The machine learning-based Boruta algorithm identified 16 significant predictors from 33 potential risk factors, with a minimum Root mean squared error (RMSE) of 0.27 when the top 5 variables were selected (education level, depression, rapid eye movement sleep disorder, age, anxiety).We used the SHAP feature in the Gradient Boosting Decision Tree Model importance to rank the top 20 significant predictors and selected the top 4 variables: education level, age, marital status, and depression to construct our model based on cross-validation results. Compared to the logistic regression model, the integrated algorithm XGBoost and the random forest model performed better. Notably, XGBoost outperformed other models, achieving an AUC score of 0.78, ACC score of 0.691, SPE score of 0.677, SEN score of 0.739, PRE score of 0.403, and Brier score of 0.140.

CONCLUSIONS

Individual characteristics and psychological status are more critical than past history. Machine-learning-based AD risk assessment tools for older adults can be easily accessed and show some accurate discrimination, which may be useful in guiding preclinical screening for AD in the elderly population.

Publisher

JMIR Publications Inc.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3