The comparison between multiple linear regression and machine learning methods in predicting cognitive function in Chinese type 2 diabetes-Reference-Cited by-同舟云学术

The comparison between multiple linear regression and machine learning methods in predicting cognitive function in Chinese type 2 diabetes

Published:2023-06-28 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Liu Chi-Hao¹,Peng Chung-Hsin²,Huang Li-Ying²,Chen Fang-Yu²,Kuo Chun-Heng²,Wu Chung-Ze³,Cheng Yu-Fang³

Affiliation:

1. Kaohsiung Armed Forces General Hospital

2. Fu Jen Catholic University

3. Taipei Medical University

Abstract

Abstract The prevalence of type 2 diabetes (T2D) has been increasing drastically in recent decades. In the same time, it has been noted that dementia is related to T2D. In the past, traditional multiple linear regression (MLR) is the most commonly used method in analyzing these kinds of relationships. However, machine learning methods (Mach-L) have been emerged recently. These methods could capture non-linear relationships better than the MLR. In the present study, we enrolled old T2D and used four different Mach-L methods to analyze the relationships between risk factors and cognitive function. Our goals were first, to compare the accuracy between MLR and Mach-L in predicting cognitive function and second, to rank importance of the risks for impaired cognitive function in T2D. There were 197 old T2D enrolled (98 men and 99 women). Demographic and biochemistry data were used as independent variables and the cognitive function assessment (CFA) score was measured by Montreal Cognitive Assessment which was regarded as independent variable. In addition to traditional MLR, random forest (RF), stochastic gradient boosting (SGB), Naïve Byer’s classifier (NB) and eXtreme gradient boosting (XGBoost) were also applied. Our results showed that all the RF, SGB, NB and XGBoost outperformed than the MLR. Education level, age, frailty score, fasting plasma glucose and body mass index were identified as the important factors from the more to the less important. In conclusion, our study demonstrated that RF, SGB, NB and XGBoost are more accurate than the MLR and in predicting CFA score. By these methods, the importance ranks of the risk factors are education level, age, frailty score, fasting plasma glucose and body mass index accordingly in a Chinese T2D cohort.

Publisher

Research Square Platform LLC

Reference37 articles.

1. IDF Diabetes. Atlas https://diabetesatlas.org/.

2. Diabetes as a risk factor for dementia and mild cognitive impairment: a meta-analysis of longitudinal studies;Cheng G;Intern Med J,2012

3. Type 2 Diabetes as a Risk Factor for Dementia in Women Compared With Men: A Pooled Analysis of 2.3 Million People Comprising More Than 100,000 Cases of Dementia;Chatterjee S;Diabetes Care,2016

4. An updated meta-analysis of cohort studies: Diabetes and risk of Alzheimer's disease;Zhang J;Diabetes Res Clin Pract,2017

5. Physical inactivity, cardiometabolic disease, and risk of dementia: an individual-participant meta-analysis;Kivimaki M;BMJ,2019