Affiliation:
1. Yonsei University
2. Gauss Labs
3. Yonsei University College of Medicine
Abstract
Abstract
Objective
As delayed diagnosis of diabetes increases the risk of irreversible diabetes complications, detecting undiagnosed diabetes using a prediction model could be useful. Recently, machine learning-based disease prediction models have been used; however, the performance of the machine learning-based prediction model and traditional statistics-based prediction models in predicting undiagnosed diabetes has not been compared. Therefore, we developed a machine learning-based undiagnosed diabetes prediction model and compared its prediction performance with that of a traditional statistics-based prediction model.
Methods
We used the 2014–2020 Korean National Health and Nutrition Examination Survey (KNHANES) (N = 32,827). The KNHANES 2014–2018 data were used as training and internal validation sets and the 2019–2020 data as external validation sets. The receiver operating characteristic curve area under the curve (AUC) was used to compare the prediction performance of the machine learning-based and the traditional statistics-based prediction models.
Results
Using sex, age, resting heart rate, and waist circumference as features, the machine learning-based model showed a higher AUC (0.788 vs. 0.740) than that of the traditional statistical-based prediction model. Using sex, age, waist circumference, family history of diabetes, hypertension, alcohol consumption, and smoking status as features, the machine learning-based prediction model showed a higher AUC (0.802 vs. 0.759) than the traditional statistical-based prediction model. The machine learning-based prediction model using features for maximum prediction performance showed a higher AUC (0.819 vs. 0.765) than the traditional statistical-based prediction model.
Conclusions
machine learning-based prediction models using anthropometric and lifestyle measurements showed good performance in predicting undiagnosed diabetes. The machine learning-based prediction model outperformed the traditional statistics-based prediction models.
Publisher
Research Square Platform LLC
Reference45 articles.
1. Diabetes fact sheets in Korea, 2020: an appraisal of current status;Jung C-H;Diabetes & metabolism journal,2021
2. Will new diagnostic criteria for diabetes mellitus change phenotype of patients with diabetes? Reanalysis of European epidemiological data;Group DS;BMJ,1998
3. Kim SR. The effects of diabetes self-awareness on diabeteic patients' subjective health level [dissertation]: Ajou university; 2013.
4. Prevalence of diabetes, impaired fasting glucose, and impaired glucose tolerance in US adults: the Third National Health and Nutrition Examination Survey, 1988–1994;Harris MI;Diabetes Care,1998
5. A simple screening score for diabetes for the Korean population: development, validation, and comparison with other scores;Lee Y-h;Diabetes Care,2012