Author:
Bobo Liu,Xiaofei Kang,Zongyue Zhang,Liying Hou,Yidan Wang
Abstract
Objective to compare the predictive efficacy of random forest, BP neural network, gradient boosting tree and plain Bayesian models for the prevalence of diabetes. Practical application: by measuring the basic indicators such as individual height, weight, triglyceride, etc., the model can be used to predict the probability of individual disease, and then targeted to improve some indicators of the body, to achieve the effect of diabetes prevention intervention, and to provide new ideas for diabetes prevention research. Methods Using the 2009 survey data from the China Health and Nutrition Survey (CHNS), the data for men and women were statistically analyzed by dividing them into four groups according to the visceral fat index (VAI). Subsequently, the processed samples were divided into training sets and test sets by 4:1, and four machine learning models, namely, random forest, BP neural network, gradient lifting tree, and naive Bayes, were constructed. The experiment was conducted using a five-fold cross validation method, and the prediction effect was evaluated through indicators such as sensitivity, accuracy, and AUC. Results One-way ANOVA showed that the differences in height, weight, waist circumference, triglycerides, high-density lipoprotein cholesterol, body mass index, fasting blood glucose, and glycosylated hemoglobin among different VAI quartile groups were statistically significant (P<0.05). Comparison of prediction effects of four models: sensitivity 75.75%, 90.77%, 76.31%, 98.57%, accuracy 74.80%, 87.82%, 74.64%, 92.00%, AUC 0.713, 0.716, 0.668, 0.676, and Jorden index 0.34, 0.27, 0.22 and 0.21. Conclusion Based on the CHNS 2009 survey data, the BP neural network model has a better effect and stability in predicting diabetes.
Reference20 articles.
1. Lin XL. Analysis of global, regional and national disease burden of diabetes from 1990 to 2017 [D]. Zhejiang University, 2020. DOI:10.27461/d. cnki. gzjdx. 2020.003332.
2. Yang JJ, Yu D, Wen W, et al. Association of Diabetes With All-Cause and Cause-Specific Mortality in Asia: A Pooled Analysis of More Than 1 Million Participants. JAMA Netw Open. 2019; 2(4):e192696. Published 2019 Apr 5. doi:10.1001/jamanetworkopen. 2019.2696
3. Zhang LW, Ruan MH, Liu JL, et al. Analysis of research and development trend in the field of diabetes [J/OL]. HEREDITY:1-26[2022-10-21]. DOI:10.16288/j. yczz. 22-272.
4. Ma SL, Xv YJ, Meng RL, et al. Effects of blood pressure and overweight/obesity on diabetes in Guangdong residents aged 40 and above [J]. Chinese Journal of Disease Control, 2022, 26(04):397-400+429. DOI:10.16462/j. cnki. zhjbkz. 2022.04.006.
5. Sun H, Saeedi P, Karuranga S, et al. IDF Diabetes Atlas: Global, regional and country-level diabetes prevalence estimates for 2021 and projections for 2045. Diabetes Res Clin Pract. 2022; 183:109119. DOI:10.1016/j. diabres. 2021.109119