Affiliation:
1. Qinghai University
2. Qinghai Provincial People’s Hospital
3. The Fourth People’s Hospital of Qinghai Province
Abstract
Abstract
OBJECTIVE
Using machine learning to construct a prediction model for the risk of diabetes kidney disease (DKD) in the American diabetes population and evaluate its effect.
METHODS
Firstly, a dataset of five cycles from 2009 to 2018 was obtained from the National Health and Nutrition Examination Survey (NHANES) database, weighted and then standardized (with the study population in the United States), and the data was processed and randomly grouped using R software. Next, variable selection for DKD patients was conducted using Lasso regression, two-way stepwise iterative regression, and random forest methods. A nomogram model was constructed for the risk prediction of DKD. Finally, the predictive performance, predictive value, calibration, and clinical effectiveness of the model were evaluated through the receipt of ROC curves, Brier score values, calibration curves (CC), and decision curves (DCA). And we will visualize it..
RESULTS
A total of 4371 participants were selected and included in this study. Patients were randomly divided into a training set (n = 3066 people) and a validation set (n = 1305 people) in a 7:3 ratio; Using machine learning algorithms and drawing Venn diagrams, five variables significantly correlated with DKD risk were identified, namely Age, Hba1c, ALB, Scr, and TP; The area under the ROC curve (AUC) of the training set evaluation index for this model is 0.735, the net benefit rate of DCA is 2% -90%, and the Brier score is 0.172; The area under the ROC curve of the validation set (AUC) is 0.717, and the DCA curve shows a good net benefit rate. The Brier score is 0.177, and the calibration curve results of the validation set and training set are almost consistent.
CONCLUSION
The DKD risk line chart model constructed in this study has good predictive performance, which helps to evaluate the risk of DKD as early as possible in clinical practice and formulate relevant intervention and treatment measures. The visual result can be used by doctors or individuals to estimate the probability of DKD risk, as a reference to help make better treatment decisions.
Publisher
Research Square Platform LLC
Reference32 articles.
1. Qi C, Mao X, Zhang Z, et al. Classification and Differential Diagnosis of Diabetic Nephropathy[J]. Journal of Diabetes Research, 2017, 2017: 8637138.
2. GBD 2021 Diabetes Collaborators. Global, regional, and national burden of diabetes from 1990 to 2021, with projections of prevalence to 2050: a systematic analysis for the Global Burden of Disease Study 2021. Lancet. 2023;402(10397):203–234. doi: 10.1016/S0140-6736(23)01301-6. Epub 2023 Jun 22. Erratum in: Lancet. 2023;402(10408):1132. PMID: 37356446; PMCID: PMC10364581.
3. IDF Diabetes Atlas: Global, regional and country-level diabetes prevalence estimates for 2021 and projections for 2045[J];Sun H;Diabetes Research and Clinical Practice,2022
4. N S. Diabetic Nephropathy: Challenges in Pathogenesis, Diagnosis, and Treatment[J]. BioMed research international, 2021, 2021.
5. A distributional code for value in dopamine-based reinforcement learning;Dabney W;Nature,2020