Improving Machine Learning Diabetes Prediction Models for the Utmost Clinical Effectiveness-Reference-Cited by-同舟云学术

Improving Machine Learning Diabetes Prediction Models for the Utmost Clinical Effectiveness

Published:2022-11-14 Issue:11 Volume:12 Page:1899
ISSN:2075-4426
Container-title:Journal of Personalized Medicine
language:en
Short-container-title:JPM

Author:

Shin Juyoung^ORCID,Lee Joonyub^ORCID,Ko Taehoon^ORCID,Lee Kanghyuck,Choi Yera,Kim Hun-Sung^ORCID

Abstract

The early prediction of diabetes can facilitate interventions to prevent or delay it. This study proposes a diabetes prediction model based on machine learning (ML) to encourage individuals at risk of diabetes to employ healthy interventions. A total of 38,379 subjects were included. We trained the model on 80% of the subjects and verified its predictive performance on the remaining 20%. Furthermore, the performances of several algorithms were compared, including logistic regression, decision tree, random forest, eXtreme Gradient Boosting (XGBoost), Cox regression, and XGBoost Survival Embedding (XGBSE). The area under the receiver operating characteristic curve (AUROC) of the XGBoost model was the largest, followed by those of the decision tree, logistic regression, and random forest models. For the survival analysis, XGBSE yielded an AUROC exceeding 0.9 for the 2- to 9-year predictions and a C-index of 0.934, while the Cox regression achieved a C-index of 0.921. After lowering the threshold from 0.5 to 0.25, the sensitivity increased from 0.011 to 0.236 for the 2-year prediction model and from 0.607 to 0.994 for the 9-year prediction model, while the specificity showed negligible changes. We developed a high-performance diabetes prediction model that applied the XGBSE algorithm with threshold adjustment. We plan to use this prediction model in real clinical practice for diabetes prevention after simplifying and validating it externally.

Funder

Daewoong Pharmaceutical company

Publisher

MDPI AG

Subject

Medicine (miscellaneous)

Link

https://www.mdpi.com/2075-4426/12/11/1899/pdf

Reference47 articles.

1. An investigation of the use of a general health examination center;J. Korean Acad. Fam. Med.,1991

2. National screening program for the transitional ages in Korea;J. Korean Med. Assoc.,2010

3. National health examination expansion policy;J. Korean Med. Assoc.,2017

4. Population-based screening for cancer: Hope and hype;Nat. Rev. Clin. Oncol.,2016

5. Environmental and genetic contributions to diabetes;Metabolism,2019

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. PyCaret for Predicting Type 2 Diabetes: A Phenotype- and Gender-Based Approach with the “Nurses’ Health Study” and the “Health Professionals’ Follow-Up Study” Datasets;Journal of Personalized Medicine;2024-07-29

2. Machine Learning Prediction of Autism Spectrum Disorder Through Linking Mothers’ and Children’s Electronic Health Record Data;2024-03-26