Comparative study on risk prediction model of type 2 diabetes based on machine learning theory: a cross-sectional study

Author:

Wang Shu,Chen Rong,Wang Shuang,Kong Danli,Cao Rudai,Lin Chunwen,Luo Ling,Huang Jialu,Zhang Qiaoli,Yu HaibingORCID,Ding Yuan Lin

Abstract

ObjectivesTo compare the prediction effects of six models based on machine learning theories, which can provide a methodological reference for predicting the risk of type 2 diabetes mellitus (T2DM).Setting and participantsThis study was based on the monitoring data of chronic disease risk factors in Dongguan residents from 2016 to 2018. The multistage cluster random sampling method was adopted at each monitoring site, and 4157 people were finally selected. In the initial population, we excluded individuals with more than 20% missing data and eventually included 4106 subjects.DesignK nearest neighbour algorithm and synthetic minority oversampling technique were used to process the data. Single factor analysis was used for preliminary selection of variables. The 10-fold cross-validation was used to optimise the parameters of some models. The accuracy, precision, recall and area under receiver operating characteristic curve (AUC) were used to evaluate the prediction effect of models, and Delong test was used to analyse the differences of AUC values of each model.ResultsAfter balancing data, the sample size increased to 8013, of which 4023 are patients with T2DM and 3990 in control group. The comparison results of the six models showed that back propagation neural network model has the best prediction effect with 93.7% accuracy, 94.6% accuracy, 92.8% recall and the AUC value of 0.977, followed by logistic model, support vector machine model, CART decision tree model and C4.5 decision tree model. Deep neural network has the worst prediction performance, with 84.5% accuracy, 86.1% precision, 82.9% recall and the AUC value of 0.845.ConclusionsIn this study, six types of risk prediction models for T2DM were constructed, and the predictive effects of these models were compared based on various indicators. The results showed that back propagation neural network based on the selected data set had the best prediction effect.

Funder

the Dongguan City Science and Technology Correspondent Project

the Innovation and entrepreneurship training program for college students of Guangdong Medical University

the Guangdong science and technology research project of traditional Chinese Medicine

the Characteristic Innovation Project of Guangdong Province General University

the Undergraduate Innovation Experiment Project of Guangdong Medical University

the Basic and Applied Basic Research Foundation of Guangdong Province Regional Joint Fund Project

the Natural Science Key Cultivation Project of Scientific Research Fund of Guangdong Medical University

the Natural Science Foundation of Basic and Applied Basic Research Foundation of Guangdong Province

the Medical Scientific Research Foundation of Guangdong Province

the Dongguan Social Development Technology Project

the Discipline Construction Project of Guangdong Medical University

the Zhanjiang City science and technology development special fund competitive allocation project

Publisher

BMJ

Subject

General Medicine

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3