Machine Learning Techniques for Diabetes Mellitus Based on Lifestyle Predictors

Author:

Ansari Gufran Ahmad1,Bhat Salliah Shafi2,Ansari Mohd Dilshad3

Affiliation:

1. Department of Computer Science and Software Engineering, University of Hail, Kingdom of Saudi Arabia

2. B.S. Abdur Rahman Crescent Institute of Science and Technology Chennai-48, India

3. SRM University Delhi-NCR, Sonepat, Haryana, India

Abstract

Background: Diabetes has been rising in recent years and prior research has demonstrated Machine Learning Techniques (MLTs) to be useful tools for predicting diabetes. This research has examined the accuracy of six different MLTs for predicting diabetes using lifestyle data gathered from UCI (University of California). To improve medical outcomes and prevent its onset, the prediction of diabetes is necessary. This research has proposed a new framework based on the early detection of diabetes using lifestyle factors. Various MLTs, such as Logistic Regression (LR), Decision Tree Classification (DTC), Random Forest Classification (RFC), Support Vector Classification (SVC), and K-Nearest Classification (KNC) have been used for tenfold cross-validation and the results obtained from different techniques have been verified. Among all classification techniques, LR has achieved the highest accuracy of 93%, the precision of 92%, the recall score of 94%, the F1 score of 93%, and the weighted average of 90%, respectively. The proposed framework is utilized by the healthcare sector to predict diabetes early. It can also be used with datasets from various sectors that share diabetes-related data. Method: In this paper, we have used the proposed framework to predict diabetes mellitus in the healthcare system, diagnose various ailments, and assess if MLA performs well. The proposed system has been developed based on the MLT for the classification of DM. An intelligent framework for Diabetes Mellitus (DM) that has been developed using MLT illustrates the full workflow from data input to output. The five algorithms, Logistic Regression (LR), Decision Tree Classification (DTC), Random Forest Classification (RFC), Support Vector Classification (SVC), and K-Nearest Classification (KNC), have been compared in terms of accuracy, precision, recall, and F1 score. Results: Results from the experimental setting using MLTs for DM prediction based on lifestyle predictors have been obtained. Descriptive statistics of lifestyle characteristics have been displayed along with their corresponding metrics, such as mean, standard deviation, minimum, maximum, etc. For instance, the age parameters’ mean, standard, and minimum at 25%, 50%, 75%, and maximum values were as follows: 520.0, 48.02, 12.151, 16.0, 39.0, 47.5, 57.0, and 90.0 respectively, as shown in Fig. (10). Feature engineering is crucial to the process of constructing MLT. Insignificant or incorrect characteristics may have a negative impact on the way a model runs. The training time is drastically reduced and accuracy is increased with careful feature selection. In machine learning frameworks, some feature selection strategies include embedding, filter, wrapper, embedded, and hybrid techniques. An alarming number of people around the world suffer from the chronic and dangerous disease of diabetes. Using MLT, early DM prediction-based biological variables have been obtained in this research work. Data on patients’ lifestyles have been thoroughly examined in order to create a framework. The Canonical-correlation Analysis (CCA) has been used to select the ideal combination of lifestyle features. Finally, 10-fold cross-validations have been used to apply five alternative machine learning techniques for the prediction of disease. Conclusion: To our knowledge, it is the first time a framework has been proposed that has yielded prediction results so much better than those from earlier research. The results obtained in this suggested work have been found accurate and reliable by metrics evaluation. other: NA

Publisher

Bentham Science Publishers Ltd.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3