The Applicability of Machine Learning in Prediabetes Prediction
Author:
Vîrgolici Oana1, Vîrgolici Horia-Marius2, Bologa Ana Ramona3
Affiliation:
1. 1 Bucharest University of Economic Studies , Bucharest , Romania 2. 2 UMF„Carol Davila” , Bucharest , Romania 3. 3 Bucharest University of Economic Studies , Bucharest , Romania
Abstract
Abstract
In over 60% of patients with prediabetes, the evolution to diabetes can be stopped by changing lifestyle. The prevalence of diabetes mellitus was 11.6% and of prediabetes was 16.5% in the Romanian population aged 20-79 years. The deficiencies are not only at the health system level, but also in the application of prediabetes criteria existing in accredited guidelines. Although these guidelines imposed by the international diabetes federation are constantly updated, many doctors do not apply the recommended steps for diagnosis and treatment. We review, in the first part of the paper, several studies in predicting diabetes, which used different algorithms and techniques. In this second part of the paper, we propose a machine learning approach for prediabetes prediction, which uses kNN (k-Nearest Neighbors), DT (decision tree), SVM (Support Vectors Machines) and Logistic Regression (LR) algorithms. We used a dataset with 125 persons (men and women), with the following features: gender (S), serum glucose (G), serum triglycerides (TG), serum high-density lipoprotein cholesterol (HDL), waist circumference (WC) and systolic blood pressure (SBP). We used standardized medical criterion named Adult Treatment Panel III Guidelines (ATP III), which specifies that prediabetes diagnosis can be established if at least three of five parameters are outside the scale of their normal values. We obtained, for both algorithms, encouraging results in evaluating the models (in terms of confusion matrix, f1_score, accuracy_score).
Publisher
Walter de Gruyter GmbH
Subject
General Earth and Planetary Sciences,General Environmental Science
Reference36 articles.
1. Ahuja, R., Sharma, S.C., Ali, M. (2019). A diabetic disease prediction model based on classification algorithms. Ann Emerg Technol Comput., 3(3), 44–52. Retrieved from doi: 10.33166/AETiC.2019.03.005. 2. Alehegn, M., Joshi, R.R., Mulay, P. (2019). Diabetes analysis and prediction using random forest, KNN, Naïve Bayes, and J48: an ensemble approach. Int J Sci Technol Res., 8(9), 1346–1354. 3. Ameena, R.R., Ashadevi, B. (2020). Predictive analysis of diabetic women patients using R. Peter JD, Fernandes SL (eds) Systems simulation and modeling for cloud computing and big data applications” Elsevier Inc., Amsterdam. Retrieved from 10.1016/B978-0-12-819779-0.00006-X. 4. Amour Diwani, S., Sam, A. (2014). Diabetes forecasting using supervised learning techniques. Adv. Comput. Sci.: Int. J., 3(5), 10–18, Retrieved from: http://www.acsij.org/acsij/article/view/156. 5. Anuja Kumari, V., Chitra, R. (2013). Classification of diabetes disease using support vector machine”, Int. J. Eng. Res. Appl., 3, 1797–1801.
|
|