A Diabetic Disease Prediction Model Based on Classification Algorithms-Reference-Cited by-同舟云学术

A Diabetic Disease Prediction Model Based on Classification Algorithms

Published:2019-07-01 Issue:3 Volume:3 Page:44-52
ISSN:2516-029X
Container-title:Annals of Emerging Technologies in Computing
language:en
Short-container-title:AETiC

Author:

Ahuja Ravinder,Sharma Subhash C.,Ali Maaruf^ORCID

Abstract

Diabetes is one of the chronic diseases in the world, 246 million people are inflicted by this disease and according to a World Health Organisation (WHO) report, this figure will increase to 380 million sufferers by 2025. Many other debilitating and critical health issues may further develop if this disease is not diagnosed or remain unidentified. Machine Learning (ML) techniques are now being used in various fields like education, healthcare, business, recommendation system, etc. Healthcare data is complex and high in dimensionality and contains irrelevant information - due to this, the prediction accuracy is low. The Pima Indians Diabetes Dataset was used in this research, it consisted of 768 records. Firstly, the missing values are replaced by the median followed by Linear Discriminant Analysis. Using the Python programming language, feature selection techniques is applied in combination with five classification algorithms: Support Vector Machine (SVM), Multi-Layer Perceptron (MLP), Logistic Regression, Random Forest and Decision Tree. The aim of this paper is to compare the different classification algorithms in order to predict diabetes in patients more accurately. K-fold cross-validation is applied, considering k to be 2, 4, 5 and 10. The performance parameters taken are the: accuracy, precision, recall, F Score and area under the curve. Our study found that the MLP classifier gave the highest accuracy of 78.7% with a recall of 61.26%, precision of 72.45% and F1 Score of 65.97% for k = 4.

Publisher

International Association for Educators and Researchers (IAER)

Subject

Electrical and Electronic Engineering,General Computer Science

Reference33 articles.

1. P. Muntner, L.D. Colantonio, M. Cushman, D.C. Goff, G. Howard, V.J. Howard and M.M. Safford, “Validation of the atherosclerotic cardiovascular disease pooled cohort risk equations”, JAMA 311(14):1406–1415, 2014.

2. B.A. Hamburg and G.E. Inoff, “Relationships between behavioral factors and diabetic control in children and adolescents: A camp study”, Psychosomatic Medicine, 44(4), 321-339, 1982.

3. American Diabetes Association, “Diagnosis, and classification of diabetes mellitus”, Diabetes Care 37 (Supplement 1): S81–S90, 2014.

4. C. Fitzmaurice, C. Allen, R.M. Barber, L. Barregard, Z.A. Bhutta, H. Brenner and T. Fleming, “Global, regional, and national cancer incidence, mortality, years of life lost, years lived with disability, and disability-adjusted life-years for 32 cancer groups, 1990 to 2015: a systematic analysis for the global burden of disease study”, JAMA Oncol. 3(4):524–548, 2017.

5. Y. Shi and F.B. Hu, “The global implications of diabetes and cancer”, Lancet 9933(383):1947–1948, 2014.

Cited by 33 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A novel adaptive weight bi-directional long short-term memory (AWBi-LSTM) classifier model for heart stroke risk level prediction in IoT;PeerJ Computer Science;2024-08-20

2. A predictive machine learning framework for diabetes;Turkish Journal of Engineering;2024-07-28

3. A decision-making tool for the determination of the distribution center location in a humanitarian logistics network;Expert Systems with Applications;2024-03

4. The Use of Neural Networks for the Prediction of Type II Diabetes: A Comparison of Recent Advances and Perspectives;Smart Innovation, Systems and Technologies;2024

5. Exploring machine learning techniques for feature extraction and classification of diabetes related medical data: A comprehensive review;Internet of Things and Machine Learning for Type I and Type II Diabetes;2024