Author:
Jasim Abdulrahman Ahmed,Hazim Layth Rafea,Mohammedqasim Hayder,Mohammedqasem Roa’a,Ata Oguz,Salman Omar Hussein
Abstract
AbstractOne of the most fatal and serious diseases that humans have encountered is diabetes, an illness affecting thousands of individuals yearly. In this era of digital systems, diabetes prediction based on machine learning (ML) is gaining high momentum. One of the benefits of treating patients early in the course of their noncommunicable diseases (NCDs) is that they can avoid costly therapies when the illness worsens later in life. Incidentally, diabetes is complicated by the dearth of medical professionals in underserved areas, such as distant rural communities. In these situations, the Internet of Medical Things and machine learning (ML) models can be used to offer healthcare practitioners the necessary prediction tools to more effectively and timely make decisions, thus assisting the early identification and diagnosis of NCDs. In this study, four conventional and hyper-AdaBoost ML models were trained and tested on the PIMA Indian Diabetes dataset. Patients with diabetes were classified on the basis of laboratory findings. Pre-processing tasks, such as the handling of imbalanced data and missing values, were performed prior to feature importance and normalisation activities. The algorithm with the best performance was examined using precision, accuracy, F1, recall and area under the curve metrics. Then, all ML models were hyper parametrically tuned via grid search to optimise their performance and reduce their error times. The decision process was also evaluated to further enhance the models. The AdaBoost-ET model performed even when features were not selected for binary classification. The model proposed in this study can predict diabetes with unprecedented high accuracy compared with the models in previous studies.
Publisher
Springer Science and Business Media LLC
Reference35 articles.
1. WHO Global Report (2016) Global report on diabetes. Isbn 978:11
2. “2013–2020 GLOBAL ACTION PLAN FOR THE PREVENTION AND CONTROL OF NONCOMMUNICABLE DISEASES,” 2013, Accessed: Feb. 18, 2023. [Online]. Available: www.who.int
3. Kaur P, Sharma N, Singh A, Gill B (2019) CI-DPF: a Cloud IoT based framework for diabetes prediction. In: 2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference, IEMCON 2018, pp 654–660.https://doi.org/10.1109/IEMCON.2018.8614775
4. Zaccardi F, Webb DR, Yates T, Davies MJ (2016) Pathophysiology of type 1 and type 2 diabetes mellitus: a 90-year perspective. Postgrad Med J 92(1084):63–69. https://doi.org/10.1136/POSTGRADMEDJ-2015-133281
5. Palimkar P, Shaw RN, Ghosh A (2022) Machine learning technique to prognosis diabetes disease: random forest classifier approach. Lect Notes Netw Syst 218:219–244. https://doi.org/10.1007/978-981-16-2164-2_19/COVER