Author:
Banday Mehroush,Zafar Sherin,Agarwal Parul,Alam M. Afshar
Abstract
Two of the leading causes of mortality in the US are diabetes and cardiovascular disease. The first step in halting the course of these disorders in patients is to recognize and anticipate them. Using survey data (as well as test findings), we assess the efficacy of machine learning algorithms for identifying patients who are at risk and pinpoint important characteristics in the data that are causing these diseases in the patients. Hunger, thirst, and frequent urination are signs of high blood sugar. Neglecting diabetes can lead to several issues. It’s critical to be able to identify diabetes early. In this study, we use supervised machine-learning techniques such as Decision Tree, KNN, (RF)Random Forest, Logistic Regression, Ada Boost, and Gradient Boosting to train on the actual data of 520 diabetics. The proposed work has achieved a more thorough comparative analysis between different datasets and their features that may be carried out to pinpoint all the essential characteristics for predicting diabetes.