Affiliation:
1. Department of Computer Science and Engineering, Manipal University Jaipur, Rajasthan, India
2. Department of Computer Science and Engineering, Presidencey University, Banglore, India
Abstract
Background:
Diabetes is spreading in the entire world. In a survey, it is observed that every
generation from child to old age people are suffering from diabetes. If diabetes is not identified in time,
it may lead to deadliest disease. Prediction of diabetes is of the utmost challenging task by machines. In
the human body, diabetes is one of the perilous maladies that creates depended disease such as kidney
disease, heart attack, blindness etc. Thus it is very important to diagnose diabetes in time.
Objective:
Our target is to develop a system using Artificial Neural Network (ANN), with the ability to
predict whether a patient suffers from diabetes or not.
Methods:
This paper illustrates various machine learning techniques in form of literature review; such
as Support Vector Machine, Naïve Bayes, K Nearest Neighbor, Decision Tree, Random Forest, etc. We
applied ANN to predict diabetes. In this paper, the architecture of ANN consists of four hidden layers each
of six neurons and one output layer with one neuron. Optimizer used for the architecture is ‘Adam’.
Results:
We have Pima Indian diabetes dataset of sufficient number of patients with nine different
symptoms with respect to the patients and nine different features in connection with the mathematical
computation/prediction. Hence we bifurcate the dataset into training and testing set in majority and minority
ratio of 80:20 respectively. It facilitates us the majority patient’s data to be used as training set
and minority data to be used as testing set. We train our network for multiple epoch with different activation
function. We used four hidden layers with six neurons in each hidden layer and one output layer.
On the hidden layer, we used multiple activation functions such as sigmoid, ReLU etc. and obtained
beat accuracy (88.71%) in 600 epochs with ReLU activation function. On the output layer, we used only
sigmoid activation function because we have only two classes in our dataset.
Conclusion:
Diabetes prediction by machine is a challenging task. So many machine learning algorithms
exist to predict the diabetes such as Naïve Bayes, decision tree, K nearest neighbor, support vector
machine etc. This paper presents a novel approach to predict whether a patient has diabetes or not
based on Pima Indian diabetes dataset. In this paper, we used artificial neural network to train out network
and it is observed that artificial neural network approach performs better than all other classifiers.
Publisher
Bentham Science Publishers Ltd.