Affiliation:
1. Kalasalingam Academy of Research and Education
2. Mepco Schlenk Engineering College
3. Velammal College of Engineering and Technology
Abstract
Abstract
Stroke is one of the deadliest diseases found in the world which is the second major reason for mortality rate. Early detection of stroke can reduce the mortality due to stroke. Inorder to diagnose it earlier several machine learning techniques are being utilized. This proves that machine learning can also be used for disease prediction for various diseases. Supervised machine learning algorithms has been used for stroke prediction. Important feature responsible for stroke prediction has been done. Inorder to balance the dataset hybrid sampling technique of SMOTE + ENN has been performed so that the results are much appreciable. After sampling, machine learning models has been used for stroke prediction using Logistic Regression (LR), KNearest Neighbour (KNN), Naïve Bayes (NB), Decision Tree (DT), Support Vector Classifier (SVC) and Random Forest (RF). With all the models Random Forest achieved higher performance with accuracy of 99%, recall of 100%, precision of 98% and f-score of 99%. The proposed work also extends the use of different classifiers like Voting, Bagging and Stacking methods. Among all the proposed classifiers stacking provides higher performance with all metrics. The stacking classifer achieved accuracy of 100%, recall of 100%, precision of 99% and f-score of 100%. From the results, it is known that Random forest algorithm perform better with data sampling of SMOTE + ENN than other models.
Publisher
Research Square Platform LLC
Reference22 articles.
1. National Center for Health Statistics. Multiple Cause of Death 2018–2021 on CDC WONDER Database
2. The changing global burden of cancer: transitions in human development and implications for cancer prevention and control;Bray F;Cancer: disease control priorities,2015
3. Stroke risk, phenotypes, and death in COVID-19: systematic review and newly reported cases;Fridman S;Neurology,2020
4. Heart disease and stroke statistics—2022 update: a report from the American Heart Association;Tsao CW;Circulation,2022
5. Machine learning for brain stroke: a review;Sirsat MS;J Stroke Cerebrovasc Dis,2020