RDET stacking classifier: a novel machine learning based approach for stroke prediction using imbalance data-Reference-Cited by-同舟云学术

RDET stacking classifier: a novel machine learning based approach for stroke prediction using imbalance data

Published:2023-11-21 Issue: Volume:9 Page:e1684
ISSN:2376-5992
Container-title:PeerJ Computer Science
language:en
Short-container-title:

Author:

Rehman Amjad¹,Alam Teg²³,Mujahid Muhammad¹^ORCID,Alamri Faten S.⁴,Ghofaily Bayan Al¹,Saba Tanzila¹

Affiliation:

1. Artificial Intelligence & Data Analytics Lab CCIS, Prince Sultan University, Riyadh, Saudi Arabia

2. Department of Industrial Engineering, College of Engineering, Prince Sattam bin Abdulaziz University, Al Kharj, Saudi Arabia

3. Azad Institute of Engineering and Technology, Azad Puram, Chandrawal via Bangla Bazar & Bijnour, Near CRPF Camp, Lucknow, India

4. Department of Mathematical Sciences, College of Science, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia

Abstract

The main cause of stroke is the unexpected blockage of blood flow to the brain. The brain cells die if blood is not supplied to them, resulting in body disability. The timely identification of medical conditions ensures patients receive the necessary treatments and assistance. This early diagnosis plays a crucial role in managing symptoms effectively and enhancing the overall quality of life for individuals affected by the stroke. The research proposed an ensemble machine learning (ML) model that predicts brain stroke while reducing parameters and computational complexity. The dataset was obtained from an open-source website Kaggle and the total number of participants is 3,254. However, this dataset needs a significant class imbalance problem. To address this issue, we utilized Synthetic Minority Over-sampling Technique (SMOTE) and Adaptive Synthetic Sampling (ADAYSN), a technique for oversampling issues. The primary focus of this study centers around developing a stacking and voting approach that exhibits exceptional performance. We propose a stacking ensemble classifier that is more accurate and effective in predicting stroke disease in order to improve the classifier’s performance and minimize overfitting problems. To create a final stronger classifier, the study used three tree-based ML classifiers. Hyperparameters are used to train and fine-tune the random forest (RF), decision tree (DT), and extra tree classifier (ETC), after which they were combined using a stacking classifier and a k-fold cross-validation technique. The effectiveness of this method is verified through the utilization of metrics such as accuracy, precision, recall, and F1-score. In addition, we utilized nine ML classifiers with Hyper-parameter tuning to predict the stroke and compare the effectiveness of Proposed approach with these classifiers. The experimental outcomes demonstrated the superior performance of the stacking classification method compared to other approaches. The stacking method achieved a remarkable accuracy of 100% as well as exceptional F1-score, precision, and recall score. The proposed approach demonstrates a higher rate of accurate predictions compared to previous techniques.

Funder

Princess Nourah bint Abdulrahman University

Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia

Publisher

PeerJ

Subject

General Computer Science

Link

https://peerj.com/articles/cs-1684.pdf

Reference63 articles.

1. Deep and hybrid learning of MRI diagnosis for early detection of the progression stages in Alzheimer’s disease;Abunadi;Connection Science,2022

2. An automated glowworm swarm optimization with an inception-based deep convolutional neural network for COVID-19 diagnosis and classification;Abunadi;Healthcare,2022