Exploring Machine Learning for Predicting Cerebral Stroke: A Study in Discovery
-
Published:2024-02-07
Issue:4
Volume:13
Page:686
-
ISSN:2079-9292
-
Container-title:Electronics
-
language:en
-
Short-container-title:Electronics
Author:
Mia Rajib1ORCID, Khanam Shapla1, Mahjabeen Amira1ORCID, Ovy Nazmul Hoque1, Ghimire Deepak2ORCID, Park Mi-Jin3, Begum Mst Ismat Ara4, Hosen A. S. M. Sanwar5ORCID
Affiliation:
1. Department of Software Engineering, Daffodil International University, Dhaka 1216, Bangladesh 2. School of AI Convergence, College of Information Technology, Soongsil University, Seoul 06978, Republic of Korea 3. Department of Psychaitry, Seoul St. Mary’s Hospital, College of Medicine, The Catholic University of Korea, Seoul 03083, Republic of Korea 4. Department of Biomedical Sciences, Institute for Medical Science, Jeonbuk National University Medical School, Jeonju 54907, Republic of Korea 5. Department of Artificial Intelligence and Big Data, Woosong University, Daejeon 34606, Republic of Korea
Abstract
Cerebral strokes, the abrupt cessation of blood flow to the brain, lead to a cascade of events, resulting in cellular damage due to oxygen and nutrient deprivation. Contemporary lifestyle factors, including high glucose levels, heart disease, obesity, and diabetes, heighten the risk of stroke. This research investigates the application of robust machine learning (ML) algorithms, including logistic regression (LR), random forest (RF), and K-nearest neighbor (KNN), to the prediction of cerebral strokes. Stroke data is collected from Harvard Dataverse Repository. The data includes—clinical, physiological, behavioral, demographic, and historical data. The Synthetic Minority Oversampling Technique (SMOTE), adaptive synthetic sampling (ADASYN), and the Random Oversampling Technique (ROSE) are used to address class imbalances to improve the accuracy of minority classes. To address the challenge of forecasting strokes from partial and imbalanced physiological data, this study introduces a novel hybrid ML approach by combining a machine learning method with an oversampling technique called ADASYN_RF. ADASYN is an oversampling technique used to resample the imbalanced dataset then RF is implemented on the resampled dataset. Also, other oversampling techniques and ML models are implemented to compare the results. Notably, the RF algorithm paired with ADASYN achieves an exceptional performance of 99% detection accuracy, exhibiting its dominance in stroke prediction. The proposed approach enables cost-effective, precise stroke prediction, providing a valuable tool for clinical diagnosis.
Reference37 articles.
1. Fang, G., Liu, W., and Wang, L. (2020). A machine learning approach to select features important to stroke prognosis. Comput. Biol. Chem., 88. 2. Park, S.J., Hussain, I., Hong, S., Kim, D., Park, H., and Benjamin, H.C.M. (2020, January 4–6). Real-time gait monitoring system for consumer stroke prediction service. Proceedings of the 2020 IEEE International Conference on Consumer Electronics (ICCE), Las Vegas, NV, USA. 3. Rajora, M., Rathod, M., and Naik, N.S. (2021, January 7–10). Stroke prediction using machine learning in a distributed environment. Proceedings of the Distributed Computing and Internet Technology: 17th International Conference, ICDCIT 2021, Bhubaneswar, India. Proceedings 17. 4. Stroke disease detection and prediction using robust learning approaches;Tazin;J. Healthc. Eng.,2021 5. Classification of stroke disease using machine learning algorithms;Govindarajan;Neural Comput. Appl.,2020
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|