Predicting Risk of Stroke From Lab Tests Using Machine Learning Algorithms: Development and Evaluation of Prediction Models-Reference-Cited by-同舟云学术

Predicting Risk of Stroke From Lab Tests Using Machine Learning Algorithms: Development and Evaluation of Prediction Models

Published:2021-12-02 Issue:12 Volume:5 Page:e23440
ISSN:2561-326X
Container-title:JMIR Formative Research
language:en
Short-container-title:JMIR Form Res

Author:

Alanazi Eman M^ORCID,Abdou Aalaa^ORCID,Luo Jake^ORCID

Abstract

Background Stroke, a cerebrovascular disease, is one of the major causes of death. It causes significant health and financial burdens for both patients and health care systems. One of the important risk factors for stroke is health-related behavior, which is becoming an increasingly important focus of prevention. Many machine learning models have been built to predict the risk of stroke or to automatically diagnose stroke, using predictors such as lifestyle factors or radiological imaging. However, there have been no models built using data from lab tests. Objective The aim of this study was to apply computational methods using machine learning techniques to predict stroke from lab test data. Methods We used the National Health and Nutrition Examination Survey data sets with three different data selection methods (ie, without data resampling, with data imputation, and with data resampling) to develop predictive models. We used four machine learning classifiers and six performance measures to evaluate the performance of the models. Results We found that accurate and sensitive machine learning models can be created to predict stroke from lab test data. Our results show that the data resampling approach performed the best compared to the other two data selection techniques. Prediction with the random forest algorithm, which was the best algorithm tested, achieved an accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and area under the curve of 0.96, 0.97, 0.96, 0.75, 0.99, and 0.97, respectively, when all of the attributes were used. Conclusions The predictive model, built using data from lab tests, was easy to use and had high accuracy. In future studies, we aim to use data that reflect different types of stroke and to explore the data to build a prediction model for each type.

Publisher

JMIR Publications Inc.

Subject

Computer Science Applications,Health Informatics,Medicine (miscellaneous)

Reference31 articles.

1. An Updated Definition of Stroke for the 21st Century

2. Heart Disease and Stroke Statistics—2019 Update: A Report From the American Heart Association

3. European Stroke Initiative Recommendations for Stroke Management – Update 2003

4. Lifestyle factors and stroke risk: Exercise, alcohol, diet, obesity, smoking, drug use, and stress

5. 2019 ACC/AHA Guideline on the Primary Prevention of Cardiovascular Disease

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Prediction of stroke using single lead ECG signal: A deep learning approach;2024 IEEE International Conference on Digital Health (ICDH);2024-07-07

2. Leveraging multivariate analysis and adjusted mutual information to improve stroke prediction and interpretability;Neurosciences;2024-07

3. Research on overburden structural characteristics and support adaptability in cooperative mining of sectional coal pillar and bottom coal seam;Scientific Reports;2024-05-20

4. Predictive modelling and identification of key risk factors for stroke using machine learning;Scientific Reports;2024-05-20

5. Machine learning-based prognostication of mortality in stroke patients;Heliyon;2024-04