Rapid triage for ischemic stroke: a machine learning-driven approach in the context of predictive, preventive and personalised medicine-Reference-Cited by-同舟云学术

Rapid triage for ischemic stroke: a machine learning-driven approach in the context of predictive, preventive and personalised medicine

Published:2022-05-27 Issue:2 Volume:13 Page:285-298
ISSN:1878-5085
Container-title:EPMA Journal
language:en
Short-container-title:EPMA Journal

Author:

Zheng Yulu^ORCID,Guo Zheng^ORCID,Zhang Yanbo,Shang Jianjing,Yu Leilei,Fu Ping,Liu Yizhi,Li Xingang^ORCID,Wang Hao^ORCID,Ren Ling,Zhang Wei,Hou Haifeng^ORCID,Tan Xuerui,Wang Wei^ORCID,

Abstract

Abstract Background Recognising the early signs of ischemic stroke (IS) in emergency settings has been challenging. Machine learning (ML), a robust tool for predictive, preventive and personalised medicine (PPPM/3PM), presents a possible solution for this issue and produces accurate predictions for real-time data processing. Methods This investigation evaluated 4999 IS patients among a total of 10,476 adults included in the initial dataset, and 1076 IS subjects among 3935 participants in the external validation dataset. Six ML-based models for the prediction of IS were trained on the initial dataset of 10,476 participants (split participants into a training set [80%] and an internal validation set [20%]). Selected clinical laboratory features routinely assessed at admission were used to inform the models. Model performance was mainly evaluated by the area under the receiver operating characteristic (AUC) curve. Additional techniques—permutation feature importance (PFI), local interpretable model-agnostic explanations (LIME), and SHapley Additive exPlanations (SHAP)—were applied for explaining the black-box ML models. Results Fifteen routine haematological and biochemical features were selected to establish ML-based models for the prediction of IS. The XGBoost-based model achieved the highest predictive performance, reaching AUCs of 0.91 (0.90–0.92) and 0.92 (0.91–0.93) in the internal and external datasets respectively. PFI globally revealed that demographic feature age, routine haematological parameters, haemoglobin and neutrophil count, and biochemical analytes total protein and high-density lipoprotein cholesterol were more influential on the model’s prediction. LIME and SHAP showed similar local feature attribution explanations. Conclusion In the context of PPPM/3PM, we used the selected predictors obtained from the results of common blood tests to develop and validate ML-based models for the diagnosis of IS. The XGBoost-based model offers the most accurate prediction. By incorporating the individualised patient profile, this prediction tool is simple and quick to administer. This is promising to support subjective decision making in resource-limited settings or primary care, thereby shortening the time window for the treatment, and improving outcomes after IS.

Funder

National Natural Science Foundation of China

China-Australia International Collaborative Grant

European Union’s Horizon 2020 Research and Innovation Program under grant agreement

Key Technology Research and Development Program of Shandong

Edith Cowan University Higher Degree by Research Scholarship

Centre for Precision Health HDR Student Award ECU

Edith Cowan University

Publisher

Springer Science and Business Media LLC

Subject

Biochemistry (medical),Health Policy,Drug Discovery