Construction and validation of risk prediction models for pulmonary embolism in hospitalized patients based on different machine learning methods-Reference-Cited by-同舟云学术

Construction and validation of risk prediction models for pulmonary embolism in hospitalized patients based on different machine learning methods

Published:2024-06-25 Issue: Volume:11 Page:
ISSN:2297-055X
Container-title:Frontiers in Cardiovascular Medicine
language:
Short-container-title:Front. Cardiovasc. Med.

Author:

Huang Tao,Huang Zhihai,Peng Xiaodong,Pang Lingpin,Sun Jie,Wu Jinbo,He Jinman,Fu Kaili,Wu Jun,Sun Xishi

Abstract

ObjectiveThis study aims to apply different machine learning (ML) methods to construct risk prediction models for pulmonary embolism (PE) in hospitalized patients, and to evaluate and compare the predictive efficacy and clinical benefit of each model.MethodsWe conducted a retrospective study involving 332 participants (172 PE positive cases and 160 PE negative cases) recruited from Guangdong Medical University. Participants were randomly divided into a training group (70%) and a validation group (30%). Baseline data were analyzed using univariate analysis, and potential independent risk factors associated with PE were further identified through univariate and multivariate logistic regression analysis. Six ML models, namely Logistic Regression (LR), Decision Tree (DT), Random Forest (RF), Naive Bayes (NB), Support Vector Machine (SVM), and AdaBoost were developed. The predictive efficacy of each model was compared using the receiver operating characteristic (ROC) curve analysis and the area under the curve (AUC). Clinical benefit was assessed using decision curve analysis (DCA).ResultsLogistic regression analysis identified lower extremity deep venous thrombosis, elevated D-dimer, shortened activated partial prothrombin time, and increased red blood cell distribution width as potential independent risk factors for PE. Among the six ML models, the RF model achieved the highest AUC of 0.778. Additionally, DCA consistently indicated that the RF model offered the greatest clinical benefit.ConclusionThis study developed six ML models, with the RF model exhibiting the highest predictive efficacy and clinical benefit in the identification and prediction of PE occurrence in hospitalized patients.

Publisher

Frontiers Media SA

Reference46 articles.

1. Joint effects of cancer and variants in the factor 5 gene on the risk of venous thromboembolism;Gran;Haematologica,2016

2. Bioinformatics-based study to detect chemical compounds that show potential as treatments for pulmonary thromboembolism;Sun;Int J Mol Med,2019

3. Update in the diagnosis and management of acute pulmonary embolism for the non-respiratory physician;Ramjug;Clin Med (Lond),2021

4. Acute pulmonary embolism: a review;Freund;JAMA,2022

5. Comparison of the wells score with the revised Geneva score for assessing suspected pulmonary embolism: a systematic review and meta-analysis;Shen;J Thromb Thrombolysis,2016