Prediction, stratification, and explanation of the risk of Venous Thromboembolism in post-operative patients using Electronic Health Records (Preprint)

Author:

Edakalavan SmithaORCID,Andraska ElizabethORCID,Myers Sara,Hanzel Robert,Neal Matthew,Visweswaran ShyamORCID,Ceschin RafaelORCID

Abstract

BACKGROUND

Postoperative venous thromboembolic events (VTE), encompassing deep vein thrombosis (DVT) and pulmonary embolism (PE), are preventable but can cause severe morbidity and mortality in post-surgical patients. Around 1 in 1000 individuals annually experience VTE, making prompt diagnosis and therapy vital, as untreated VTE carries a 30% mortality rate. These events result in $1 billion in hospital costs every year. While prophylaxis for preventing VTE is crucial, some patients may face challenges that limit their ability to take prophylactic measures, such as the risk of bleeding, allergies or adverse reactions, recent surgery or trauma, severe liver or kidney disease, and drug interactions. Of the patients who develop VTE after surgery, around 40% occur during the initial hospital stay , whereas 60% of the patients develop VTE within 30-90 days after discharge from the hospital following surgery. Existing VTE risk scoring systems, such as Caprini and Rogers sub-optimal and can be cumbersome and difficult to translate to patient-specific clinical decision-making. Though machine learning approaches have been used to predict VTE risk in specific surgery types, no validated predictive models are currently used in clinical practice for VTE risk calculation under the broader umbrella of post-operative surgery patients.

OBJECTIVE

The study seeks to use both the static and the time-varying EHR patient data (aggregated per visit) to 1) predict the VTE risk for a surgical patient within 30 days post-discharge using machine learning (ML); 2) stratify the VTE risk of the patient into high, medium, and low-risk categories; and 3) compare the performance of individual department-specific ML models to that of a unified model 4) determine which variables are most strongly associated with postoperative VTE

METHODS

The structured EHR data from post-operative patients from our multi-center hospital system between 2013 and 2019 were used. Various machine learning algorithms, including linear regression with L1/L2 regularization, random forest, and eXtreme Gradient Boosting (XGBoost), were evaluated. Different output probability thresholds were chosen to stratify the patients into low, medium, and high-risk categories. Department-specific ML models were developed, and their performance was compared with the unified model to determine the viability of constructing individual models versus unified models. Feature importance techniques were subsequently applied to identify the most influential features, followed by Rank Biased Overlap (RBO) analysis to quantify the level of overlap among the department-specific models.

RESULTS

Our findings demonstrate the efficiency of ML models in predicting and categorizing VTE risk among post-operative patients. The top-performing ML model achieved an F1 score of 0.76 with an area under the receiver operating characteristic curve (AUROC) of 0.83 and an area under the precision recall curve (AUPRC) of 0.89. A threshold of less than 0.3 in ML output probability was utilized to designate patients as low risk, while probabilities exceeding 0.7 indicated high risk. Interestingly, our investigation revealed that department-specific models failed to surpass the performance of the unified model. Analysis of feature importance correlation highlighted the length of stay (LOS) as the most influential predictor, followed by various laboratory results such as blood urea nitrogen (BUN), white blood cell count (WBC), vital signs like heart rate and temperature, as well as the patient’s age and body mass index (BMI). Our model integrates the dynamic aspects of a patient’s condition, incorporating changes in laboratory values and vital signs into its assessments. This represents a critical improvement over static models, which fail to account for the fluctuating clinical features of patients.

CONCLUSIONS

We have demonstrated the utility of machine learning models in predicting and stratifying VTE risk among post-operative patients. Accurate risk assessment plays a pivotal role in VTE prevention, enabling the development of personalized VTE prophylaxis strategies and monitoring plans. Our research marks the initial phase in creating a decision support tool to provide automated risk assessments, guiding tailored screening and prophylaxis approaches for individual patients.

Publisher

JMIR Publications Inc.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3