Maximizing Interpretability and Cost-Effectiveness of Surgical Site Infection (SSI) Predictive Models Using Feature-Specific Regularized Logistic Regression on Preoperative Temporal Data

Author:

Kocbek Primoz1ORCID,Fijacko Nino1,Soguero-Ruiz Cristina23,Mikalsen Karl Øyvind34,Maver Uros5ORCID,Povalej Brzan Petra16ORCID,Stozer Andraz5ORCID,Jenssen Robert37,Skrøvseth Stein Olav48,Stiglic Gregor16

Affiliation:

1. Faculty of Health Sciences, University of Maribor, Maribor 2000, Slovenia

2. Department of Signal Theory and Communications, Telematics and Computing, Universidad Rey Juan Carlos, Fuenlabrada 28943, Spain

3. UiT Machine Learning Group, UiT the Arctic University of Norway, Tromsø 9037, Norway

4. Department of Mathematics and Statistics, UiT the Arctic University of Norway, Tromsø 9037, Norway

5. Faculty of Medicine, University of Maribor, Maribor 2000, Slovenia

6. Faculty of Electrical Engineering and Computer Science, University of Maribor, Maribor 2000, Slovenia

7. Department of Physics and Technology, UiT the Arctic University of Norway, Tromsø 9037, Norway

8. Norwegian Centre for E-health Research, University Hospital of North Norway, Tromsø 9037, Norway

Abstract

This study describes a novel approach to solve the surgical site infection (SSI) classification problem. Feature engineering has traditionally been one of the most important steps in solving complex classification problems, especially in cases with temporal data. The described novel approach is based on abstraction of temporal data recorded in three temporal windows. Maximum likelihood L1-norm (lasso) regularization was used in penalized logistic regression to predict the onset of surgical site infection occurrence based on available patient blood testing results up to the day of surgery. Prior knowledge of predictors (blood tests) was integrated in the modelling by introduction of penalty factors depending on blood test prices and an early stopping parameter limiting the maximum number of selected features used in predictive modelling. Finally, solutions resulting in higher interpretability and cost-effectiveness were demonstrated. Using repeated holdout cross-validation, the baseline C-reactive protein (CRP) classifier achieved a mean AUC of 0.801, whereas our best full lasso model achieved a mean AUC of 0.956. Best model testing results were achieved for full lasso model with maximum number of features limited at 20 features with an AUC of 0.967. Presented models showed the potential to not only support domain experts in their decision making but could also prove invaluable for improvement in prediction of SSI occurrence, which may even help setting new guidelines in the field of preoperative SSI prevention and surveillance.

Funder

Javna Agencija za Raziskovalno Dejavnost RS

Publisher

Hindawi Limited

Subject

Applied Mathematics,General Immunology and Microbiology,General Biochemistry, Genetics and Molecular Biology,Modeling and Simulation,General Medicine

Cited by 14 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3