Comparison and interpretability of machine learning models to predict severity of chest injury

Author:

Kulshrestha Sujay12ORCID,Dligach Dmitriy345,Joyce Cara34,Gonzalez Richard12,O’Rourke Ann P6,Glazer Joshua M7,Stey Anne8,Kruser Jacqueline M9,Churpek Matthew M9,Afshar Majid9

Affiliation:

1. Burn and Shock Trauma Research Institute, Loyola University Chicago, Maywood, Illinois, USA

2. Department of Surgery, Loyola University Medical Center, Maywood, Illinois, USA

3. Center for Health Outcomes and Informatics Research, Health Sciences Division, Loyola University Chicago, Maywood, Illinois, USA

4. Department of Public Health Sciences, Stritch School of Medicine, Loyola University Chicago, Maywood, Illinois, USA

5. Department of Computer Science, Loyola University Chicago, Chicago, Illinois, USA

6. Department of Surgery, University of Wisconsin, Madison, Wisconsin, USA

7. Department of Emergency Medicine, University of Wisconsin, Madison, Wisconsin, USA

8. Department of Surgery, Northwestern University, Chicago, Illinois, USA

9. Department of Medicine, University of Wisconsin, Madison, Wisconsin, USA

Abstract

Abstract Objective Trauma quality improvement programs and registries improve care and outcomes for injured patients. Designated trauma centers calculate injury scores using dedicated trauma registrars; however, many injuries arrive at nontrauma centers, leaving a substantial amount of data uncaptured. We propose automated methods to identify severe chest injury using machine learning (ML) and natural language processing (NLP) methods from the electronic health record (EHR) for quality reporting. Materials and Methods A level I trauma center was queried for patients presenting after injury between 2014 and 2018. Prediction modeling was performed to classify severe chest injury using a reference dataset labeled by certified registrars. Clinical documents from trauma encounters were processed into concept unique identifiers for inputs to ML models: logistic regression with elastic net (EN) regularization, extreme gradient boosted (XGB) machines, and convolutional neural networks (CNN). The optimal model was identified by examining predictive and face validity metrics using global explanations. Results Of 8952 encounters, 542 (6.1%) had a severe chest injury. CNN and EN had the highest discrimination, with an area under the receiver operating characteristic curve of 0.93 and calibration slopes between 0.88 and 0.97. CNN had better performance across risk thresholds with fewer discordant cases. Examination of global explanations demonstrated the CNN model had better face validity, with top features including “contusion of lung” and “hemopneumothorax.” Discussion The CNN model featured optimal discrimination, calibration, and clinically relevant features selected. Conclusion NLP and ML methods to populate trauma registries for quality analyses are feasible.

Funder

National Institutes of Health

NIH

National Institute of General Medical Sciences

EarlySense

National Library of Medicine of the National Institutes of Health

NIH National Institute on Alcohol Abuse and Alcoholism

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3