Machine learning natural language processing for identifying venous thromboembolism: systematic review and meta-analysis

Author:

Lam Barbara D.12ORCID,Chrysafi Pavlina3,Chiasakul Thita4ORCID,Khosla Harshit5,Karagkouni Dimitra6,McNichol Megan7,Adamski Alys8ORCID,Reyes Nimia8,Abe Karon8,Mantha Simon9,Vlachos Ioannis S.6ORCID,Zwicker Jeffrey I.9ORCID,Patell Rushad1ORCID

Affiliation:

1. 1Division of Hematology, Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA

2. 2Division of Clinical Informatics, Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA

3. 3Department of Medicine, Mount Auburn Hospital, Harvard Medical School, Boston, MA

4. 4Center of Excellence in Translational Hematology, Division of Hematology, Department of Medicine, Faculty of Medicine, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, Thailand

5. 5Department of Medicine, Saint Vincent Hospital, Worcester, MA

6. 6Department of Pathology, Cancer Research Institute, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA

7. 7Library Sciences, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA

8. 8Division of Blood Disorders, National Center on Birth Defects and Developmental Disabilities, Centers for Disease Control and Prevention, Atlanta, GA

9. 9Division of Hematology, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY

Abstract

Abstract Venous thromboembolism (VTE) is a leading cause of preventable in-hospital mortality. Monitoring VTE cases is limited by the challenges of manual medical record review and diagnosis code interpretation. Natural language processing (NLP) can automate the process. Rule-based NLP methods are effective but time consuming. Machine learning (ML)-NLP methods present a promising solution. We conducted a systematic review and meta-analysis of studies published before May 2023 that use ML-NLP to identify VTE diagnoses in the electronic health records. Four reviewers screened all manuscripts, excluding studies that only used a rule-based method. A meta-analysis evaluated the pooled performance of each study’s best performing model that evaluated for pulmonary embolism and/or deep vein thrombosis. Pooled sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) with confidence interval (CI) were calculated by DerSimonian and Laird method using a random-effects model. Study quality was assessed using an adapted TRIPOD (Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis) tool. Thirteen studies were included in the systematic review and 8 had data available for meta-analysis. Pooled sensitivity was 0.931 (95% CI, 0.881-0.962), specificity 0.984 (95% CI, 0.967-0.992), PPV 0.910 (95% CI, 0.865-0.941) and NPV 0.985 (95% CI, 0.977-0.990). All studies met at least 13 of the 21 NLP-modified TRIPOD items, demonstrating fair quality. The highest performing models used vectorization rather than bag-of-words and deep-learning techniques such as convolutional neural networks. There was significant heterogeneity in the studies, and only 4 validated their model on an external data set. Further standardization of ML studies can help progress this novel technology toward real-world implementation.

Publisher

American Society of Hematology

Reference40 articles.

1. Venous thromboembolism risk and prophylaxis in the acute hospital care setting (ENDORSE study): a multinational cross-sectional study;Cohen;Lancet,2008

2. Long-term complications of medical patients with hospital-acquired venous thromboembolism;Fanikos;Thromb Haemost,2009

3. Call to action to prevent venous thromboembolism in hospitalized patients: a policy statement from the American Heart Association;Henke;Circulation,2020

4. Agency of Healthcare Research and Quality . Chapter 4. Choose the Model to Assess VTE and Bleeding Risk. Accessed 8 April 2024. https://www.ahrq.gov/patient-safety/settings/hospital/vtguide/guide4.html.

5. The Joint Commission . Venous Thromboembolism. Accessed 8 April 2024. https://www.jointcommission.org/measurement/measures/venous-thromboembolism/.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3