Potential Impact of an Artificial Intelligence–based Mammography Triage Algorithm on Performance and Workload in a Population-based Screening Sample

Author:

Watanabe Alyssa T12ORCID,Vu Hoanh2,Chim Chi Y2,Litt Andrew W3,Retson Tara4ORCID,Mayo Ray C5ORCID

Affiliation:

1. Department of Radiology, Keck School of Medicine, University of Southern California (USC) , Los Angeles, CA , USA

2. CureMetrix Incorporated , San Diego, CA , USA

3. Cornice Health Ventures, LLC , Miami Beach FL , USA

4. Department of Radiology, University of California San Diego (UCSD) , La Jolla, CA , USA

5. Department of Breast Imaging, University of Texas MD Anderson Cancer Center , Houston, TX , USA

Abstract

Abstract Objective To evaluate potential screening mammography performance and workload impact using a commercial artificial intelligence (AI)–based triage device in a population-based screening sample. Methods In this retrospective study, a sample of 2129 women who underwent screening mammograms were evaluated. The performance of a commercial AI-based triage device was compared with radiologists’ reports, actual outcomes, and national benchmarks using commonly used mammography metrics. Up to 5 years of follow-up examination results were evaluated in cases to establish benignity. The algorithm sorted cases into groups of “suspicious” and “low suspicion.” A theoretical workload reduction was calculated by subtracting cases triaged as “low suspicion” from the sample. Results At the default 93% sensitivity setting, there was significant improvement (P <.05) in the following triage simulation mean performance measures compared with actual outcome: 45.5% improvement in recall rate (13.4% to 7.3%; 95% CI, 6.2-8.3), 119% improvement in positive predictive value (PPV) 1 (5.3% to 11.6%; 95% CI, 9.96-13.4), 28.5% improvement in PPV2 (24.6% to 31.6%; 95% CI, 24.8-39.1), 20% improvement in sensitivity (83.3% to 100%; 95% CI, 100-100), and 7.2% improvement in specificity (87.2% to 93.5%; 95% CI, 92.4-94.5). A theoretical 62.5% workload reduction was possible. At the ultrahigh 99% sensitivity setting, a theoretical 27% workload reduction was possible. No cancers were missed by the algorithm at either sensitivity. Conclusion Artificial intelligence–based triage in this simulation demonstrated potential for significant improvement in mammography performance and predicted substantial theoretical workload reduction without any missed cancers.

Publisher

Oxford University Press (OUP)

Reference33 articles.

1. Screening mammography benefit controversies: sorting the evidence;Feig;Radiol Clin North America,2014

2. Consensus reads: the more sets of eyes interpreting a mammogram, the better for women;Hofvind;Radiology,2020

3. Sensitivity, specificity, and false negative rate for 1,682,504 screening mammography examinations from 2007-2013;Breast Cancer Surveillance Consortium (BCSC),2017

4. Accuracy of screening mammography interpretation by characteristics of radiologists;Barlow;J Natl Cancer Inst,2004

5. Diagnostic accuracy of digital screening mammography with and without computer-aided detection;Lehman;JAMA Intern Med,2015

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3