Improving breast cancer diagnostics with deep learning for MRI

Author:

Witowski Jan12ORCID,Heacock Laura1ORCID,Reig Beatriu1ORCID,Kang Stella K.13ORCID,Lewin Alana1,Pysarenko Kristine1,Patel Shalin1,Samreen Naziya1,Rudnicki Wojciech4ORCID,Łuczyńska Elżbieta4,Popiela Tadeusz5ORCID,Moy Linda1267ORCID,Geras Krzysztof J.12689ORCID

Affiliation:

1. Department of Radiology, New York University Grossman School of Medicine, New York, NY 10016, USA.

2. Center for Advanced Imaging Innovation and Research, New York University, New York, NY 10016, USA.

3. Department of Population Health, New York University Grossman School of Medicine, New York NY 10016, USA.

4. Electroradiology Department, Jagiellonian University Medical College, 31-126 Kraków, Poland.

5. Chair of Radiology, Jagiellonian University Medical College, 31-501 Kraków, Poland.

6. Vilcek Institute of Graduate Biomedical Sciences, New York University Grossman School of Medicine, New York, NY 10016, USA.

7. Perlmutter Cancer Center, New York University Langone Health, New York, NY 10016, USA.

8. Center for Data Science, New York University, New York NY 10011, USA.

9. Department of Computer Science, Courant Institute of Mathematical Sciences, New York University, New York NY 10012, USA.

Abstract

Dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) has a high sensitivity in detecting breast cancer but often leads to unnecessary biopsies and patient workup. We used a deep learning (DL) system to improve the overall accuracy of breast cancer diagnosis and personalize management of patients undergoing DCE-MRI. On the internal test set ( n = 3936 exams), our system achieved an area under the receiver operating characteristic curve (AUROC) of 0.92 (95% CI: 0.92 to 0.93). In a retrospective reader study, there was no statistically significant difference ( P = 0.19) between five board-certified breast radiologists and the DL system (mean ΔAUROC, +0.04 in favor of the DL system). Radiologists’ performance improved when their predictions were averaged with DL’s predictions [mean ΔAUPRC (area under the precision-recall curve), +0.07]. We demonstrated the generalizability of the DL system using multiple datasets from Poland and the United States. An additional reader study on a Polish dataset showed that the DL system was as robust to distribution shift as radiologists. In subgroup analysis, we observed consistent results across different cancer subtypes and patient demographics. Using decision curve analysis, we showed that the DL system can reduce unnecessary biopsies in the range of clinically relevant risk thresholds. This would lead to avoiding biopsies yielding benign results in up to 20% of all patients with BI-RADS category 4 lesions. Last, we performed an error analysis, investigating situations where DL predictions were mostly incorrect. This exploratory work creates a foundation for deployment and prospective analysis of DL-based models for breast MRI.

Publisher

American Association for the Advancement of Science (AAAS)

Subject

General Medicine

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3