Enhancing Accuracy in Breast Density Assessment Using Deep Learning: A Multicentric, Multi-Reader Study-Reference-Cited by-同舟云学术

Enhancing Accuracy in Breast Density Assessment Using Deep Learning: A Multicentric, Multi-Reader Study

Published:2024-05-28 Issue:11 Volume:14 Page:1117
ISSN:2075-4418
Container-title:Diagnostics
language:en
Short-container-title:Diagnostics

Author:

Biroš Marek¹^ORCID,Kvak Daniel¹²^ORCID,Dandár Jakub¹,Hrubý Robert¹^ORCID,Janů Eva³^ORCID,Atakhanova Anora¹,Al-antari Mugahed A.⁴^ORCID

Affiliation:

1. Carebot, Ltd., 128 00 Prague, Czech Republic

2. Department of Simulation Medicine, Faculty of Medicine, Masaryk University, 625 00 Brno, Czech Republic

3. Department of Radiology, Masaryk Memorial Cancer Institute, 602 00 Brno, Czech Republic

4. Department of Artificial Intelligence and Data Science, Daeyang AI Center, Sejong University, Seoul 05006, Republic of Korea

Abstract

The evaluation of mammographic breast density, a critical indicator of breast cancer risk, is traditionally performed by radiologists via visual inspection of mammography images, utilizing the Breast Imaging-Reporting and Data System (BI-RADS) breast density categories. However, this method is subject to substantial interobserver variability, leading to inconsistencies and potential inaccuracies in density assessment and subsequent risk estimations. To address this, we present a deep learning-based automatic detection algorithm (DLAD) designed for the automated evaluation of breast density. Our multicentric, multi-reader study leverages a diverse dataset of 122 full-field digital mammography studies (488 images in CC and MLO projections) sourced from three institutions. We invited two experienced radiologists to conduct a retrospective analysis, establishing a ground truth for 72 mammography studies (BI-RADS class A: 18, BI-RADS class B: 43, BI-RADS class C: 7, BI-RADS class D: 4). The efficacy of the DLAD was then compared to the performance of five independent radiologists with varying levels of experience. The DLAD showed robust performance, achieving an accuracy of 0.819 (95% CI: 0.736–0.903), along with an F1 score of 0.798 (0.594–0.905), precision of 0.806 (0.596–0.896), recall of 0.830 (0.650–0.946), and a Cohen’s Kappa (κ) of 0.708 (0.562–0.841). The algorithm achieved robust performance that matches and in four cases exceeds that of individual radiologists. The statistical analysis did not reveal a significant difference in accuracy between DLAD and the radiologists, underscoring the model’s competitive diagnostic alignment with professional radiologist assessments. These results demonstrate that the deep learning-based automatic detection algorithm can enhance the accuracy and consistency of breast density assessments, offering a reliable tool for improving breast cancer screening outcomes.

Funder

Carebot, Ltd.

Publisher

MDPI AG

Link

https://www.mdpi.com/2075-4418/14/11/1117/pdf

Reference27 articles.

1. The impact of mammographic screening on breast cancer mortality in Europe: A review of observational studies;Broeders;J. Med. Screen.,2012

2. Digital mammography screening: Weighing reduced mortality against increased overdiagnosis;Fracheboud;Prev. Med.,2011

3. Others Mammographic density and the risk and detection of breast cancer;Boyd;N. Engl. J. Med.,2007

4. BI-RADS: Revised and replicated;Ellenbogen;J. Am. Coll. Radiol.,2014

5. Radiologist assessment of breast density by BI-RADS categories versus fully automated volumetric assessment;Gweon;AJR Am. J. Roentgenol.,2013