Better performance of deep learning pulmonary nodule detection using chest radiography with pixel level labels in reference to computed tomography: data quality matters-Reference-Cited by-同舟云学术

Better performance of deep learning pulmonary nodule detection using chest radiography with pixel level labels in reference to computed tomography: data quality matters

Published:2024-07-10 Issue:1 Volume:14 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Kim Jae Yong,Ryu Wi-Sun,Kim Dongmin,Kim Eun Young

Abstract

AbstractLabeling errors can significantly impact the performance of deep learning models used for screening chest radiographs. The deep learning model for detecting pulmonary nodules is particularly vulnerable to such errors, mainly because normal chest radiographs and those with nodules obscured by ribs appear similar. Thus, high-quality datasets referred to chest computed tomography (CT) are required to prevent the misclassification of nodular chest radiographs as normal. From this perspective, a deep learning strategy employing chest radiography data with pixel-level annotations referencing chest CT scans may improve nodule detection and localization compared to image-level labels. We trained models using a National Institute of Health chest radiograph-based labeling dataset and an AI-HUB CT-based labeling dataset, employing DenseNet architecture with squeeze-and-excitation blocks. We developed four models to assess whether CT versus chest radiography and pixel-level versus image-level labeling would improve the deep learning model’s performance to detect nodules. The models' performance was evaluated using two external validation datasets. The AI-HUB dataset with image-level labeling outperformed the NIH dataset (AUC 0.88 vs 0.71 and 0.78 vs. 0.73 in two external datasets, respectively; both p < 0.001). However, the AI-HUB data annotated at the pixel level produced the best model (AUC 0.91 and 0.86 in external datasets), and in terms of nodule localization, it significantly outperformed models trained with image-level annotation data, with a Dice coefficient ranging from 0.36 to 0.58. Our findings underscore the importance of accurately labeled data in developing reliable deep learning algorithms for nodule detection in chest radiography.

Funder

Gil Medical Center, Gachon University

Publisher

Springer Science and Business Media LLC

Link

https://www.nature.com/articles/s41598-024-66530-y.pdf

Reference45 articles.

1. Mortality, G. B. D., Causes of Death C. Global, regional, and national life expectancy, all-cause mortality, and cause-specific mortality for 249 causes of death, 1980–2015: A systematic analysis for the Global Burden of Disease Study 2015. Lancet 388, 1459–1544 (2016).

2. Brogdon, B. G., Kelsey, C. A. & Moseley, R. D. Jr. Factors affecting perception of pulmonary lesions. Radiol. Clin. North Am. 21, 633–654 (1983).

3. Forrest, J. V. & Friedman, P. J. Radiologic errors in patients with lung cancer. West J Med. 134, 485–490 (1981).

4. Levin, D. C., Rao, V. M., Parker, L. & Frangos, A. J. Analysis of radiologists’ imaging workload trends by place of service. J. Am. Coll. Radiol. 10, 760–763 (2013).

5. Bhargavan, M., Kaye, A. H., Forman, H. P. & Sunshine, J. H. Workload of radiologists in United States in 2006–2007 and trends since 1991–1992. Radiology. 252, 458–467 (2009).