Inter-observer variability between readers of CT images: all for one and one for all-Reference-Cited by-同舟云学术

Inter-observer variability between readers of CT images: all for one and one for all

Published:2021-08-10 Issue:2 Volume:2 Page:105-118
ISSN:2712-8962
Container-title:Digital Diagnostics
language:
Short-container-title:Digital Diagnostics

Author:

Kulberg Nikolas S.^ORCID,Reshetnikov Roman V.^ORCID,Novik Vladimir P.^ORCID,Elizarov Alexey B.^ORCID,Gusev Maxim A.^ORCID,Gombolevskiy Victor A.^ORCID,Vladzymyrskyy Anton V.^ORCID,Morozov Sergey P.^ORCID

Abstract

BACKGROUND: The markup of medical image datasets is based on the subjective interpretation of the observed entities by radiologists. There is currently no widely accepted protocol for determining ground truth based on radiologists reports. AIM: To assess the accuracy of radiologist interpretations and their agreement for the publicly available dataset CTLungCa-500, as well as the relationship between these parameters and the number of independent readers of CT scans. MATERIALS AND METHODS: Thirty-four radiologists took part in the dataset markup. The dataset included 536 patients who were at high risk of developing lung cancer. For each scan, six radiologists worked independently to create a report. After that, an arbitrator reviewed the lesions discovered by them. The number of true-positive, false-positive, true-negative, and false-negative findings was calculated for each reader to assess diagnostic accuracy. Further, the inter-observer variability was analyzed using the percentage agreement metric. RESULTS: An increase in the number of independent readers providing CT scan interpretations leads to accuracy increase associated with a decrease in agreement. The majority of disagreements were associated with the presence of a lung nodule in a specific site of the CT scan. CONCLUSION: If arbitration is provided, an increase in the number of independent initial readers can improve their combined accuracy. The experience and diagnostic accuracy of individual readers have no bearing on the quality of a crowd-tagging annotation. At four independent readings per CT scan, the optimal balance of markup accuracy and cost was achieved.

Publisher

ECO-Vector LLC

Link

https://journals.eco-vector.com/DD/article/viewFile/60622/pdf_2

Reference20 articles.

1. Morozov SP, Kulberg NS, Gombolevsky VA, et al. Moscow Radiology Dataset CTLungCa-500. 2018. (In Russ). Available from: https://mosmed.ai/datasets/ct_lungcancer_500/

2. A simplified cluster model and a tool adapted for collaborative labeling of lung cancer CT scans

3. Methodology and tools for creating training samples for artificial intelligence systems for recognizing lung cancer on CT images

4. Improving Performance by Multiple Interpretations of Chest Radiographs: Effectiveness and Cost

5. Accuracy and Its Relationship to Experience in the Interpretation of Chest Radiographs

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Results of the work of the Reference center for diagnostic radiology with using telemedicine technology;HEALTH CARE OF THE RUSSIAN FEDERATION;2024-04-29

2. Probabilistic Modeling of Inter- and Intra-observer Variability in Medical Image Segmentation;2023 IEEE/CVF International Conference on Computer Vision (ICCV);2023-10-01

3. Volumetry versus linear diameter lung nodule measurement: an ultra-low-dose computed tomography lung cancer screening study;Digital Diagnostics;2023-04-19

4. Automated analysis of lung lesions in COVID-19: comparison of standard and low-dose CT;The Siberian Journal of Clinical and Experimental Medicine;2023-01-17

5. Machine learning technologies in CT-based diagnostics and classification of intracranial hemorrhages;Voprosy neirokhirurgii imeni N.N. Burdenko;2023