Evaluation of Effectiveness of Self-Supervised Learning in Chest X-Ray Imaging to Reduce Annotated Images
-
Published:2024-03-08
Issue:4
Volume:37
Page:1618-1624
-
ISSN:2948-2933
-
Container-title:Journal of Imaging Informatics in Medicine
-
language:en
-
Short-container-title:J Digit Imaging. Inform. med.
Author:
Imagawa KunikiORCID, Shiomoto Kohei
Abstract
AbstractA significant challenge in machine learning-based medical image analysis is the scarcity of medical images. Obtaining a large number of labeled medical images is difficult because annotating medical images is a time-consuming process that requires specialized knowledge. In addition, inappropriate annotation processes can increase model bias. Self-supervised learning (SSL) is a type of unsupervised learning method that extracts image representations. Thus, SSL can be an effective method to reduce the number of labeled images. In this study, we investigated the feasibility of reducing the number of labeled images in a limited set of unlabeled medical images. The unlabeled chest X-ray (CXR) images were pretrained using the SimCLR framework, and then the representations were fine-tuned as supervised learning for the target task. A total of 2000 task-specific CXR images were used to perform binary classification of coronavirus disease 2019 (COVID-19) and normal cases. The results demonstrate that the performance of pretraining on task-specific unlabeled CXR images can be maintained when the number of labeled CXR images is reduced by approximately 40%. In addition, the performance was significantly better than that obtained without pretraining. In contrast, a large number of pretrained unlabeled images are required to maintain performance regardless of task specificity among a small number of labeled CXR images. In summary, to reduce the number of labeled images using SimCLR, we must consider both the number of images and the task-specific characteristics of the target images.
Funder
Tokyo City University
Publisher
Springer Science and Business Media LLC
Reference25 articles.
1. G. Litjens, T. Kooi, B. E. Bejnordi, A. A. A. Setio, F. Ciompi, M. Ghafoorian, J. A. W. M. van der Laak, B. van Ginneken, and C. I. Sánchez. A survey on deep learning in medical image analysis. Med Image Anal, 42:60–88, Dec 2017. 2. Ryuji Hamamoto, Kruthi Suvarna, Masayoshi Yamada, Kazuma Kobayashi, Norio Shinkai, Mototaka Miyake, Masamichi Takahashi, Shunichi Jinnai, Ryo Shimoyama, Akira Sakai, Ken Takasawa, Amina Bolatkan, Kanto Shozu, Ai Dozen, Hidenori Machino, Satoshi Takahashi, Ken Asada, Masaaki Komatsu, Jun Sese, and Syuzo Kaneko. Application of artificial intelligence technology in oncology: Towards the establishment of precision medicine. Cancers, 12(12), 2020. 3. Ibrahim Tolga Öztürk, Rostislav Nedelchev, Christian Heumann, Esteban Garces Arias, Marius Roger, Bernd Bischl, and Matthias Aßenmacher. How different is stereotypical bias across languages? arXiv preprint arXiv:2307.07331, 2023. 4. Bogdan A Bercean, Andreea Birhala, Paula G Ardelean, Ioana Barbulescu, Marius M Benta, Cristina D Rasadean, Dan Costachescu, Cristian Avramescu, Andrei Tenescu, Stefan Iarca, et al. Evidence of a cognitive bias in the quantification of covid-19 with ct: an artificial intelligence randomised clinical trial. Scientific Reports, 13(1):4887, 2023. 5. J Deng, W Dong, R Socher, LJ Li, Kai Li, and Li Fei-Fei. Imagenet: A large-scale hierarchical image database. 2009 ieee conference on computer vision and pattern recognition. miami. 2009.
|
|