Synergizing Deep Learning-Enabled Preprocessing and Human–AI Integration for Efficient Automatic Ground Truth Generation
-
Published:2024-04-28
Issue:5
Volume:11
Page:434
-
ISSN:2306-5354
-
Container-title:Bioengineering
-
language:en
-
Short-container-title:Bioengineering
Author:
Collazo Christopher1, Vargas Ian2, Cara Brendon2, Weinheimer Carla J.3, Grabau Ryan P.2, Goldgof Dmitry1ORCID, Hall Lawrence1, Wickline Samuel A.2, Pan Hua345
Affiliation:
1. College of Engineering, University of South Florida, Tampa, FL 33620, USA 2. The Heart Institute, College of Medicine, University of South Florida, Tampa, FL 33602, USA 3. Department of Medicine, Washington University in St. Louis, St. Louis, MO 63110, USA 4. Department of Pathology & Immunology, Washington University in St. Louis, St. Louis, MO 63110, USA 5. Department of Biomedical Engineering, Washington University in St. Louis, St. Louis, MO 63130, USA
Abstract
The progress of incorporating deep learning in the field of medical image interpretation has been greatly hindered due to the tremendous cost and time associated with generating ground truth for supervised machine learning, alongside concerns about the inconsistent quality of images acquired. Active learning offers a potential solution to these problems of expanding dataset ground truth by algorithmically choosing the most informative samples for ground truth labeling. Still, this effort incurs the costs of human labeling, which needs minimization. Furthermore, automatic labeling approaches employing active learning often exhibit overfitting tendencies while selecting samples closely aligned with the training set distribution and excluding out-of-distribution samples, which could potentially improve the model’s effectiveness. We propose that the majority of out-of-distribution instances can be attributed to inconsistent cross images. Since the FDA approved the first whole-slide image system for medical diagnosis in 2017, whole-slide images have provided enriched critical information to advance the field of automated histopathology. Here, we exemplify the benefits of a novel deep learning strategy that utilizes high-resolution whole-slide microscopic images. We quantitatively assess and visually highlight the inconsistencies within the whole-slide image dataset employed in this study. Accordingly, we introduce a deep learning-based preprocessing algorithm designed to normalize unknown samples to the training set distribution, effectively mitigating the overfitting issue. Consequently, our approach significantly increases the amount of automatic region-of-interest ground truth labeling on high-resolution whole-slide images using active deep learning. We accept 92% of the automatic labels generated for our unlabeled data cohort, expanding the labeled dataset by 845%. Additionally, we demonstrate expert time savings of 96% relative to manual expert ground-truth labeling.
Reference43 articles.
1. A survey on deep learning in medical image analysis;Litjens;Med. Image Anal.,2017 2. Maki, S., Furuya, T., Inoue, M., Shiga, Y., Inage, K., Eguchi, Y., Orita, S., and Ohtori, S. (2024). Machine Learning and Deep Learning in Spinal Injury: A Narrative Review of Algorithms in Diagnosis and Prognosis. J. Clin. Med., 13. 3. Advancements in cardiac structures segmentation: A comprehensive systematic review of deep learning in CT imaging;Alnasser;Front. Cardiovasc. Med.,2024 4. Alongi, P., Arnone, A., Vultaggio, V., Fraternali, A., Versari, A., Casali, C., Arnone, G., DiMeco, F., and Vetrano, I.G. (2024). Artificial Intelligence Analysis Using MRI and PET Imaging in Gliomas: A Narrative Review. Cancers, 16. 5. Deep learning for digital pathology image analysis: A comprehensive tutorial with selected use cases;Janowczyk;J. Pathol. Inform.,2016
|
|