Deep learning with uncertainty estimation for automatic tumor segmentation in PET/CT of head and neck cancers: impact of model complexity, image processing and augmentation-Reference-Cited by-同舟云学术

Deep learning with uncertainty estimation for automatic tumor segmentation in PET/CT of head and neck cancers: impact of model complexity, image processing and augmentation

Published:2024-08-30 Issue:5 Volume:10 Page:055038
ISSN:2057-1976
Container-title:Biomedical Physics & Engineering Express
language:
Short-container-title:Biomed. Phys. Eng. Express

Author:

Huynh Bao Ngoc^ORCID,Groendahl Aurora Rosvoll^ORCID,Tomic Oliver^ORCID,Liland Kristian Hovde^ORCID,Knudtsen Ingerid Skjei^ORCID,Hoebers Frank,van Elmpt Wouter,Dale Einar,Malinen Eirik^ORCID,Futsaether Cecilia Marie^ORCID

Abstract

Abstract Objective. Target volumes for radiotherapy are usually contoured manually, which can be time-consuming and prone to inter- and intra-observer variability. Automatic contouring by convolutional neural networks (CNN) can be fast and consistent but may produce unrealistic contours or miss relevant structures. We evaluate approaches for increasing the quality and assessing the uncertainty of CNN-generated contours of head and neck cancers with PET/CT as input. Approach. Two patient cohorts with head and neck squamous cell carcinoma and baseline 18F-fluorodeoxyglucose positron emission tomography and computed tomography images (FDG-PET/CT) were collected retrospectively from two centers. The union of manual contours of the gross primary tumor and involved nodes was used to train CNN models for generating automatic contours. The impact of image preprocessing, image augmentation, transfer learning and CNN complexity, architecture, and dimension (2D or 3D) on model performance and generalizability across centers was evaluated. A Monte Carlo dropout technique was used to quantify and visualize the uncertainty of the automatic contours. Main results. CNN models provided contours with good overlap with the manually contoured ground truth (median Dice Similarity Coefficient: 0.75–0.77), consistent with reported inter-observer variations and previous auto-contouring studies. Image augmentation and model dimension, rather than model complexity, architecture, or advanced image preprocessing, had the largest impact on model performance and cross-center generalizability. Transfer learning on a limited number of patients from a separate center increased model generalizability without decreasing model performance on the original training cohort. High model uncertainty was associated with false positive and false negative voxels as well as low Dice coefficients. Significance. High quality automatic contours can be obtained using deep learning architectures that are not overly complex. Uncertainty estimation of the predicted contours shows potential for highlighting regions of the contour requiring manual revision or flagging segmentations requiring manual inspection and intervention.

Funder

Kreftforeningen

Publisher

IOP Publishing

Link

https://iopscience.iop.org/article/10.1088/2057-1976/ad6dcd/pdf

Reference79 articles.

1. Interobserver variation of clinical oncologists compared to therapeutic radiographers (RTT) prostate contours on T2 weighted MRI;Adair Smith;Tech Innov Patient Support Radiat Oncol,2023

2. Training, validation, and clinical implementation of a deep-learning segmentation model for radiotherapy of loco-regional breast cancer;Almberg;Radiother. Oncol.,2022

3. Overview of the HECKTOR challenge at MICCAI 2022: automatic head and neck tumor segmentation and outcome prediction in PET/CT;Andrearczyk,2023a

4. Automatic head and neck tumor segmentation and outcome prediction relying on FDG-PET/CT images: findings from the second edition of the HECKTOR challenge;Andrearczyk;Med. Image Anal.,2023b

5. Towards a safe and efficient clinical implementation of machine learning in radiation oncology by exploring model interpretability, explainability and data-model dependency;Barragan-Montero;Phys. Med. Biol.,2022