Videomics of the Upper Aero-Digestive Tract Cancer: Deep Learning Applied to White Light and Narrow Band Imaging for Automatic Segmentation of Endoscopic Images-Reference-Cited by-同舟云学术

Videomics of the Upper Aero-Digestive Tract Cancer: Deep Learning Applied to White Light and Narrow Band Imaging for Automatic Segmentation of Endoscopic Images

Published:2022-06-01 Issue: Volume:12 Page:
ISSN:2234-943X
Container-title:Frontiers in Oncology
language:
Short-container-title:Front. Oncol.

Author:

Azam Muhammad Adeel,Sampieri Claudio,Ioppi Alessandro,Benzi Pietro,Giordano Giorgio Gregory,De Vecchi Marta,Campagnari Valentina,Li Shunlei,Guastini Luca,Paderno Alberto,Moccia Sara,Piazza Cesare,Mattos Leonardo S.,Peretti Giorgio

Abstract

IntroductionNarrow Band Imaging (NBI) is an endoscopic visualization technique useful for upper aero-digestive tract (UADT) cancer detection and margins evaluation. However, NBI analysis is strongly operator-dependent and requires high expertise, thus limiting its wider implementation. Recently, artificial intelligence (AI) has demonstrated potential for applications in UADT videoendoscopy. Among AI methods, deep learning algorithms, and especially convolutional neural networks (CNNs), are particularly suitable for delineating cancers on videoendoscopy. This study is aimed to develop a CNN for automatic semantic segmentation of UADT cancer on endoscopic images.Materials and MethodsA dataset of white light and NBI videoframes of laryngeal squamous cell carcinoma (LSCC) was collected and manually annotated. A novel DL segmentation model (SegMENT) was designed. SegMENT relies on DeepLabV3+ CNN architecture, modified using Xception as a backbone and incorporating ensemble features from other CNNs. The performance of SegMENT was compared to state-of-the-art CNNs (UNet, ResUNet, and DeepLabv3). SegMENT was then validated on two external datasets of NBI images of oropharyngeal (OPSCC) and oral cavity SCC (OSCC) obtained from a previously published study. The impact of in-domain transfer learning through an ensemble technique was evaluated on the external datasets.Results219 LSCC patients were retrospectively included in the study. A total of 683 videoframes composed the LSCC dataset, while the external validation cohorts of OPSCC and OCSCC contained 116 and 102 images. On the LSCC dataset, SegMENT outperformed the other DL models, obtaining the following median values: 0.68 intersection over union (IoU), 0.81 dice similarity coefficient (DSC), 0.95 recall, 0.78 precision, 0.97 accuracy. For the OCSCC and OPSCC datasets, results were superior compared to previously published data: the median performance metrics were, respectively, improved as follows: DSC=10.3% and 11.9%, recall=15.0% and 5.1%, precision=17.0% and 14.7%, accuracy=4.1% and 10.3%.ConclusionSegMENT achieved promising performances, showing that automatic tumor segmentation in endoscopic images is feasible even within the highly heterogeneous and complex UADT environment. SegMENT outperformed the previously published results on the external validation cohorts. The model demonstrated potential for improved detection of early tumors, more precise biopsies, and better selection of resection margins.

Publisher

Frontiers Media SA

Subject

Cancer Research,Oncology

Reference59 articles.

1. “Biologic Endoscopy”: Optimization of Upper Aerodigestive Tract Cancer Evaluation;Piazza;Curr Opin Otolaryngol Head Neck Surg,2011

2. Impact of Close and Positive Margins in Transoral Laser Microsurgery for Tis-T2 Glottic Cancer;Fiz;Front Oncol,2017

3. Narrow Band Imaging and High Definition Television in the Assessment of Laryngeal Cancer: A Prospective Study on 279 Patients;Piazza;Eur Arch Oto-Rhino-Laryngol 2009 2673,2010

4. Usefulness of Office Examination With Narrow Band Imaging for the Diagnosis of Head and Neck Squamous Cell Carcinoma and Follow-Up of Premalignant Lesions;Vilaseca;Head Neck,2017

5. Enhanced Contact Endoscopy for the Assessment of the Neoangiogenetic Changes in Precancerous and Cancerous Lesions of the Oral Cavity and Oropharynx;Carta;Eur Arch oto-rhino-laryngol,2016

Cited by 25 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Computer Vision and Videomics in Otolaryngology–Head and Neck Surgery;Otolaryngologic Clinics of North America;2024-10

2. Enhancing Oral Squamous Cell Carcinoma Detection Using Histopathological Images: A Deep Feature Fusion and Improved Haris Hawks Optimization-Based Framework;Bioengineering;2024-09-12

3. SCC-NET: Segmentation of Clinical Cancer image for Head and Neck Squamous Cell Carcinoma;2024-07-16

4. Multi‐Instance Learning for Vocal Fold Leukoplakia Diagnosis Using White Light and Narrow‐Band Imaging: A Multicenter Study;The Laryngoscope;2024-05-27

5. Laryngeal Cancer Screening During Flexible Video Laryngoscopy Using Large Computer Vision Models;Annals of Otology, Rhinology & Laryngology;2024-05-16