An automated approach for real-time informative frames classification in laryngeal endoscopy using deep learning-Reference-Cited by-同舟云学术

An automated approach for real-time informative frames classification in laryngeal endoscopy using deep learning

Published:2024-05-02 Issue:8 Volume:281 Page:4255-4264
ISSN:0937-4477
Container-title:European Archives of Oto-Rhino-Laryngology
language:en
Short-container-title:Eur Arch Otorhinolaryngol

Author:

Baldini Chiara,Azam Muhammad Adeel,Sampieri Claudio^ORCID,Ioppi Alessandro,Ruiz-Sevilla Laura,Vilaseca Isabel,Alegre Berta,Tirrito Alessandro,Pennacchi Alessia,Peretti Giorgio,Moccia Sara,Mattos Leonardo S.

Abstract

Abstract Purpose Informative image selection in laryngoscopy has the potential for improving automatic data extraction alone, for selective data storage and a faster review process, or in combination with other artificial intelligence (AI) detection or diagnosis models. This paper aims to demonstrate the feasibility of AI in providing automatic informative laryngoscopy frame selection also capable of working in real-time providing visual feedback to guide the otolaryngologist during the examination. Methods Several deep learning models were trained and tested on an internal dataset (n = 5147 images) and then tested on an external test set (n = 646 images) composed of both white light and narrow band images. Four videos were used to assess the real-time performance of the best-performing model. Results ResNet-50, pre-trained with the pretext strategy, reached a precision = 95% vs. 97%, recall = 97% vs, 89%, and the F1-score = 96% vs. 93% on the internal and external test set respectively (p = 0.062). The four testing videos are provided in the supplemental materials. Conclusion The deep learning model demonstrated excellent performance in identifying diagnostically relevant frames within laryngoscopic videos. With its solid accuracy and real-time capabilities, the system is promising for its development in a clinical setting, either autonomously for objective quality control or in conjunction with other algorithms within a comprehensive AI toolset aimed at enhancing tumor detection and diagnosis.

Funder

Università degli Studi di Genova

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s00405-024-08676-z.pdf

Reference19 articles.

1. Piazza C, Cocco D, de Benedetto L et al (2010) Narrow band imaging and high definition television in the assessment of laryngeal cancer: a prospective study on 279 patients. Eur Arch Oto-Rhino-Laryngol 267(3):409–414. https://doi.org/10.1007/S00405-009-1121-6

2. Vilaseca I, Valls-Mateus M, Nogués A et al (2017) Usefulness of office examination with narrow band imaging for the diagnosis of head and neck squamous cell carcinoma and follow-up of premalignant lesions. Head Neck 39:1854–1863. https://doi.org/10.1002/HED.24849

3. Haug CJ, Drazen JM (2023) Artificial intelligence and machine learning in clinical medicine, 2023. N Engl J Med 388:1201–1208. https://doi.org/10.1056/NEJMRA2302038

4. Sampieri C, Baldini C, Azam MA et al (2023) Artificial intelligence for upper aerodigestive tract endoscopy and laryngoscopy: a guide for physicians and state-of-the-art review. Otolaryngol Head Neck Surg 169:811–829. https://doi.org/10.1002/OHN.343

5. Galdran A, Costa P, Campilho A (2019) Real-time informative laryngoscopic frame classification with pre-trained convolutional neural networks. Proc Int Symp Biomed Imag 2019:87–90. https://doi.org/10.1109/ISBI.2019.8759511