Interpretable Computer Vision to Detect and Classify Structural Laryngeal Lesions in Digital Flexible Laryngoscopic Images

Author:

Bur Andrés M.1,Zhang Tianxiao2,Chen Xiangyu2,Kavookjian Hannah1ORCID,Kraft Shannon1,Karadaghy Omar1ORCID,Farrokhian Nathan1ORCID,Mussatto Caroline3ORCID,Penn Joseph3ORCID,Wang Guanghui4

Affiliation:

1. Department of Otolaryngology–Head and Neck Surgery University of Kansas Medical Center Kansas City KS USA

2. Department of Electrical Engineering and Computer Science University of Kansas Lawrence KS USA

3. University of Kansas School of Medicine Kansas City KS USA

4. Department of Computer Science Toronto Metropolitan University Toronto ON Canada

Abstract

AbstractObjectiveTo localize structural laryngeal lesions within digital flexible laryngoscopic images and to classify them as benign or suspicious for malignancy using state‐of‐the‐art computer vision detection models.Study DesignCross‐sectional diagnostic studySettingTertiary care voice clinicMethodsDigital stroboscopic videos, demographic and clinical data were collected from patients evaluated for a structural laryngeal lesion. Laryngoscopic images were extracted from videos and manually labeled with bounding boxes encompassing the lesion. Four detection models were employed to simultaneously localize and classify structural laryngeal lesions in laryngoscopic images. Classification accuracy, intersection over union (IoU) and mean average precision (mAP) were evaluated as measures of classification, localization, and overall performance, respectively.ResultsIn total, 8,172 images from 147 patients were included in the laryngeal image dataset. Classification accuracy was 88.5 for individual laryngeal images and increased to 92.0 when all images belonging to the same sequence (video) were considered. Mean average precision across all four detection models was 50.1 using an IoU threshold of 0.5 to determine successful localization.ConclusionResults of this study showed that deep neural network‐based detection models trained using a labeled dataset of digital laryngeal images have the potential to classify structural laryngeal lesions as benign or suspicious for malignancy and to localize them within an image. This approach provides valuable insight into which part of the image was used by the model to determine a diagnosis, allowing clinicians to independently evaluate models' predictions.

Publisher

Wiley

Subject

Otorhinolaryngology,Surgery

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3