Affiliation:
1. Artificial Intelligence Development Center Changhua Christian Hospital Changhua Taiwan
2. Department of Otorhinolaryngology, Head and Neck Surgery Changhua Christian Hospital Changhua Taiwan
3. Doctoral Program in Translational Medicine National Chung Hsing University Taichung Taiwan
4. Rong Hsing Translational Medicine Research Center National Chung Hsing University Taichung Taiwan
5. Department of Post‐Baccalaureate Medicine, College of Medicine National Chung Hsing University Taichung Taiwan
Abstract
BackgroundWhile videostroboscopy is recognized as the most popular approach for investigating vocal fold function, evaluating the numerical values, such as the membranous glottal gap area, remains too time consuming for clinical applications.MethodsWe used a total of 2507 videostroboscopy images from 137 patients and developed five U‐Net‐based deep‐learning image segmentation models for automatic masking of the membranous glottal gap area. To further validate the models, we used another 410 images from 41 different patients.ResultsDuring development, all five models exhibited acceptable and similar metrics. While the VGG19 U‐Net had a long inference time of 1654 ms, the other four models had more practical inference times, ranging from 16 to 138 ms.During further validation, Efficient U‐Net demonstrated the highest intersection over union of 0.8455, the highest Dice coefficient of 0.9163, and the lowest Hausdorff distance of 1.5626. The normalized membranous glottal gap area index was also calculated and validated. Efficient U‐Net and VGG19 U‐Net exhibited the lowest mean squared errors (3.5476 and 3.3842) and the lowest mean absolute errors (1.8835 and 1.8396).ConclusionsAutomatic segmentation of the membranous glottal gap area can be achieved through U‐net‐based architecture. Considering the segmentation quality and speed, Efficient U‐Net is a reasonable choice for this task, while the other four models remain valuable competitors. The models' masked area enables possible calculation of the normalized membranous glottal gap area and analysis of the glottal area waveform, revealing promising clinical applications for this model.Level of EvidenceNA Laryngoscope, 134:2835–2843, 2024
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献