AI Detection of Glottic Neoplasm Using Voice Signals, Demographics, and Structured Medical Records

Author:

Wang Chi‐Te123ORCID,Chen Tsai‐Min45,Lee Nien‐Ting2,Fang Shih‐Hau36

Affiliation:

1. Department of Otolaryngology Head and Neck Surgery Far Eastern Memorial Hospital Taipei Taiwan

2. Center of Artificial Intelligence, Far Eastern Memorial Hospital Taipei Taiwan

3. Department of Electrical Engineering Yuan Ze University Taoyuan Taiwan

4. Graduate Program of Data Science, National Taiwan University and Academia Sinica Taipei Taiwan

5. Research Center for Information Technology Innovation, Academia Sinica Taipei Taiwan

6. Department of Electrical Engineering National Taiwan Normal University Taipei Taiwan

Abstract

ObjectiveThis study investigated whether artificial intelligence (AI) models combining voice signals, demographics, and structured medical records can detect glottic neoplasm from benign voice disorders.MethodsWe used a primary dataset containing 2–3 s of vowel “ah”, demographics, and 26 items of structured medical records (e.g., symptoms, comorbidity, smoking and alcohol consumption, vocal demand) from 60 patients with pathology‐proved glottic neoplasm (i.e., squamous cell carcinoma, carcinoma in situ, and dysplasia) and 1940 patients with benign voice disorders. The validation dataset comprised data from 23 patients with glottic neoplasm and 1331 patients with benign disorders. The AI model combined convolutional neural networks, gated recurrent units, and attention layers. We used 10‐fold cross‐validation (training–validation–testing: 8–1–1) and preserved the percentage between neoplasm and benign disorders in each fold.ResultsResults from the AI model using voice signals reached an area under the ROC curve (AUC) value of 0.631, and additional demographics increased this to 0.807. The highest AUC of 0.878 was achieved when combining voice, demographics, and medical records (sensitivity: 0.783, specificity: 0.816, accuracy: 0.815). External validation yielded an AUC value of 0.785 (voice plus demographics; sensitivity: 0.739, specificity: 0.745, accuracy: 0.745). Subanalysis showed that AI had higher sensitivity but lower specificity than human assessment (p < 0.01). The accuracy of AI detection with additional medical records was comparable with human assessment (82% vs. 83%, p = 0.78).ConclusionsVoice signal alone was insufficient for AI differentiation between glottic neoplasm and benign voice disorders, but additional demographics and medical records notably improved AI performance and approximated the prediction accuracy of humans.Level of EvidenceNA Laryngoscope, 2024

Funder

Ministry of Education

Publisher

Wiley

Reference46 articles.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3