Author:
Dankovičová Zuzana,Sovák Dávid,Drotár Peter,Vokorokos Liberios
Abstract
This paper addresses the processing of speech data and their utilization in a decision support system. The main aim of this work is to utilize machine learning methods to recognize pathological speech, particularly dysphonia. We extracted 1560 speech features and used these to train the classification model. As classifiers, three state-of-the-art methods were used: K-nearest neighbors, random forests, and support vector machine. We analyzed the performance of classifiers with and without gender taken into account. The experimental results showed that it is possible to recognize pathological speech with as high as a 91.3% classification accuracy.
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference29 articles.
1. Advances in a Multimodal Approach for Dysphagia Analysis Based on Automatic Voice Analysis;Lopez-de-Ipina,2016
2. Psycho-Acoustic Evaluation of Voice;Hirano,1981
3. Clinical Measurement of Speech and Voice;Baken,2000
4. Robust and complex approach of pathological speech signal analysis
5. Understanding technology adoption in clinical care: Clinician adoption behavior of a point-of-care reminder system
Cited by
38 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献