Author:
Martin Vincent P.,Rouas Jean-Luc,Micoulaud-Franchi Jean-Arthur,Philip Pierre,Krajewski Jarek
Abstract
This article presents research on the detection of pathologies affecting speech through automatic analysis. Voice processing has indeed been used for evaluating several diseases such as Parkinson, Alzheimer, or depression. If some studies present results that seem sufficient for clinical applications, this is not the case for the detection of sleepiness. Even two international challenges and the recent advent of deep learning techniques have still not managed to change this situation. This article explores the hypothesis that the observed average performances of automatic processing find their cause in the design of the corpora. To this aim, we first discuss and refine the concept of sleepiness related to the ground-truth labels. Second, we present an in-depth study of four corpora, bringing to light the methodological choices that have been made and the underlying biases they may have induced. Finally, in light of this information, we propose guidelines for the design of new corpora.
Reference90 articles.
1. Applying multiple classifiers and non-linear dynamics features for detecting sleepiness from speech;Krajewski;Neurocomputing,2011
2. Privacy implications of voice and speech analysis-information disclosure by inference;Kröger,2020
3. Acoustic analysis of voice and speech characteristics in early untreated parkinson's disease;Rusz;Models and Analysis of Vocal Emissions for Biomedical Applications: 7th International Workshop,2011
4. 193842
WeinerJ
HerffC
SchultzT
Speech-Based Detection of Alzheimer's Disease in Conversational German2016
5. Using ASR posterior probability and acoustic features for voice disorder classification;Tulics,2020
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献