Author:
Chlasta Karol,Wołk Krzysztof
Abstract
Dementia, a prevalent disorder of the brain, has negative effects on individuals and society. This paper concerns using Spontaneous Speech (ADReSS) Challenge of Interspeech 2020 to classify Alzheimer's dementia. We used (1) VGGish, a deep, pretrained, Tensorflow model as an audio feature extractor, and Scikit-learn classifiers to detect signs of dementia in speech. Three classifiers (LinearSVM, Perceptron, 1NN) were 59.1% accurate, which was 3% above the best-performing baseline models trained on the acoustic features used in the challenge. We also proposed (2) DemCNN, a new PyTorch raw waveform-based convolutional neural network model that was 63.6% accurate, 7% more accurate then the best-performing baseline linear discriminant analysis model. We discovered that audio transfer learning with a pretrained VGGish feature extractor performs better than the baseline approach using automatically extracted acoustic features. Our DepCNN exhibits good generalization capabilities. Both methods presented in this paper offer progress toward new, innovative, and more effective computer-based screening of dementia through spontaneous speech.
Reference39 articles.
1. Tensorflow: a system for large-scale machine learning,;Abadi,2016
2. Abu-El-HaijaS.
KothariN.
LeeJ.
NatsevP.
TodericiG.
VaradarajanB.
Youtube-8m: a large-scale video classification benchmark. arXiv preprint arXiv:1609.086752016
3. Early diagnosis of Alzheimer's type dementia using continuous speech recognition,;Baldas,2010
4. Epidemiology of multimorbidity and implications for health care, research, and medical education: a cross-sectional study;Barnett;Lancet,2012
5. Google colaboratory,;Bisong,2019
Cited by
19 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献