Classification of dementia from spoken speech using feature selection and the bag of acoustic words model
-
Published:2024
Issue:1
Volume:4
Page:45-65
-
ISSN:2771-392X
-
Container-title:Applied Computing and Intelligence
-
language:
-
Short-container-title:ACI
Author:
Niemelä Marko1, von Bonsdorff Mikaela23, Äyrämö Sami14, Kärkkäinen Tommi1
Affiliation:
1. Faculty of Information Technology, University of Jyväskylä, Jyväskylä, Finland 2. Faculty of Sport and Health Sciences and Gerontology Research Center, University of Jyväskylä, Jyväskylä, Finland 3. Public Health Programme, Folkhälsan Research Center, Helsinki, Finland 4. Wellbeing Services County of Central Finland, Finland
Abstract
<p>Memory disorders and dementia are a central factor in the decline of functioning and daily activities in older individuals. The workload related to standardized speech tests in clinical settings has led to a growing emphasis on developing automatic machine learning techniques for analyzing naturally spoken speech. This study presented a bag of acoustic words approach for distinguishing dementia patients from control individuals based on audio speech recordings. In this approach, each individual's speech was segmented into voiced periods, and these segments were characterized by acoustic features using the open-source openSMILE library. Word histogram representations were formed from the characterized speech segments of each speaker, which were used for classifying subjects. The formation of word histograms involved a clustering phase where feature vectors were quantized. It is well-known that partitional clustering involves instability in clustering results due to the selection of starting points, which can cause variability in classification outcomes. This study aimed to address instability by utilizing robust K-spatial-medians clustering, efficient K-means$ ++ $ clustering initialization, and selecting the smallest clustering error from repeated clusterings. Additionally, the study employed feature selection based on the Wilcoxon signed-rank test to achieve computational efficiency in the methods. The results showed that it is possible to achieve a consistent 75% classification accuracy using only twenty-five features, both with the external ADReSS 2020 test data and through leave-one-subject-out cross-validation of the entire dataset. The results rank at the top compared to international research, where the same dataset and only acoustic features have been used to diagnose patients.</p>
Publisher
American Institute of Mathematical Sciences (AIMS)
Reference53 articles.
1. M. W. Bondi, D. P. Salmon, A. W. Kaszniak, The neuropsychology of dementia, In: Neuropsychological assessment of neuropsychiatric and neuromedical disorders, Oxford: Oxford University Press, 2009,159–198. 2. World Health Organization, Global action plan on the public health response to dementia 2017–2025, World Health Organization, 2017. 3. R. N. Kalaria, G. E. Maestre, R. Arizaga, R. P. Friedland, D. Galasko, K. Hall, et al., Alzheimer's disease and vascular dementia in developing countries: prevalence, management, and risk factors, Lancet Neurol., 7 (2008), 812–826. http://dx.doi.org/10.1016/S1474-4422(08)70169-8 4. T. Ngandu, J. Lehtisalo, A. Solomon, E. Levälahti, S. Ahtiluoto, R. Antikainen, et al., A 2 year multidomain intervention of diet, exercise, cognitive training, and vascular risk monitoring versus control to prevent cognitive decline in at-risk elderly people (FINGER): a randomised controlled trial, Lancet Neurol., 385 (2015), 2255–2263. http://dx.doi.org/10.1016/S0140-6736(15)60461-5 5. M. F. Folstein, S. E. Folstein, P. R. McHugh, "Mini-mental state": a practical method for grading the cognitive state of patients for the clinician, J. Psychiat. Res., 12 (1975), 189–198.
|
|