Improving speech depression detection using transfer learning with wav2vec 2.0 in low-resource environments-Reference-Cited by-同舟云学术

Improving speech depression detection using transfer learning with wav2vec 2.0 in low-resource environments

Published:2024-04-25 Issue:1 Volume:14 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Zhang Xu,Zhang Xiangcheng,Chen Weisi,Li Chenlong,Yu Chengyuan

Abstract

AbstractDepression, a pervasive global mental disorder, profoundly impacts daily lives. Despite numerous deep learning studies focused on depression detection through speech analysis, the shortage of annotated bulk samples hampers the development of effective models. In response to this challenge, our research introduces a transfer learning approach for detecting depression in speech, aiming to overcome constraints imposed by limited resources. In the context of feature representation, we obtain depression-related features by fine-tuning wav2vec 2.0. By integrating 1D-CNN and attention pooling structures, we generate advanced features at the segment level, thereby enhancing the model's capability to capture temporal relationships within audio frames. In the realm of prediction results, we integrate LSTM and self-attention mechanisms. This incorporation assigns greater weights to segments associated with depression, thereby augmenting the model's discernment of depression-related information. The experimental results indicate that our model has achieved impressive F1 scores, reaching 79% on the DAIC-WOZ dataset and 90.53% on the CMDC dataset. It outperforms recent baseline models in the field of speech-based depression detection. This provides a promising solution for effective depression detection in low-resource environments.

Publisher

Springer Science and Business Media LLC

Link

https://www.nature.com/articles/s41598-024-60278-1.pdf

Reference46 articles.

1. World Health Organization. Depression and other common mental disorders: global health estimates. World Health Organization. (2017).

2. World Health Organization. Depression: Overview, Impact and Response. https://www.who.int/health-topics/depression.(2020).

3. Evans-Lacko, S. et al. Socio-economic variations in the mental health treatment gap for people with anxiety, mood, and substance use disorders: Results from the WHO World Mental Health (WMH) surveys. Psychol. Med. 48(9), 1560–1571 (2018).

4. Herrman, H. et al. Time for united action on depression: A Lancet-World Psychiatric Association Commission. Lancet. 399(10328), 957–1022 (2022).

5. Dumpala, S. H. et al. Manifestation of depression in speech overlaps with characteristics used to represent and recognize speaker identity. Sci. Rep. 13, 11155 (2023).