Diagnostic accuracy of deep learning using speech samples in depression: a systematic review and meta-analysis

Author:

Liu Lidan1,Liu Lu1,Wafa Hatem A1,Tydeman Florence1,Xie Wanqing234,Wang Yanzhong1ORCID

Affiliation:

1. Department of Population Health Sciences, School of Life Course and Population Sciences, Faculty of Life Sciences & Medicine, King's College London , London, SE1 1UL, United Kingdom

2. Department of Intelligent Medical Engineering, School of Biomedical Engineering, Anhui Medical University , Hefei, 230032, China

3. Department of Psychology, School of Mental Health and Psychological Sciences, Anhui Medical University , Hefei, 230032, China

4. Beth Israel Deaconess Medical Center, Harvard Medical School, Harvard University , Boston, MA, 02115, United States

Abstract

Abstract Objective This study aims to conduct a systematic review and meta-analysis of the diagnostic accuracy of deep learning (DL) using speech samples in depression. Materials and Methods This review included studies reporting diagnostic results of DL algorithms in depression using speech data, published from inception to January 31, 2024, on PubMed, Medline, Embase, PsycINFO, Scopus, IEEE, and Web of Science databases. Pooled accuracy, sensitivity, and specificity were obtained by random-effect models. The diagnostic Precision Study Quality Assessment Tool (QUADAS-2) was used to assess the risk of bias. Results A total of 25 studies met the inclusion criteria and 8 of them were used in the meta-analysis. The pooled estimates of accuracy, specificity, and sensitivity for depression detection models were 0.87 (95% CI, 0.81-0.93), 0.85 (95% CI, 0.78-0.91), and 0.82 (95% CI, 0.71-0.94), respectively. When stratified by model structure, the highest pooled diagnostic accuracy was 0.89 (95% CI, 0.81-0.97) in the handcrafted group. Discussion To our knowledge, our study is the first meta-analysis on the diagnostic performance of DL for depression detection from speech samples. All studies included in the meta-analysis used convolutional neural network (CNN) models, posing problems in deciphering the performance of other DL algorithms. The handcrafted model performed better than the end-to-end model in speech depression detection. Conclusions The application of DL in speech provided a useful tool for depression detection. CNN models with handcrafted acoustic features could help to improve the diagnostic performance. Protocol registration The study protocol was registered on PROSPERO (CRD42023423603).

Funder

King’s College London—China Scholarship Council

Publisher

Oxford University Press (OUP)

Reference62 articles.

1. Epidemiology, burden, and disability in depression and anxiety;Lepine;J Clin Psychiatry,2001

2. The burden of depression and anxiety in general medicine;Lecrubier;J Clin Psychiatry,2021

3. Diagnostic and statistical manual of mental disorders: DSM-5;Guha;Ref Rev,2014

4. Automated assessment of psychiatric disorders using speech: a systematic review;Low;Laryngoscope Investig Otolaryngol,2020

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3