Detection of dementia on raw voice recordings using deep learning: A Framingham Heart Study

Author:

Xue Chonghua,Karjadi Cody,Paschalidis Ioannis Ch.,Au Rhoda,Kolachalama Vijaya B.ORCID

Abstract

AbstractBackgroundIdentification of reliable, affordable, and easy-to-use strategies for detection of dementia are sorely needed. Digital technologies, such as individual voice recordings, offer an attractive modality to assess cognition but methods that could automatically analyze such data are not readily available.Methods and findingsWe used 1264 digital voice recordings of neuropsychological examinations administered to participants from the Framingham Heart Study (FHS), a community-based longitudinal observational study. The recordings were 73 minutes in duration, on average, and contained at least two speakers (participant and clinician). Of the total voice recordings, 483 were of participants with normal cognition (NC), 451 recordings were of participants with mild cognitive impairment (MCI), and 330 were of participants with dementia (DE). We developed two deep learning models (a two-level long short-term memory (LSTM) network and a convolutional neural network (CNN)), which used the raw audio recordings to classify if the recording included a participant with only NC or only DE and to differentiate between recordings corresponding to those that were non-demented (NDE (NC+MCI)) and DE. Based on 5-fold cross-validation, the LSTM model achieved a mean (±std) area under the receiver operating characteristic curve (AUC) of 0.740±0.017, mean balanced accuracy of 0.647±0.027, and mean weighted F1-score of 0.596±0.047 in predicting cases with DE from those with NC. The CNN model achieved a mean AUC of 0.805±0.027, mean balanced accuracy of 0.743±0.015, and mean weighted F1-score of 0.742±0.033 in predicting cases with DE from those with NC. For the task related to classification of participants with DE from NDE, the LSTM model achieved a mean AUC of 0.734±0.014, mean balanced accuracy of 0.675±0.013, and mean weighted F1-score of 0.671±0.015. The CNN model achieved a mean AUC of 0.746±0.021, mean balanced accuracy of 0.652±0.020, and mean weighted F1-score of 0.635±0.031 in predicting cases with DE from those who were NDE.ConclusionThis proof-of-concept study demonstrates the potential that raw audio recordings of neuropsychological testing performed on individuals recruited within a community cohort setting can facilitate dementia screening.

Publisher

Cold Spring Harbor Laboratory

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3