Towards reconstructing intelligible speech from the human auditory cortex-Reference-Cited by-同舟云学术

Towards reconstructing intelligible speech from the human auditory cortex

Published:2018-06-19 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Akbari Hassan,Khalighinejad Bahar,Herrero Jose L.,Mehta Ashesh D.,Mesgarani Nima

Abstract

AbstractAuditory stimulus reconstruction is a technique that finds the best approximation of the acoustic stimulus from the population of evoked neural activity. Reconstructing speech from the human auditory cortex creates the possibility of a speech neuroprosthetic to establish a direct communication with the brain and has been shown to be possible in both overt and covert conditions. However, the low quality of the reconstructed speech has severely limited the utility of this method for brain-computer interface (BCI) applications. To advance the state-of-the-art in speech neuroprosthesis, we combined the recent advances in deep learning with the latest innovations in speech synthesis technologies to reconstruct closed-set intelligible speech from the human auditory cortex. We investigated the dependence of reconstruction accuracy on linear and nonlinear (deep neural network) regression methods and the acoustic representation that is used as the target of reconstruction, including auditory spectrogram and speech synthesis parameters. In addition, we compared the reconstruction accuracy from low and high neural frequency ranges. Our results show that a deep neural network model that directly estimates the parameters of a speech synthesizer from all neural frequencies achieves the highest subjective and objective scores on a digit recognition task, improving the intelligibility by 65% over the baseline method which used linear regression to reconstruct the auditory spectrogram. These results demonstrate the efficacy of deep learning and speech synthesis algorithms for designing the next generation of speech BCI systems, which not only can restore communications for paralyzed patients but also have the potential to transform human-computer interaction technologies.

Publisher

Cold Spring Harbor Laboratory

Reference87 articles.

1. Reading a Neural Code

2. Naturalistic stimuli increase the rate and efficiency of information transmission by primary auditory afferents

3. Influence of Context and Behavior on Stimulus Reconstruction From Neural Activity in Primary Auditory Cortex

4. Reconstruction of Natural Scenes from Ensemble Responses in the Lateral Geniculate Nucleus

5. Incorporating Naturalistic Correlation Structure Improves Spectrogram Reconstruction from Neuronal Activity in the Songbird Auditory Midbrain

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. High-resolution neural recordings improve the accuracy of speech decoding;Nature Communications;2023-11-06

2. High-resolution neural recordings improve the accuracy of speech decoding;2022-05-20

3. Brain-Computer Interface: Applications to Speech Decoding and Synthesis to Augment Communication;Neurotherapeutics;2022-01

4. Generalizable EEG encoding models with naturalistic audiovisual stimuli;2021-01-18

5. Automatic Speech Separation Enables Brain-Controlled Hearable Technologies;SpringerBriefs in Electrical and Computer Engineering;2021