Intelligible speech synthesis from neural decoding of spoken sentences-Reference-Cited by-同舟云学术

Intelligible speech synthesis from neural decoding of spoken sentences

Published:2018-11-29 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Anumanchipalli Gopala K.,Chartier Josh,Chang Edward F.

Abstract

AbstractThe ability to read out, or decode, mental content from brain activity has significant practical and scientific implications1. For example, technology that translates cortical activity into speech would be transformative for people unable to communicate as a result of neurological impairment2,3,4. Decoding speech from neural activity is challenging because speaking requires extremely precise and dynamic control of multiple vocal tract articulators on the order of milliseconds. Here, we designed a neural decoder that explicitly leverages the continuous kinematic and sound representations encoded in cortical activity5,6 to generate fluent and intelligible speech. A recurrent neural network first decoded vocal tract physiological signals from direct cortical recordings, and then transformed them to acoustic speech output. Robust decoding performance was achieved with as little as 25 minutes of training data. Naïve listeners were able to accurately identify these decoded sentences. Additionally, speech decoding was not only effective for audibly produced speech, but also when participants silently mimed speech. These results advance the development of speech neuroprosthetic technology to restore spoken communication in patients with disabling neurological disorders.

Publisher

Cold Spring Harbor Laboratory

Reference53 articles.

1. Brain–computer interfaces for communication and control

2. Key considerations in designing a speech brain computer interface;J Physiol Paris,2016

3. Brain–Computer Interfaces for Augmentative and Alternative Communication: A Tutorial;American journal of speech-language pathology,2018

4. Electrocorticographic representations of segmental features in continuous speech;Frontiers in human neuroscience,2015

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Tracing Responsibility and Neuroprosthesis-Mediated Speech;Techno:Phil – Aktuelle Herausforderungen der Technikphilosophie;2024

2. Mouth2Audio: intelligible audio synthesis from videos with distinctive vowel articulation;International Journal of Speech Technology;2023-05-25

4. Brain-Computer Interface: Applications to Speech Decoding and Synthesis to Augment Communication;Neurotherapeutics;2022-01

5. Towards Speech Synthesis from Intracranial Signals;SpringerBriefs in Electrical and Computer Engineering;2020