A brain-to-text framework of decoding natural tonal sentences-Reference-Cited by-同舟云学术

A brain-to-text framework of decoding natural tonal sentences

Published:2024-03-18 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Zhang Daohan,Wang Zhenjie,Qian Youkun,Zhao Zehao,Liu Yan,Hao Xiaotao,Li Wanxin,Lu Shuo,Zhu Honglin,Chen Luyao,Xu Kunyu,Li Yuanning^ORCID,Lu Junfeng

Abstract

AbstractSpeech brain-computer interfaces (BCIs) directly translate brain activity into speech sound and text, yet decoding tonal languages like Mandarin Chinese poses a significant, unexplored challenge. Despite successful cases in non-tonal languages, the complexities of Mandarin, with its distinct syllabic structures and pivotal lexical information conveyed through tonal nuances, present challenges in BCI decoding. Here we designed a brain-to-text framework to decode Mandarin tonal sentences from invasive neural recordings. Our modular approach dissects speech onset, base syllables, and lexical tones, integrating them with contextual information through Bayesian likelihood and the Viterbi decoder. The results demonstrate accurate tone and syllable decoding under variances in continuous naturalistic speech production, surpassing previous intracranial Mandarin tonal syllable decoders in decoding accuracy. We also verified the robustness of our decoding framework and showed that the model hyperparameters can be generalized across participants of varied gender, age, education backgrounds, pronunciation behaviors, and coverage of electrodes. Our pilot study shed lights on the feasibility of more generalizable brain-to-text decoding of natural tonal sentences from patients with high heterogeneities.

Publisher

Cold Spring Harbor Laboratory

Reference51 articles.

1. Gilakjani, A. P. & Ahmadi, M. R. A study of factors affecting EFL learners’ English listening comprehension and the strategies for improvement. (2011).

2. Speech synthesis from neural decoding of spoken sentences

3. Real-time decoding of question-and-answer speech dialogue using human cortical activity

4. Machine translation of cortical activity to text with an encoder–decoder framework

5. Neuroprosthesis for Decoding Speech in a Paralyzed Person with Anarthria