Decoding articulatory and phonetic components of naturalistic continuous speech from the distributed language network-Reference-Cited by-同舟云学术

Decoding articulatory and phonetic components of naturalistic continuous speech from the distributed language network

Published:2023-08-01 Issue:4 Volume:20 Page:046030
ISSN:1741-2560
Container-title:Journal of Neural Engineering
language:
Short-container-title:J. Neural Eng.

Author:

Thomas Tessy M^ORCID,Singh Aditya,Bullock Latané P,Liang Daniel,Morse Cale W,Scherschligt Xavier^ORCID,Seymour John P,Tandon Nitin

Abstract

Abstract Objective. The speech production network relies on a widely distributed brain network. However, research and development of speech brain–computer interfaces (speech-BCIs) has typically focused on decoding speech only from superficial subregions readily accessible by subdural grid arrays—typically placed over the sensorimotor cortex. Alternatively, the technique of stereo-electroencephalography (sEEG) enables access to distributed brain regions using multiple depth electrodes with lower surgical risks, especially in patients with brain injuries resulting in aphasia and other speech disorders. Approach. To investigate the decoding potential of widespread electrode coverage in multiple cortical sites, we used a naturalistic continuous speech production task. We obtained neural recordings using sEEG from eight participants while they read aloud sentences. We trained linear classifiers to decode distinct speech components (articulatory components and phonemes) solely based on broadband gamma activity and evaluated the decoding performance using nested five-fold cross-validation. Main Results. We achieved an average classification accuracy of 18.7% across 9 places of articulation (e.g. bilabials, palatals), 26.5% across 5 manner of articulation (MOA) labels (e.g. affricates, fricatives), and 4.81% across 38 phonemes. The highest classification accuracies achieved with a single large dataset were 26.3% for place of articulation, 35.7% for MOA, and 9.88% for phonemes. Electrodes that contributed high decoding power were distributed across multiple sulcal and gyral sites in both dominant and non-dominant hemispheres, including ventral sensorimotor, inferior frontal, superior temporal, and fusiform cortices. Rather than finding a distinct cortical locus for each speech component, we observed neural correlates of both articulatory and phonetic components in multiple hubs of a widespread language production network. Significance. These results reveal the distributed cortical representations whose activity can enable decoding speech components during continuous speech through the use of this minimally invasive recording method, elucidating language neurobiology and neural targets for future speech-BCIs.

Funder

National Institute of Neurological Disorders and Stroke

Publisher

IOP Publishing

Subject

Cellular and Molecular Neuroscience,Biomedical Engineering

Link

https://iopscience.iop.org/article/10.1088/1741-2552/ace9fb/pdf

Reference41 articles.

1. Classification of intended phoneme production from chronic intracortical microelectrode recordings in speech motor cortex;Brumberg;Front. Neurosci.,2011

2. Direct classification of all American English phonemes using signals from functional speech motor cortex;Mugler;J. Neural Eng.,2014

3. Differential representation of articulatory gestures and phonemes in precentral and inferior frontal gyri;Mugler;J. Neurosci.,2018