Categorical encoding of voice in human superior temporal cortex-Reference-Cited by-同舟云学术

Categorical encoding of voice in human superior temporal cortex

Published:2021-11-23 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Rupp Kyle,Hect Jasmine^ORCID,Remick Madison,Ghuman Avniel^ORCID,Chandrasekaran Bharath^ORCID,Holt Lori L.^ORCID,Abel Taylor J.^ORCID

Abstract

AbstractThe ability to recognize abstract features of voice during auditory perception is a complex, yet poorly understood, feat of human audition. For the listener, this occurs in near-automatic fasion to seamlessly extract complex cues from a highly variable auditory signal. Voice perception depends on specialized regions of auditory cortex, including superior temporal gyrus (STG) and superior temporal sulcus (STS). However, the nature of voice encoding at the cortical level remains poorly understoood. We leverage intracerebral recordings across human auditory cortex during presentation of voice and non-voice acoustic stimuli to examine voice encoding in auditory cortex, in eight patient-participants undergoing epilepsy surgery evaluation. We show that voice-selectivity increases along the auditory hierarchy from supratemporal plane (STP) to the STG and STS. Results show accurate decoding of vocalizations from human auditory cortical activity even in the complete absence of linguistic content. These findings show an early, less-selective temporal window of neural activity in the STG and STS followed by a sustained, strongly voice-selective window. We then developed encoding models that demonstrate divergence in the encoding of acoustic features along the auditory hierarchy, wherein STG/STS responses were best explained by voice category as opposed to the acoustic features of voice stimuli. This is in contrast to neural activity recorded from STP, in which responses were accounted for by acoustic features. These findings support a model of voice perception that engages categorical encoding mechanisms within STG and STS.Significance StatementVoice perception occurs via specialized networks in higher order auditory cortex, yet how voice features are encoded remains a central unanswered question. With human intracerebral recordings of auditory cortex, we provide evidence for categorical encoding of voice in STG and STS and occurs in the absence of linguistic content. This selectivity strengthens after an initial onset response and cannot be explained by simple acoustic features. Together, these data support the existence of sites within STG and STS that are specialized for voice perception.

Publisher

Cold Spring Harbor Laboratory

Reference50 articles.

1. The Primate Cortical Auditory System and Neural Representation of Conspecific Vocalizations

2. S. R. Mathias , K. von Kriegstein , “Voice Processing and Voice-Identity Recognition” in Timbre: Acoustics, Perception, and Cognition, Springer Handbook of Auditory Research., K. Siedenburg , C. Saitis , S. McAdams , A. N. Popper , R. R. Fay , Eds. (Springer International Publishing, 2019), pp. 175–209.

3. Newborn and fetal response to maternal voice

4. P. K. Kuhl , Early language acquisition: Cracking the speech code. Nat. Rev. Neurosci. (2004) https://doi.org/10.1038/nrn1533.

5. J. M. Zarate , X. Tian , K. J. P. Woods , D. Poeppel , Multiple levels of linguistic and paralinguistic features contribute to voice recognition. Sci. Rep. (2015) https://doi.org/10.1038/srep11475.