Cortical tracking of speakers’ formant changes predicts selective listening

Author:

Cervantes Constantino FranciscoORCID,Caputi Ángel A.

Abstract

AbstractA social scene is particularly informative when people are distinguishable. To understand somebody amid a ‘cocktail party’ chatter, we automatically index their voice. This ability is underpinned by parallel processing of vocal spectral contours from speech sounds, but it has not yet been established how this occurs in the brain’s cortex. We investigate single-trial neural tracking of slow modulations in speech formants using electroencephalography. Participants briefly listened to unfamiliar single speakers, and in addition, they performed a cocktail party comprehension task. Quantified through stimulus reconstruction methods, robust tracking was found in neural responses to slow (delta-theta range) modulations of the fourth and fifth formant band contours equivalent to the 3.5–5 KHz audible range. Instantaneous inter-formant spacing (ΔF), which also yields indexical information from the vocal tract, was similarly decodable. Moreover, EEG evidence of listeners’ spectral tracking abilities predicted their chances of succeeding at selective listening when faced with two-speaker speech mixtures. In summary, the results indicate that the communicating brain can rely on locking of cortical rhythms to major changes led by upper resonances of the vocal tract. Their corresponding articulatory mechanics hence continuously issue a fundamental credential for listeners to target in real time.Significance statementThe human voice is acoustically fingerprinted, thanks in part to the shape and function of the larynx and oral cavities as they change from person to person, and from one instant to the next. This study shows that a substantial portion of these time-varying signatures are consistently traced by cortical networks. People who are more capable of tracking these variations neurally are more likely to grasp speakers of their choice despite masking by others. Thus, the changes can be used for the brain to tell what pieces of an unfamiliar voice fit together and which ones to put apart. Solving this inference in real time represents a cornerstone to understanding communication in everyday’s social world.

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3