Attentional Modulation of Hierarchical Speech Representations in a Multitalker Environment

Author:

Kiremitçi Ibrahim12,Yilmaz Özgür23,Çelik Emin12,Shahdloo Mo24,Huth Alexander G567,Çukur Tolga1237

Affiliation:

1. Neuroscience Program, Sabuncu Brain Research Center, Bilkent University, Ankara TR-06800, Turkey

2. National Magnetic Resonance Research Center (UMRAM), Bilkent University, Ankara TR-06800, Turkey

3. Department of Electrical and Electronics Engineering, Bilkent University, Ankara TR-06800, Turkey

4. Department of Experimental Psychology, Wellcome Centre for Integrative Neuroimaging, University of Oxford, Oxford OX3 9DU, UK

5. Department of Neuroscience, The University of Texas at Austin, Austin, TX 78712, USA

6. Department of Computer Science, The University of Texas at Austin, Austin, TX 78712, USA

7. Helen Wills Neuroscience Institute, University of California, Berkeley, CA 94702, USA

Abstract

Abstract Humans are remarkably adept in listening to a desired speaker in a crowded environment, while filtering out nontarget speakers in the background. Attention is key to solving this difficult cocktail-party task, yet a detailed characterization of attentional effects on speech representations is lacking. It remains unclear across what levels of speech features and how much attentional modulation occurs in each brain area during the cocktail-party task. To address these questions, we recorded whole-brain blood-oxygen-level-dependent (BOLD) responses while subjects either passively listened to single-speaker stories, or selectively attended to a male or a female speaker in temporally overlaid stories in separate experiments. Spectral, articulatory, and semantic models of the natural stories were constructed. Intrinsic selectivity profiles were identified via voxelwise models fit to passive listening responses. Attentional modulations were then quantified based on model predictions for attended and unattended stories in the cocktail-party task. We find that attention causes broad modulations at multiple levels of speech representations while growing stronger toward later stages of processing, and that unattended speech is represented up to the semantic level in parabelt auditory cortex. These results provide insights on attentional mechanisms that underlie the ability to selectively listen to a desired speaker in noisy multispeaker environments.

Funder

European Molecular Biology Organization

National Eye Institute

Publisher

Oxford University Press (OUP)

Subject

Cellular and Molecular Neuroscience,Cognitive Neuroscience

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3