Perceptual Fusion Tendency of Speech Sounds

Author:

Huang Ying,Li Jingyu,Zou Xuefei,Qu Tianshu,Wu Xihong,Mao Lihua,Wu Yanhong,Li Liang

Abstract

Abstract To discriminate and to recognize sound sources in a noisy, reverberant environment, listeners need to perceptually integrate the direct wave with the reflections of each sound source. It has been confirmed that perceptual fusion between direct and reflected waves of a speech sound helps listeners recognize this speech sound in a simulated reverberant environment with disrupting sound sources. When the delay between a direct sound wave and its reflected wave is sufficiently short, the two waves are perceptually fused into a single sound image as coming from the source location. Interestingly, compared with nonspeech sounds such as clicks and noise bursts, speech sounds have a much larger perceptual fusion tendency. This study investigated why the fusion tendency for speech sounds is so large. Here we show that when the temporal amplitude fluctuation of speech was artificially time reversed, a large perceptual fusion tendency of speech sounds disappeared, regardless of whether the speech acoustic carrier was in normal or reversed temporal order. Moreover, perceptual fusion of normal-order speech, but not that of time-reversed speech, was accompanied by increased coactivation of the attention-control-related, spatial-processing-related, and speech-processing-related cortical areas. Thus, speech-like acoustic carriers modulated by speech amplitude fluctuation selectively activate a cortical network for top–down modulations of speech processing, leading to an enhancement of perceptual fusion of speech sounds. This mechanism represents a perceptual-grouping strategy for unmasking speech under adverse conditions.

Publisher

MIT Press - Journals

Subject

Cognitive Neuroscience

Reference48 articles.

1. “What” and “where” in the human auditory system.;Alain;Proceedings of the National Academy of Sciences, U.S.A.,2001

2. The effect of spatial separation on informational and energetic masking of speech.;Arbogast;Journal of the Acoustical Society of America,2002

3. The effect of spatial separation on informational masking of speech in normal-hearing and hearing-impaired listeners.;Arbogast;Journal of the Acoustical Society of America,2005

4. Assessing the auditory dual-pathway model in humans.;Arnott;Neuroimage,2004

5. Human temporal lobe activation by speech and nonspeech sounds.;Binder;Cerebral Cortex,2000

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3