Statistics of natural reverberation enable perceptual separation of sound and space

Author:

Traer James,McDermott Josh H.

Abstract

In everyday listening, sound reaches our ears directly from a source as well as indirectly via reflections known as reverberation. Reverberation profoundly distorts the sound from a source, yet humans can both identify sound sources and distinguish environments from the resulting sound, via mechanisms that remain unclear. The core computational challenge is that the acoustic signatures of the source and environment are combined in a single signal received by the ear. Here we ask whether our recognition of sound sources and spaces reflects an ability to separate their effects and whether any such separation is enabled by statistical regularities of real-world reverberation. To first determine whether such statistical regularities exist, we measured impulse responses (IRs) of 271 spaces sampled from the distribution encountered by humans during daily life. The sampled spaces were diverse, but their IRs were tightly constrained, exhibiting exponential decay at frequency-dependent rates: Mid frequencies reverberated longest whereas higher and lower frequencies decayed more rapidly, presumably due to absorptive properties of materials and air. To test whether humans leverage these regularities, we manipulated IR decay characteristics in simulated reverberant audio. Listeners could discriminate sound sources and environments from these signals, but their abilities degraded when reverberation characteristics deviated from those of real-world environments. Subjectively, atypical IRs were mistaken for sound sources. The results suggest the brain separates sound into contributions from the source and the environment, constrained by a prior on natural reverberation. This separation process may contribute to robust recognition while providing information about spaces around us.

Funder

James S. McDonnell

HHS | NIH | National Institute on Deafness and Other Communication Disorders

Publisher

Proceedings of the National Academy of Sciences

Subject

Multidisciplinary

Reference57 articles.

1. Scene analysis in the natural environment;Lewicki;Front Psychol,2014

2. Sabine H (1953) Room acoustics. Trans IRE 1:4–12.

3. Frequency‐Correlation Functions of Frequency Responses in Rooms

4. Blesser B Salter L (2009) Spaces Speak, Are You Listening?: Experiencing Aural Architecture (MIT Press, Cambridge, MA).

5. Kuttruff H (2009) Room Acoustics (Spon Press, Oxon, UK), 4th ed, pp 204–251.

Cited by 90 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. End-to-End Deep Learning-Based Adaptation Control for Linear Acoustic Echo Cancellation;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024

2. Brouhaha: Multi-Task Training for Voice Activity Detection, Speech-to-Noise Ratio, and C50 Room Acoustics Estimation;2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU);2023-12-16

3. Yet Another Generative Model for Room Impulse Response Estimation;2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA);2023-10-22

4. Reverberation time control by acoustic metamaterials in a small room;Building and Environment;2023-10

5. A Two-Dimensional Threshold Test for Reverberation Time and Direct-to-Reverberant Ratio;2023 Immersive and 3D Audio: from Architecture to Automotive (I3DA);2023-09-05

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3