A Binaural Grouping Model for Predicting Speech Intelligibility in Multitalker Environments-Reference-Cited by-同舟云学术

A Binaural Grouping Model for Predicting Speech Intelligibility in Multitalker Environments

Published:2016-01 Issue: Volume:20 Page:233121651666991
ISSN:2331-2165
Container-title:Trends in Hearing
language:en
Short-container-title:Trends in Hearing

Author:

Mi Jing¹,Colburn H. Steven¹

Affiliation:

1. Boston University, Boston, MA, USA

Abstract

Spatially separating speech maskers from target speech often leads to a large intelligibility improvement. Modeling this phenomenon has long been of interest to binaural-hearing researchers for uncovering brain mechanisms and for improving signal-processing algorithms in hearing-assistive devices. Much of the previous binaural modeling work focused on the unmasking enabled by binaural cues at the periphery, and little quantitative modeling has been directed toward the grouping or source-separation benefits of binaural processing. In this article, we propose a binaural model that focuses on grouping, specifically on the selection of time-frequency units that are dominated by signals from the direction of the target. The proposed model uses Equalization-Cancellation (EC) processing with a binary decision rule to estimate a time-frequency binary mask. EC processing is carried out to cancel the target signal and the energy change between the EC input and output is used as a feature that reflects target dominance in each time-frequency unit. The processing in the proposed model requires little computational resources and is straightforward to implement. In combination with the Coherence-based Speech Intelligibility Index, the model is applied to predict the speech intelligibility data measured by Marrone et al. The predicted speech reception threshold matches the pattern of the measured data well, even though the predicted intelligibility improvements relative to the colocated condition are larger than some of the measured data, which may reflect the lack of internal noise in this initial version of the model.

Publisher

SAGE Publications

Subject

Speech and Hearing,Otorhinolaryngology

Link

http://journals.sagepub.com/doi/pdf/10.1177/2331216516669919

Reference39 articles.

1. The effect of spatial separation on informational masking of speech in normal-hearing and hearing-impaired listeners

2. Object continuity enhances selective auditory attention

3. Prediction of speech intelligibility in spatial noise and reverberation for normal-hearing and hearing-impaired listeners

4. Revision, extension, and evaluation of a binaural speech intelligibility model

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Spectro-Temporal Post-Filtering Via Short-Time Target Cancellation for Directional Speech Enhancement in a Dual-Microphone Hearing AID;ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2023-06-04

2. Using a blind EC mechanism for modelling the interaction between binaural and temporal speech processing;Acta Acustica;2022

3. Binaural speaker identification using the equalization-cancelation technique;EURASIP Journal on Audio, Speech, and Music Processing;2020-12

4. A framework for testing and comparing binaural models;Hearing Research;2018-03

5. Comparison of a target-equalization-cancellation approach and a localization approach to source separation;The Journal of the Acoustical Society of America;2017-11