A unified beamforming and source separation model for static and dynamic human-robot interaction-Reference-Cited by-同舟云学术

A unified beamforming and source separation model for static and dynamic human-robot interaction

Published:2024-03-01 Issue:3 Volume:4 Page:
ISSN:2691-1191
Container-title:JASA Express Letters
language:en
Short-container-title:

Author:

Wuth Jorge¹,Mahu Rodrigo¹,Cohen Israel²,Stern Richard M.³,Yoma Néstor Becerra¹^ORCID

Affiliation:

1. Speech Processing and Transmission Laboratory, Department of Electrical Engineering, University of Chile 1 , Av. Tupper 2007, Santiago, Chile

2. Technion–Israel Institute of Technology 2 , Haifa 3200003, Israel

3. Department of Electrical and Computer Engineering, Carnegie Mellon University 3 , Pittsburgh, Pennsylvania 15213, USA jwuths@uchile.cl , rmahus@gmail.com , icohen@ee.technion.ac.il , rms@cs.cmu.edu , nbecerra@ing.uchile.cl

Abstract

This paper presents a unified model for combining beamforming and blind source separation (BSS). The validity of the model's assumptions is confirmed by recovering target speech information in noise accurately using Oracle information. Using real static human-robot interaction (HRI) data, the proposed combination of BSS with the minimum-variance distortionless response beamformer provides a greater signal-to-noise ratio (SNR) than previous parallel and cascade systems that combine BSS and beamforming. In the difficult-to-model HRI dynamic environment, the system provides a SNR gain that was 2.8 dB greater than the results obtained with the cascade combination, where the parallel combination is infeasible.

Funder

Agencia Nacional de Investigación y Desarrollo

Publisher

Acoustical Society of America (ASA)

Link

https://pubs.aip.org/asa/jel/article-pdf/doi/10.1121/10.0025238/19709253/035203_1_10.0025238.pdf

Reference25 articles.

1. Learnable spectral dimension compression mapping for full-band speech enhancement;JASA Express Lett.,2023

2. Phase-aware deep speech enhancement: It's all about the frame length;JASA Express Lett.,2022

3. Direction-of-arrival estimation with blind surface impedance compensation for spherical microphone array;JASA Express Lett.,2021

4. Beamforming: A versatile approach to spatial filtering;IEEE ASSP Mag.,1988

5. Blind separation of speech mixtures via time-frequency masking;IEEE Trans. Signal Process.,2004