A nested generalized sidelobe canceller for source counting, localization, and signal separation in reverberant fields

Author:

Kung Fan-Jie1ORCID,Bai Mingsian R.1ORCID

Affiliation:

1. Department of Electrical Engineering, National Tsing Hua University , Hsinchu 300044, Taiwan

Abstract

This paper proposes a nested generalized sidelobe canceller (NGSC) for typical array signal processing tasks, including source counting, localization, and signal separation. Multiple blocking matrices are arranged in a nested structure to successively eliminate dominant sources until the remaining signal is predominantly incoherent noise. The microphone signals are dereverberated using a multichannel weighted prediction error algorithm. In source counting, the number of sound sources is determined by tracking the average power of the blocked signal. In source localization, the direction-of-arrival (DOA) is estimated via cosine similarity in conjunction with the golden section search. In signal separation, the estimated DOA enables speech separation in a linearly constrained minimum variance beamformer with postfiltering (LCMV-PF). Monte Carlo simulations are performed to compare the proposed NGSC approach with four baselines, minimum description length, second order statistic of the eigenvalue, multistage Wiener filtering, and multiple signal classification. The results show that NGSC achieves at least 28.80% higher source counting accuracy with a 1.04° lower root mean square degree error than the baselines. The signal-to-distortion ratio achieved by LCMV-PF is 1.52 dB higher than that achieved by the linearly constrained minimum power beamformer and the multichannel Wiener filter.

Funder

Ministry of Science and Technology

Publisher

Acoustical Society of America (ASA)

Subject

Acoustics and Ultrasonics,Arts and Humanities (miscellaneous)

Reference50 articles.

1. Robust localization of multiple sound sources based on BSS algorithms,2015

2. A novel directional framework for source counting and source separation in instantaneous underdetermined audio mixtures;IEEE/ACM Trans. Audio. Speech. Lang. Process.,2020

3. Speech processing for digital home assistants: Combining signal processing with deep-learning techniques;IEEE Signal Process. Mag.,2019

4. New insights into the MVDR beamformer in room acoustics;IEEE Trans. Audio. Speech. Lang. Process.,2010

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3