Deep Learning-Based Dereverberation for Sound Source Localization with Beamforming

Author:

Zhai Qingbo1ORCID,Ning Fangli1ORCID,Hou Hongjie1ORCID,Wei Juan2ORCID,Su Zhaojing3ORCID

Affiliation:

1. School of Mechanical Engineering, Northwestern Polytechnical University, Xi’an, Shaanxi Province, P. R. China

2. School of Telecommunications Engineering, Xidian University, Xi’an, Shaanxi Province, P. R. China

3. College of Arts, Shandong University of Science and Technology, Tsingtao, Shandong Province, P. R. China

Abstract

In this work, an algorithm that combines deep-learning based dereverberation and beamforming is proposed for sound source localization in the reverberant environment. The contribution is combining deep-learning based dereverberation and beamforming together. Through deep learning, the proposed algorithm can directly achieve dereverberation of the cross-spectral matrix of microphone array measurement signals in the reverberant environment, which can overcome the challenge of requiring multiple measurements like average beamforming. In this way, the proposed algorithm can be applied to the reverberant environment that requires efficient sound source localization. At the same time, the network of deep-learning based dereverberation is trained directly using the cross-spectral matrix of signals collected by microphone arrays, which can overcome the challenge of requiring prior knowledge of reflective surfaces like empirical dereverberation beamforming. In this way, the proposed algorithm can be applied to the reverberant environment composed of walls with unknown sound reflection coefficients. Both specular reflections, diffuse reflections, and background noise are considered in the reverberant environment. The cross-spectral matrix of microphone array measurement signals in the reverberant environment is first fed into the U-net for dereverberation and then combined with beamforming for sound source localization. The results in the test set show that the proposed algorithm can eliminate the influence of reverberation on sound source localization. The applicability limitation results of the proposed algorithm show the proposed algorithm’s strong robustness to unseen sound reflection coefficient, unseen SNR, and expandability to some degree for unseen source location. However, the proposed algorithm cannot provide stable and accurate sound source localization results under unseen room geometry. The proposed algorithm has higher spatial resolution and smaller errors than the average beamforming and the empirical dereverberation beamforming.

Funder

National Natural Science Foundation of China

the 2023 Key Industrial Chain Technology Research Project of Xi&an

Aeronautical Science Foundation of China

Key R&D Program of Shandong Provinc

Shaanxi key Research Program Project

Shaanxi Provincial Natural Science Basic Research Program Project

Publisher

World Scientific Pub Co Pte Ltd

Subject

Applied Mathematics,Computer Science Applications,Acoustics and Ultrasonics

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3