Multiple Sound Source Localization, Separation, and Reconstruction by Microphone Array: A DNN-Based Approach-Reference-Cited by-同舟云学术

Multiple Sound Source Localization, Separation, and Reconstruction by Microphone Array: A DNN-Based Approach

Published:2022-03-28 Issue:7 Volume:12 Page:3428
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Chen Long,Chen Guitong,Huang Lei,Choy Yat-Sze,Sun Weize

Abstract

Synchronistical localization, separation, and reconstruction for multiple sound sources are usually necessary in various situations, such as in conference rooms, living rooms, and supermarkets. To improve the intelligibility of speech signals, the application of deep neural networks (DNNs) has achieved considerable success in the area of time-domain signal separation and reconstruction. In this paper, we propose a hybrid microphone array signal processing approach for the nearfield scenario that combines the beamforming technique and DNN. Using this method, the challenge of identifying both the sound source location and content can be overcome. Moreover, the use of a sequenced virtual sound field reconstruction process enables the proposed approach to be quite suitable for a sound field which contains a dominant, stronger sound source and masked, weaker sound sources. Using this strategy, all traceable, mainly sound, sources can be discovered by loops in a given sound field. The operational duration and accuracy of localization are further improved by substituting the broadband weighted multiple signal classification (BW-MUSIC) method for the conventional delay-and-sum (DAS) beamforming algorithm. The effectiveness of the proposed method for localizing and reconstructing speech signals was validated by simulations and experiments with promising results. The localization results were accurate, while the similarity and correlation between the reconstructed and original signals was high.

Funder

National Natural Science Foundation of China

Natural Science Foundation of Guangdong

Foundation of Shenzhen

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/12/7/3428/pdf

Reference41 articles.

1. ADL-MVDR: All deep learning MVDR beamformer for target speech separation;Zhang;arXiv,2020

2. Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network-Based Vector-to-Vector Regression

3. On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Time Reverse Modeling of Acoustic Waves for Enhanced Mapping of Cracking Sound Events in Textile Reinforced Concrete;Journal of Nondestructive Evaluation;2024-08-26

2. A Wearable Assistive Listening Device With Immersive Function Using Sensors Fusion Method for the 3-D Space Perception;IEEE Sensors Journal;2024-01-15

3. Parametric Doppler correction for wayside array acoustic signal via short-time reconstruction;Mechanical Systems and Signal Processing;2024-01

4. Research on Sound Source Localization and Path Planning with SLAM Maps;2023 3rd International Conference on Robotics, Automation and Intelligent Control (ICRAIC);2023-11-24

5. Technology of sound source localization based on incoming sound intensity;2023 IEEE 18th International Conference on Computer Science and Information Technologies (CSIT);2023-10-19