Evaluation of localization precision by proposed quasi-spherical nested microphone array in combination with multiresolution adaptive steered response power-Reference-Cited by-同舟云学术

Evaluation of localization precision by proposed quasi-spherical nested microphone array in combination with multiresolution adaptive steered response power

Published:2020-06-01 Issue:3 Volume:71 Page:150-164
ISSN:1339-309X
Container-title:Journal of Electrical Engineering
language:en
Short-container-title:

Author:

Firoozabadi Ali Dehghan¹^ORCID,Irarrazaval Pablo²³⁴,Adasme Pablo⁵,Zabala-Blanco David⁶,Azurdia-Meza Cesar⁷

Affiliation:

1. Department of Electricity , Universidad Tecnológica Metropolitana , Av. José Pedro Alessandri 1242 , Santiago 7800002 , Chile

2. Electrical Engineering Department ,

3. Biomedical Imaging Center ,

4. Institute for Biological and Medical Engineering , Pontificia Universidad Católica de Chile , Santiago 7820436 , Chile

5. Electrical Engineering Department , Universidad de Santiago de Chile , Av. Ecuador 3519 , Santiago 9170124 , Chile

6. Department of Computing and Industries , Universidad Católica del Maule , Talca 3466706 , Chile

7. Department of Electrical Engineering , Universidad de Chile , Santiago 8370451 , Chile

Abstract

Abstract Multiple sound source localization in noisy and reverberant conditions is one of the important challenges in the speech signal processing. The aim of this article is three-dimensional sound source localization in undesirable scenarios. For the localization algorithms, the spatial aliasing is one of the destructive factors in reducing the accuracy. Firstly, a 3D quasi-spherical nested microphone array (QSNMA) is proposed for eliminating the spatial aliasing. Since the speech signal has the windowed-disjoint orthogonality property, the speech information differs in terms of the frequency bands. Then, the Gammatone filter bank is introduced for the speech subband processing. In the following, the multiresolution steered response power (SRP) algorithm is adaptively implemented on subbands with the phase transform (PHAT)/maximum likelihood (ML) weighted functions based on the levels of the noise and reverberation. The peaks of the multiresolution adaptive SRP (MASRP) algorithm are extracted in each subband based on the number of speakers for continuous time frames. Finally, the distribution of these peaks are calculated in each subband and they are merged by the use of weighted averaging method. The final 3D speakers locations are estimated by extracting the peaks in the final distribution. The proposed QSNMAMASRP(PHAT/ML) algorithm is evaluated on real and simulated data for 2 and 3 simultaneous speakers in noisy and reverberant conditions. The proposed method is compared with SRP-PHAT, spectral source model-deep neural network, and spherical harmonic temporal extension of multiple response model sparse Bayesian learning algorithms on different range of signal-to-noise ratio and reverberation time. The mean absolute estimation error, averaged standard deviation for absolute estimation error, and computational complexity results show the superiority of the proposed method.

Publisher

Walter de Gruyter GmbH

Link

https://www.sciendo.com/pdf/10.2478/jee-2020-0022

Reference35 articles.

1. [1] X. Sheng and Y.-H. Hu, “Maximum Likelihood Multiple-Source Localization Using Acoustic Energy Measurements with Wireless Sensor Networks”, IEEE Transactions on Signal Processing, vol. 53, pp. 44-53, 2005.

2. [2] A. Ikeda, H. Mizoguchi, Y. Sasaki, T. Enomoto, and S. Kagami, “2D Sound Source Localization in Azimuth & Elevation from Microphone Array by Using a Directional Pattern of Element”, IEEE SENSORS, Atlanta, GA, pp. 1213-1216, 2007.

3. [3] M. I. Mandel, R. J. Weiss, and D. P. Ellis, “Model-based expectation maximization source separation and localization”, IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, no. 2, p. 382-394, 2010.

4. [4] F. Antonacci, M. Matteucci, D. Migliore, D. Riva, A. Sarti, M. Tagliasacchi, and S. Tubaro, “Tracking multiple acoustic sources in reverberant environments using regularized particle filter”, In Proceedings IEEE International Conference on Digital Signal Processing, Cardi, UK, pp. 99-102, 2017.

5. [5] Q. Yan, J. Chen, G. Ottoy, and L. D. Strycker, “Robust AOA based acoustic source localization method with unreliable measurements”, Signal Processing, vol. 152, pp. 13-21, 2018.