Direction-of-arrival and power spectral density estimation using a single directional microphone and group-sparse optimization
-
Published:2023-10-04
Issue:1
Volume:2023
Page:
-
ISSN:1687-4722
-
Container-title:EURASIP Journal on Audio, Speech, and Music Processing
-
language:en
-
Short-container-title:J AUDIO SPEECH MUSIC PROC.
Author:
Tengan ElisaORCID, Dietzen Thomas, Elvander Filip, van Waterschoot Toon
Abstract
AbstractIn this paper, two approaches are proposed for estimating the direction of arrival (DOA) and power spectral density (PSD) of stationary point sources by using a single, rotating, directional microphone. These approaches are based on a method previously presented by the authors, in which point source DOAs were estimated by using a broadband signal model and solving a group-sparse optimization problem, where the number of observations made by the rotating directional microphone can be lower than the number of candidate DOAs in an angular grid. The DOA estimation is followed by the estimation of the sources’ PSDs through the solution of an overdetermined least squares problem. The first approach proposed in this paper includes the use of an additional nonnegativity constraint on the residual noise term when solving the group-sparse optimization problem and is referred to as the Group Lasso Least Squares (GL-LS) approach. The second proposed approach, in addition to the new nonnegativity constraint, employs a narrowband signal model when building the linear system of equations used for formulating the group-sparse optimization problem, where the DOAs and PSDs can be jointly estimated by iterative, group-wise reweighting. This is referred to as the Group-Lasso with $$l_1$$
l
1
-reweighting (GL-L1) approach. Both proposed approaches are implemented using the alternating direction method of multipliers (ADMM), and their performance is evaluated through simulations in which different setup conditions are considered, ranging from different types of model mismatch to variations in the acoustic scene and microphone directivity pattern. The results obtained show that in a scenario involving a microphone response mismatch between observed data and the signal model used, having the additional nonnegativity constraint on the residual noise can improve the DOA estimation for the case of GL-LS and the PSD estimation for the case of GL-L1. Moreover, the GL-L1 approach can present an advantage over GL-LS in terms of DOA estimation performance in scenarios with low SNR or where multiple sources are closely located to each other. Finally, it is shown that having the least squares PSD re-estimation step is beneficial in most scenarios, such that GL-LS outperformed GL-L1 in terms of PSD estimation errors.
Funder
Fonds Wetenschappelijk Onderzoek HORIZON EUROPE European Research Council
Publisher
Springer Science and Business Media LLC
Subject
Electrical and Electronic Engineering,Acoustics and Ultrasonics
Reference61 articles.
1. S. Doclo, S. Gannot, M. Moonen, A. Spriet, S. Haykin, K.R. Liu, in Handbook on array processing and sensor networks, Acoustic beamforming for hearing aid applications, vol. 9 (Wiley, Hoboken, 2010), pp.269–302 2. P.C. Loizou, Speech Enhancement: Theory and Practice, 2nd edn. (CRC Press, Boca Raton, 2013) 3. P.A. Naylor, N.D. Gaubitch, Speech dereverberation, vol. 2 (Springer, New York, 2010) 4. M. Brandstein, D. Ward, Microphone arrays: signal processing techniques and applications (Springer, New York, 2013) 5. K. Kinoshita, M. Delcroix, S. Gannot, E.A.P. Habets, R. Haeb-Umbach, W. Kellermann, V. Leutnant, R. Maas, T. Nakatani, B. Raj et al., A summary of the reverb challenge: state-of-the-art and remaining challenges in reverberant speech processing research. EURASIP J. Adv. Signal Process. 2016, 1–19 (2016)
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|