Abstract
The Multiple Signal Classification (MUSIC) algorithm has become one of the most popular algorithms for estimating the direction-of-arrival (DOA) of multiple sources due to its simplicity and ease of implementation. Spherical microphone arrays can capture more sound field information than planar arrays. The collected multichannel speech signals can be transformed from the space domain to the spherical harmonic domain (SHD) for processing through spherical modal decomposition. The spherical harmonic domain MUSIC (SHD-MUSIC) algorithm reduces the dimensionality of the covariance matrix and achieves better DOA estimation performance than the conventional MUSIC algorithm. However, the SHD-MUSIC algorithm is prone to failure in low signal-to-noise ratio (SNR), high reverberation time (RT), and other multi-source environments. To address these challenges, we propose a novel joint spherical harmonic domain beam-space MUSIC (SHD-BMUSIC) algorithm in this paper. The advantage of decoupling the signal frequency and angle information in the SHD is exploited to improve the anti-reverberation property of the DOA estimation. In the SHD, the broadband beamforming matrix converts the SHD sound pressure to the beam domain output. Beamforming enhances the incoming signal in the desired direction and reduces the SNR threshold as well as the dimension of the signal covariance matrix. In addition, the 3D beam of the spherical array has rotational symmetry and its beam steering is decoupled from the beam shape. Therefore, the broadband beamforming constructed in this paper allows for the arbitrary adjustment of beam steering without the need to redesign the beam shape. Both simulation experiments and practical tests are conducted to verify that the proposed SHD-BMUSIC algorithm has a more robust adjacent source discrimination capability than the SHD-MUSIC algorithm.
Funder
National Natural Science Foundation of China
Guangxi Natural Science Foundation
Key Laboratory of Cognitive Radio and Information Processing, Ministry of Education
Subject
Physics and Astronomy (miscellaneous),General Mathematics,Chemistry (miscellaneous),Computer Science (miscellaneous)
Reference29 articles.
1. The LOCATA Challenge: Acoustic Source Localization and Tracking;Evers;IEEE/ACM Trans. Audio Speech Lang. Process.,2020
2. Time Difference of Arrival Estimation Exploiting Multichannel Spatio-Temporal Prediction;He;IEEE Trans. Audio Speech Lang. Process.,2013
3. Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks;Miguel;IEEE/ACM Trans. Audio Speech Lang. Process.,2021
4. Jaafer, Z., Goli, S., and Elameer, A.S. (2018, January 8–9). Performance Analysis of Beam Scan, MIN-NORM, Music and Mvdr DOA Estimation Algorithms. Proceedings of the 2018 International Conference on Engineering Technology and their Applications (IICETA), Al-Najaf, Iraq.
5. Xu, C., Xiao, X., Sun, S., Rao, W., Chng, E.S., and Li, H. (2017, January 20–24). Weighted Spatial Covariance Matrix Estimation for MUSIC Based TDOA Estimation of Speech Source. Proceedings of the Interspeech 2017, ISCA, Stockholm, Sweden.