Sound Source Localization Using Deep Learning Models-Reference-Cited by-同舟云学术

Sound Source Localization Using Deep Learning Models

Published:2017-02-20 Issue:1 Volume:29 Page:37-48
ISSN:1883-8049
Container-title:Journal of Robotics and Mechatronics
language:en
Short-container-title:J. Robot. Mechatron.

Author:

Yalta Nelson, ,Nakadai Kazuhiro,Ogata Tetsuya,

Abstract

[abstFig src='/00290001/04.jpg' width='300' text='Using a deep learning model, the robot locate the sound source from a multiple channel audio stream input' ] This study proposes the use of a deep neural network to localize a sound source using an array of microphones in a reverberant environment. During the last few years, applications based on deep neural networks have performed various tasks such as image classification or speech recognition to levels that exceed even human capabilities. In our study, we employ deep residual networks, which have recently shown remarkable performance in image classification tasks even when the training period is shorter than that of other models. Deep residual networks are used to process audio input similar to multiple signal classification (MUSIC) methods. We show that with end-to-end training and generic preprocessing, the performance of deep residual networks not only surpasses the block level accuracy of linear models on nearly clean environments but also shows robustness to challenging conditions by exploiting the time delay on power information.

Publisher

Fuji Technology Press Ltd.

Subject

Electrical and Electronic Engineering,General Computer Science

Reference35 articles.

1. K. Nakadai, T. Lourens, H. G. Okuno, and H. Kitano, “Active audition for humanoid,” Proc. of the National Conf. on Artificial Intelligence, pp. 832-839, 2000.

2. K. Nakadai, H. Nakajima, Y. Hasegawa, and H. Tsujino, “Sound source separation of moving speakers for robot audition,” Proc. of IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), pp. 3685-3688, 2009.

3. R. O. Schmidt, “Multiple emitter location and signal parameter estimation,” IEEE Trans. on Antennas and Propagation, Vol.34, No.3, pp. 276-280, 1986.

4. K. Nakamura, K. Nakadai, F. Asano, Y. Hasegawa, and H. Tsujino, “Intelligent Sound Source Localization for Dynamic Environments,” 2009 IEEE/RSJ Int. Conf. on Intelligent Robots and Systems, pp. 664-669, 2009.

5. B. D. Rao and K. V. S. Hari, “Performance Analysis of Root-Music,” IEEE Trans. on Acoustics, Speech, and Signal Processing, Vol.37, No.12, pp. 1939-1949, 1989.

Cited by 78 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Online adaptation of fourier series-based acoustic transfer function model and its application to sound source localization and separation;Advanced Robotics;2024-07-15

2. Snn and sound: a comprehensive review of spiking neural networks in sound;Biomedical Engineering Letters;2024-07-11

3. An Underwater Wideband Sound Source Localization Method Based on Light Neural Network Structure;IEEE Sensors Journal;2024-07-01

4. Robots Saving Lives: A Literature Review About Search and Rescue (SAR) in Harsh Environments;2024 IEEE Intelligent Vehicles Symposium (IV);2024-06-02

5. Robust acoustic directional sensing enabled by synergy between resonator-based sensor and deep learning;Scientific Reports;2024-05-02