Sound Source Distance Estimation Using Deep Learning: An Image Classification Approach-Reference-Cited by-同舟云学术

Sound Source Distance Estimation Using Deep Learning: An Image Classification Approach

Published:2019-12-27 Issue:1 Volume:20 Page:172
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Yiwere Mariam,Rhee Eun Joo

Abstract

This paper presents a sound source distance estimation (SSDE) method using a convolutional recurrent neural network (CRNN). We approach the sound source distance estimation task as an image classification problem, and we aim to classify a given audio signal into one of three predefined distance classes—one meter, two meters, and three meters—irrespective of its orientation angle. For the purpose of training, we create a dataset by recording audio signals at the three different distances and three angles in different rooms. The CRNN is trained using time-frequency representations of the audio signals. Specifically, we transform the audio signals into log-scaled mel spectrograms, allowing the convolutional layers to extract the appropriate features required for the classification. When trained and tested with combined datasets from all rooms, the proposed model exhibits high classification accuracies; however, training and testing the model in separate rooms results in lower accuracies, indicating that further study is required to improve the method’s generalization ability. Our experimental results demonstrate that it is possible to estimate sound source distances in known environments by classification using the log-scaled mel spectrogram.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/20/1/172/pdf

Reference56 articles.

1. Time-frequency processing for sound source localization from a micro aerial vehicle

2. Broadband doa estimation using convolutional neural networks trained with noise signals

3. Sound Source Localization in a Multipath Environment Using Convolutional Neural Networks

Cited by 19 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. ConvLSTM-based Sound Source Localization in a manufacturing workplace;Computers & Industrial Engineering;2024-06

2. Expendable Conductivity–Temperature–Depth-Assisted Fast Underwater Sound Speed Estimation by Convolutional Neural Network with Reduced Fully Connected Layers;Journal of Marine Science and Engineering;2024-02-26

3. Binaural Sound Source Distance Estimation and Localization for a Moving Listener;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024

4. Speaker Distance Estimation in Enclosures From Single-Channel Audio;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024

5. A Survey of Sound Source Localization and Detection Methods and Their Applications;Sensors;2023-12-22