Synthetic data generation techniques for training deep acoustic siren identification networks-Reference-Cited by-同舟云学术

Synthetic data generation techniques for training deep acoustic siren identification networks

Published:2024-07-12 Issue: Volume:4 Page:
ISSN:2673-8198
Container-title:Frontiers in Signal Processing
language:
Short-container-title:Front. Signal Process.

Author:

Damiano Stefano,Cramer Benjamin,Guntoro Andre,van Waterschoot Toon

Abstract

Acoustic sensing has been widely exploited for the early detection of harmful situations in urban environments: in particular, several siren identification algorithms based on deep neural networks have been developed and have proven robust to the noisy and non-stationary urban acoustic scene. Although high classification accuracy can be achieved when training and evaluating on the same dataset, the cross-dataset performance of such models remains unexplored. To build robust models that generalize well to unseen data, large datasets that capture the diversity of the target sounds are needed, whose collection is generally expensive and time consuming. To overcome this limitation, in this work we investigate synthetic data generation techniques for training siren identification models. To obtain siren source signals, we either collect from public sources a small set of stationary, recorded siren sounds, or generate them synthetically. We then simulate source motion, acoustic propagation and Doppler effect, and finally combine the resulting signal with background noise. This way, we build two synthetic datasets used to train three different convolutional neural networks, then tested on real-world datasets unseen during training. We show that the proposed training strategy based on the use of recorded source signals and synthetic acoustic propagation performs best. In particular, this method leads to models that exhibit a better generalization ability, as compared to training and evaluating in a cross-dataset setting. Moreover, the proposed method loosens the data collection requirement and is entirely built using publicly available resources.

Publisher

Frontiers Media SA

Reference39 articles.

1. Large-scale audio dataset for emergency vehicle sirens and road noises;Asif;Sci. Data,2022

2. An automatic emergency signal recognition system for the hearing impaired;Beritelli,2006

3. Acoustic features for deep learning-based models for emergency Siren detection: an evaluation study;Cantarini,2021

4. Few-shot emergency Siren detection;Cantarini;Sensors,2022

5. Detection of alarm sounds in noisy environments;Carmel,2017