End-to-end sound field reproduction based on deep learning-Reference-Cited by-同舟云学术

End-to-end sound field reproduction based on deep learning

Published:2023-05-01 Issue:5 Volume:153 Page:3055
ISSN:0001-4966
Container-title:The Journal of the Acoustical Society of America
language:en
Short-container-title:

Author:

Hong Xi¹,Du Bokai²,Yang Shuang¹,Lei Menghui¹,Zeng Xiangyang¹

Affiliation:

1. School of Marine Science and Technology, Northwestern Polytechnical University 1 , Xi'An, 710072, China

2. Aircraft Strength Research Institute 2 , Xi'An, 710065, China

Abstract

Sound field reproduction, which attempts to create a virtual acoustic environment, is a fundamental technology in the achievement of virtual reality. In sound field reproduction, the driving signals of the loudspeakers are calculated by considering the signals collected by the microphones and working environment of the reproduction system. In this paper, an end-to-end reproduction method based on deep learning is proposed. The inputs and outputs of this system are the sound-pressure signals recorded by microphones and the driving signals of loudspeakers, respectively. A convolutional autoencoder network with skip connections in the frequency domain is used. Furthermore, sparse layers are applied to capture the sparse features of the sound field. Simulation results show that the reproduction errors of the proposed method are lower than those generated by the conventional pressure matching and least absolute shrinkage and selection operator methods, especially at high frequencies. Experiments were performed under conditions of single and multiple primary sources. The results in both cases demonstrate that the proposed method achieves better high-frequency performance than the conventional methods.

Funder

National Natural Science Foundation of China

Publisher

Acoustical Society of America (ASA)

Subject

Acoustics and Ultrasonics,Arts and Humanities (miscellaneous)

Link

https://pubs.aip.org/asa/jasa/article-pdf/153/5/3055/17795505/3055_1_10.0019575.pdf

Reference27 articles.

1. The theory of wave field synthesis revisited,2008

2. Acoustic control by wave field synthesis;J. Acoust. Soc. Am.,1993

3. An overview of machine learning and other data-based methods for spatial audio capture, processing, and reproduction;J. Audio Speech Music Process.,2022

4. Comanducci, L., Antonacci, F., and Sarti, A. (2022). “ Synthesis of soundfields through irregular loudspeaker arrays based on convolutional neural networks,” available at http://arxiv.org/abs/2205.12872 (Last viewed September 1, 2022).

5. Non-linear dimensionality reduction,1992

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multizone sound field reproduction using pressure matching with sparse equivalent source;Journal of Sound and Vibration;2024-06

2. Synthesis of soundfields through irregular loudspeaker arrays based on convolutional neural networks;EURASIP Journal on Audio, Speech, and Music Processing;2024-03-28