Sampling the user controls in neural modeling of audio devices-Reference-Cited by-同舟云学术

Sampling the user controls in neural modeling of audio devices

Published:2024-05-20 Issue:1 Volume:2024 Page:
ISSN:1687-4722
Container-title:EURASIP Journal on Audio, Speech, and Music Processing
language:en
Short-container-title:J AUDIO SPEECH MUSIC PROC.

Author:

Mikkonen Otto^ORCID,Wright Alec,Välimäki Vesa

Abstract

AbstractThis work studies neural modeling of nonlinear parametric audio circuits, focusing on how the diversity of settings of the target device user controls seen during training affects network generalization. To study the problem, a large corpus of training datasets is synthetically generated using SPICE simulations of two distinct devices, an analog equalizer and an analog distortion pedal. A proven recurrent neural network architecture is trained using each dataset. The difference in the datasets is in the sampling resolution of the device user controls and in their overall size. Based on objective and subjective evaluation of the trained models, a sampling resolution of five for the device parameters is found to be sufficient to capture the behavior of the target systems for the types of devices considered during the study. This result is desirable, since a dense sampling grid can be impractical to realize in the general case when no automated way of setting the device parameters is available, while collecting large amounts of data using a sparse grid only incurs small additional costs. Thus, the result provides guidance for efficient collection of training data for neural modeling of other similar audio devices.

Funder

NordForsk

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1186/s13636-024-00347-5.pdf

Reference64 articles.

1. V. Välimäki, F. Fontana, J.O. Smith, U. Zolzer, Introduction to the special issue on virtual analog audio effects and musical instruments. IEEE Trans. Audio Speech Lang. Process. 18(4), 713–714 (2010). https://doi.org/10.1109/TASL.2010.2046449

2. J. Pakarinen, V. Välimäki, F. Fontana, V. Lazzarini, J.S. Abel, Recent advances in real-time musical effects, synthesis and virtual analog models. EURASIP J. Adv. Signal Process. 2011(1), 940784 (2011). https://doi.org/10.1155/2011/940784

3. J. Pakarinen, D.T. Yeh, A review of digital techniques for modeling vacuum-tube guitar amplifiers. Comput. Music J. 33(2), 85–100 (2009). https://doi.org/10.1162/comj.2009.33.2.85

4. T. Vanhatalo, P. Legrand, M. Desainte-Catherine, P. Hanna, A. Brusco, G. Pille, Y. Bayle, A review of neural network-based emulation of guitar amplifiers. Appl. Sci. 12(12), 5894 (2022). https://doi.org/10.3390/app12125894

5. O. Massi, A.I. Mezza, R. Giampiccolo, A. Bernardini, Deep learning-based wave digital modeling of rate-dependent hysteretic nonlinearities for virtual analog applications. EURASIP J. Audio Speech Music Process. 2023(1) (2023). https://doi.org/10.1186/s13636-023-00277-8