Assessing HRTF preprocessing methods for Ambisonics rendering through perceptual models-Reference-Cited by-同舟云学术

Assessing HRTF preprocessing methods for Ambisonics rendering through perceptual models

Published:2022 Issue: Volume:6 Page:4
ISSN:2681-4617
Container-title:Acta Acustica
language:
Short-container-title:Acta Acust.

Author:

Engel Isaac^ORCID,Goodman Dan F. M.^ORCID,Picinali Lorenzo^ORCID

Abstract

Binaural rendering of Ambisonics signals is a common way to reproduce spatial audio content. Processing Ambisonics signals at low spatial orders is desirable in order to reduce complexity, although it may degrade the perceived quality, in part due to the mismatch that occurs when a low-order Ambisonics signal is paired with a spatially dense head-related transfer function (HRTF). In order to alleviate this issue, the HRTF may be preprocessed so its spatial order is reduced. Several preprocessing methods have been proposed, but they have not been thoroughly compared yet. In this study, nine HRTF preprocessing methods were used to render anechoic binaural signals from Ambisonics representations of orders 1 to 44, and these were compared through perceptual hearing models in terms of localisation performance, externalisation and speech reception. This assessment was supported by numerical analyses of HRTF interpolation errors, interaural differences, perceptually-relevant spectral differences, and loudness stability. Models predicted that the binaural renderings’ accuracy increased with spatial order, as expected. A notable effect of the preprocessing method was observed: whereas all methods performed similarly at the highest spatial orders, some were considerably better at lower orders. A newly proposed method, BiMagLS, displayed the best performance overall and is recommended for the rendering of bilateral Ambisonics signals. The results, which were in line with previous literature, indirectly validate the perceptual models’ ability to predict listeners’ responses in a consistent and explicable manner.

Publisher

EDP Sciences

Subject

Electrical and Electronic Engineering,Speech and Hearing,Computer Science Applications,Acoustics and Ultrasonics

Link

https://acta-acustica.edpsciences.org/10.1051/aacus/2021055/pdf

Reference73 articles.

1. Headphone simulation of free‐field listening. I: Stimulus synthesis

2. 3D Tune-In Toolkit: An open-source library for real-time binaural spatialisation

3. Zotter F., Frank M.: Ambisonics: A practical 3D audio theory for recording, studio production, sound reinforcement, and virtual reality, in Vol. 19 of Springer Topics in Signal Processing, Springer International Publishing, Cham. 2019. https://link.springer.com/10.1007/978-3-030-17207-7.

4. Schissler C., Stirling P., Mehra R.: Efficient construction of the spatial room impulse response, in 2017 IEEE Virtual Reality (VR). 2017, pp. 122–130. https://doi.org/10.1109/VR.2017.7892239.

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Comparison of Non-Parametric Interpolation Techniques for Sparsely Measured Binaural Room Impulse Responses;Journal of the Audio Engineering Society;2024-07-18

2. Analysis and Design of Head-Tracked Compensation for Bilateral Ambisonics;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024

3. HRTF Upsampling With a Generative Adversarial Network Using a Gnomonic Equiangular Projection;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024

4. Efficient representation of head-related transfer functions in continuous space–frequency domains;Journal of Sound and Vibration;2023-10

5. Application of different types of microphones in room impulse response measurements;2023 Immersive and 3D Audio: from Architecture to Automotive (I3DA);2023-09-05