Abstract
In this paper, we propose a global personalized head-related transfer function (HRTF) method based on anthropometric measurements and ear images. The model consists of two sub-networks. The first is the VGG-Ear Model, which extracts features from the ear images. The second sub-network uses anthropometric measurements, ear features, and frequency information to predict the spherical harmonic (SH) coefficients. Finally, the personalized HRTF is obtained through inverse spherical harmonic transform (SHT) reconstruction. With only one training, the HRTF in all directions can be obtained, which greatly reduces the parameters and training cost of the model. To objectively evaluate the proposed method, we calculate the spectral distance (SD) between the predicted HRTF and the actual HRTF. The results show that the SD provided by this method is 5.31 dB, which is better than the average HRTF of 7.61 dB. In particular, the SD value is only increased by 0.09 dB compared to directly using the pinna measurements.
Funder
Science and Technology Commission of Shanghai Municipality
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference40 articles.
1. Individualized Interaural Feature Learning and Personalized Binaural Localization Model
2. The psychophysics of human sound localization;Blauert,1997
3. Head-Related Transfer Function and Virtual Auditory Display;Xie,2013
4. Acoustics and Psychoacoustics;Howard,2009
5. Binaural technique: Do we need individual recordings?;Møller;J. Audio Eng. Soc.,1996
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. HRTF Estimation in the Wild;Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology;2023-10-29
2. HRTF Field: Unifying Measured HRTF Magnitude Representation with Neural Fields;ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2023-06-04