Affiliation:
1. Signal Processing (SP), Universität Hamburg, Germany,
Abstract
Algorithmic latency in speech processing is dominated by the frame length used for Fourier analysis, which in turn limits the achievable performance of magnitude-centric approaches. As previous studies suggest the importance of phase grows with decreasing frame length, this work presents a systematic study on the contribution of phase and magnitude in modern deep neural network (DNN)-based speech enhancement at different frame lengths. Results indicate that DNNs can successfully estimate phase when using short frames, with similar or better overall performance compared to using longer frames. Thus, interestingly, modern phase-aware DNNs allow for low-latency speech enhancement at high quality.
Funder
Deutsche Forschungsgemeinschaft
Publisher
Acoustical Society of America (ASA)
Subject
Electrical and Electronic Engineering,Atomic and Molecular Physics, and Optics
Cited by
14 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献