Publisher
Springer Nature Singapore
Reference42 articles.
1. Fu, S.W., Wang, T.W., Tsao, Y., Lu, X., Kawai, H.: End-to-end waveform utterance enhancement for direct evaluation metrics optimization by fully convolutional neural networks. IEEE/ACM Trans. Audio Speech Lang. Process. 26(9), 1570–1584 (2018)
2. Plantinga, P., Bagchi, D., Fosler-Lussier, E.: Perceptual loss with recognition model for single-channel enhancement and robust ASR. arXiv preprint arXiv:2112.06068 (2021)
3. Turian, J., Henry, M.: I’m sorry for your loss: spectrally-based audio distances are bad at pitch. In: “I Can’t Believe It’s Not Better!” NeurIPS 2020 Workshop (2020)
4. Reddy, C.K., Beyrami, E., Pool, J., Cutler, R., Srinivasan, S., Gehrke, J.: A scalable noisy speech dataset and online subjective test framework. arXiv preprint arXiv:1909.08050 (2019)
5. Kolbæk, M., Tan, Z.H., Jensen, S.H., Jensen, J.: On loss functions for supervised monaural time-domain speech enhancement. IEEE/ACM Trans. Audio Speech Lang. Process. 28, 825–838 (2020)