Author:
Girfanov O. V.,Shishkin A. G.
Reference46 articles.
1. Radford, A., Kim, J.W., Xu, T., Brockman, G., Mcleave-y, C., and Sutskever, I., Robust speech recognition via large-scale weak supervision, Proc. Mach. Learn. Res., 2023, vol. 202, pp. 28492–28518.
2. Williamson, D.S., Wang, Yu., and Wang, D., Complex ratio masking for monaural speech separation, IEEE/ACM Trans. Audio, Speech, Lang. Process., 2015, vol. 24, no. 3, pp. 483–492. https://doi.org/10.1109/taslp.2015.2512042
3. Fu, S.-W., Hu, T.-Ya., Tsao, Yu., and Lu, X., Complex spectrogram enhancement by convolutional neural network with multi-metrics learning, 2017 IEEE 27th Int. Workshop on Machine Learning for Signal Processing (MLSP), Tokyo, 2017, IEEE, 2017, pp. 1–6. https://doi.org/10.1109/mlsp.2017.8168119
4. Fu, S.-W., Tsao, Yu., Lu, X., and Kawai, H., Raw waveform-based speech enhancement by fully convolutional networks, 2017 Asia-Pacific Signal and Information Processing Association Annu. Summit and Conf. (A-PSIPA ASC), Kuala-Lumpur, Malaysia, 2017, IEEE, 2017, pp. 6–12. https://doi.org/10.1109/apsipa.2017.8281993
5. Wang, P., Tan, K., and Wang, D.L., Bridging the gap between monaural speech enhancement and recognition with distortion-independent acoustic modeling, IEEE/ACM Trans. Audio, Speech, Lang. Process., 2020, vol. 28, pp. 39–48. https://doi.org/10.1109/taslp.2019.2946789
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献