Speech Enhancement with Generative Diffusion Models-Reference-Cited by-同舟云学术

Speech Enhancement with Generative Diffusion Models

Published:2023-10 Issue:5 Volume:57 Page:249-257
ISSN:0005-1055
Container-title:Automatic Documentation and Mathematical Linguistics
language:en
Short-container-title:Autom. Doc. Math. Linguist.

Author:

Girfanov O. V.,Shishkin A. G.

Publisher

Allerton Press

Subject

General Medicine

Link

https://link.springer.com/content/pdf/10.3103/S0005105523050035.pdf

Reference46 articles.

1. Radford, A., Kim, J.W., Xu, T., Brockman, G., Mcleave-y, C., and Sutskever, I., Robust speech recognition via large-scale weak supervision, Proc. Mach. Learn. Res., 2023, vol. 202, pp. 28492–28518.

2. Williamson, D.S., Wang, Yu., and Wang, D., Complex ratio masking for monaural speech separation, IEEE/ACM Trans. Audio, Speech, Lang. Process., 2015, vol. 24, no. 3, pp. 483–492. https://doi.org/10.1109/taslp.2015.2512042

3. Fu, S.-W., Hu, T.-Ya., Tsao, Yu., and Lu, X., Complex spectrogram enhancement by convolutional neural network with multi-metrics learning, 2017 IEEE 27th Int. Workshop on Machine Learning for Signal Processing (MLSP), Tokyo, 2017, IEEE, 2017, pp. 1–6. https://doi.org/10.1109/mlsp.2017.8168119

4. Fu, S.-W., Tsao, Yu., Lu, X., and Kawai, H., Raw waveform-based speech enhancement by fully convolutional networks, 2017 Asia-Pacific Signal and Information Processing Association Annu. Summit and Conf. (A-PSIPA ASC), Kuala-Lumpur, Malaysia, 2017, IEEE, 2017, pp. 6–12. https://doi.org/10.1109/apsipa.2017.8281993

5. Wang, P., Tan, K., and Wang, D.L., Bridging the gap between monaural speech enhancement and recognition with distortion-independent acoustic modeling, IEEE/ACM Trans. Audio, Speech, Lang. Process., 2020, vol. 28, pp. 39–48. https://doi.org/10.1109/taslp.2019.2946789

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Research on U-Net seismic signal denoising combined with residual dense blocks;Measurement Science and Technology;2024-02-06