1. Loizou, P.C.: Speech Enhancement: Theory and Practice. CRC Press, Boca Raton, FL, USA (2007)
2. Tan, K., Wang, D.A.: Convolutional recurrent neural network for real-time speech enhancement: In: Proc. Interspeech 2018, 2–6 September 2018, Hyderabad, India, pp. 3229–3233 (2018)
3. Umut, I., Giri, R., Phansalkar, N., Valin, J.-M., Helwani, K., Krishnaswamy, A.: PoCoNet: better speech enhancement with frequency-positional embeddings, semi-supervised conversational data, and biased loss. In: Proc. of Interspeech 2020, 25–29 October 2020, Shanghai, pp. 2487–2491 (2020)
4. Hao, X., Su, X., Horaud, R, Li, X.: FullSubNet: a full-band and sub-band fusion model for real-time single-channel speech enhancement. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 6–11 June 2021, Toronto, Canada, pp. 6633–6637 (2021)
5. Xu, R., Wu, R., Ishiwaka, Y., Vondrick, C., Zheng, C.: Listening to sounds of silence for speech denoising. In: Conference on Neural Information Processing Systems (NeurIPS 2020), 6–12 December 2020, Vancouver, Canada, pp. 9633–9648 (2020)