Author:
Chernenkiy Ivan,Trufanov Nikita,Egorov Dobroslav,Kravchenko Oleg
Reference19 articles.
1. Spectrogram based multi-task audio classification
2. Y. Xu, Q. Kong, W. Wang, and M. D. Plumbley, “Large-scale weakly supervised audio classification using gated convolutional neural network,” in 2018 IEEE international conference on acoustics, speech and signal processing (ICASSP) (IEEE, 2018) pp. 121–125.
3. C. Subakan, M. Ravanelli, S. Cornell, F. Grondin, and M. Bronzi, “On using transformers for speech-separation,” arXiv preprint arXiv:2202.02884 (2022).
4. Wavesplit: End-to-End Speech Separation by Speaker Clustering
5. A. Defossez, G. Synnaeve, and Y. Adi, “Real time speech enhancement in the waveform domain,” in Interspeech (2020).