1. StyleGAN-NADA
2. Look at what i'm doing: Self-supervised spa-tial grounding of narrations in instructional videos;tan;Advances in Neural Information Processing Systems 34 14476–14487,2021
3. Source-filter based clustering for monaural blind source separation;spiertz;Proceedings of the 12th International Conference on Digital Audio Ef-fects,0
4. Audio set: An ontology and human-labeled dataset for audio events;jort;2017 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP),0