Author:
Afouras Triantafyllos,Chung Joon Son,Zisserman Andrew
Cited by
58 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Target Speech Extraction with Pre-trained AV-HuBERT and Mask-And-Recover Strategy;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30
2. Mini-3DCvT: a lightweight lip-reading method based on 3D convolution visual transformer;The Visual Computer;2024-06-11
3. Look Once to Hear: Target Speech Hearing with Noisy Examples;Proceedings of the CHI Conference on Human Factors in Computing Systems;2024-05-11
4. Voicevector: Multimodal Enrolment Vectors for Speaker Separation;2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW);2024-04-14
5. Training Strategies for Modality Dropout Resilient Multi-Modal Target Speaker Extraction;2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW);2024-04-14