Author:
Pan Zexu,Tao Ruijie,Xu Chenglin,Li Haizhou
Funder
National Research Foundation
Deutsche Forschungsgemeinschaft
Cited by
23 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Target Speech Extraction with Pre-trained AV-HuBERT and Mask-And-Recover Strategy;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30
2. Training Strategies for Modality Dropout Resilient Multi-Modal Target Speaker Extraction;2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW);2024-04-14
3. Late Audio-Visual Fusion for in-the-Wild Speaker Diarization;2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW);2024-04-14
4. MAVAR-SE: Multi-scale Audio-Visual Association Representation Network for End-to-End Speaker Extraction;Lecture Notes in Computer Science;2024
5. Scenario-Aware Audio-Visual TF-Gridnet for Target Speech Extraction;2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU);2023-12-16