1. Multi-Modal Knowledge Transfer for Target Speaker Lipreading with Improved Audio-Visual Pretraining and Cross-Lingual Fine-Tuning;2024 IEEE International Conference on Multimedia and Expo Workshops (ICMEW);2024-07-15
2. Enhancing GAN-based Vocoders with Contrastive Learning Under Data-Limited Condition;2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW);2024-04-14
3. BRAVEn: Improving Self-supervised pre-training for Visual and Auditory Speech Recognition;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
4. Multilingual Audio-Visual Speech Recognition with Hybrid CTC/RNN-T Fast Conformer;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14