Author:
Schoburg Carrillo de Mira Rodrigo,Haliassos Alexandros,Petridis Stavros,Schuller Björn W.,Pantic Maja
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. M3TTS: Multi-modal text-to-speech of multi-scale style control for dubbing;Pattern Recognition Letters;2024-03
2. Large-Scale Unsupervised Audio Pre-Training for Video-to-Speech Synthesis;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024
3. Towards Accurate Lip-to-Speech Synthesis in-the-Wild;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26
4. On the Audio-visual Synchronization for Lip-to-Speech Synthesis;2023 IEEE/CVF International Conference on Computer Vision (ICCV);2023-10-01
5. DiffV2S: Diffusion-based Video-to-Speech Synthesis with Vision-guided Speaker Embedding;2023 IEEE/CVF International Conference on Computer Vision (ICCV);2023-10-01