Talking Head from Speech Audio using a Pre-trained Image Generator-Reference-Cited by-同舟云学术

Talking Head from Speech Audio using a Pre-trained Image Generator

Published:2022-10-10 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 30th ACM International Conference on Multimedia
language:
Short-container-title:

Author:

Alghamdi Mohammed M.¹,Wang He²,Bulpitt Andrew J.²,Hogg David C.²

Affiliation:

1. University of Leeds & Taif University, Leeds, United Kingdom

2. University of Leeds, Leeds, United Kingdom

Funder

Taif University

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3503161.3548101

Reference41 articles.

1. Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space?

2. Lele Chen , Zhiheng Li , Ross K. Maddox , Zhiyao Duan , and Chenliang Xu. 2018. Lip Movements Generation at a Glance. CoRR abs/1803.10404 ( 2018 ). arXiv:1803.10404 http://arxiv.org/abs/1803.10404 Lele Chen, Zhiheng Li, Ross K. Maddox, Zhiyao Duan, and Chenliang Xu. 2018. Lip Movements Generation at a Glance. CoRR abs/1803.10404 (2018). arXiv:1803.10404 http://arxiv.org/abs/1803.10404

3. Hierarchical Cross-Modal Talking Face Generation With Dynamic Pixel-Wise Loss

4. Joon Son Chung , Amir Jamaludin , and Andrew Zisserman . 2017 . You said that? . In British Machine Vision Conference. Joon Son Chung, Amir Jamaludin, and Andrew Zisserman. 2017. You said that?. In British Machine Vision Conference.

5. Martin Cooke , Jon Barker , Stuart P. Cunningham , and Xu Shao . 2006. An audiovisual corpus for speech perception and automatic speech recognition. The Journal of the Acoustical Society of America 120 5 Pt 1 ( 2006 ), 2421--4. Martin Cooke, Jon Barker, Stuart P. Cunningham, and Xu Shao. 2006. An audiovisual corpus for speech perception and automatic speech recognition. The Journal of the Acoustical Society of America 120 5 Pt 1 (2006), 2421--4.

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. 3D facial modeling, animation, and rendering for digital humans: A survey;Neurocomputing;2024-09

2. RADIO: Reference-Agnostic Dubbing Video Synthesis;2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV);2024-01-03

3. Modular Joint Training for Speech-Driven 3D Facial Animation;Communications in Computer and Information Science;2024

4. Application of a 3D Talking Head as Part of Telecommunication AR, VR, MR System: Systematic Review;Electronics;2023-11-26

5. Speech-Driven 3D Face Animation with Composite and Regional Facial Movements;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26