Talking Head from Speech Audio using a Pre-trained Image Generator

Author:

Alghamdi Mohammed M.1,Wang He2,Bulpitt Andrew J.2,Hogg David C.2

Affiliation:

1. University of Leeds & Taif University, Leeds, United Kingdom

2. University of Leeds, Leeds, United Kingdom

Funder

Taif University

Publisher

ACM

Reference41 articles.

1. Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space?

2. Lele Chen , Zhiheng Li , Ross K. Maddox , Zhiyao Duan , and Chenliang Xu. 2018. Lip Movements Generation at a Glance. CoRR abs/1803.10404 ( 2018 ). arXiv:1803.10404 http://arxiv.org/abs/1803.10404 Lele Chen, Zhiheng Li, Ross K. Maddox, Zhiyao Duan, and Chenliang Xu. 2018. Lip Movements Generation at a Glance. CoRR abs/1803.10404 (2018). arXiv:1803.10404 http://arxiv.org/abs/1803.10404

3. Hierarchical Cross-Modal Talking Face Generation With Dynamic Pixel-Wise Loss

4. Joon Son Chung , Amir Jamaludin , and Andrew Zisserman . 2017 . You said that? . In British Machine Vision Conference. Joon Son Chung, Amir Jamaludin, and Andrew Zisserman. 2017. You said that?. In British Machine Vision Conference.

5. Martin Cooke , Jon Barker , Stuart P. Cunningham , and Xu Shao . 2006. An audiovisual corpus for speech perception and automatic speech recognition. The Journal of the Acoustical Society of America 120 5 Pt 1 ( 2006 ), 2421--4. Martin Cooke, Jon Barker, Stuart P. Cunningham, and Xu Shao. 2006. An audiovisual corpus for speech perception and automatic speech recognition. The Journal of the Acoustical Society of America 120 5 Pt 1 (2006), 2421--4.

Cited by 6 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Modular Joint Training for Speech-Driven 3D Facial Animation;Computer Supported Cooperative Work and Social Computing;2024

2. Application of a 3D Talking Head as Part of Telecommunication AR, VR, MR System: Systematic Review;Electronics;2023-11-26

3. DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26

4. Bio-Inspired Audiovisual Multi-Representation Integration via Self-Supervised Learning;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26

5. MFR-Net: Multi-faceted Responsive Listening Head Generation via Denoising Diffusion Model;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3