Manitalk: manipulable talking head generation from single image in the wild-Reference-Cited by-同舟云学术

Manitalk: manipulable talking head generation from single image in the wild

Published:2024-06-08 Issue:7 Volume:40 Page:4913-4925
ISSN:0178-2789
Container-title:The Visual Computer
language:en
Short-container-title:Vis Comput

Author:

Fang Hui,Weng Dongdong,Tian Zeyu,Ma Yin

Funder

National Key Research and Development Program of China

National Natural Science Foundation of China

2022 major science and technology project "Yuelu·Multimodal Graph-Text-Sound-Semantic Gesture Big Model Research and Demonstration Application" in Changsha

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s00371-024-03490-4.pdf

Reference48 articles.

1. Baevski, A., Zhou, Y., Mohamed, A., Auli, M.: wav2vec 2.0: a framework for self-supervised learning of speech representations. Adv. Neural Inf. Process. Syst. 33, 12449–12460 (2020)

2. Baltrusaitis, T., Zadeh, A., Lim, Y.C., Morency, L.P.: Openface 2.0: facial behavior analysis toolkit. In: 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018), pp. 59–66. IEEE (2018)

3. Bookstein, F.L.: Principal warps: thin-plate splines and the decomposition of deformations. IEEE Trans. Pattern Anal. Mach. Intell. 11(6), 567–585 (1989)

4. Chatziagapi, A., Athar, S., Jain, A., Rohith, M., Bhat, V., Samaras, D.: Lipnerf: what is the right feature space to lip-sync a nerf? In: 2023 IEEE 17th International Conference on Automatic Face and Gesture Recognition (FG), pp. 1–8. IEEE (2023)

5. Cheng, K., Cun, X., Zhang, Y., Xia, M., Yin, F., Zhu, M., Wang, X., Wang, J., Wang, N.: Videoretalking: audio-based lip synchronization for talking head video editing in the wild. In: SIGGRAPH Asia 2022 Conference Papers (2022)