SliderGAN: Synthesizing Expressive Face Images by Sliding 3D Blendshape Parameters-Reference-Cited by-同舟云学术

SliderGAN: Synthesizing Expressive Face Images by Sliding 3D Blendshape Parameters

Published:2020-06-11 Issue:10-11 Volume:128 Page:2629-2650
ISSN:0920-5691
Container-title:International Journal of Computer Vision
language:en
Short-container-title:Int J Comput Vis

Author:

Ververas Evangelos^ORCID,Zafeiriou Stefanos

Abstract

AbstractImage-to-image (i2i) translation is the dense regression problem of learning how to transform an input image into an output using aligned image pairs. Remarkable progress has been made in i2i translation with the advent of deep convolutional neural networks and particular using the learning paradigm of generative adversarial networks (GANs). In the absence of paired images, i2i translation is tackled with one or multiple domain transformations (i.e., CycleGAN, StarGAN etc.). In this paper, we study the problem of image-to-image translation, under a set of continuous parameters that correspond to a model describing a physical process. In particular, we propose the SliderGAN which transforms an input face image into a new one according to the continuous values of a statistical blendshape model of facial motion. We show that it is possible to edit a facial image according to expression and speech blendshapes, using sliders that control the continuous values of the blendshape model. This provides much more flexibility in various tasks, including but not limited to face editing, expression transfer and face neutralisation, comparing to models based on discrete expressions or action units.

Funder

Imperial College London

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Software

Link

https://link.springer.com/content/pdf/10.1007/s11263-020-01338-7.pdf

Reference42 articles.

1. Alami Mejjati, Y., Richardt, C., Tompkin, J., Cosker, D., & Kim, K.I. (2018). Unsupervised attention-guided image-to-image translation (pp. 3693–3703).

2. Amos, B., Ludwiczuk, B., & Satyanarayanan, M. (2016). Openface: A general-purpose face recognition library with mobile applications. Technical report CMU-CS-16-118, CMU School of Computer Science.

3. Arjovsky, M., Chintala, S., Bottou, L. (2017) Wasserstein generative adversarial networks. In Proceedings of the 34th international conference on machine learning, ICML 2017, Sydney, NSW, Australia, 6–11 August 2017, pp. 214–223.

4. Bach, F., Jenatton, R., Mairal, J., & Obozinski, G. (2012). Optimization with sparsity-inducing penalties. Foundations and Trends in Machine Learning, 4(1), 1–106.

5. Benitez-Quiroz, C. F., Srinivasan, R., & Martinez, A. M. (2016). Emotionet: An accurate, real-time algorithm for the automatic annotation of a million facial expressions in the wild. In 2016 IEEE conference on computer vision and pattern recognition (CVPR) (pp. 5562–5570).

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Attention-Based Image-to-Video Translation for Synthesizing Facial Expression Using GAN;Journal of Electrical and Computer Engineering;2023-11-14

2. Human Latent Metrics: Perceptual and Cognitive Response Correlates to Distance in GAN Latent Space for Facial Images;ACM Symposium on Applied Perception 2022;2022-09-22

3. DOTMUG: A Threat Model for Target Specific APT Attacks–Misusing Google Teachable Machine;2022 10th International Symposium on Digital Forensics and Security (ISDFS);2022-06-06

4. Semantic consistency generative adversarial network for cross-modality domain adaptation in ultrasound thyroid nodule classification;Applied Intelligence;2022-01-13

5. 3D-FM GAN: Towards 3D-Controllable Face Manipulation;Lecture Notes in Computer Science;2022