StyleGAN-NADA-Reference-Cited by-同舟云学术

StyleGAN-NADA

Published:2022-07 Issue:4 Volume:41 Page:1-13
ISSN:0730-0301
Container-title:ACM Transactions on Graphics
language:en
Short-container-title:ACM Trans. Graph.

Author:

Gal Rinon¹,Patashnik Or²,Maron Haggai³,Bermano Amit H.²,Chechik Gal³,Cohen-Or Daniel²

Affiliation:

1. Tel Aviv University, NVIDIA, Israel

2. Tel Aviv University, Israel

3. NVIDIA, Israel

Abstract

Can a generative model be trained to produce images from a specific domain, guided only by a text prompt, without seeing any image? In other words: can an image generator be trained "blindly"? Leveraging the semantic power of large scale Contrastive-Language-Image-Pre-training (CLIP) models, we present a text-driven method that allows shifting a generative model to new domains, without having to collect even a single image. We show that through natural language prompts and a few minutes of training, our method can adapt a generator across a multitude of domains characterized by diverse styles and shapes. Notably, many of these modifications would be difficult or infeasible to reach with existing methods. We conduct an extensive set of experiments across a wide range of domains. These demonstrate the effectiveness of our approach, and show that our models preserve the latent-space structure that makes generative models appealing for downstream tasks. Code and videos available at: stylegan-nada.github.io/

Funder

Israel Science Foundation

BSF

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Graphics and Computer-Aided Design

Link

https://dl.acm.org/doi/pdf/10.1145/3528223.3530164

Reference68 articles.

1. Partitioning Around Medoids (Program PAM)

2. Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space?

3. StyleFlow: Attribute-conditioned Exploration of StyleGAN-Generated Images using Conditional Continuous Normalizing Flows

4. Yuval Alaluf Or Patashnik and Daniel Cohen-Or. 2021a. Only a Matter of Style: Age Transformation Using a Style-Based Regression Model. arXiv:2102.02754 [cs.CV] Yuval Alaluf Or Patashnik and Daniel Cohen-Or. 2021a. Only a Matter of Style: Age Transformation Using a Style-Based Regression Model. arXiv:2102.02754 [cs.CV]

5. Yuval Alaluf , Or Patashnik , and Daniel Cohen-Or . 2021b. ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refinement. arXiv preprint arXiv:2104.02699 ( 2021 ). Yuval Alaluf, Or Patashnik, and Daniel Cohen-Or. 2021b. ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refinement. arXiv preprint arXiv:2104.02699 (2021).

Cited by 106 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Conditional reiterative High-Fidelity GAN inversion for image editing;Pattern Recognition;2024-03

2. Dual-path hypernetworks of style and text for one-shot domain adaptation;Applied Intelligence;2024-02-06

3. SPGAN: Siamese projection Generative Adversarial Networks;Knowledge-Based Systems;2024-02

4. TextStyler: A CLIP-based Approach to text-guided style transfer;Computers & Graphics;2024-02

5. Automated data processing and feature engineering for deep learning and big data applications: a survey;Journal of Information and Intelligence;2024-01