Specialist Diffusion: Plug-and-Play Sample-Efficient Fine-Tuning of Text-to-Image Diffusion Models to Learn Any Unseen Style-Reference-Cited by-同舟云学术

Specialist Diffusion: Plug-and-Play Sample-Efficient Fine-Tuning of Text-to-Image Diffusion Models to Learn Any Unseen Style

Published:2023-06 Issue: Volume: Page:
ISSN:
Container-title:2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
language:
Short-container-title:

Author:

Lu Haoming¹,Tunanyan Hazarapet¹,Wang Kai²,Navasardyan Shant¹,Wang Zhangyang¹,Shi Humphrey¹

Affiliation:

1. Picsart AI Research (PAIR)

2. U of Oregon

Publisher

IEEE

Link

Reference36 articles.

1. Few-shot image generation with elastic weight consolidation;li;ArXiv Preprint,2020

2. On leveraging pretrained gans for generation with limited data;zhao;International Conference on Machine Learning,0

3. Blip: Bootstrapping language-image pre-training for unified vision-language understanding and generation;li;ICML,2022

4. Versatile diffusion: Text, images and variations all in one diffusion model;xu;ArXiv Preprint,2022

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Survey of Cross-Modal Visual Content Generation;IEEE Transactions on Circuits and Systems for Video Technology;2024-08

2. Capability-aware Prompt Reformulation Learning for Text-to-Image Generation;Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval;2024-07-10

3. A Survey of Multimodal Controllable Diffusion Models;Journal of Computer Science and Technology;2024-05

5. Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators;2023 IEEE/CVF International Conference on Computer Vision (ICCV);2023-10-01