Author:
Hattori Shun,Takahara Madoka
Publisher
Springer Nature Switzerland
Reference29 articles.
1. Saharia, C., et al.: Photorealistic text-to-image diffusion models with deep language understanding. arXiv:2205.11487 (2022)
2. Yu, J., et al.: Scaling autoregressive models for content-rich text-to-image generation. arXiv:2206.10789 (2022)
3. Li, R., Li, W., Yang, Y., Wei, H., Jiang, J., Bai, Q.: Swinv2-imagen: hierarchical vision transformer diffusion models for text-to-image generation. arXiv:2210.09549 (2022)
4. Balaji, Y., et al.: eDiff-I: text-to-image diffusion models with an ensemble of expert denoisers. arXiv:2211.01324 (2022)
5. Feng, Z., et al.: ERNIE-ViLG 2.0: improving text-to-image diffusion model with knowledge-enhanced mixture-of-denoising-experts. arXiv:2210.15257 (2022)
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献