1. Text2shape: Generating shapes from natural language by learning joint embeddings;Chen,2018
2. Fashion meets computer vision: A survey;Cheng;ACM Comput. Surv.,2021
3. Diffusion models beat GANs on image synthesis;Dhariwal;Adv. Neural Inf. Process. Syst.,2021
4. Cogview: Mastering text-to-image generation via transformers;Ding;Adv. Neural Inf. Process. Syst.,2021
5. Cogview2: Faster and better text-to-image generation via hierarchical transformers;Ding,2022