1. Alcantara KD, Calandria JP, Calupas JS, Echas JPR, Sagum RA (2014) Storvi (story visualization): a text-to-image conversion. Int J Future Comput Commun 3(5):363
2. Sauer A, Karras T, Laine S, Geiger A, Aila T (2023) StyleGAN-T: unlocking the power of GANS for fast large-scale text-to-image synthesis. arXiv preprint arXiv:2301.09515
3. Gilboa G, Sochen N, Zeevi YY (2002) Forward-and-backward diffusion processes for adaptive image enhancement and denoising. IEEE Trans Image Process 11(7):689–703. https://doi.org/10.1109/TIP.2002.800883
4. Ramesh A, Dhariwal P, Nichol A, Chu C, Chen M (2022) Hierarchical text-conditional image generation with clip latents. arXiv preprint arXiv:2204.06125
5. Li W, Xu X, Xiao X, Liu J, Yang H, Li G, Wang Z, Feng Z, She Q, Lyu Y et al (2022) Upainting: unified text-to-image diffusion generation with cross-modal guidance. arXiv preprint arXiv:2210.16031