1. Esmaeilpour, M., Gharibvand, F., Shiri, M.E.: Text-to-image synthesis: a comprehensive survey. IEEE Access 9, 28627–28651 (2021)
2. Zhang, X., Zhu, J.Y., Zhang, H., Huang, X., Metaxas, D.N.: StackGAN++: realistic image synthesis with stacked generative adversarial networks. IEEE Trans. Pattern Anal. Mach. Intell. (2019)
3. Zhang, H., et al.: StackGAN: text to photo-realistic image synthesis with stacked generative adversarial networks. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV) (2017)
4. Cai, B., Xu, X., Zhang, K., Zhang, Y., Wang, G.: Stacked attention GAN for text-to-image synthesis with fine-grained expression manipulation. Neurocomputing 459, 94–104 (2021)
5. Chen, Y., Li, X., Zhang, S., Tang, X.: Text to image generation with semantic-spatial aware GAN. arXiv preprint arXiv:2104.00567 (2021)