1. Samira Abnar and Willem Zuidema . 2020. Quantifying Attention Flow in Transformers. ArXiv abs/2005.00928 ( 2020 ). Samira Abnar and Willem Zuidema. 2020. Quantifying Attention Flow in Transformers. ArXiv abs/2005.00928 (2020).
2. Specifying Object Attributes and Relations in Interactive Scene Generation
3. Omri Avrahami , Ohad Fried , and Dani Lischinski . 2022a. Blended Latent Diffusion. arXiv preprint arXiv:2206.02779 ( 2022 ). Omri Avrahami, Ohad Fried, and Dani Lischinski. 2022a. Blended Latent Diffusion. arXiv preprint arXiv:2206.02779 (2022).
4. Omri Avrahami , Thomas Hayes , Oran Gafni , Sonal Gupta , Yaniv Taigman , Devi Parikh , Dani Lischinski , Ohad Fried , and Xi Yin . 2022b. SpaText: Spatio-Textual Representation for Controllable Image Generation. arXiv preprint arXiv:2211.14305 ( 2022 ). Omri Avrahami, Thomas Hayes, Oran Gafni, Sonal Gupta, Yaniv Taigman, Devi Parikh, Dani Lischinski, Ohad Fried, and Xi Yin. 2022b. SpaText: Spatio-Textual Representation for Controllable Image Generation. arXiv preprint arXiv:2211.14305 (2022).
5. Yogesh Balaji , Seungjun Nah , Xun Huang , Arash Vahdat , Jiaming Song , Karsten Kreis , Miika Aittala , Timo Aila , Samuli Laine , Bryan Catanzaro , Tero Karras , and Ming-Yu Liu . 2022. eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers. ArXiv abs/2211.01324 ( 2022 ). Yogesh Balaji, Seungjun Nah, Xun Huang, Arash Vahdat, Jiaming Song, Karsten Kreis, Miika Aittala, Timo Aila, Samuli Laine, Bryan Catanzaro, Tero Karras, and Ming-Yu Liu. 2022. eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers. ArXiv abs/2211.01324 (2022).