1. VQA: Visual Question Answering
2. Fan Bao , Shen Nie , Kaiwen Xue , Chongxuan Li , Shiliang Pu , Yaole Wang , Gang Yue , Yue Cao , Hang Su , and Jun Zhu . 2023. One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale. ArXiv abs/2303.06555 ( 2023 ). Fan Bao, Shen Nie, Kaiwen Xue, Chongxuan Li, Shiliang Pu, Yaole Wang, Gang Yue, Yue Cao, Hang Su, and Jun Zhu. 2023. One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale. ArXiv abs/2303.06555 (2023).
3. Omer Bar-Tal , Lior Yariv , Yaron Lipman , and Tali Dekel . 2023. MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation. arXiv preprint arXiv:2302.08113 2 ( 2023 ). Omer Bar-Tal, Lior Yariv, Yaron Lipman, and Tali Dekel. 2023. MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation. arXiv preprint arXiv:2302.08113 2 (2023).
4. Dmitry Baranchuk , Ivan Rubachev , Andrey Voynov , Valentin Khrulkov , and Artem Babenko . 2021. Label-Efficient Semantic Segmentation with Diffusion Models. ArXiv abs/2112.03126 ( 2021 ). Dmitry Baranchuk, Ivan Rubachev, Andrey Voynov, Valentin Khrulkov, and Artem Babenko. 2021. Label-Efficient Semantic Segmentation with Diffusion Models. ArXiv abs/2112.03126 (2021).
5. Georgios Batzolis Jan Stanczuk Carola-Bibiane Schonlieb and Christian Etmann. 2021. Conditional Image Generation with Score-Based Diffusion Models. Georgios Batzolis Jan Stanczuk Carola-Bibiane Schonlieb and Christian Etmann. 2021. Conditional Image Generation with Score-Based Diffusion Models.