1. Antony, V. N., & Huang, C. M. (2023). ID. 8: Co-Creating visual stories with Generative AI. arXiv Preprint arXiv:2309.14228.
2. Multimodal Machine Learning: A Survey and Taxonomy
3. Bell, G., Burgess, J., Thomas, J., & Shadiq, S. (2023). Rapid Response Information Report: Generative AI-language models (LLMs) and multimodal foundation models (MFMs). Academic Press.
4. Bensaid, E., Martino, M., Hoover, B., & Strobelt, H. (2021). Fairytailor: A multimodal generative framework for storytelling. arXiv Preprint arXiv:2108.04324.
5. Bewersdorff, A., Hartmann, C., Hornberger, M., Seßler, K., Bannert, M., Kasneci, E., . . . Nerdel, C. (2024). Taking the next step with Generative Artificial Intelligence: The transformative role of multimodal Large Language Models in science education. arXiv preprint arXiv:2401.00832.