1. Foundations and recent trends in multimodal machine learning: Principles, challenges, and open questions;Liang,2022
2. A comprehensive survey on segment anything model for vision and beyond;Zhang,2023
3. Bert: Pre-training of deep bidirectional transformers for language understanding;Devlin,2018
4. Exploring the limits of transfer learning with a unified text-to-text transformer;Raffel;J. Mach. Learn. Res.,2020
5. Gpt-4;OpenAI,2023