Funder
National Research Foundation of Korea
Ministry of Science, ICT and Future Planning
Reference58 articles.
1. GPT-4 technical report;Achiam,2024
2. Flamingo: a visual language model for few-shot learning;Alayrac,2022
3. Invariant risk minimization;Arjovsky,2020
4. Domain generalization by mutual-information regularization with pre-trained models;Cha,2022
5. Chen, C.-F. R., Fan, Q., & Panda, R. (2021). CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification. In Proceedings of the IEEE/CVF international conference on computer vision (pp. 357–366).