1. Flamingo: A visual language model for few-shot learning;Alayrac,2022
2. YOLOv5 by ultralytics;Anon,2020
3. Claude 3 family;Anthropic,2024
4. YOLOv4: Optimal speed and accuracy of object detection;Bochkovskiy,2020
5. How far are we to GPT-4V? closing the gap to commercial multimodal models with open-source suites;Chen,2024