1. Learning transferable visual models from natural language supervision;Radford,2021
2. Segment anything;Kirillov,2023
3. The dawn of lmms: Preliminary explorations with gpt-4v (ision);Yang,2023
4. Towards generic anomaly detection and understanding: Large-scale visual-linguistic model (GPT-4V) takes the lead;Cao,2023
5. Vision transformers for dense prediction;Ranftl,2021