1. Learning transferable visual models from natural language supervision;Radford
2. On the opportunities and risks of foundation models;Bommasani,2021
3. Segment Anything
4. Dinov2: Learning robust visual features without supervision;Oquab,2023
5. Prototypical networks for few-shot learning;Snell