1. Model soups: Averaging weights of multiple fine-tuned models improves accuracy without increasing inference time;Wortsman,2022
2. Scaling vision transformers;Zhai,2022
3. DINO: DETR with improved denoising anchor boxes for end-to-end object detection;Zhang,2022
4. Camembert: A tasty french language model;Martin,2019
5. FlauBERT: Unsupervised language model pre-training for French;Le,2019