1. nocaps: novel object captioning at scale
2. A Closer Look at Memorization in Deep Networks;Arpit
3. Improving language models by retrieving from trillions of tokens;Borgeaud
4. Language models are few-shot learners;Brown
5. COYO-700M: Image-Text Pair Dataset;Byeon,2022