1. Better fine-tuning by reducing representational collapse;Aghajanyan
2. A closer look at memorization in deep networks;Arpit
3. Obfuscated gradients give a false sense of security: Circumventing defenses to adversarial examples;Athalye
4. Language models are few-shot learners;Brown;Advances in Neural Information Processing Systems,2020