1. Intrinsic dimensionality explains the effectiveness of language model fine-tuning;Aghajanyan,2021
2. Layer normalization;Ba,2016
3. The second PASCAL recognising textual entailment challenge;Bar-Haim,2006
4. Bentivogli, L., Clark, P., Dagan, I., & Giampiccolo, D. (2009). The Fifth PASCAL Recognizing Textual Entailment Challenge. In TAC.
5. Language models are few-shot learners;Brown,2020