1. OpenAI GPT: Generative pre-training of language models;Chen;ACL,2018
2. GPT-3: Language models are few-shot learners;Devlin;NeurIPS,2020
3. BERT: Pre-training of deep bidirectional transformers for language understanding;Devlin;NAACL-HLT,2019
4. Emergent Abilities of Large Language Models;Wei;arXiv preprint,2022