1. Language models are few-shot learners;Brown;arXiv,2020
2. GPT-4 technical report;Open;arXiv,2023
3. BERT: pre-training of deep bidirectional transformers for language understanding;Devlin;arXiv,2019
4. Exploring the limits of transfer learning with a unified text-to-text transformer;Raffel;arXiv],2023
5. LaMDA: language models for dialog applications;Thoppilan;arXiv,2022