1. Gpt-4 is coming soon. here’s what we know about it;romero,2022
2. Distilbert, a distilled version of bert: smaller, faster, cheaper and lighter;sanh,2019
3. Exploring the limits of transfer learning with a unified text-to-text transformer;raffel,2019
4. Galactica: A large language model for science;taylor,2022
5. Bert: Pre-training of deep bidirectional transformers for language understanding;devlin,2018