1. Language models are few-shot learners;T Brown;Advances in neural information processing systems,2020
2. Exploring the limits of transfer learning with a unified text-to-text transformer;C Raffel;The Journal of Machine Learning Research,2020
3. Attention is all you need;A Vaswani;Advances in neural information processing systems,2017