1. Exploring the limits of transfer learning with a unified text-to-text transformer;raffel;The Journal of Machine Learning Research,2020
2. Attention is all you need;vaswani;Advances in neural information processing systems,2017
3. Large language models and the perils of their hallucinations
4. Minds, brains, and programs
5. Bert: Pre-training of deep bidirectional transformers for language understanding;devlin,2018