1. Attention is all you need;vaswani;Advances in neural information processing systems,2017
2. Polygen: An autoregressive generative model of 3D meshes;nash;International Conference on Machine Learning,2020
3. Identity mappings in deep residual networks;he;European Conference on Computer Vision,2016
4. Thinking like transformers;weiss;International Conference on Machine Learning,2021
5. The curious case of neural text degeneration;holtzman;International Conference on Learning Representations,2019