1. The Evolved Transformer;so,2019
2. Reformer: The Efficient Transformer;kitaev,2020
3. Cross-lingual Language Model Pretraining;lample,2019
4. FlauBERT: Unsupervised Language Model Pre-training for French;le,2020
5. SOTA( state-of-the-art) machine learning for JAX, PyTorch and Tensorflow from the site,0