1. TT-REC: Tensor train compression for deep learning recommendation model embeddings;yin;arXiv preprint arXiv 2101 10955,2021
2. Nyströmformer: A nyström-based algorithm for approximating self-attention;xiong;Association for the Advancement of Artificial Intelligence,2021
3. Linformer: Self-attention with linear complexity;wang;arXiv preprint arXiv 2006 04989,2020
4. FALCON: Honest-majority maliciously secure framework for private deep learning;wagh;Proceedings on Privacy Enhancing Technologies,2020
5. Gaussian error linear units (gelus);hendrycks,2020