1. DeepSpeedInference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale;Aminabadi
2. Semantic Parsing on Freebase from Question-Answer Pairs;Berant
3. Language Models are Few-Shot Learners;Brown
4. TA-MoE: TopologyAware Large Scale Mixture-of-Expert Training;Chen
5. PaLM: Scaling Language Modeling with Pathways;Chowdhery;in arxiv.org,2022