1. Reza Yazdani Aminabadi , Samyam Rajbhandari , Ammar Ahmad Awan , Cheng Li , Du Li , Elton Zheng , Olatunji Ruwase , Shaden Smith , Minjia Zhang , Jeff Rasley , and Yuxiong He . 2022 . DeepSpeed-Inference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale . In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis ( Dallas, Texas) (SC '22). IEEE Press, Article 46, 15 pages. Reza Yazdani Aminabadi, Samyam Rajbhandari, Ammar Ahmad Awan, Cheng Li, Du Li, Elton Zheng, Olatunji Ruwase, Shaden Smith, Minjia Zhang, Jeff Rasley, and Yuxiong He. 2022. DeepSpeed-Inference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale. In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (Dallas, Texas) (SC '22). IEEE Press, Article 46, 15 pages.
2. Léon Bottou . 1991 . Stochastic Gradient Learning in Neural Networks . In Proceedings of Neuro-Nîmes 91 . EC2, Nimes, France. http://leon.bottou.org/papers/bottou-91c Léon Bottou. 1991. Stochastic Gradient Learning in Neural Networks. In Proceedings of Neuro-Nîmes 91. EC2, Nimes, France. http://leon.bottou.org/papers/bottou-91c