1. TensorFlow: A System for Large-Scale Machine Learning;Abadi,2016
2. Accurate, large minibatch SGD: training ImageNet in 1 hour;Goyal,2017
3. Y. You, J. Li, S.J. Reddi, J. Hseu, S. Kumar, S. Bhojanapalli, X. Song, J. Demmel, K. Keutzer, C. Hsieh, Large Batch Optimization for Deep Learning: Training BERT in 76 minutes, in: 8th International Conference on Learning Representations, 2020.
4. Communication efficient distributed machine learning with the parameter server;Li,2014
5. Performance analysis and comparison of distributed machine learning systems;Alqahtani,2019