1. Tensorflow: a system for large-scale machine learning;Abadi,2016
2. Sparse communication for distributed gradient descent;Aji,2017
3. Qsgd: communication-efficient sgd via gradient quantization and encoding;Alistarh,2017
4. Stochastic gradient push for distributed deep learning;Assran,2019
5. S-caffe: co-designing mpi runtimes and caffe for scalable deep learning on modern gpu clusters;Awan,2017