1. M. Abadi, P. Barham, J. Chen, Z. Chen, A. Davis, J. Dean, M. Devin, S. Ghemawat, G. Irving, M. Isard, et al. Tensorflow: A system for large-scale machine learning, in: 12th {USENIX} Symposium on Operating Systems Design and Implementation, {OSDI} 16, 2016, pp. 265–283.
2. Sparse communication for distributed gradient descent;Aji,2017
3. QSGD: Communication-efficient SGD via gradient quantization and encoding;Alistarh,2017
4. What does fault tolerant deep learning need from MPI?;Amatya,2017
5. Large-scale machine learning with stochastic gradient descent;Bottou,2010