1. Large Scale Distributed Deep Networks;Dean,2012
2. Optimal Distributed Online Prediction Using Mini-batches;Dekel;J. Mach. Learn. Res.,2012
3. Parallelized Stochastic Gradient Descent;Zinkevich,2010
4. Scaling Distributed Machine Learning with the Parameter Server;Li,2014