1. Stochastic reformulations of linear systems: algorithms and convergence theory;Richtárik;SIAM J. Matrix Anal. Appl.,2020
2. GPipe: efficient training of giant neural networks using pipeline parallelism;Huang;Adv. Neural Inf. Process. Syst.,2019
3. Pegasos: primal estimated sub-gradient solver for SVM;Shalev-Shwartz,2007
4. Distributed asynchronous online learning for natural language processing;Gimpel,2010
5. Optimal distributed online prediction using mini-batches;Dekel;J. Mach. Learn. Res.,2012