1. Communication-Efficient Learning of Deep Networks from Decentralized Data;McMahan
2. Information Theory, Mathematical Optimization, and Their Crossroads in 6G System Design
3. The Convergence of Sparsified Gradient Methods;Alistarh;Advances in Neural Information Processing Systems,2018
4. Sparsified SGD with memory;Stich;Advances in Neural Information Processing Systems,2018
5. QSGD: Communication-Efficient SGD via Gradient Quantization and Encoding;Alistarh;Advances in neural information processing systems,2017