1. Sparse communication for distributed gradient descent;Aji,2017
2. QSGD: communication-efficient SGD via gradient quantization and encoding;Alistarh;Adv. Neural Inf. Process. Syst.,2017
3. The convergence of sparsified gradient methods;Alistarh;Adv. Neural Inf. Process. Syst.,2018
4. Low overhead instruction latency characterization for nvidia gpgpus;Arafa,2019