1. Accelerating recommendation system training by leveraging popular choices
2. Dan Alistarh , Demjan Grubic , Jerry Z Li , Ryota Tomioka , and Milan Vojnovic . 2017 . QSGD: communication-efficient SGD via gradient quantization and encoding . In Proceedings of the 31st International Conference on Neural Information Processing Systems. 1707--1718 . Dan Alistarh, Demjan Grubic, Jerry Z Li, Ryota Tomioka, and Milan Vojnovic. 2017. QSGD: communication-efficient SGD via gradient quantization and encoding. In Proceedings of the 31st International Conference on Neural Information Processing Systems. 1707--1718.
3. TensorOpt: Exploring the Tradeoffs in Distributed DNN Training With Auto-Parallelism
4. Jianmin Chen , Rajat Monga , Samy Bengio , and Rafal Jó zefowicz. 2016. Revisiting Distributed Synchronous SGD. CoRR , Vol. abs/ 1604 .00981 ( 2016 ). Jianmin Chen, Rajat Monga, Samy Bengio, and Rafal Jó zefowicz. 2016. Revisiting Distributed Synchronous SGD. CoRR, Vol. abs/1604.00981 (2016).
5. Wenqiang Chen , Lizhang Zhan , Yuanlong Ci , and Chen Lin . 2019 . FLEN: Leveraging Field for Scalable CTR Prediction. CoRR , Vol. abs/ 1911 .04690 (2019). Wenqiang Chen, Lizhang Zhan, Yuanlong Ci, and Chen Lin. 2019. FLEN: Leveraging Field for Scalable CTR Prediction. CoRR, Vol. abs/1911.04690 (2019).