1. TensorFlow: large-scale machine learning on heterogeneous distributed systems;Abadi,2016
2. Deep learning-based job placement in distributed machine learning clusters;Bao,2019
3. Balancing stragglers against staleness in distributed deep learning;Basu,2018
4. Borg, Omega, and Kubernetes;Burns;Commun. ACM,2016
5. Semi-dynamic load balancing: efficient distributed learning in non-dedicated environments;Chen,2020