1. Abadi M, Barham P, Chen J, Chen Z, Davis A, Dean J, Devin M, Ghemawat S, Irving G, Isard M, Kudlur M, Levenberg J, Monga R, Moore S, Murray D, Steiner B, Tucker P, Vasudevan V, Warden P, Wicke M, Yu Y, Zheng X (2016) Tensorflow: a system for large-scale machine learning. In: 12th USENIX Symposium on Operating Systems Design and Implementation
2. Assran M, Loizou N, Ballas N, Rabbat M (2019) Stochastic gradient push for distributed deep learning. In: International Conference on Machine Learning. PMLR, pp 344–353
3. Assran MS, Rabbat MG (2020) Asynchronous gradient push. IEEE Trans Autom Control 66(1):168–183
4. Aybat N, Wang Z, Iyengar G (2015) An asynchronous distributed proximal gradient method for composite convex optimization. In: International Conference on Machine Learning. PMLR, pp 2454–2462
5. Bastianello N, Carli R, Schenato L, Todescato M (2020) Asynchronous distributed optimization over lossy networks via relaxed ADMM: stability and linear convergence. IEEE Trans Autom Control 66(6):2620–2635