1. Martín Abadi , Paul Barham , Jianmin Chen , Zhifeng Chen , 2016 . Tensorflow: A system for large-scale machine learning. In 12th {USENIX} Symposium on Operating Systems Design and Implementation ({OSDI} 16). 265–283. Martín Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, 2016. Tensorflow: A system for large-scale machine learning. In 12th {USENIX} Symposium on Operating Systems Design and Implementation ({OSDI} 16). 265–283.
2. Revisiting Asynchronous Linear Solvers
3. Lukas Balles Javier Romero and Philipp Hennig. 2016. Coupling adaptive batch sizes with learning rates. arXiv preprint arXiv:1612.05086(2016). Lukas Balles Javier Romero and Philipp Hennig. 2016. Coupling adaptive batch sizes with learning rates. arXiv preprint arXiv:1612.05086(2016).
4. Optimization Methods for Large-Scale Machine Learning
5. Thorsten Brants , Ashok C Popat , Peng Xu , Franz J Och , and Jeffrey Dean . 2007 . Large language models in machine translation . In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL). 858–867 . Thorsten Brants, Ashok C Popat, Peng Xu, Franz J Och, and Jeffrey Dean. 2007. Large language models in machine translation. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL). 858–867.