1. Google Research blog 2016 Google supercharges machine learning tasks with TPU custom chip
2. NVIDIA blog 2017 Nvidia tensor cores: Retrieved 1 January 2020 from https://www.nvidia.com/en-us/data-center/tensorcore/
3. Martin Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, Manjunath Kudlur, Josh Levenberg, Rajat Monga, Sherry Moore, Derek G. Murray, Benoit Steiner, Paul Tucker, Vijay Vasudevan, Pete Warden, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng. 2016. TensorFlow: A system for large-scale machine learning. In Proceedings of the OSDI. 265–283.
4. 9.1 A 7nm 4-Core AI Chip with 25.6TFLOPS Hybrid FP8 Training, 102.4TOPS INT4 Inference and Workload-Aware Throttling
5. Byung Hoon Ahn Jinwon Lee Jamie Menjay Lin Hsin-Pai Cheng Jilei Hou and Hadi Esmaeilzadeh. 2020. Ordering chaos: Memory-aware scheduling of irregularly wired neural networks for edge devices. In Proceedings of Machine Learning and Systems 2 (2020) 44–57.