1. J. Wells , B. Bland et al., “Announcing Supercomputer Summit,” Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States), Tech. Rep., Jun. 2016.
2. N.P. Jouppi , C. Young et al., “In-Datacenter Performance Analysis of a Tensor Processing Unit,” in Proceedings of the 44th Annual International Symposium on Computer Architecture, ser. ISCA ‘17. Toronto, ON, Canada: Association for Computing Machinery, Jun. 2017, pp. 1–12.
3. A. Paszke , S. Gross et al., “PyTorch: An Imperative Style, High-Performance Deep Learning Library,” in Advances in Neural Information Processing Systems 32, H. Wallach , H. Larochelle et al. , Eds. Curran Associates, Inc., 2019, pp. 8026–8037.
4. D. Kirk , “NVIDIA cuda software and gpu parallel computing architecture,” in Proceedings of the 6th International Symposium on Memory Management, ser. ISMM ‘07. Montreal, Quebec, Canada: Association for Computing Machinery, Oct. 2007, pp. 103–104.