1. In-datacenter performance analysis of a tensor processing unit;Jouppi,2017
2. A domain-specific supercomputer for training deep neural networks;Jouppi;Commun. ACM,2020
3. A configurable cloud-scale DNN processor for real-time AI;Fowers,2018
4. Y. Wu, M. Schuster, Z. Chen, Q.V. Le, M. Norouzi, W. Macherey, M. Krikun, Y. Cao, Q. Gao, K. Macherey, et al., Google’S neural machine translation system: bridging the gap between human and machine translation, arXiv preprint arXiv:1609.08144(2016).
5. Acceleration of deep recurrent neural networks with an FPGA cluster;Sun,2019