1. Carlos Carvalho . The gap between processor and memory speeds . In Proc. of IEEE International Conference on Control and Automation , 2002 . Carlos Carvalho. The gap between processor and memory speeds. In Proc. of IEEE International Conference on Control and Automation, 2002.
2. Zhe Jia , Marco Maggioni , Benjamin Staiger , and Daniele P Scarpazza . Dissecting the nvidia volta gpu architecture via microbenchmarking. arXiv preprint arXiv:1804.06826 , 2018 . Zhe Jia, Marco Maggioni, Benjamin Staiger, and Daniele P Scarpazza. Dissecting the nvidia volta gpu architecture via microbenchmarking. arXiv preprint arXiv:1804.06826, 2018.
3. In-Datacenter Performance Analysis of a Tensor Processing Unit
4. Low-Cost Epoch-Based Correlation Prefetching for Commercial Applications
5. Linearizing irregular memory accesses for improved correlated prefetching