1. José E. Moreira Kit Barton Steven Battle Peter Bergner Ramon Bertran Puneeth Bhat Pedro Caldeira David Edelsohn Gordon C. Fossum Brad Frey Nemanja Ivanovic Chip Kerchner Vincent Lim Shakti Kapoor Tulio Machado Filho Silvia Melitta Mueller Brett Olsson Satish Sadasivam Baptiste Saleil Bill Schmidt Rajalakshmi Srinivasaraghavan Shricharan Srivatsan Brian W. Thompto Andreas Wagner and Nelson Wu. 2021. A matrix math facility for Power ISA(TM) processors. CoRR abs/2104.03142(2021). arXiv:2104.03142 https://arxiv.org/abs/2104.03142 José E. Moreira Kit Barton Steven Battle Peter Bergner Ramon Bertran Puneeth Bhat Pedro Caldeira David Edelsohn Gordon C. Fossum Brad Frey Nemanja Ivanovic Chip Kerchner Vincent Lim Shakti Kapoor Tulio Machado Filho Silvia Melitta Mueller Brett Olsson Satish Sadasivam Baptiste Saleil Bill Schmidt Rajalakshmi Srinivasaraghavan Shricharan Srivatsan Brian W. Thompto Andreas Wagner and Nelson Wu. 2021. A matrix math facility for Power ISA(TM) processors. CoRR abs/2104.03142(2021). arXiv:2104.03142 https://arxiv.org/abs/2104.03142
2. Nicholai Tukanov , Rajalakshmi Srinivasaraghavan , José E Moreira , and Tze Meng Low . 2022 . Modeling Matrix Engines for Portability and Performance. In 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS). IEEE, 1173–1183 . Nicholai Tukanov, Rajalakshmi Srinivasaraghavan, José E Moreira, and Tze Meng Low. 2022. Modeling Matrix Engines for Portability and Performance. In 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS). IEEE, 1173–1183.
3. Nvidia Corporation . NVIDIA A100 Tensor Core GPU Architecture . Nvidia Corporation . https://images.nvidia.com/aem-dam/en-zz/Solutions/data-center/nvidia-ampere-architecture-whitepaper.pdf Nvidia Corporation. NVIDIA A100 Tensor Core GPU Architecture. Nvidia Corporation. https://images.nvidia.com/aem-dam/en-zz/Solutions/data-center/nvidia-ampere-architecture-whitepaper.pdf
4. Albert Reuther Peter Michaleas Michael Jones Vijay Gadepally Siddharth Samsi and Jeremy Kepner. 2020. Survey of Machine Learning Accelerators. CoRR abs/2009.00993(2020). arXiv:2009.00993 https://arxiv.org/abs/2009.00993 Albert Reuther Peter Michaleas Michael Jones Vijay Gadepally Siddharth Samsi and Jeremy Kepner. 2020. Survey of Machine Learning Accelerators. CoRR abs/2009.00993(2020). arXiv:2009.00993 https://arxiv.org/abs/2009.00993
5. Martín Abadi Ashish Agarwal Paul Barham Eugene Brevdo Zhifeng Chen Craig Citro Greg S. Corrado Andy Davis Jeffrey Dean Matthieu Devin Sanjay Ghemawat Ian Goodfellow Andrew Harp Geoffrey Irving Michael Isard Yangqing Jia Rafal Jozefowicz Lukasz Kaiser Manjunath Kudlur Josh Levenberg Dandelion Mané Rajat Monga Sherry Moore Derek Murray Chris Olah Mike Schuster Jonathon Shlens Benoit Steiner Ilya Sutskever Kunal Talwar Paul Tucker Vincent Vanhoucke Vijay Vasudevan Fernanda Viégas Oriol Vinyals Pete Warden Martin Wattenberg Martin Wicke Yuan Yu and Xiaoqiang Zheng. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. (2015). https://www.tensorflow.org/ Software available from tensorflow.org. Martín Abadi Ashish Agarwal Paul Barham Eugene Brevdo Zhifeng Chen Craig Citro Greg S. Corrado Andy Davis Jeffrey Dean Matthieu Devin Sanjay Ghemawat Ian Goodfellow Andrew Harp Geoffrey Irving Michael Isard Yangqing Jia Rafal Jozefowicz Lukasz Kaiser Manjunath Kudlur Josh Levenberg Dandelion Mané Rajat Monga Sherry Moore Derek Murray Chris Olah Mike Schuster Jonathon Shlens Benoit Steiner Ilya Sutskever Kunal Talwar Paul Tucker Vincent Vanhoucke Vijay Vasudevan Fernanda Viégas Oriol Vinyals Pete Warden Martin Wattenberg Martin Wicke Yuan Yu and Xiaoqiang Zheng. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. (2015). https://www.tensorflow.org/ Software available from tensorflow.org.