1. OpenAI. AI and Compute. https://openai.com/blog/ai-and-compute/. ([n. d.]). OpenAI. AI and Compute. https://openai.com/blog/ai-and-compute/. ([n. d.]).
2. Kunle Olukotun. 2020. Accelerating Software 2.0. ScaledML (2020). Kunle Olukotun. 2020. Accelerating Software 2.0. ScaledML (2020).
3. Zhihao Jia Matei Zaharia and Alex Aiken. 2018. Beyond data and model parallelism for deep neural networks. arXiv preprint arXiv:1807.05358(2018). Zhihao Jia Matei Zaharia and Alex Aiken. 2018. Beyond data and model parallelism for deep neural networks. arXiv preprint arXiv:1807.05358(2018).
4. Amazon AWS Inferentia . (accessed Sep 10, 2021). Achieve 12x higher throughput and lowest latency for PyTorch Natural Language Processing applications out-of-the-box on AWS Inferentia. https://tinyurl.com/3mbuetmr. ((accessed Sep 10, 2021 )). Amazon AWS Inferentia. (accessed Sep 10, 2021). Achieve 12x higher throughput and lowest latency for PyTorch Natural Language Processing applications out-of-the-box on AWS Inferentia. https://tinyurl.com/3mbuetmr. ((accessed Sep 10, 2021)).
5. Timeloop: A Systematic Approach to DNN Accelerator Evaluation