1. ArchiveTeam 2018. Twitter Streaming Traces. https://archive.org/details/archiveteam- twitter-stream- 2018- 04.
2. AWS. 2019. Deliver high performance ML inference with AWS Inferentia. https://d1.awsstatic.com/events/reinvent/2019/REPEAT_1_Deliver_high_performance_ML_inference_with_AWS_Inferentia_CMP324-R1.pdf.
3. OPTIMAL CONTROL POLICIES FOR AN M/M/1 QUEUE WITH A REMOVABLE SERVER AND DYNAMIC SERVICE RATES
4. Benchmark Analysis of Representative Deep Neural Network Architectures
5. Marshall Choy. 2021. Accelerating the Modern Machine Learning Workhorse: Recommendation Inference. https://sambanova.ai/blog/accelerating-the-modern-ml-workhorse-recommendation-inference/