1. 2023. MULTI-PROCESS SERVICE. pdf. https://docs.nvidia.com/deploy/pdf/CUDA_Multi_Process_Service_Overview.pdf.
2. 2023. NVIDIA Multi-Instance GPU User Guide. pdf. https://docs.nvidia.com/datacenter/tesla/pdf/NVIDIA_MIG_User_Guide.pdf.
3. 2023. Virtual GPU Software User Guide. pdf. https://docs.nvidia.com/grid/latest/pdf/grid-vgpu-user-guide.pdf.
4. Arnold O Allen. 1990. Probability, statistics, and queueing theory. Gulf Professional Publishing.
5. Martin F Arlitt and Carey L Williamson. 1997. Internet web servers: Workload characterization and performance implications. IEEE/ACM Transactions on networking 5, 5 (1997), 631–645.