1. 2018. NVIDIA TensorRT: Programmable Inference Accelerator. https://developer.nvidia.com/tensorrt. 2018. NVIDIA TensorRT: Programmable Inference Accelerator. https://developer.nvidia.com/tensorrt.
2. AWS [n.d.]. AWS Neuron. https://github.com/aws/aws-neuron-sdk. AWS [n.d.]. AWS Neuron. https://github.com/aws/aws-neuron-sdk.
3. AWS 2018. AWS Inferentia. https://aws.amazon.com/machine-learning/inferentia/. AWS 2018. AWS Inferentia. https://aws.amazon.com/machine-learning/inferentia/.
4. AWS 2019. Deliver high performance ML inference with AWS Inferentia. https://d1.awsstatic.com/events/reinvent/2019/REPEAT_1_Deliver_high_performance_ML_inference_with_AWS_Inferentia_CMP324-R1.pdf. AWS 2019. Deliver high performance ML inference with AWS Inferentia. https://d1.awsstatic.com/events/reinvent/2019/REPEAT_1_Deliver_high_performance_ML_inference_with_AWS_Inferentia_CMP324-R1.pdf.