1. Amazon sagemaker. https://aws.amazon.com/sagemaker/. Accessed: 2023-07-07.
2. Host multiple models in one container behind one endpoint. https://docs.aws.amazon.com/sagemaker/latest/dg/multi-model-endpoints.html. Accessed: 2023-07-07.
3. Introducing the hugging face llm inference container for amazon sagemaker. https://huggingface.co/blog/sagemaker-huggingface-llm. Accessed: 2023-07-07.
4. Kserve documentation website. https://kserve.github.io/website/0.10/. Accessed: 2023-07-07.
5. Modelmesh. https://github.com/kserve/modelmesh. Accessed: 2023-07-07.