1. Model-switching: Dealing with fluctuating workloads in machine-learning-as-a-service systems;Zhang;USENIX HotCloud 2020
2. Cocktail: A multidimensional optimization for model serving in cloud;Gunasekaran;USENIX NSDI 2022
3. Infaas: Automated model-less inference serving;Romero;USENIX ATC 2021
4. Chatgpt,2024
5. Microsoft copilot,2022