Author:
Gupta Varun,Harchol Balter Mor,Sigman Karl,Whitt Ward
Subject
Computer Networks and Communications,Hardware and Architecture,Modelling and Simulation,Software
Cited by
135 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Queue-length-aware dispatching in large-scale heterogeneous systems;Queueing Systems;2024-08-03
2. Dynamically Balancing Load with Overload Control for Microservices;ACM Transactions on Autonomous and Adaptive Systems;2024-07-05
3. Splitwise: Efficient Generative LLM Inference Using Phase Splitting;2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA);2024-06-29
4. A Study Comparing Waiting Times in Global and Local Queuing Systems with Heterogeneous Workers;Applied Sciences;2024-04-29
5. Efficient Microsecond-scale Blind Scheduling with Tiny Quanta;Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2;2024-04-27