Affiliation:
1. Eindhoven University of Technology, Eindhoven, Netherlands
2. Nokia Bell Labs & Eindhoven University of Technology, Eindhoven, Netherlands
Abstract
Load balancing algorithms play a vital role in enhancing performance in data centers and cloud networks. Due to the massive size of these systems, scalability challenges, and especially the communication overhead associated with load balancing mechanisms, have emerged as major concerns. Motivated by these issues, we introduce and analyze a novel class of load balancing schemes where the various servers provide occasional queue updates to guide the load assignment.
We show that the proposed schemes strongly outperform JSQ( d ) strategies with comparable communication overhead per job, and can achieve a vanishing waiting time in the many-server limit with just one message per job, just like the popular JIQ scheme. The proposed schemes are particularly geared however towards the sparse feedback regime with less than one message per job, where they outperform corresponding sparsified JIQ versions.
We investigate fluid limits for synchronous updates as well as asynchronous exponential update intervals. The fixed point of the fluid limit is identified in the latter case, and used to derive the queue length distribution. We also demonstrate that in the ultra-low feedback regime the mean stationary waiting time tends to a constant in the synchronous case, but grows without bound in the asynchronous case.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Hardware and Architecture,Safety, Risk, Reliability and Quality,Computer Science (miscellaneous)
Reference31 articles.
1. Choice-memory tradeoff in allocations
2. J Anselmi and F Dufour. 2018. Power-of- d -Choices with Memory: Fluid Limit and Optimality. arXiv preprint arXiv:1802.06566 (2018). J Anselmi and F Dufour. 2018. Power-of- d -Choices with Memory: Fluid Limit and Optimality. arXiv preprint arXiv:1802.06566 (2018).
3. Dynamic pull-based load balancing for autonomic servers
4. Randomized load balancing with general service time distributions
5. Asymptotic independence of queues under randomized load balancing
Cited by
14 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献