Asymptotic Optimality of Power-of-d Load Balancing in Large-Scale Systems

Author:

Mukherjee Debankur1ORCID,Borst Sem C.23,van Leeuwaarden Johan S. H.45,Whiting Philip A.6

Affiliation:

1. H. Milton Stewart School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332;

2. Eindhoven University of Technology, 5612 AZ Eindhoven, Netherlands;

3. Nokia Bell Labs, Murray Hill, New Jersey 07974;

4. Eindhoven University of Technology, 5612 AZ Eindhoven, Netherlands

5. Tilburg University, 5037 AB Tilburg, Netherlands;

6. Macquarie University, North Ryde, New South Wales 2109, Australia

Abstract

We consider a system of N identical server pools and a single dispatcher in which tasks with unit-exponential service requirements arrive at rate [Formula: see text]. In order to optimize the experienced performance, the dispatcher aims to evenly distribute the tasks across the various server pools. Specifically, when a task arrives, the dispatcher assigns it to the server pool with the minimum number of tasks among d(N) randomly selected server pools. We construct a stochastic coupling to bound the difference in the system occupancy processes between the join-the-shortest-queue (JSQ) policy and a scheme with an arbitrary value of d(N). We use the coupling to derive the fluid limit in case [Formula: see text] and [Formula: see text] as [Formula: see text] along with the associated fixed point. The fluid limit turns out to be insensitive to the exact growth rate of d(N) and coincides with that for the JSQ policy. We further establish that the diffusion limit corresponds to that for the JSQ policy as well, as long as [Formula: see text], and characterize the common limiting diffusion process. These results indicate that the JSQ optimality can be preserved at the fluid and diffusion levels while reducing the overhead by nearly a factor O(N) and O([Formula: see text]), respectively.

Publisher

Institute for Operations Research and the Management Sciences (INFORMS)

Subject

Management Science and Operations Research,Computer Science Applications,General Mathematics

Reference13 articles.

1. Borovkov AA (1976) Stochastic Processes in Queueing Theory (Springer, New York).

2. A simple dynamic routing problem

3. Join the Shortest Queue with Many Servers. The Heavy-Traffic Asymptotics

4. Ethier SN, Kurtz TG (2009) Markov Processes: Characterization and Convergence (John Wiley & Sons, Hoboken, NJ).

Cited by 15 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Asynchronous Load Balancing and Auto-Scaling: Mean-Field Limit and Optimal Design;IEEE/ACM Transactions on Networking;2024-08

2. Learning and Balancing Unknown Loads in Large-Scale Systems;Mathematics of Operations Research;2024-05-03

3. The impact of load comparison errors on the power-of-d load balancing;Performance Evaluation;2024-05

4. Queueing analysis of imbalance between multiple server pools with an application to 3-phase EV charging;ACM SIGMETRICS Performance Evaluation Review;2024-02-22

5. The Power of Two Choices with Load Comparison Errors;Proceedings of the Twenty-fourth International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing;2023-10-16

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3