Affiliation:
1. University of Minnesota, Minneapolis, MN, USA
2. Microsoft, Redmond, WA, USA
3. Stanford University, Stanford, CA, USA
Abstract
Randomized load balancing greatly improves the sharing of resources in a number of applications while being simple to implement. One model that has been extensively used to study randomized load balancing schemes is the supermarket model. In this model, jobs arrive according to a rate-nλ Poisson process at a bank of n rate-1 exponential server queues. A notable result, due to Vvedenskaya
et.al.
(1996), showed that when each arriving job is assigned to the shortest of d ≥ 2 randomly chosen queues, the equilibrium queue sizes decay doubly exponentially in the limit as n to ∞. This is a substantial improvement over the case d=1, where queue sizes decay exponentially.
The method of analysis used in the above paper and in the subsequent literature applies to jobs with exponential service time distributions and does not easily generalize. It is desirable to study load balancing models with more general, especially heavy-tailed, service time distributions since such service times occur widely in practice.
This paper describes a modularized program for treating randomized load balancing problems with general service time distributions and service disciplines. The program relies on an
ansatz
which asserts that any finite set of queues in a randomized load balancing scheme becomes independent as n to ∞. This allows one to derive queue size distributions and other performance measures of interest. We establish the
ansatz
when the service discipline is FIFO and the service time distribution has a decreasing hazard rate (this includes heavy-tailed service times). Assuming the
ansatz
, we also obtain the following results: (i) as n to ∞, the process of job arrivals at any fixed queue tends to a Poisson process whose rate depends on the size of the queue, (ii) when the service discipline at each server is processor sharing or LIFO with preemptive resume, the distribution of the number of jobs is insensitive to the service distribution, and (iii) the tail behavior of the queue-size distribution in terms of the service distribution for the FIFO service discipline.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Hardware and Architecture,Software
Reference17 articles.
1. SIAM Journal on Computing;Azar Y.,1994
2. Insensitivity in processor-sharing networks
3. M. Bramson. Stability of join the shortest queue networks. submitted to the Annals of Probability. M. Bramson. Stability of join the shortest queue networks. submitted to the Annals of Probability.
4. Chaoticity on path space for a queueing network with selection of the shortest queue among several
5. Chaos hypothesis for a system interacting through shared resources
Cited by
92 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Queue-length-aware dispatching in large-scale heterogeneous systems;Queueing Systems;2024-08-03
2. Learning and Balancing Unknown Loads in Large-Scale Systems;Mathematics of Operations Research;2024-05-03
3. The impact of load comparison errors on the power-of-d load balancing;Performance Evaluation;2024-05
4. Analysis of Fork-Join Scheduling on Heterogeneous Parallel Servers;IEEE/ACM Transactions on Networking;2024
5. SDSSE: A Self-Driven RPC Load Balancing Method in Datacenter;2023 IEEE International Conference on High Performance Computing & Communications, Data Science & Systems, Smart City & Dependability in Sensor, Cloud & Big Data Systems & Application (HPCC/DSS/SmartCity/DependSys);2023-12-17