Affiliation:
1. Univ. Grenoble Alpes, Inria, Grenoble, France
2. University of Antwerp, Antwerp, Belgium
Abstract
As job redundancy has been recognized as an effective means to improve performance of large-scale computer systems, queueing systems with redundancy have been studied by various authors. Existing results include methods to compute the queue length distribution and response time but only when the service discipline is First-Come-First-Served (FCFS). For other service disciplines, such as Processor Sharing (PS), or Last-Come-First-Served (LCFS), only the stability conditions are known. In this paper we develop the first methods to approximate the queue length distribution in a queueing system with redundancy under various service disciplines. We focus on a system with exponential job sizes, i.i.d. copies, and a large number of servers. We first derive a mean field approximation that is independent of the scheduling policy. In order to study the impact of service discipline, we then derive refinements of this approximation to specific scheduling policies. In the case of Processor Sharing, we provide a pair and a triplet approximation. The pair approximation can be regarded as a refinement of the classic mean field approximation and takes the service discipline into account, while the triplet approximation further refines the pair approximation. We also develop a pair approximation for three other service disciplines: First-Come-First-Served, Limited Processor Sharing and Last-Come-First-Served. We present numerical evidence that shows that all the approximations presented in the paper are highly accurate, but that none of them are asymptotically exact (as the number of servers goes to infinity). This makes these approximations suitable to study the impact of the service discipline on the queue length distribution. Our results show that FCFS yields the shortest queue length, and that the differences are more substantial at higher loads.
Funder
Agence Nationale de la Recherche
Publisher
Association for Computing Machinery (ACM)
Reference25 articles.
1. Ganesh Ananthanarayanan, Ali Ghodsi, Scott Shenker, and Ion Stoica. 2013. Effective Straggler Mitigation: Attack of the Clones. In 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI 13). USENIX Association, Lombard, IL, 185--198. https://www.usenix.org/conference/nsdi13/technical-sessions/presentation/ananthanarayanan
2. On the Stability of Redundancy Models
3. Elene Anton, Urtzi Ayesta, Matthieu Jonckheere, and Ina Maria Verloop. 2021. A Survey of Stability Results for Redundancy Systems. In Modern Trends in Controlled Stochastic Processes:, Alexey Piunovskiy and Yi Zhang (Eds.). Springer International Publishing, Cham, 266--283.
4. The stationary distribution of the redundancy-d model with random order of service
5. U. Ayesta T. Bodas and I.M. Verloop. 2018. On a unifying product form framework for redundancy models. Performance Evaluation 127--128 (2018) 93--119. https://doi.org/10.1016/j.peva.2018.09.008
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. New directions in pass-and-swap queues;Queueing Systems;2024-07-16
2. Approximations to Study the Impact of the Service Discipline in Systems with Redundancy;ACM SIGMETRICS Performance Evaluation Review;2024-06-11
3. Approximations to Study the Impact of the Service Discipline in Systems with Redundancy;Abstracts of the 2024 ACM SIGMETRICS/IFIP PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems;2024-06-10