Affiliation:
1. Eindhoven University of Technology & University of Amsterdam, Eindhoven, Netherlands
Abstract
We consider a large distributed service system consisting of n homogeneous servers with infinite capacity FIFO queues. Jobs arrive as a Poisson process of rate λn/k_n (for some positive constant λ and integer k_n). Each incoming job consists of k_n identical tasks that can be executed in parallel, and that can be encoded into at least k_n "replicas" of the same size (by introducing redundancy) so that the job is considered to be completed when any k_n replicas associated with it finish their service. Moreover, we assume that servers can experience random slowdowns in their processing rate so that the service time of a replica is the product of its size and a random slowdown. First, we assume that the server slowdowns are shifted exponential and independent of the replica sizes. In this setting we show that the delay of a typical job is asymptotically minimized (as $n\to\infty$) when the number of replicas per task is a constant that only depends on the arrival rate λ, and on the expected slowdown of servers. Second, we introduce a new model for the server slowdowns in which larger tasks experience less variable slowdowns than smaller tasks. In this setting we show that, under the class of policies where all replicas start their service at the same time, the delay of a typical job is asymptotically minimized (as n\to\infty) when the number of replicas per task is made to depend on the actual size of the tasks being replicated, with smaller tasks being replicated more than larger tasks.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Hardware and Architecture,Safety, Risk, Reliability and Quality,Computer Science (miscellaneous)
Reference37 articles.
1. Elene Anton Urtzi Ayesta Matthieu Jonckheere and Ina M. Verloop. 2019. On the stability of redundancy models. (2019). arXiv:1903.04414. Elene Anton Urtzi Ayesta Matthieu Jonckheere and Ina M. Verloop. 2019. On the stability of redundancy models. (2019). arXiv:1903.04414.
2. Soren Asmussen. 2003. Applied Probability and Queues. Springer. Soren Asmussen. 2003. Applied Probability and Queues. Springer.
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Design of fork-join networks of First-In-First-Out and infinite-server queues applied to clinical chemistry laboratories;European Journal of Operational Research;2023-11
2. Sharp waiting-time bounds for multiserver jobs;Proceedings of the Twenty-Third International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing;2022-10-03
3. Efficient Replication for Fast and Predictable Performance in Distributed Computing;IEEE/ACM Transactions on Networking;2021-08
4. A Survey of Stability Results for Redundancy Systems;Modern Trends in Controlled Stochastic Processes:;2021