Affiliation:
1. Carnegie Mellon University, Pittsburgh, PA, USA
2. Aalto University, Aalto, Finland
Abstract
Recent computer systems research has proposed using redundant requests to reduce latency. The idea is to run a request on multiple servers and wait for the first completion (discarding all remaining copies of the request). However there is no exact analysis of systems with redundancy.
This paper presents the first exact analysis of systems with redundancy. We allow for any number of classes of redundant requests, any number of classes of non-redundant requests, any degree of redundancy, and any number of heterogeneous servers. In all cases we derive the limiting distribution on the state of the system.
In small (two or three server) systems, we derive simple forms for the distribution of response time of both the redundant classes and non-redundant classes, and we quantify the "gain" to redundant classes and "pain" to non-redundant classes caused by redundancy. We find some surprising results. First, the response time of a fully redundant class follows a simple Exponential distribution and that of the non-redundant class follows a Generalized Hyperexponential. Second, fully redundant classes are "immune" to any pain caused by other classes becoming redundant.
We also compare redundancy with other approaches for reducing latency, such as optimal probabilistic splitting of a class among servers (Opt-Split) and Join-the-Shortest-Queue (JSQ) routing of a class. We find that, in many cases, redundancy outperforms JSQ and Opt-Split with respect to overall response time, making it an attractive solution.
Funder
Academy of Finland
Intel Science and Technology Center for Cloud Computing
National Science Foundation
Google
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Hardware and Architecture,Software
Cited by
40 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Fault-Tolerant Parallel Integer Multiplication;Proceedings of the 36th ACM Symposium on Parallelism in Algorithms and Architectures;2024-06-17
2. Approximations to Study the Impact of the Service Discipline in Systems with Redundancy;ACM SIGMETRICS Performance Evaluation Review;2024-06-11
3. Approximations to Study the Impact of the Service Discipline in Systems with Redundancy;Abstracts of the 2024 ACM SIGMETRICS/IFIP PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems;2024-06-10
4. Approximations to Study the Impact of the Service Discipline in Systems with Redundancy;Proceedings of the ACM on Measurement and Analysis of Computing Systems;2024-02-16
5. Multi-Listing for Horizontally Differentiated Services;2024