Abstract
In recent years, there has been an explosion of large-scale real-time analytics needs and a plethora of streaming systems have been developed to support such applications. These systems are able to continue stream processing even when faced with hardware and software failures. However, these systems do not address some crucial challenges facing their operators: the manual, time-consuming and error-prone tasks of tuning various configuration knobs to achieve service level objectives (SLO) as well as the maintenance of SLOs in the face of sudden, unpredictable load variation and hardware or software performance degradation.
In this paper, we introduce the notion of
self-regulating
streaming systems and the key properties that they must satisfy. We then present the design and evaluation of Dhalion, a system that provides self-regulation capabilities to underlying streaming systems. We describe our implementation of the Dhalion framework on top of Twitter Heron, as well as a number of policies that automatically reconfigure Heron topologies to meet throughput SLOs, scaling resource consumption up and down as needed. We experimentally evaluate our Dhalion policies in a cloud environment and demonstrate their effectiveness. We are in the process of open-sourcing our Dhalion policies as part of the Heron project.
Subject
General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development
Cited by
90 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Bayesian-Driven Automated Scaling in Stream Computing With Multiple QoS Targets;IEEE Transactions on Parallel and Distributed Systems;2024-07
2. Evaluating Stream Processing Autoscalers;Proceedings of the 18th ACM International Conference on Distributed and Event-based Systems;2024-06-24
3. Serverless-like platform for container-based YARN clusters;Future Generation Computer Systems;2024-06
4. Lorentz: Learned SKU Recommendation Using Profile Data;Proceedings of the ACM on Management of Data;2024-05-29
5. Emma: Elastic Multi-Resource Management for Realtime Stream Processing;IEEE INFOCOM 2024 - IEEE Conference on Computer Communications;2024-05-20