Affiliation:
1. Beijing University of Posts and Telecommunications
2. School of Computer Science, and National Engineering Laboratory for Big Data Analysis and Applications, Peking University
Abstract
The past decade has seen rapid growth of distributed stream data processing systems. Under these systems, a stream application is realized as a Directed Acyclic Graph (DAG) of operators, where the level of parallelism of each operator has a substantial impact on its overall performance. However, finding optimal levels of parallelism remains challenging. Most existing methods are heavily coupled with the topological graph of operators, unable to efficiently tune under-provisioned jobs. They either insufficiently use previous tuning experience by treating successively tuning independently, or explore the configuration space aggressively, violating the Service Level Agreements (SLA).To address the above problems, we propose ContTune, a continuous tuning system for stream applications. It is equipped with a novel Big-small algorithm, in which the Big phase decouples the tuning from the topological graph by decomposing the job tuning problem into sub-problems that can be solved concurrently. We propose a conservative Bayesian Optimization (CBO) technique in the Small phase to speed up the tuning process by utilizing the previous observations. It leverages the state-of-the-art (SOTA) tuning method as conservative exploration to avoid SLA violations. Experimental results show that ContTune reduces up to 60.75% number of reconfigurations under synthetic workloads and up to 57.5% number of reconfigurations under real workloads, compared to the SOTA method DS2.
Publisher
Association for Computing Machinery (ACM)
Subject
General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development
Reference76 articles.
1. 2019. Esper. Retrieved\from\https://www.espertech.com/. 2019. Esper. Retrieved\from\https://www.espertech.com/.
2. September 18 2023. ApacheBeamNexmarkbenchmarksuite. https://beam.apache.org/documentation/sdks/java/nexmark. September 18 2023. ApacheBeamNexmarkbenchmarksuite. https://beam.apache.org/documentation/sdks/java/nexmark.
3. September 18 2023. Network buffer. https://nightlies.apache.org/flink/flink-docs-release-1.15/docs/deployment/memory/network_mem_tuning/. September 18 2023. Network buffer. https://nightlies.apache.org/flink/flink-docs-release-1.15/docs/deployment/memory/network_mem_tuning/.
4. September 18 2023. NEXMark benchmark. http://datalab.cs.pdx.edu/niagaraST/NEXMark. September 18 2023. NEXMark benchmark. http://datalab.cs.pdx.edu/niagaraST/NEXMark.
5. Distributed operation in the Borealis stream processing engine
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献