Affiliation:
1. University of Pittsburgh, Pittsburgh, PA
Abstract
The emergence of monitoring applications has precipitated the need for Data Stream Management Systems (DSMSs), which constantly monitor incoming data feeds (through registered continuous queries), in order to detect events of interest. In this article, we examine the problem of how to schedule multiple Continuous Queries (CQs) in a DSMS to optimize different Quality of Service (QoS) metrics. We show that, unlike traditional online systems, scheduling policies in DSMSs that optimize for average response time will be different from policies that optimize for average slowdown, which is a more appropriate metric to use in the presence of a heterogeneous workload. Towards this, we propose policies to optimize for the average-case performance for both metrics. Additionally, we propose a hybrid scheduling policy that strikes a fine balance between performance and fairness, by looking at both the average- and worst-case performance, for both metrics. We also show how our policies can be adaptive enough to handle the inherent dynamic nature of monitoring applications. Furthermore, we discuss how our policies can be efficiently implemented and extended to exploit sharing in optimized multi-query plans and multi-stream CQs. Finally, we experimentally show using real data that our policies consistently outperform currently used ones.
Funder
National Science Foundation
Publisher
Association for Computing Machinery (ACM)
Cited by
40 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Accelerating Stream Processing Queries with Congestion-aware Scheduling and Real-time Linux Threads;Proceedings of the 20th ACM International Conference on Computing Frontiers;2023-05-09
2. The Metaverse Data Deluge: What Can We Do About It?;2023 IEEE 39th International Conference on Data Engineering (ICDE);2023-04
3. Runtime Adaptation of Data Stream Processing Systems: The State of the Art;ACM Computing Surveys;2022-01-31
4. Lachesis;Proceedings of the 22nd International Middleware Conference;2021-12-02
5. Klink: Progress-Aware Scheduling for Streaming Data Systems;Proceedings of the 2021 International Conference on Management of Data;2021-06-09