Managing operational business intelligence workloads

Author:

Dayal Umeshwar1,Kuno Harumi1,Wiener Janet L.1,Wilkinson Kevin1,Ganapathi Archana2,Krompass Stefan3

Affiliation:

1. HPL Hewlett-Packard Laboratories, Palo Alto, CA

2. UCB UC Berkeley, Berkeley, CA

3. TUM Technische Universität München,, Munich, Germany

Abstract

We explore how to manage database workloads that contain a mixture of OLTP-like queries that run for milliseconds as well as business intelligence queries and maintenance tasks that last for hours. As data warehouses grow in size to petabytes and complex analytic queries play a greater role in day-to-day business operations, factors such as inaccurate cardinality estimates, data skew, and resource contention all make it notoriously difficult to predict how such queries will behave before they start executing. However, traditional workload management assumes that accurate expectations for the resource requirements and performance characteristics of a workload are available at compile-time, and relies on such information in order to make critical workload management decisions. In this paper, we describe our approach to dealing with inaccurate predictions. First, we evaluate the ability of workload management algorithms to handle workloads that include unexpectedly long-running queries. Second, we describe a new and more accurate method for predicting the resource usage of queries before runtime. We have carried out an extensive set of experiments, and report on a few of our results.

Publisher

Association for Computing Machinery (ACM)

Reference29 articles.

1. Characterizing Web user sessions

2. F. R. Bach and M. I. Jordan. Kernel Independent Component Analysis. Journal of Machine Learning Research 3:1--48 2003. 10.1162/153244303768966085 F. R. Bach and M. I. Jordan. Kernel Independent Component Analysis. Journal of Machine Learning Research 3:1--48 2003. 10.1162/153244303768966085

3. When can we trust progress estimators for SQL queries?

Cited by 10 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Towards an Explicitation and a Conceptualization of Cost Models in Database Systems;Model and Data Engineering;2017

2. SQL Scorecard for Improved Stability and Performance of Data Warehouses;International Journal of Software Innovation;2016-07

3. A Unified View of Data-Intensive Flows in Business Intelligence Systems: A Survey;Lecture Notes in Computer Science;2016

4. Multi-core column-store parallelization under concurrent workload;Proceedings of the 12th International Workshop on Data Management on New Hardware - DaMoN '16;2016

5. Trends der Human Resource Intelligence und Analytics;Human Resource Intelligence und Analytics;2015

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3