Efficient computation of Iceberg cubes with complex measures-Reference-Cited by-同舟云学术

Efficient computation of Iceberg cubes with complex measures

Published:2001-06 Issue:2 Volume:30 Page:1-12
ISSN:0163-5808
Container-title:ACM SIGMOD Record
language:en
Short-container-title:SIGMOD Rec.

Author:

Han Jiawei¹,Pei Jian¹,Dong Guozhu²,Wang Ke¹

Affiliation:

1. School of Computing Science, Simon Fraser University, B.C., Canada

2. Department of Computer Science, Wright State University, Dayton, OH

Abstract

It is often too expensive to compute and materialize a complete high-dimensional data cube. Computing an iceberg cube, which contains only aggregates above certain thresholds, is an effective way to derive nontrivial multi-dimensional aggregations for OLAP and data mining. In this paper, we study efficient methods for computing iceberg cubes with some popularly used complex measures, such as average , and develop a methodology that adopts a weaker but anti-monotonic condition for testing and pruning search space. In particular, for efficient computation of iceberg cubes with the average measure, we propose a top-k average pruning method and extend two previously studied methods, Apriori and BUC, to Top- k Apriori and Top- k BUC. To further improve the performance, an interesting hypertree structure, called H-tree, is designed and a new iceberg cubing method, called Top- k H-Cubing, is developed. Our performance study shows that Top- k BUC and Top- k H-Cubing are two promising candidates for scalable computation, and Top- k H-Cubing has better performance in most cases.

Publisher

Association for Computing Machinery (ACM)

Subject

Information Systems,Software

Link

https://dl.acm.org/doi/pdf/10.1145/376284.375664

Reference16 articles.

1. S. Agarwal R. Agrawal P. M. Deshpande A. Gupta J. F. Naughton R. Ramakrishnan and S. Sarawagi. On the computation of multidimensional aggregates. VLDB'96. S. Agarwal R. Agrawal P. M. Deshpande A. Gupta J. F. Naughton R. Ramakrishnan and S. Sarawagi. On the computation of multidimensional aggregates. VLDB'96.

2. R. Agrawal and R. Srikant. Fast algorithms for mining association rules. VLDB'94. R. Agrawal and R. Srikant. Fast algorithms for mining association rules. VLDB'94.

3. Bottom-up computation of sparse and Iceberg CUBE

4. An overview of data warehousing and OLAP technology

Cited by 45 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Mining Interesting Aggregate Tuples;Lecture Notes in Networks and Systems;2024

2. Elucidating strategic patterns from target customers using multi-stage RFM analysis;Journal of Global Scholars of Marketing Science;2022-09-09

3. Enabling efficient and general subpopulation analytics in multidimensional data streams;Proceedings of the VLDB Endowment;2022-07

4. Timely Reporting of Heavy Hitters Using External Memory;ACM Transactions on Database Systems;2021-12-31

5. Big high-dimension data cube designs for hybrid memory systems;Knowledge and Information Systems;2020-08-26