Affiliation:
1. University of Bordeaux, Talence, France
2. University of Houston, TX, USA
3. University of Bordeaux-CNRS, Talence, France
Abstract
Given a table
T
(
Id
,
D
1
, …,
D
d
), the skycube of
T
is the set of skylines with respect to to all nonempty subsets (subspaces) of the set of all dimensions {
D
1
, …,
D
d
}. To optimize the evaluation of any skyline query, the solutions proposed so far in the literature either (i) precompute all of the skylines or (ii) use compression techniques so that the derivation of any skyline can be done with little effort. Even though solutions (i) are appealing because skyline queries have optimal execution time, they suffer from time and space scalability because the number of skylines to be materialized is exponential with respect to
d
. On the other hand, solutions (ii) are attractive in terms of memory consumption, but as we show, they also have a high time complexity. In this article, we make contributions to both kinds of solutions. We first observe that skyline patterns are monotonic. This property leads to a simple yet efficient solution for full and partial skycube materialization when the skyline with respect to all dimensions, the topmost skyline, is small. On the other hand, when the topmost skyline is large relative to the size of the input table, it turns out that functional dependencies, a fundamental concept in databases, uncover a monotonic property between skylines. Equipped with this information, we show that closed attributes sets are fundamental for partial and full skycube materialization. Extensive experiments with real and synthetic datasets show that our solutions generally outperform state-of-the-art algorithms.
Funder
French State
“Investments for the Future” Programme IdEx Bordeaux--CPU
CNRS under the MASTODONS-PETASKY initiative
SPEEDDATA research project
Publisher
Association for Computing Machinery (ACM)
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. A framework for multidimensional skyline queries over streaming data;Data & Knowledge Engineering;2020-05
2. The negative skycube;Information Systems;2020-02
3. Efficient Computation of Subspace Skyline over Categorical Domains;Proceedings of the 2017 ACM on Conference on Information and Knowledge Management;2017-11-06