Affiliation:
1. University of Poitiers, France
Abstract
Horizontal Partitioning has been largely adopted by the database community, where it took a significant part in the physical design process. Actually, it is supported by most commercial database systems (DBMS), where a native Data Definition Language for decomposing tables/materialized views using various modes is proposed. In traditional databases, horizontal partitioning has been largely studied, where several fragmentation algorithms were proposed to partition tables in isolation. In the relational data warehouse environment, horizontal partitioning consists in decomposing the whole warehouse schema into sub schemas, where each schema contains fragments of dimension and fact tables. Dimension tables are fragmented using the primary partitioning mode, whereas the fact table is divided using referential mode. In this article, the authors first focus on the evolution of horizontal partitioning in commercial DBMS motivated by decision support applications. Secondly, they give a formalization of the referential fragmentation schema selection problem in the data warehouse and they study its hardness to select an optimal solution. Due to its high complexity, they develop two algorithms: hill climbing and simulated annealing with several variants to select a near optimal partitioning schema. Finally, extensive experimental studies are conducted using the data set of APB1 benchmark to compare the quality the proposed algorithms using a mathematical cost model. Based on these experiments, some recommendations are given to advise database administrator for well using horizontal partitioning.
Subject
Hardware and Architecture,Software
Reference27 articles.
1. Bellatreche, L., Boukhalfa, K., & Richard, P. (2008). Horizontal Partitioning in Data Warehouse: Hardness Study, Selection Algorithms and Validation on ORACLE10G. 10th International Conference on Data Warehousing and Knowledge Discovery (DAWAK) (pp. 87-96).
2. A Data Mining Approach for Selecting Bitmap Join Indices.;L.Bellatreche;Journal of Computing Science and Engineering,2008
3. Boukhalfa, K., Bellatreche, L., & Richard, P. (2008). Fragmentation Primaire et Dérivée: Étude de Complexité, Algorithmes de Sélection et Validation sous Oracle10g. (Report No. 01 -2008), LISI/ENSMA, available at: http://www.lisi.ensma.fr/members/bellatreche
4. Breslau, L., Cao, P., Fan, L., Phillips, G., & Shenker, S. (1999). Web Caching and Zipf-like Distributions: Evidence and Implications. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE INFOCOM) (pp. 126-134).
Cited by
21 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献