Modeling and querying facts with period timestamps in data warehouses
Author:
Mahlknecht Giovanni1, Dignös Anton1, Kozmina Natalija2
Affiliation:
1. Faculty of Computer Science , Free University of Bozen-Bolzano , Dominikanerplatz 3, 39100 Bozen , Italy 2. Faculty of Computing , University of Latvia , Raiņa bulvāris 19, Riga , LV-1586 , Latvia
Abstract
Abstract
In this paper, we study various ways of representing and querying fact data that are time-stamped with a time period in a data warehouse. The main focus is on how to represent the time periods that are associated with the facts in order to support convenient and efficient aggregations over time. We propose three distinct logical models that represent time periods as sets of all time points in a period (instant model), as pairs of start and end time points of a period (period model), and as atomic units that are explicitly stored in a new period dimension (period∗ model). The period dimension is enriched with information about the days of each period, thereby combining the former two models. We use four different classes of aggregation queries to analyze query formulation, query execution, and query performance over the three models. An extensive empirical evaluation on synthetic and real-world datasets and the analysis of the query execution plans reveal that the period model is the best choice in terms of runtime and space for all four query classes.
Publisher
Walter de Gruyter GmbH
Subject
Applied Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)
Reference43 articles.
1. Ahmed, W., Zimányi, E. and Wrembel, R. (2014). A logical model for multiversion data warehouses, Proceedings of the 16th International Conference on Data Warehousing and Knowledge Discovery, DaWaK 2014, Munich, Germany, pp. 23–34. 2. Bebel, B., Cichowicz, T., Morzy, T., Rytwinski, F., Wrembel, R. and Koncilia, C. (2015). Sequential data analytics by means of Seq-SQL language, Proceedings of the 26th International Conference on Database and Expert Systems Applications, DEXA 2015, Valencia, Spain, Part I, pp. 416–431. 3. Ben-Gan, I., Machanic, A., Sarka, D. and Farlee, K. (2015). TSQL Querying, Microsoft Press, Redmond, WA. 4. Blaschka, M., Sapia, C. and Höfling, G. (1999). On schema evolution in multidimensional databases, Proceedings of the 1st International Conference on Data Warehousing and Knowledge Discovery, DaWaK 1999, Florence, Italy, pp. 153–164. 5. Bliujute, R., Saltenis, S., Slivinskas, G. and Jensen, C.S. (1998). Systematic change management in dimensional data warehousing, Proceedings of the 3rd International Baltic Workshop on DB and IS, Riga, Latvia, pp. 27–41.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|