Affiliation:
1. Blaise Pascal University, France
Abstract
Basically, the schema of a data warehouse lies on two kinds of elements: facts and dimensions. Facts are used to memorize measures about situations or events. Dimensions are used to analyse these measures, particularly through aggregation operations (counting, summation, average, etc.). To fix the ideas let us consider the analysis of the sales in a shop according to the product type and to the month in the year. Each sale of a product is a fact. One can characterize it by a quantity. One can calculate an aggregation function on the quantities of several facts. For example, one can make the sum of quantities sold for the product type “mineral water” during January in 2001, 2002 and 2003. Product type is a criterion of the dimension Product. Month and Year are criteria of the dimension Time. A quantity is so connected both with a type of product and with a month of one year. This type of connection concerns the organization of facts with regard to dimensions. On the other hand a month is connected to one year. This type of connection concerns the organization of criteria within a dimension. The possibilities of fact analysis depend on these two forms of connection and on the schema of the warehouse. This schema is chosen by the designer in accordance with the users needs. Determining the schema of a data warehouse cannot be achieved without adequate modelling of dimensions and facts. In this article we present a general model for dimensions and facts and their relationships. This model will facilitate greatly the choice of the schema and its manipulation by the users.
Reference13 articles.
1. Abello, A., Samos, J., & Saltor, F. (2001). Understanding analysis dimensions in a multidimensional object-oriented model. Intl Workshop on Design and Management of Data Warehouses, DMDW’2000, Interlaken, Switzerland.
2. Agrawal, R., Gupta, A., & Sarawagi, S. (1997). Modelling multidimensional databases. International Conference on Data Engineering, ICDE’97 (pp. 232-243), Birmingham, UK.
3. The cube data model: a conceptual model and algebra for on-line analytical processing in data warehouses
4. Golfarelli, M., Maio, D., & Rizzi, V. S. (1998). Conceptual design of data warehouses from E/R schemes. 32th Hawaii International Conference on System Sciences, HICSS’1998.
5. Gyssens, M., & Lakshmanan, V. S. (1997). A foundation for multi-dimensional databases. Intl Conference on Very Large Databases (pp. 106-115).