Affiliation:
1. Intel Corporation, USA
Abstract
Incremental load is an important factor for successful data warehousing. Lack of standardized incremental refresh methodologies can lead to poor analytical results, which can be unacceptable to an organization’s analytical community. Successful data warehouse implementation depends on consistent metadata as well as incremental data load techniques. If consistent load timestamps are maintained and efficient transformation algorithms are used, it is possible to refresh databases with complete accuracy and with little or no manual checking. This paper proposes an Extract-Transform-Load (ETL) metadata model that archives load observation timestamps and other useful load parameters. The author also recommends algorithms and techniques for incremental refreshes that enable table loading while ensuring data consistency, integrity, and improving load performance. In addition to significantly improving quality in incremental load techniques, these methods will save a substantial amount of data warehouse systems resources.
Subject
Decision Sciences (miscellaneous),Information Systems
Reference54 articles.
1. Efficient view maintenance at data warehouses
2. Incremental Maintenance of Data Warehouses Based on Past Temporal Logic Operators.;S. D.Amo;Journal of Universal Computer Science,2004
3. Incremental computation of time-varying query expressions
4. Enhancing data quality in data warehouse environments
5. Bokun, M. & Taglienti, C. (1998). Incremental Data Warehouse Updates. DM Review, 1-5.
Cited by
87 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献