Affiliation:
1. National Technical University of Athens, Greece
2. New Jersey Institute of Technology, USA
Abstract
The back-end tools of a data warehouse are pieces of software responsible for the extraction of data from several sources, their cleansing, customization, and insertion into a data warehouse. In general, these tools are known as Extract – Transformation – Load (ETL) tools and the process that describes the population of a data warehouse from its sources is called ETL process. In all the phases of an ETL process (extraction and transportation, transformation and cleaning, and loading), individual issues arise, and, along with the problems and constraints that concern the overall ETL process, make its lifecycle a very complex task.
Reference32 articles.
1. Adzic, J., & Fiore, V. (2003). Data Warehouse Population Platform. In Proceedings of 5th International Workshop on the Design and Management of Data Warehouses (DMDW), Berlin, Germany.
2. Arktos, I. I. (2004). A Framework for Modeling and Managing ETL Processes. Available at: http://www.dblab.ece.ntua.gr/~asimi
3. Automatically Extracting Structure form Free Text Addresses.;V.Borkar;A Quarterly Bulletin of the Computer Society of the IEEE Technical Committee on Data Engineering,2000
4. Calì, A., et al. (2003). IBIS: Semantic data integration at work. In Proceedings of the 15th CAiSE, Vol. 2681 of Lecture Notes in Computer Science, pages 79-94, Springer.
5. Galhardas, H., Florescu, D., Shasha, D., & Simon, E. (2000). Ajax: An Extensible Data Cleaning Tool. In Proceedings ACM SIGMOD International Conference On the Management of Data, page 590, Dallas, Texas.