Affiliation:
1. EEDIS Laboratory, University of Djillali Liabes, Sidi Bel Abbes, Algeria
2. LIPN Laboratory, Paris13 University, Paris, France
Abstract
Before the arrival of the Big Data era, data warehouse (DW) systems were considered the best decision support systems (DSS). DW systems have always helped organizations around the world to analyse their stored data and use it in making decisive decisions. However, analyzing and mining data of poor quality can give the wrong conclusions. Several data quality (DQ) problems can appear during a data warehouse project like missing values, duplicates values, integrity constrains issues and more. As a result, organizations around the world are more aware of the importance of data quality and invest a lot of money in order to manage data quality in the DW systems. On the other hand, with the arrival of BD, new challenges have to be considered like the need for collecting the most recent data and the ability to make real-time decisions. This article provides a survey about the exiting techniques to control the quality of the stored data in the DW systems and the new solutions proposed in the literature to face the new Big Data requirements.
Reference43 articles.
1. Bala, M., Boussaid, O., Alimazighi, Z., & Bentayeb, F. (2014). Pfetl: vers l’intégration de données massives dans les fonctionnalités d’etl. In INFORSID (pp. 61–76). Academic Press.
2. Integrating Big Data: A Semantic Extract-Transform-Load Framework
3. Data and Information Quality
4. Social big data: Recent achievements and new challenges
5. Benkhaled, H. N., & Berrabah, D. (2019). Data Quality Management For Data Warehouse Systems: State Of The Art. In Proceedings of JERI 2019. Academic Press.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献