Author:
Lu Lihua,Zhang Hengzhen,Gao Xiao-Zhi
Abstract
Purpose
– Data integration is to combine data residing at different sources and to provide the users with a unified interface of these data. An important issue on data integration is the existence of conflicts among the different data sources. Data sources may conflict with each other at data level, which is defined as data inconsistency. The purpose of this paper is to aim at this problem and propose a solution for data inconsistency in data integration.
Design/methodology/approach
– A relational data model extended with data source quality criteria is first defined. Then based on the proposed data model, a data inconsistency solution strategy is provided. To accomplish the strategy, fuzzy multi-attribute decision-making (MADM) approach based on data source quality criteria is applied to obtain the results. Finally, users feedbacks strategies are proposed to optimize the result of fuzzy MADM approach as the final data inconsistent solution.
Findings
– To evaluate the proposed method, the data obtained from the sensors are extracted. Some experiments are designed and performed to explain the effectiveness of the proposed strategy. The results substantiate that the solution has a better performance than the other methods on correctness, time cost and stability indicators.
Practical implications
– Since the inconsistent data collected from the sensors are pervasive, the proposed method can solve this problem and correct the wrong choice to some extent.
Originality/value
– In this paper, for the first time the authors study the effect of users feedbacks on integration results aiming at the inconsistent data.
Reference27 articles.
1. Agarwal, S.
,
Keller, A.M.
,
Wiederhold, G.
and
Saraswat, K.
(1995), “Flexible relation: an approach for integrating data from multiple possibly inconsistent databases”, Proceedings of the 11th International Conference on Data Engineering, pp. 495-504.
2. Andez, M.A.H.
,
Stolfo, S.J.
and
Fayyad, U.
(1998), “Real-world data is dirty: data cleansing and the merge/purge problem”,
Data Mining and Knowledge Discovery
, Vol. 2 No. 1, pp. 9-37.
3. Anokhin, P.
(2001), “Data inconsistency detection and resolution in the integration of heterogeneous information sources”, PhD thesis, School of Information Technology and Engineering, George Mason University, Fairfax, VA.
4. Belhajjame, K.
,
Paton, N.W.
,
Embury, S.M.
,
Fernandes, A.A.A.
and
Hedeler, C.
(2010), “Feedback-based annotation, selection and refinement of schema mappings for dataspaces”, EDBT, ACM International Conference Proceeding Series, pp. 573-584.
5. Beníte, J.M.
,
Martín, J.C.
and
Romían, C.
(2007), “Using fuzzy number for measuring quality of service in the hotel industry”,
Tourism Management
, Vol. 28 No. 2, pp. 544-555.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献