Affiliation:
1. Univ. Adolfo Ibáñez, Santiago, Chile and RelationalAI Inc., Toronto, Canada
2. University of Antwerp, Antwerp, Belgium
Abstract
In this work, we provide some insights and develop some ideas, with few technical details, about the role of explanations in Data Quality in the context of data-based machine learning models (ML). In this direction, there are, as expected, roles for causality, and
explainable artificial intelligence
. The latter area not only sheds light on the models, but also on the data that support model construction. There is also room for defining, identifying, and explaining errors in data, in particular, in ML, and also for suggesting repair actions. More generally, explanations can be used as a basis for defining dirty data in the context of ML, and measuring or quantifying them. We think dirtiness as relative to the ML task at hand, e.g., classification.
Publisher
Association for Computing Machinery (ACM)
Subject
Information Systems and Management,Information Systems
Reference40 articles.
1. ERBlox: Combining matching dependencies with machine learning for entity resolution
2. Ontological Multidimensional Data Models and Contextual Data Quality
3. Data quality is context dependent. In Proc. of the Workshop on Enabling Real-Time Business Intelligence (BIRTE) Collocated with the International Conference on Very Large Data Bases (VLDB);Bertossi L.;Springer LNBIP,2011
4. From Causes for Database Queries to Repairs and Model-Based Diagnosis and Back
Cited by
27 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献