Affiliation:
1. University of Edinburgh, Scotland, UK
2. Hasselt University and Transnational University of Limburg, Diepenbeek, Belgium
Abstract
Information describing the origin of data, generally referred to as
provenance
, is important in scientific and curated databases where it is the basis for the trust one puts in their contents. Since such databases are constructed using operations of both query and update languages, it is of paramount importance to describe the effect of these languages on provenance.
In this article we study provenance for query and update languages that are closely related to SQL, and compare two ways in which they can manipulate provenance so that elements of the input are rearranged to elements of the output:
implicit provenance
, where a query or update only provides the rearranged output, and provenance is provided implicitly by a default provenance semantics; and
explicit provenance
, where a query or update provides both the output and the description of the provenance of each component of the output. Although explicit provenance is in general more expressive, we show that the classes of implicit provenance operations expressible by query and update languages correspond to natural semantic subclasses of the explicit provenance queries.
One of the consequences of this study is that provenance separates the expressive power of query and update languages. The model is also relevant to annotation propagation schemes in which annotations on the input to a query or update have to be transferred to the output or vice versa.
Publisher
Association for Computing Machinery (ACM)
Reference30 articles.
1. Abiteboul S. Hull R. and Vianu V. 1995. Foundations of Databases. Addison-Wesley New York. Abiteboul S. Hull R. and Vianu V. 1995. Foundations of Databases. Addison-Wesley New York.
2. On genericity and parametricity (extended abstract)
3. An annotation management system for relational databases
Cited by
45 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Modeling the Data Provenance of Relational Databases Supporting Full-Featured SQL and Procedural Languages;Applied Sciences;2022-12-21
2. On Optimizing the Trade-off between Privacy and Utility in Data Provenance;Proceedings of the 2021 International Conference on Management of Data;2021-06-09
3. Data Provenance;Foundations and Trends® in Databases;2021
4. Equivalence-Invariant Algebraic Provenance for Hyperplane Update Queries;Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data;2020-06-11
5. Hypothetical Reasoning via Provenance Abstraction;Proceedings of the 2019 International Conference on Management of Data;2019-06-25