On the expressiveness of implicit provenance in query and update languages

Author:

Buneman Peter1,Cheney James1,Vansummeren Stijn2

Affiliation:

1. University of Edinburgh, Scotland, UK

2. Hasselt University and Transnational University of Limburg, Diepenbeek, Belgium

Abstract

Information describing the origin of data, generally referred to as provenance , is important in scientific and curated databases where it is the basis for the trust one puts in their contents. Since such databases are constructed using operations of both query and update languages, it is of paramount importance to describe the effect of these languages on provenance. In this article we study provenance for query and update languages that are closely related to SQL, and compare two ways in which they can manipulate provenance so that elements of the input are rearranged to elements of the output: implicit provenance , where a query or update only provides the rearranged output, and provenance is provided implicitly by a default provenance semantics; and explicit provenance , where a query or update provides both the output and the description of the provenance of each component of the output. Although explicit provenance is in general more expressive, we show that the classes of implicit provenance operations expressible by query and update languages correspond to natural semantic subclasses of the explicit provenance queries. One of the consequences of this study is that provenance separates the expressive power of query and update languages. The model is also relevant to annotation propagation schemes in which annotations on the input to a query or update have to be transferred to the output or vice versa.

Publisher

Association for Computing Machinery (ACM)

Subject

Information Systems

Reference30 articles.

1. Abiteboul S. Hull R. and Vianu V. 1995. Foundations of Databases. Addison-Wesley New York. Abiteboul S. Hull R. and Vianu V. 1995. Foundations of Databases. Addison-Wesley New York.

2. On genericity and parametricity (extended abstract)

3. An annotation management system for relational databases

Cited by 45 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Modeling the Data Provenance of Relational Databases Supporting Full-Featured SQL and Procedural Languages;Applied Sciences;2022-12-21

2. On Optimizing the Trade-off between Privacy and Utility in Data Provenance;Proceedings of the 2021 International Conference on Management of Data;2021-06-09

3. Data Provenance;Foundations and Trends® in Databases;2021

4. Equivalence-Invariant Algebraic Provenance for Hyperplane Update Queries;Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data;2020-06-11

5. Hypothetical Reasoning via Provenance Abstraction;Proceedings of the 2019 International Conference on Management of Data;2019-06-25

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3