Affiliation:
1. The Hebrew University of Jerusalem, Jerusalem, Israel
2. IBM Almaden Research Center, San Jose, CA
Abstract
Constraints are important, not only for maintaining data integrity, but also because they capture natural probabilistic dependencies among data items. A
probabilistic XML database
(PXDB) is the probability subspace comprising the instances of a
p-document
that satisfy a set of constraints. In contrast to existing models that can express probabilistic dependencies, it is shown that query evaluation is tractable in PXDBs. The problems of sampling and determining well-definedness (i.e., whether the aforesaid subspace is nonempty) are also tractable. Furthermore, queries and constraints can include the aggregate functions
count, max, min,
and
ratio.
Finally, this approach can be easily extended to allow a probabilistic interpretation of constraints.
Funder
Israel Science Foundation
Publisher
Association for Computing Machinery (ACM)
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. An approach of top-k keyword querying for fuzzy XML;Computing;2017-10-20
2. Structurally Tractable Uncertain Data;Proceedings of the 2015 ACM SIGMOD on PhD Symposium;2015-05-31
3. Consistency checking and querying in probabilistic databases under integrity constraints;Journal of Computer and System Sciences;2014-11
4. Quasi-SLCA Based Keyword Query Processing over Probabilistic XML Data;IEEE Transactions on Knowledge and Data Engineering;2014-04
5. Probabilistic XML: Models and Complexity;Advances in Probabilistic Databases for Uncertain Information Management;2013