Discovering denial constraints-Reference-Cited by-同舟云学术

Discovering denial constraints

Published:2013-08-29 Issue:13 Volume:6 Page:1498-1509
ISSN:2150-8097
Container-title:Proceedings of the VLDB Endowment
language:en
Short-container-title:Proc. VLDB Endow.

Author:

Chu Xu¹,Ilyas Ihab F.²,Papotti Paolo²

Affiliation:

1. University of Waterloo

2. QCRI

Abstract

Integrity constraints (ICs) provide a valuable tool for enforcing correct application semantics. However, designing ICs requires experts and time. Proposals for automatic discovery have been made for some formalisms, such as functional dependencies and their extension conditional functional dependencies. Unfortunately, these dependencies cannot express many common business rules. For example, an American citizen cannot have lower salary and higher tax rate than another citizen in the same state. In this paper, we tackle the challenges of discovering dependencies in a more expressive integrity constraint language, namely Denial Constraints (DCs). DCs are expressive enough to overcome the limits of previous languages and, at the same time, have enough structure to allow efficient discovery and application in several scenarios. We lay out theoretical and practical foundations for DCs, including a set of sound inference rules and a linear algorithm for implication testing. We then develop an efficient instance-driven DC discovery algorithm and propose a novel scoring function to rank DCs for user validation. Using real-world and synthetic datasets, we experimentally evaluate scalability and effectiveness of our solution.

Publisher

VLDB Endowment

Subject

General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development

Link

https://dl.acm.org/doi/pdf/10.14778/2536258.2536262

Cited by 148 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An incremental algorithm for repairing denial constraint violations;Information Systems;2024-12

2. Efficient Validation of SHACL Shapes with Reasoning;Proceedings of the VLDB Endowment;2024-07

3. BUNNI: Learning Repair Actions in Rule-driven Data Cleaning;Journal of Data and Information Quality;2024-06-24

4. Cocoon: Semantic Table Profiling Using Large Language Models;Proceedings of the 2024 Workshop on Human-In-the-Loop Data Analytics;2024-06-14

5. CaFA: Cost-aware, Feasible Attacks With Database Constraints Against Neural Tabular Classifiers;2024 IEEE Symposium on Security and Privacy (SP);2024-05-19