Affiliation:
1. Meltwater
2. Arizona State University
3. EURECOM
Abstract
RuDiK is a system for the discovery of declarative rules over knowledge-bases (KBs). RuDiK discovers both
positive
rules, which identify relationships between entities, e.g., "if two persons have the same parent, they are siblings", and
negative
rules, which identify data contradictions, e.g., "if two persons are married, one cannot be the child of the other". Rules help domain experts to curate data in large KBs. Positive rules suggest new facts to mitigate incompleteness and negative rules detect erroneous facts. Also, negative rules are useful to generate negative examples for learning algorithms. RuDiK goes beyond existing solutions since it discovers rules with a more
expressive rule language
w.r.t. previous approaches, which leads to wide coverage of the facts in the KB, and its mining is robust to existing
errors and incompleteness in the KB.
The system has been deployed for multiple KBs, including Yago, DBpedia, Freebase and Wiki-Data, and identifies new facts and real errors with 85% to 97% accuracy, respectively. This demonstration shows how RuDiK can be used to interact with domain experts. Once the audience pick a KB and a predicate, they will add new facts, remove errors, and train a machine learning system with automatically generated examples.
Subject
General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development
Cited by
25 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献