Maestro: Automatic Generation of Comprehensive Benchmarks for Question Answering Over Knowledge Graphs-Reference-Cited by-同舟云学术

Maestro: Automatic Generation of Comprehensive Benchmarks for Question Answering Over Knowledge Graphs

Published:2023-06-13 Issue:2 Volume:1 Page:1-24
ISSN:2836-6573
Container-title:Proceedings of the ACM on Management of Data
language:en
Short-container-title:Proc. ACM Manag. Data

Author:

Orogat Abdelghny¹^ORCID,El-Roby Ahmed¹^ORCID

Affiliation:

1. Carleton University, Ottawa, ON, Canada

Abstract

Recently, there has been an upsurge in the number of knowledge graphs (KG) that can only be accessed by experts. Non-expert users lack an adequate understanding of the queried knowledge graph's vocabulary and structure, as well as the syntax of the structured query language used to express the user's information needs. To increase the user base of these KGs, a set of Question Answering (QA) systems that use natural language to query these knowledge graphs have been introduced. However, finding a benchmark that accurately evaluates the quality of a QA system is a difficult task due to (1) the high degree of variation in the fine-grained properties among the existing benchmarks, (2) the static nature of the existing benchmarks versus the evolving nature of KGs, and (3) the limited number of KGs targeted by existing benchmarks, which hinders the usability of QA systems in real-world deployment over KGs that are different from those that were used in the evaluation of the QA systems. In this paper, we introduce Maestro, a benchmark generation system for question answering over knowledge graphs. Maestro can generate a new benchmark for any KG given the KG and, optionally, a text corpus that covers this KG. The benchmark generated by Maestro is guaranteed to cover all the properties of the natural language questions and queries that were encountered in the literature as long as the targeted KG includes these properties. Maestro also generates high-quality natural language questions with various utterances that are on par with manually-generated ones to better evaluate QA systems.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3589322

Reference43 articles.

1. SPARQL 1.1 query language. http://www.w3.org/TR/sparql11-query/ , 2013 . SPARQL 1.1 query language. http://www.w3.org/TR/sparql11-query/, 2013.

2. RDF 1.1 concepts and abstract syntax. http://www.w3.org/TR/2014/REC-rdf11-concepts-20140225/ , 2014 . RDF 1.1 concepts and abstract syntax. http://www.w3.org/TR/2014/REC-rdf11-concepts-20140225/, 2014.

3. A. Abujabal , R. Saha Roy , M. Yahya , and G. Weikum . ComQA: A community-sourced dataset for complex factoid question answering with paraphrase clusters . In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HL) , 2019 . A. Abujabal, R. Saha Roy, M. Yahya, and G. Weikum. ComQA: A community-sourced dataset for complex factoid question answering with paraphrase clusters. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HL), 2019.

4. Automated Template Generation for Question Answering over Knowledge Graphs

5. DBpedia: A Nucleus for a Web of Open Data

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Multi-Task Learning Framework for Reading Comprehension of Scientific Tabular Data;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13