Natural Questions: A Benchmark for Question Answering Research-Reference-Cited by-同舟云学术

Natural Questions: A Benchmark for Question Answering Research

Published:2019-11 Issue: Volume:7 Page:453-466
ISSN:2307-387X
Container-title:Transactions of the Association for Computational Linguistics
language:en
Short-container-title:Transactions of the Association for Computational Linguistics

Author:

Kwiatkowski Tom¹,Palomaki Jennimaria²,Redfield Olivia²,Collins Michael²,Parikh Ankur²,Alberti Chris²,Epstein Danielle²,Polosukhin Illia²,Devlin Jacob²,Lee Kenton²,Toutanova Kristina²,Jones Llion²,Kelcey Matthew²,Chang Ming-Wei²,Dai Andrew M.²,Uszkoreit Jakob²,Le Quoc²,Petrov Slav²

Affiliation:

1. Google Research.

2. Google Research

Abstract

We present the Natural Questions corpus, a question answering data set. Questions consist of real anonymized, aggregated queries issued to the Google search engine. An annotator is presented with a question along with a Wikipedia page from the top 5 search results, and annotates a long answer (typically a paragraph) and a short answer (one or more entities) if present on the page, or marks null if no long/short answer is present. The public release consists of 307,373 training examples with single annotations; 7,830 examples with 5-way annotations for development data; and a further 7,842 examples with 5-way annotated sequestered as test data. We present experiments validating quality of the data. We also describe analysis of 25-way annotations on 302 examples, giving insights into human variability on the annotation task. We introduce robust metrics for the purposes of evaluating question answering systems; demonstrate high human upper bounds on these metrics; and establish baseline results using competitive methods drawn from related literature.

Publisher

MIT Press - Journals

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Human-Computer Interaction,Communication

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/tacl_a_00276

Reference25 articles.

1. A large annotated corpus for learning natural language inference

2. Reading Wikipedia to Answer Open-Domain Questions

3. QuAC: Question Answering in Context

4. Simple and Effective Multi-Paragraph Reading Comprehension

Cited by 415 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Adversarial Entity Graph Convolutional Networks for multi-hop inference question answering;Expert Systems with Applications;2024-12

2. Large language models present new questions for decision support;International Journal of Information Management;2024-12

3. TPKE-QA: A gapless few-shot extractive question answering approach via task-aware post-training and knowledge enhancement;Expert Systems with Applications;2024-11

4. An Efficient Corpus Indexer for dynamic corpora retrieval;Expert Systems with Applications;2024-11

5. Event extraction as machine reading comprehension with question-context bridging;Knowledge-Based Systems;2024-09