MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering-Reference-Cited by-同舟云学术

MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering

Published:2021 Issue: Volume:9 Page:1389-1406
ISSN:2307-387X
Container-title:Transactions of the Association for Computational Linguistics
language:en
Short-container-title:

Author:

Longpre Shayne¹,Lu Yi²,Daiber Joachim³

Affiliation:

1. Apple Inc. slongpre@mit.edu

2. Apple Inc. ylu7@apple.com

3. Apple Inc. jodaiber@apple.com

Abstract

Abstract Progress in cross-lingual modeling depends on challenging, realistic, and diverse evaluation sets. We introduce Multilingual Knowledge Questions and Answers (MKQA), an open- domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically diverse languages (260k question-answer pairs in total). Answers are based on heavily curated, language- independent data representation, making results comparable across languages and independent of language-specific passages. With 26 languages, this dataset supplies the widest range of languages to-date for evaluating question answering. We benchmark a variety of state- of-the-art methods and baselines for generative and extractive question answering, trained on Natural Questions, in zero shot and translation settings. Results indicate this dataset is challenging even in English, but especially in low-resource languages.1

Publisher

MIT Press - Journals

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Human-Computer Interaction,Communication

Link

https://direct.mit.edu/tacl/article-pdf/doi/10.1162/tacl_a_00433/1976187/tacl_a_00433.pdf

Reference38 articles.

1. Translation artifacts in cross-lingual transfer learning;Artetxe,2020

2. On the cross-lingual transferability of monolingual representations;Artetxe,2020

3. XOR QA: Cross-lingual open- retrieval question answering;Asai,2021

4. A thorough examination of the CNN/Daily Mail reading comprehension task;Chen,2016

5. Reading Wikipedia to answer open-domain questions;Chen,2017

Cited by 28 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Building efficient and effective OpenQA systems for low-resource languages;Knowledge-Based Systems;2024-10

2. EHMMQA: English, Hindi, and Marathi multilingual question answering framework using deep learning;Natural Language Processing;2024-05-24

3. Is word order considered by foundation models? A comparative task-oriented analysis;Expert Systems with Applications;2024-05

4. ArQuAD: An Expert-Annotated Arabic Machine Reading Comprehension Dataset;Cognitive Computation;2024-03-11

5. Semantic search as extractive paraphrase span detection;Language Resources and Evaluation;2024-02-01