A survey of methods for revealing and overcoming weaknesses of data-driven Natural Language Understanding-Reference-Cited by-同舟云学术

A survey of methods for revealing and overcoming weaknesses of data-driven Natural Language Understanding

Published:2022-04-22 Issue:1 Volume:29 Page:1-31
ISSN:1351-3249
Container-title:Natural Language Engineering
language:en
Short-container-title:Nat. Lang. Eng.

Author:

Schlegel Viktor^ORCID,Nenadic Goran,Batista-Navarro Riza

Abstract

AbstractRecent years have seen a growing number of publications that analyse Natural Language Understanding (NLU) datasets for superficial cues, whether they undermine the complexity of the tasks underlying those datasets and how they impact those models that are optimised and evaluated on this data. This structured survey provides an overview of the evolving research area by categorising reported weaknesses in models and datasets and the methods proposed to reveal and alleviate those weaknesses for the English language. We summarise and discuss the findings and conclude with a set of recommendations for possible future research directions. We hope that it will be a useful resource for researchers who propose new datasets to assess the suitability and quality of their data to evaluate various phenomena of interest, as well as those who propose novel NLU approaches, to further understand the implications of their improvements with respect to their model’s acquired capabilities.

Publisher

Cambridge University Press (CUP)

Subject

Artificial Intelligence,Linguistics and Language,Language and Linguistics,Software

Reference194 articles.

1. SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference

2. Chen, M. , D’Arcy, M. , Liu, A. , Fernandez, J. and Downey, D. (2019). CODAH: An Adversarially-Authored Question Answering Dataset for Common Sense.

3. Möller, T. , Reina, A. , Jayakumar, R. and Pietsch, M. (2020). COVID-QA: A question answering dataset for COVID-19 | OpenReview. In ACL 2020 Workshop on Natural Language Processing for COVID-19 (NLP-COVID).

4. emrQA: A Large Corpus for Question Answering on Electronic Medical Records

5. Look at the First Sentence: Position Bias in Question Answering