A Survey on Machine Reading Comprehension—Tasks, Evaluation Metrics and Benchmark Datasets-Reference-Cited by-同舟云学术

A Survey on Machine Reading Comprehension—Tasks, Evaluation Metrics and Benchmark Datasets

Published:2020-10-29 Issue:21 Volume:10 Page:7640
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Zeng Changchang^ORCID,Li Shaobo^ORCID,Li Qin,Hu Jie,Hu Jianjun^ORCID

Abstract

Machine Reading Comprehension (MRC) is a challenging Natural Language Processing (NLP) research field with wide real-world applications. The great progress of this field in recent years is mainly due to the emergence of large-scale datasets and deep learning. At present, a lot of MRC models have already surpassed human performance on various benchmark datasets despite the obvious giant gap between existing MRC models and genuine human-level reading comprehension. This shows the need for improving existing datasets, evaluation metrics, and models to move current MRC models toward “real” understanding. To address the current lack of comprehensive survey of existing MRC tasks, evaluation metrics, and datasets, herein, (1) we analyze 57 MRC tasks and datasets and propose a more precise classification method of MRC tasks with 4 different attributes; (2) we summarized 9 evaluation metrics of MRC tasks, 7 attributes and 10 characteristics of MRC datasets; (3) We also discuss key open issues in MRC research and highlighted future research directions. In addition, we have collected, organized, and published our data on the companion website where MRC researchers could directly access each MRC dataset, papers, baseline projects, and the leaderboard.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/10/21/7640/pdf

Reference118 articles.

1. Recent Trends in Deep Learning Based Natural Language Processing [Review Article]

2. DeepPatent: patent classification with convolutional neural networks and word embedding

3. Tourism Review Sentiment Classification Using a Bidirectional Recurrent Neural Network with an Attention Mechanism and Topic-Enriched Word Vectors

4. Reading Pictures for Story Comprehension Requires Mental Imagery Skills

5. Assessing the Benchmarking Capacity of Machine Reading Comprehension Datasets

Cited by 42 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. SESAME - self-supervised framework for extractive question answering over document collections;Journal of Intelligent Information Systems;2024-07-30

2. QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

3. A comparative evaluation for question answering over Greek texts by using machine translation and BERT;Language Resources and Evaluation;2024-06-19

4. Neural models for semantic analysis of handwritten document images;International Journal on Document Analysis and Recognition (IJDAR);2024-06-06

5. Numerical reasoning reading comprehension on Vietnamese COVID-19 news: task, corpus, and challenges;Neural Computing and Applications;2024-05-03