Deep Learning Based Abstractive Text Summarization: Approaches, Datasets, Evaluation Measures, and Challenges-Reference-Cited by-同舟云学术

Deep Learning Based Abstractive Text Summarization: Approaches, Datasets, Evaluation Measures, and Challenges

Published:2020-08-24 Issue: Volume:2020 Page:1-29
ISSN:1024-123X
Container-title:Mathematical Problems in Engineering
language:en
Short-container-title:Mathematical Problems in Engineering

Author:

Suleiman Dima¹^ORCID,Awajan Arafat¹

Affiliation:

1. Princess Sumaya University for Technology, Amman, Jordan

Abstract

In recent years, the volume of textual data has rapidly increased, which has generated a valuable resource for extracting and analysing information. To retrieve useful knowledge within a reasonable time period, this information must be summarised. This paper reviews recent approaches for abstractive text summarisation using deep learning models. In addition, existing datasets for training and validating these approaches are reviewed, and their features and limitations are presented. The Gigaword dataset is commonly employed for single-sentence summary approaches, while the Cable News Network (CNN)/Daily Mail dataset is commonly employed for multisentence summary approaches. Furthermore, the measures that are utilised to evaluate the quality of summarisation are investigated, and Recall-Oriented Understudy for Gisting Evaluation 1 (ROUGE1), ROUGE2, and ROUGE-L are determined to be the most commonly applied metrics. The challenges that are encountered during the summarisation process and the solutions proposed in each approach are analysed. The analysis of the several approaches shows that recurrent neural networks with an attention mechanism and long short-term memory (LSTM) are the most prevalent techniques for abstractive text summarisation. The experimental results show that text summarisation with a pretrained encoder model achieved the highest values for ROUGE1, ROUGE2, and ROUGE-L (43.85, 20.34, and 39.9, respectively). Furthermore, it was determined that most abstractive text summarisation models faced challenges such as the unavailability of a golden token at testing time, out-of-vocabulary (OOV) words, summary sentence repetition, inaccurate sentences, and fake facts.

Publisher

Hindawi Limited

Subject

General Engineering,General Mathematics

Link

http://downloads.hindawi.com/journals/mpe/2020/9365340.pdf

Reference38 articles.

1. Text Summarization Techniques: A Brief Survey

2. Automatic Arabic text summarization: a survey

3. A Hybrid Approach for Arabic Text Summarization Using Domain Knowledge and Genetic Algorithms

4. A Study on Abstractive Summarization Techniques in Indian Languages

Cited by 65 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A systematic literature review of deep learning-based text summarization: Techniques, input representation, training strategies, mechanisms, datasets, evaluation, and challenges;Expert Systems with Applications;2024-10

2. Abstractive text summarization: State of the art, challenges, and improvements;Neurocomputing;2024-10

3. Encoder-Decoder Transformers for Textual Summaries on Social Media Content;Automation, Control and Intelligent Systems;2024-08-15

4. Text summarization based on semantic graphs: an abstract meaning representation graph-to-text deep learning approach;Journal of Big Data;2024-07-14

5. Neural natural language processing for long texts: A survey on classification and summarization;Engineering Applications of Artificial Intelligence;2024-07