Neural Abstractive Text Summarization with Sequence-to-Sequence Models-Reference-Cited by-同舟云学术

Neural Abstractive Text Summarization with Sequence-to-Sequence Models

Published:2021-02-28 Issue:1 Volume:2 Page:1-37
ISSN:2691-1922
Container-title:ACM/IMS Transactions on Data Science
language:en
Short-container-title:ACM/IMS Trans. Data Sci.

Author:

Shi Tian¹^ORCID,Keneshloo Yaser¹,Ramakrishnan Naren¹,Reddy Chandan K.¹

Affiliation:

1. Virginia Tech, Arlington, VA

Abstract

In the past few years, neural abstractive text summarization with sequence-to-sequence (seq2seq) models have gained a lot of popularity. Many interesting techniques have been proposed to improve seq2seq models, making them capable of handling different challenges, such as saliency, fluency and human readability, and generate high-quality summaries. Generally speaking, most of these techniques differ in one of these three categories: network structure, parameter inference, and decoding/generation. There are also other concerns, such as efficiency and parallelism for training a model. In this article, we provide a comprehensive literature survey on different seq2seq models for abstractive text summarization from the viewpoint of network structures, training strategies, and summary generation algorithms. Several models were first proposed for language modeling and generation tasks, such as machine translation, and later applied to abstractive text summarization. Hence, we also provide a brief review of these models. As part of this survey, we also develop an open source library, namely, Neural Abstractive Text Summarizer (NATS) toolkit, for the abstractive text summarization. An extensive set of experiments have been conducted on the widely used CNN/Daily Mail dataset to examine the effectiveness of several different neural network components. Finally, we benchmark two models implemented in NATS on the two recently released datasets, namely, Newsroom and Bytecup.

Funder

National Science Foundation

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3419106

Reference158 articles.

1. Maximum mutual information estimation of hidden Markov model parameters for speech recognition

Cited by 111 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A systematic literature review of deep learning-based text summarization: Techniques, input representation, training strategies, mechanisms, datasets, evaluation, and challenges;Expert Systems with Applications;2024-10

2. A comprehensive survey for automatic text summarization: Techniques, approaches and perspectives;Neurocomputing;2024-10

3. Abstractive text summarization: State of the art, challenges, and improvements;Neurocomputing;2024-10

4. KEMoS: A knowledge-enhanced multi-modal summarizing framework for Chinese online meetings;Neural Networks;2024-10

5. Bibliography;Deep Learning;2024-07-05