Boundary-Aware Abstractive Summarization with Entity-Augmented Attention for Enhancing Faithfulness-Reference-Cited by-同舟云学术

Boundary-Aware Abstractive Summarization with Entity-Augmented Attention for Enhancing Faithfulness

Published:2024-04-15 Issue:4 Volume:23 Page:1-18
ISSN:2375-4699
Container-title:ACM Transactions on Asian and Low-Resource Language Information Processing
language:en
Short-container-title:ACM Trans. Asian Low-Resour. Lang. Inf. Process.

Author:

Li Jiuyi¹^ORCID,Liu Junpeng¹^ORCID,Ma Jianjun¹^ORCID,Yang Wei¹^ORCID,Huang Degen¹^ORCID

Affiliation:

1. Dalian University of Technology, Dalian, China

Abstract

With the successful application of deep learning, document summarization systems can produce more readable results. However, abstractive summarization still suffers from unfaithful outputs and factual errors, especially in named entities. Current approaches tend to employ external knowledge to improve model performance while neglecting the boundary information and the semantics of the entities. In this article, we propose an entity-augmented method (EAM) to encourage the model to make full use of the entity boundary information and pay more attention to the critical entities. Experimental results on three Chinese and English summarization datasets show that our method outperforms several strong baselines and achieves state-of-the-art performance on the CLTS dataset. Our method can also improve the faithfulness of the summary and generalize well to different pre-trained language models. Moreover, we propose a method to evaluate the integrity of generated entities. Besides, we adapt the data augmentation method in the FactCC model according to the difference between Chinese and English in grammar and train a new evaluation model for factual consistency evaluation in Chinese summarization.

Funder

Key Research and Development Program of Yunnan Province

National Natural Science Foundation of China

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3641278

Reference44 articles.

1. Ramesh Nallapati, Bowen Zhou, Cicero dos Santos, Caglar Gulcehre, and Bing Xiang. 2016. Abstractive text summarization using sequence-to-sequence RNNs and beyond. In Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning (CoNLL).

2. Abigail See, Peter J. Liu, and Christopher D. Manning. 2017. Get to the point: Summarization with pointer-generator networks. In Proceeding of the ACL (Volume 1: Long Papers). 1073–1083

3. Regularizing output distribution of abstractive chinese social media text summarization for improved semantic consistency;Wei Bingzhen;ACM Transactions on Asian and Low-Resource Language,2019

4. Global encoding for long chinese text summarization;Xi Xuefeng;ACM Transactions on Asian and Low-Resource Language,2020

5. Logan Lebanoff, Franck Dernoncourt, Doo Soon Kim, Lidan Wang, Walter Chang, and Fei Liu. 2020. Learning to fuse sentences with transformers for summarization. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). 4136–4142.