Unified Training for Cross-Lingual Abstractive Summarization by Aligning Parallel Machine Translation Pairs-Reference-Cited by-同舟云学术

Unified Training for Cross-Lingual Abstractive Summarization by Aligning Parallel Machine Translation Pairs

Published:2024-07-04 Issue:13 Volume:12 Page:2107
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Cheng Shaohuan¹^ORCID,Chen Wenyu¹,Tang Yujia¹,Fu Mingsheng¹,Qu Hong¹^ORCID

Affiliation:

1. School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China

Abstract

Cross-lingual summarization (CLS) is essential for enhancing global communication by facilitating efficient information exchange across different languages. However, owing to the scarcity of CLS data, recent studies have employed multi-task frameworks to combine parallel monolingual summaries. These methods often use independent decoders or models with non-shared parameters because of the mismatch in output languages, which limits the transfer of knowledge between CLS and its parallel data. To address this issue, we propose a unified training method for CLS that combines parallel machine translation (MT) pairs with CLS pairs, jointly training them within a single model. This design ensures consistent input and output languages and promotes knowledge sharing between the two tasks. To further enhance the model’s capability to focus on key information, we introduce two additional loss terms to align the hidden representations and probability distributions between the parallel MT and CLS pairs. Experimental results demonstrate that our method outperforms competitive methods in both full-dataset and low-resource scenarios on two benchmark datasets, Zh2EnSum and En2ZhSum.

Funder

Young Scientists Fund of the Natural Science Foundation of Sichuan Province

Publisher

MDPI AG

Link

https://www.mdpi.com/2227-7390/12/13/2107/pdf

Reference36 articles.

1. Muresan, S., Nakov, P., and Villavicencio, A. (2022, January 22–27). A Variational Hierarchical Model for Neural Cross-Lingual Summarization. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, Dublin, Ireland. Volume 1: Long Papers.

2. Zhu, J., Wang, Q., Wang, Y., Zhou, Y., Zhang, J., Wang, S., and Zong, C. (2019, January 3–7). NCLS: Neural Cross-Lingual Summarization. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), Hong Kong, China.

3. Cross-lingual c* st* rd: English access to hindi information;Leuski;ACM Trans. Asian Lang. Inf. Process.,2003

4. Wan, X., Li, H., and Xiao, J. (2010, January 11–16). Cross-language document summarization based on machine translation quality prediction. Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden.

5. A two-stage fine-tuning method for low-resource cross-lingual summarization;Zhang;Math. Biosci. Eng.,2024