Cross-Lingual Natural Language Generation via Pre-Training-Reference-Cited by-同舟云学术

Cross-Lingual Natural Language Generation via Pre-Training

Published:2020-04-03 Issue:05 Volume:34 Page:7570-7577
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Chi Zewen,Dong Li,Wei Furu,Wang Wenhui,Mao Xian-Ling,Huang Heyan

Abstract

In this work we focus on transferring supervision signals of natural language generation (NLG) tasks between multiple languages. We propose to pretrain the encoder and the decoder of a sequence-to-sequence model under both monolingual and cross-lingual settings. The pre-training objective encourages the model to represent different languages in the shared space, so that we can conduct zero-shot cross-lingual transfer. After the pre-training procedure, we use monolingual data to fine-tune the pre-trained model on downstream NLG tasks. Then the sequence-to-sequence model trained in a single language can be directly evaluated beyond that language (i.e., accepting multi-lingual input and producing multi-lingual output). Experimental results on question generation and abstractive summarization show that our model outperforms the machine-translation-based pipeline methods for zero-shot cross-lingual generation. Moreover, cross-lingual transfer improves NLG performance of low-resource languages by leveraging rich-resource language data. Our implementation and data are available at https://github.com/CZWin32768/xnlg.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 32 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A systematic literature review of hate speech identification on Arabic Twitter data: research challenges and future directions;PeerJ Computer Science;2024-04-02

2. Product promotion copywriting from multimodal data: New benchmark and model;Neurocomputing;2024-03

3. Quantum Natural Language Processing: A Comprehensive Survey;IEEE Access;2024

4. Can Pretrained English Language Models Benefit Non-English NLP Systems in Low-Resource Scenarios?;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024

5. Oversea Cross-Lingual Summarization Service in Multilanguage Pre-Trained Model through Knowledge Distillation;Electronics;2023-12-14