Multi-granularity Knowledge Sharing in Low-resource Neural Machine Translation-Reference-Cited by-同舟云学术

Multi-granularity Knowledge Sharing in Low-resource Neural Machine Translation

Published:2024-02-08 Issue:2 Volume:23 Page:1-19
ISSN:2375-4699
Container-title:ACM Transactions on Asian and Low-Resource Language Information Processing
language:en
Short-container-title:ACM Trans. Asian Low-Resour. Lang. Inf. Process.

Author:

Mi Chenggang¹^ORCID,Xie Shaoliang¹^ORCID,Fan Yi²^ORCID

Affiliation:

1. Foreign Language and Literature Institute, Xi’an International Studies University, China

2. School of Aeronautics, Northwestern Polytechnical University China

Abstract

As the rapid development of deep learning methods, neural machine translation (NMT) has attracted more and more attention in recent years. However, lack of bilingual resources decreases the performance of the low-resource NMT model seriously. To overcome this problem, several studies put their efforts on knowledge transfer from high-resource language pairs to low-resource language pairs. However, these methods usually focus on one single granularity of language and the parameter sharing among different granularities in NMT is not well studied. In this article, we propose to improve the parameter sharing in low-resource NMT by introducing multi-granularity knowledge such as word, phrase and sentence. This knowledge can be monolingual and bilingual. We build the knowledge sharing model for low-resource NMT based on a multi-task learning framework, three auxiliary tasks such as syntax parsing, cross-lingual named entity recognition, and natural language generation are selected for the low-resource NMT. Experimental results show that the proposed method consistently outperforms six strong baseline systems on several low-resource language pairs.

Funder

National Natural Science Foundation of China

Construction of the International Communication Competences

Shaanxi Federation of Social Sciences Circles

Scientific Research Program

Shaanxi Provincial Education Department

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3639930

Reference36 articles.

1. On The Alignment Problem In Multi-Head Attention-Based Neural Machine Translation

2. On the Linguistic Representational Power of Neural Machine Translation Models

3. Linguistic knowledge-based vocabularies for Neural Machine Translation

4. From Words to Sentences: A Progressive Learning Approach for Zero-resource Machine Translation with Visual Pivots

5. A Survey of Multilingual Neural Machine Translation