A Mongolian–Chinese Neural Machine Translation Method Based on Semantic-Context Data Augmentation-Reference-Cited by-同舟云学术

A Mongolian–Chinese Neural Machine Translation Method Based on Semantic-Context Data Augmentation

Published:2024-04-19 Issue:8 Volume:14 Page:3442
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Zhang Huinuan¹,Ji Yatu¹,Wu Nier¹,Lu Min¹

Affiliation:

1. School of Information Engineering, Inner Mongolia University of Technology, Hohhot 010051, China

Abstract

Neural machine translation (NMT) typically relies on a substantial number of bilingual parallel corpora for effective training. Mongolian, as a low-resource language, has relatively few parallel corpora, resulting in poor translation performance. Data augmentation (DA) is a practical and promising method to solve problems related to data sparsity and single semantic structure by expanding the size and structure of available data. In order to address the issues of data sparsity and semantic inconsistency in Mongolian–Chinese NMT processes, this paper proposes a new semantic-context DA method. This method adds an additional semantic encoder based on the original translation model, which utilizes both source and target sentences to generate different semantic vectors to enhance each training instance. The results show that this method significantly improves the quality of Mongolian–Chinese NMT tasks, with an increase of approximately 2.5 BLEU values compared to the basic Transformer model. Compared to the basic model, this method can achieve the same translation results with about half of the data, greatly improving translation efficiency.

Funder

National Natural Science Foundation of China

Fundamental Research Fund

Research program of science and technology at Universities of Inner Mongolia Autonomous Region

Science Research Foundation of Inner Mongolia University of Technology

Fundamental Research Fund Project

Publisher

MDPI AG

Link

https://www.mdpi.com/2076-3417/14/8/3442/pdf

Reference28 articles.

1. Mikołajczyk, A., and Grochowski, M. (2018, January 9–12). Data augmentation for improving deep learning in image classification problem. Proceedings of the 2018 International Interdisciplinary PhD Workshop (IIPhDW), Swinoujscie, Poland.

2. Fadaee, M., Bisazza, A., and Monz, C. (August, January 30). Data augmentation for low-resource neural machine translation. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Vancouver, BC, Canada.

3. Xia, M., Kong, X., Anastasopoulos, A., and Neubig, G. (August, January 28). Generalized data augmentation for low-resource translation. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy.

4. Gao, F., Zhu, J., Wu, L., Xia, Y., Qin, T., Cheng, X., Zhou, W., and Liu, T.Y. (August, January 28). Soft contextual data augmentation for neural machine translation. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy.

5. Zhou, C., Ma, X., Hu, J., and Neubig, G. (2019, January 3–7). Handling syntactic divergence in low-resource machine translation. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), Hong Kong, China.