Research on Mongolian-Chinese machine translation based on the end-to-end neural network-Reference-Cited by-同舟云学术

Research on Mongolian-Chinese machine translation based on the end-to-end neural network

Published:2019-03-05 Issue:01 Volume:18 Page:1941003
ISSN:0219-6913
Container-title:International Journal of Wavelets, Multiresolution and Information Processing
language:en
Short-container-title:Int. J. Wavelets Multiresolut Inf. Process.

Author:

Qing-Dao-Er-Ji Ren¹,Su Yila¹,Wu Nier¹

Affiliation:

1. School of Information Engineering, Inner Mongolia University of Technology, 49 Aimin Street Xincheng District, Hohhot 010051, P. R. China

Abstract

With the development of natural language processing and neural machine translation, the neural machine translation method of end-to-end (E2E) neural network model has gradually become the focus of research because of its high translation accuracy and strong semantics of translation. However, there are still problems such as limited vocabulary and low translation loyalty, etc. In this paper, the discriminant method and the Conditional Random Field (CRF) model were used to segment and label the stem and affixes of Mongolian in the preprocessing stage of Mongolian-Chinese bilingual corpus. Aiming at the low translation loyalty problem, a decoding model combining Convolution Neural Network (CNN) and Gated Recurrent Unit (GRU) was constructed. The target language decoding was performed by using the GRU. A global attention model was used to obtain the bilingual word alignment information in the process of bilingual word alignment processing. Finally, the quality of the translation was evaluated by Bilingual Evaluation Understudy (BLEU) values and Perplexity (PPL) values. The improved model yields a BLEU value of 25.13 and a PPL value of [Formula: see text]. The experimental results show that the E2E Mongolian-Chinese neural machine translation model was improved in terms of translation quality and semantic confusion compared with traditional statistical methods and machine translation models based on Recurrent Neural Networks (RNN).

Funder

The Natural Science Foundation of Inner Mongolia

The Foundation of Autonomous regional civil committee of Inner Mongolia

The Inner Mongolia Science and Technology Plan Project

Publisher

World Scientific Pub Co Pte Lt

Subject

Applied Mathematics,Information Systems,Signal Processing

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0219691319410030

Reference15 articles.

1. Describing Multimedia Content Using Attention-Based Encoder-Decoder Networks