Neural Machine Translation of Electrical Engineering with Fusion of Memory Information-Reference-Cited by-同舟云学术

Neural Machine Translation of Electrical Engineering with Fusion of Memory Information

Published:2023-09-13 Issue:18 Volume:13 Page:10279
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Chen Yuan¹²,Liu Zikang²³,Zhang Juwei²³

Affiliation:

1. School of Foreign Languages, Henan University of Science and Technology, Luoyang 471023, China

2. Henan Province New Energy Vehicle Power Electronics and Power Transmission Engineering Research Center, Luoyang 471023, China

3. School of Information Engineering, Henan University of Science and Technology, Luoyang 471023, China

Abstract

This paper proposes a new neural machine translation model of electrical engineering that combines a transformer with gated recurrent unit (GRU) networks. By fusing global information and memory information, the model effectively improves the performance of low-resource neural machine translation. Unlike traditional transformers, our proposed model includes two different encoders: one is the global information encoder, which focuses on contextual information, and the other is the memory encoder, which is responsible for capturing recurrent memory information. The model with these two types of attention can encode both global and memory information and learn richer semantic knowledge. Because transformers require global attention calculation for each word position, the time and space complexity are both squared with the length of the source language sequence. When the length of the source language sequence becomes too long, the performance of the transformer will sharply decline. Therefore, we propose a memory information encoder based on the GRU to improve this drawback. The model proposed in this paper has a maximum improvement of 2.04 BLEU points over the baseline model in the field of electrical engineering with low resources.

Funder

J.Z.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/13/18/10279/pdf

Reference27 articles.

1. The neural machine translation models for the low-resource Kazakh–English language pair;Karyukin;PeerJ Comput. Sci.,2023

2. Maučec, M.S., and Donaj, G. (2019). Machine translation and the evaluation of its quality. Recent Trends Comput. Intell., 143.

3. Kalchbrenner, N., and Blunsom, P. (2013). Recurrent convolutional neural networks for discourse compositionality. arXiv.

4. Bahdanau, D., Cho, K., and Bengio, Y. (2015, January 7–9). Neural machine translation by jointly learning to align and translate. Proceedings of the ICLR, San Diego, CA, USA.

5. Gehring, J., Auli, M., Grangier, D., Yarats, D., and Dauphin, Y.N. (2017, January 6–11). Convolutional sequence to sequence learning. Proceedings of the ICML, Sydney, Australia.