Incorporating Source-Side Phrase Structures into Neural Machine Translation-Reference-Cited by-同舟云学术

Incorporating Source-Side Phrase Structures into Neural Machine Translation

Published:2019-06 Issue:2 Volume:45 Page:267-292
ISSN:0891-2017
Container-title:Computational Linguistics
language:en
Short-container-title:Computational Linguistics

Author:

Eriguchi Akiko¹,Hashimoto Kazuma²,Tsuruoka Yoshimasa³

Affiliation:

1. Microsoft Research.

2. Salesforce Research.

3. The University of Tokyo, Department of Information and Communication Engineering.

Abstract

Neural machine translation (NMT) has shown great success as a new alternative to the traditional Statistical Machine Translation model in multiple languages. Early NMT models are based on sequence-to-sequence learning that encodes a sequence of source words into a vector space and generates another sequence of target words from the vector. In those NMT models, sentences are simply treated as sequences of words without any internal structure. In this article, we focus on the role of the syntactic structure of source sentences and propose a novel end-to-end syntactic NMT model, which we call a tree-to-sequence NMT model, extending a sequence-to-sequence model with the source-side phrase structure. Our proposed model has an attention mechanism that enables the decoder to generate a translated word while softly aligning it with phrases as well as words of the source sentence. We have empirically compared the proposed model with sequence-to-sequence models in various settings on Chinese-to-Japanese and English-to-Japanese translation tasks. Our experimental results suggest that the use of syntactic structure can be beneficial when the training data set is small, but is not as effective as using a bi-directional encoder. As the size of training data set increases, the benefits of using a syntactic tree tends to diminish.

Publisher

MIT Press - Journals

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Language and Linguistics

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/coli_a_00348

Reference51 articles.

1. Towards String-To-Tree Neural Machine Translation

2. Graph Convolutional Encoders for Syntax-aware Neural Machine Translation

3. Towards Neural Machine Translation with Latent Tree Attention

4. Improved Neural Machine Translation with a Syntax-Aware Encoder and Decoder

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep Learning Model-Based Machine Learning for Chinese and Japanese Translation;Wireless Communications and Mobile Computing;2022-03-09

2. Improving Neural Machine Translation by Efficiently Incorporating Syntactic Templates;Advances and Trends in Artificial Intelligence. Theory and Practices in Artificial Intelligence;2022

3. Augmenting training data with syntactic phrasal-segments in low-resource neural machine translation;Machine Translation;2021-12

4. Augmenting training data with syntactic phrasal-segments in low-resource neural machine translation;MACH TRANSL;2021

5. Translation Mechanism of Neural Machine Algorithm for Online English Resources;Complexity;2021-04-05