Towards Making the Most of Context in Neural Machine Translation-Reference-Cited by-同舟云学术

Towards Making the Most of Context in Neural Machine Translation

Published:2020-07 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Zheng Zaixiang¹,Yue Xiang¹,Huang Shujian¹,Chen Jiajun¹,Birch Alexandra²

Affiliation:

1. Nanjing University

2. University of Edinburgh

Abstract

Document-level machine translation manages to outperform sentence level models by a small margin, but have failed to be widely adopted. We argue that previous research did not make a clear use of the global context, and propose a new document-level NMT framework that deliberately models the local context of each sentence with the awareness of the global context of the document in both source and target languages. We specifically design the model to be able to deal with documents containing any number of sentences, including single sentences. This unified approach allows our model to be trained elegantly on standard datasets without needing to train on sentence and document level data separately. Experimental results demonstrate that our model outperforms Transformer baselines and previous document-level NMT models with substantial margins of up to 2.1 BLEU on state-of-the-art baselines. We also provide analyses which show the benefit of context far beyond the neighboring two or three sentences, which previous studies have typically incorporated.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 17 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A survey of context in neural machine translation and its evaluation;Natural Language Processing;2024-05-17

2. Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14

3. Efficient Data Augmentation via lexical matching for boosting performance on Statistical Machine Translation for Indic and a Low-resource language;Multimedia Tools and Applications;2024-01-15

4. Context-Aware Machine Translation with Source Coreference Explanation;Transactions of the Association for Computational Linguistics;2024

5. Segmentation-Free Streaming Machine Translation;Transactions of the Association for Computational Linguistics;2024