Affiliation:
1. Faculty of Information Technology, Monash University, Clayton, VIC, Australia
Abstract
Machine translation (MT) is an important task in natural language processing (NLP), as it automates the translation process and reduces the reliance on human translators. With the resurgence of neural networks, the translation quality surpasses that of the translations obtained using statistical techniques for most language-pairs. Up until a few years ago, almost all of the neural translation models translated sentences
independently
, without incorporating the wider
document-context
and inter-dependencies among the sentences.
The aim of this survey article is to highlight the major works that have been undertaken in the space of document-level machine translation after the neural revolution, so researchers can recognize the current state and future directions of this field.
We provide an organization of the literature based on novelties in modelling and architectures as well as training and decoding strategies. In addition, we cover evaluation strategies that have been introduced to account for the improvements in document MT, including automatic metrics and discourse-targeted test sets. We conclude by presenting possible avenues for future exploration in this research field.
Funder
Google Faculty Research Award and ARC
Publisher
Association for Computing Machinery (ACM)
Subject
General Computer Science,Theoretical Computer Science
Reference144 articles.
1. Ruchit Agrawal Marco Turchi and Matteo Negri. 2018. Contextual handling in neural machine translation: Look behind ahead and on both sides. In Proceedings of the 21st Conference of the European Association for Machine Translation Juan Antonio Pérez-Ortiz Felipe Sánchez-Martínez Miquel Esplà-Gomis Maja Popović Celia Rico André Martins Joachim Van den Bogaert and Mikel L. Forcada (Eds.). EAMT 11--20. Ruchit Agrawal Marco Turchi and Matteo Negri. 2018. Contextual handling in neural machine translation: Look behind ahead and on both sides. In Proceedings of the 21st Conference of the European Association for Machine Translation Juan Antonio Pérez-Ortiz Felipe Sánchez-Martínez Miquel Esplà-Gomis Maja Popović Celia Rico André Martins Joachim Van den Bogaert and Mikel L. Forcada (Eds.). EAMT 11--20.
2. Globally Normalized Transition-Based Neural Networks
3. Linguistic Evaluation of German-English Machine Translation Using a Test Suite
Cited by
45 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献