Affiliation:
1. Department of Computer Science and Engineering, National Institute of Technology, Silchar, India
Abstract
Language translation is essential to bring the world closer and plays a significant part in building a community among people of different linguistic backgrounds. Machine translation dramatically helps in removing the language barrier and allows easier communication among linguistically diverse communities. Due to the unavailability of resources, major languages of the world are accounted as low-resource languages. This leads to a challenging task of automating translation among various such languages to benefit indigenous speakers. This article investigates neural machine translation for the English–Assamese resource-poor language pair by tackling insufficient data and out-of-vocabulary problems. We have also proposed an approach of data augmentation-based NMT, which exploits synthetic parallel data and shows significantly improved translation accuracy for English-to-Assamese and Assamese-to-English translation and obtained state-of-the-art results.
Subject
Artificial Intelligence,General Engineering,Statistics and Probability
Reference7 articles.
1. Kalyanee Kanchan Baruah Pranjal Das A.H. and Sarma S.K. , Assamese-English Bilingual Machine Translation, International Journal on Natural Language Computing (IJNLC) 3 (2014).
2. Interlingua-based English-Hindi Machine Translation and Language Divergence;Dave;Machine Translation,2001
3. Koehn P. , Statistical Machine Translation, 1st edn, Cambridge University Press, USA, 2010. ISBN ISBN 0521874157.
4. English–Mizo Machine Translation using neural and statistical approaches;Pathak;Neural Computing and Applications,2018
5. The Meteor Metric for Automatic Evaluation of Machine Translation;Lavie;Machine Translation,2009
Cited by
10 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献