Affiliation:
1. National Institute of Information and Communications Technology
Abstract
Recent research on multilingual statistical machine translation focuses on the usage of pivot languages in order to overcome language resource limitations for certain language pairs. Due to the richness of available language resources, English is, in general, the pivot language of choice. However, factors like language relatedness can also effect the choice of the pivot language for a given language pair, especially for Asian languages, where language resources are currently quite limited. In this article, we provide new insights into what factors make a pivot language effective and investigate the impact of these factors on the overall pivot translation performance for translation between 22 Indo-European and Asian languages. Experimental results using state-of-the-art statistical machine translation techniques revealed that the translation quality of 54.8% of the language pairs improved when a non-English pivot language was chosen. Moreover, 81.0% of system performance variations can be explained by a combination of factors such as language family, vocabulary, sentence length, language perplexity, translation model entropy, reordering, monotonicity, and engine performance.
Publisher
Association for Computing Machinery (ACM)
Cited by
10 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Mixture-of-languages Routing for Multilingual Dialogues;ACM Transactions on Information Systems;2024-08-05
2. All Translation Tools Are Not Equal: Investigating the Quality of Language Translation for Forced Migration;2023 IEEE 10th International Conference on Data Science and Advanced Analytics (DSAA);2023-10-09
3. Research on Chinese-Lao Neural Machine Translation Based on Multi-Pivot;2023 2nd International Conference on Artificial Intelligence and Computer Information Technology (AICIT);2023-09-15
4. Low-resource Neural Machine Translation: Methods and Trends;ACM Transactions on Asian and Low-Resource Language Information Processing;2022-09-30
5. Word reordering on multiple pivots for the Japanese and Indonesian language pair;Machine Translation;2021-12