Unsupervised Multimodal Machine Translation for Low-resource Distant Language Pairs-Reference-Cited by-同舟云学术

Unsupervised Multimodal Machine Translation for Low-resource Distant Language Pairs

Published:2024-04-15 Issue:4 Volume:23 Page:1-22
ISSN:2375-4699
Container-title:ACM Transactions on Asian and Low-Resource Language Information Processing
language:en
Short-container-title:ACM Trans. Asian Low-Resour. Lang. Inf. Process.

Author:

Tayir Turghun¹^ORCID,Li Lin¹^ORCID

Affiliation:

1. School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan, China

Abstract

Unsupervised machine translation (UMT) has recently attracted more attention from researchers, enabling models to translate when languages lack parallel corpora. However, the current works mainly consider close language pairs (e.g., English-German and English-French), and the effectiveness of visual content for distant language pairs has yet to be investigated. This article proposes an unsupervised multimodal machine translation model for low-resource distant language pairs. Specifically, we first employ adequate measures such as transliteration and re-ordering to bring distant language pairs closer together. We then use visual content to extend masked language modeling and generate visual masked language modeling for UMT. Finally, empirical experiments are conducted on our distant language pair dataset and the public Multi30k dataset. Experimental results demonstrate the superior performance of our model, with BLEU score improvements of 2.5 and 2.6 on translation for distant language pairs English-Uyghur and Chinese-Uyghur. Moreover, our model also brings remarkable results for close language pairs, improving 2.3 BLEU compared with the existing models in English-German.

Funder

NSFC, China

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3652161

Reference57 articles.

1. Mikel Artetxe, Gorka Labaka, Eneko Agirre, and Kyunghyun Cho. 2018. Unsupervised neural machine translation. In Proceedings of the 6th International Conference on Learning Representations. 1–12.

2. Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural machine translation by jointly learning to align and translate. In Proceedings of the 3rd International Conference on Learning Representations. 1–15.

3. NMTPY: A Flexible Toolkit for Advanced Neural Machine Translation Systems

4. Ozan Caglayan, Menekse Kuyu, Mustafa Sercan Amac, Pranava Madhyastha, Erkut Erdem, Aykut Erdem, and Lucia Specia. 2021. Cross-lingual visual pre-training for multimodal machine translation. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics. 1317–1324.

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. AI-based visual speech recognition towards realistic avatars and lip-reading applications in the metaverse;Applied Soft Computing;2024-10

2. ENIMNR: Enhanced node influence maximization through node representation in social networks;Chaos, Solitons & Fractals;2024-09

3. Detection of adversarial phishing attack using machine learning techniques;Sādhanā;2024-08-08

4. Advancements in intrusion detection: A lightweight hybrid RNN-RF model;PLOS ONE;2024-06-21

5. Virtual Reality and 6G Based Smart Classroom Teaching Using Artificial Intelligence;Wireless Personal Communications;2024-06-18