Human Evaluation of English–Irish Transformer-Based NMT-Reference-Cited by-同舟云学术

Human Evaluation of English–Irish Transformer-Based NMT

Published:2022-06-25 Issue:7 Volume:13 Page:309
ISSN:2078-2489
Container-title:Information
language:en
Short-container-title:Information

Author:

Lankford Séamus,Afli Haithem^ORCID,Way Andy^ORCID

Abstract

In this study, a human evaluation is carried out on how hyperparameter settings impact the quality of Transformer-based Neural Machine Translation (NMT) for the low-resourced English–Irish pair. SentencePiece models using both Byte Pair Encoding (BPE) and unigram approaches were appraised. Variations in model architectures included modifying the number of layers, evaluating the optimal number of heads for attention and testing various regularisation techniques. The greatest performance improvement was recorded for a Transformer-optimized model with a 16k BPE subword model. Compared with a baseline Recurrent Neural Network (RNN) model, a Transformer-optimized model demonstrated a BLEU score improvement of 7.8 points. When benchmarked against Google Translate, our translation engines demonstrated significant improvements. Furthermore, a quantitative fine-grained manual evaluation was conducted which compared the performance of machine translation systems. Using the Multidimensional Quality Metrics (MQM) error taxonomy, a human evaluation of the error types generated by an RNN-based system and a Transformer-based system was explored. Our findings show the best-performing Transformer system significantly reduces both accuracy and fluency errors when compared with an RNN-based model.

Funder

Science Foundation Ireland

Publisher

MDPI AG

Subject

Information Systems

Link

https://www.mdpi.com/2078-2489/13/7/309/pdf

Reference53 articles.

1. Dual learning for machine translation;He;Proceedings of the Advances in Neural Information Processing Systems 29 (NIPS 2016),2016

2. Augmenting Neural Machine Translation through Round-Trip Training Approach

3. SMT versus NMT: Preliminary comparisons for Irish;Dowling;Proceedings of the AMTA 2018 Workshop on Technologies for MT of Low Resource Languages (LoResMT 2018),2018

4. A call for prudent choice of subword merge operations in neural machine translation;Ding;arXiv,2019

5. Finding the optimal vocabulary size for neural machine translation;Gowda;arXiv,2020

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. adaptMLLM: Fine-Tuning Multilingual Language Models on Low-Resource Languages with Integrated LLM Playgrounds;Information;2023-11-29

2. adaptNMT: an open-source, language-agnostic development environment for neural machine translation;Language Resources and Evaluation;2023-07-14

3. The neural machine translation models for the low-resource Kazakh–English language pair;PeerJ Computer Science;2023-02-08

4. Human Versus Automatic Evaluation of NMT for Low-Resource Indian Language;Lecture Notes in Electrical Engineering;2023

5. Exploiting Parts of Speech in Bangla-To-English Machine Translation Evaluation;Lecture Notes in Electrical Engineering;2023