Automatische Evaluation der Humanübersetzung: BLEU vs. METEOR-Reference-Cited by-同舟云学术

Automatische Evaluation der Humanübersetzung: BLEU vs. METEOR

Published:2020-05-06 Issue:1 Volume:65 Page:181-205
ISSN:1868-0267
Container-title:Lebende Sprachen
language:
Short-container-title:

Author:

Chung Hye-Yeon¹

Affiliation:

1. Graduate School of Interpretation and Translation, Hankuk University of Foreign Studies, Imun-Ro 107, Dongdaemun-Gu, Seoul, Republic of Korea (South)Korea (Republic of)

Abstract

AbstractHuman evaluation (HE) of translation is generally considered to be valid, but it requires a lot of effort. Automatic evaluation (AE) which assesses the quality of machine translations can be done easily, but it still requires validation. This study addresses the questions of whether and how AE can be used for human translations. For this purpose AE formulas and HE criteria were compared to each other in order to examine the validity of AE. In the empirical part of the study, 120 translations were evaluated by professional translators as well as by two representative AE-systems, BLEU/ METEOR, respectively. The correlations between AE and HE were relatively high at 0.849** (BLEU) and 0.862** (METEOR) in the overall analysis, but in the ratings of the individual texts, AE and ME exhibited a substantial difference. The AE-ME correlations were often below 0.3 or even in the negative range. Ultimately, the results indicate that neither METEOR nor BLEU can be used to assess human translation at this stage. But this paper suggests three possibilities to apply AE to compromise the weakness of HE.

Publisher

Walter de Gruyter GmbH

Subject

Linguistics and Language,Language and Linguistics

Link

https://www.degruyter.com/downloadpdf/journals/les/65/1/article-p181.xml

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Machine-learning based automatic assessment of communication in interpreting;Frontiers in Communication;2023-01-24

2. Contextualizing assessment feedback in translation education: A corpus-assisted ecological approach;Frontiers in Psychology;2022-12-19

3. Automatic assessment of spoken-language interpreting based on machine-translation evaluation metrics;Interpreting. International Journal of Research and Practice in Interpreting;2022-03-04

4. Research on chest radiography recognition model based on deep learning;Mathematical Biosciences and Engineering;2022

5. Can automated machine translation evaluation metrics be used to assess students’ interpretation in the language learning classroom?;Computer Assisted Language Learning;2021-08-28