Predictor-Estimator-Reference-Cited by-同舟云学术

Predictor-Estimator

Published:2018-03-31 Issue:1 Volume:17 Page:1-22
ISSN:2375-4699
Container-title:ACM Transactions on Asian and Low-Resource Language Information Processing
language:en
Short-container-title:ACM Trans. Asian Low-Resour. Lang. Inf. Process.

Author:

Kim Hyun¹^ORCID,Jung Hun-Young¹,Kwon Hongseok¹,Lee Jong-Hyeok¹,Na Seung-Hoon²

Affiliation:

1. Pohang University of Science and Technology (POSTECH), Pohang, Republic of Korea

2. Chonbuk National University, Jeonju, Republic of Korea

Abstract

Recently, quality estimation has been attracting increasing interest from machine translation researchers, aiming at finding a good estimator for the “quality” of machine translation output. The common approach for quality estimation is to treat the problem as a supervised regression/classification task using a quality-annotated noisy parallel corpus, called quality estimation data , as training data. However, the available size of quality estimation data remains small, due to the too-expensive cost of creating such data. In addition, most conventional quality estimation approaches rely on manually designed features to model nonlinear relationships between feature vectors and corresponding quality labels. To overcome these problems, this article proposes a novel neural network architecture for quality estimation task—called the predictor-estimator —that considers word prediction as an additional pre-task. The major component of the proposed neural architecture is a word prediction model based on a modified neural machine translation model—a probabilistic model for predicting a target word conditioned on all the other source and target contexts. The underlying assumption is that the word prediction model is highly related to quality estimation models and is therefore able to transfer useful knowledge to quality estimation tasks. Our proposed quality estimation method sequentially trains the following two types of neural models: (1) Predictor : a neural word prediction model trained from parallel corpora and (2) Estimator : a neural quality estimation model trained from quality estimation data. To transfer word a prediction task to a quality estimation task, we generate quality estimation feature vectors from the word prediction model and feed them into the quality estimation model. The experimental results on WMT15 and 16 quality estimation datasets show that our proposed method has great potential in the various sub-challenges.

Funder

ICT Consilience Creative Program of MSIP/IITP

ICT R8D Program of MSIP/IITP

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3109480

Reference34 articles.

1. Confidence estimation for machine translation

Cited by 32 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Mismatching-aware unsupervised translation quality estimation for low-resource languages;Language Resources and Evaluation;2024-05-05

2. A Quality Prediction Model for the Parallel Corpora of Korean-Chinese and Chinese-Korean Translation Utilizing Sentence Similarity and Sentence Attributes;2024 5th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT);2024-03-29

3. A study on intelligent translation of English sentences by a semantic feature extractor;Journal of Intelligent Systems;2024-01-01

4. Linguistic Communication Channels Reveal Connections between Texts: The New Testament and Greek Literature;Information;2023-07-14

5. Readability Metrics for Machine Translation in Dutch: Google vs. Azure & IBM;Applied Sciences;2023-03-31