Language Modeling for Syntax-Based Machine Translation Using Tree Substitution Grammars
-
Published:2011-12
Issue:4
Volume:10
Page:1-29
-
ISSN:1530-0226
-
Container-title:ACM Transactions on Asian Language Information Processing
-
language:en
-
Short-container-title:ACM Transactions on Asian Language Information Processing
Author:
Xiao Tong1,
Zhu Jingbo1,
Zhu Muhua1
Affiliation:
1. Northeastern University
Abstract
The poor grammatical output of Machine Translation (MT) systems appeals syntax-based approaches within language modeling. However, previous studies showed that syntax-based language modeling using (Context-Free) Treebank Grammars was not very helpful in improving BLEU scores for Chinese-English machine translation. In this article we further study this issue in the context of Chinese-English syntax-based Statistical Machine Translation (SMT) where Synchronous Tree Substitution Grammars (STSGs) are utilized to model the translation process. In particular, we develop a Tree Substitution Grammar-based language model for syntax-based MT, and present three methods to efficiently integrate the proposed language model into MT decoding. In addition, we design a simple and effective method to adapt syntax-based language models for MT tasks. We demonstrate that the proposed methods are able to benefit a state-of-the-art syntax-based MT system. On the NIST Chinese-English MT evaluation corpora, we finally achieve an improvement of 0.6 BLEU points over the baseline.
Funder
National Natural Science Foundation of China
Research Grants Council, University Grants Committee, Hong Kong
Publisher
Association for Computing Machinery (ACM)
Subject
General Computer Science
Reference48 articles.
1. Beal M. 2003. Variational algorithms for approximate bayesian inference. Doctoral dissertation. University College London. Beal M. 2003. Variational algorithms for approximate bayesian inference. Doctoral dissertation. University College London.
2. Bonnema R. 2002. Probability models for DOP. Data-Oriented Parsing. CSLI publications. Bonnema R. 2002. Probability models for DOP. Data-Oriented Parsing . CSLI publications.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献