Modality and Negation in SIMT Use of Modality and Negation in Semantically-Informed Syntactic MT

Author:

Baker Kathryn1,Bloodgood Michael2,Dorr Bonnie J.2,Callison-Burch Chris3,Filardo Nathaniel W.3,Piatko Christine3,Levin Lori4,Miller Scott5

Affiliation:

1. U.S. Department of Defense

2. University of Maryland

3. Johns Hopkins University

4. Carnegie Mellon University

5. BBN Technologies

Abstract

This article describes the resource- and system-building efforts of an 8-week Johns Hopkins University Human Language Technology Center of Excellence Summer Camp for Applied Language Exploration (SCALE-2009) on Semantically Informed Machine Translation (SIMT). We describe a new modality/negation (MN) annotation scheme, the creation of a (publicly available) MN lexicon, and two automated MN taggers that we built using the annotation scheme and lexicon. Our annotation scheme isolates three components of modality and negation: a trigger (a word that conveys modality or negation), a target (an action associated with modality or negation), and a holder (an experiencer of modality). We describe how our MN lexicon was semi-automatically produced and we demonstrate that a structure-based MN tagger results in precision around 86% (depending on genre) for tagging of a standard LDC data set. We apply our MN annotation scheme to statistical machine translation using a syntactic framework that supports the inclusion of semantic annotations. Syntactic tags enriched with semantic annotations are assigned to parse trees in the target-language training texts through a process of tree grafting. Although the focus of our work is modality and negation, the tree grafting procedure is general and supports other types of semantic information. We exploit this capability by including named entities, produced by a pre-existing tagger, in addition to the MN elements produced by the taggers described here. The resulting system significantly outperformed a linguistically naive baseline model (Hiero), and reached the highest scores yet reported on the NIST 2009 Urdu–English test set. This finding supports the hypothesis that both syntactic and semantic information can improve translation quality.

Publisher

MIT Press - Journals

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Language and Linguistics

Reference43 articles.

Cited by 20 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Sentiment induced phrase-based machine translation: Robustness analysis of PBSMT with senti-module;Engineering Applications of Artificial Intelligence;2023-11

2. Figuring Out Root and Epistemic Uses of Modals: The Role of the Input;Journal of Semantics;2022-08-26

3. Negation and Speculation in NLP: A Survey, Corpora, Methods, and Applications;Applied Sciences;2022-05-21

4. A Deep Learning Approach for Negation Detection from Product Reviews written in Spanish;2021 XLVII Latin American Computing Conference (CLEI);2021-10-25

5. Revisiting Negation in Neural Machine Translation;Transactions of the Association for Computational Linguistics;2021

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3