Interlingual annotation of parallel text corpora: a new framework for annotation and evaluation
-
Published:2010-06-15
Issue:3
Volume:16
Page:197-243
-
ISSN:1351-3249
-
Container-title:Natural Language Engineering
-
language:en
-
Short-container-title:Nat. Lang. Eng.
Author:
DORR BONNIE J.,PASSONNEAU REBECCA J.,FARWELL DAVID,GREEN REBECCA,HABASH NIZAR,HELMREICH STEPHEN,HOVY EDUARD,LEVIN LORI,MILLER KEITH J.,MITAMURA TERUKO,RAMBOW OWEN,SIDDHARTHAN ADVAITH
Abstract
AbstractThis paper focuses on an important step in the creation of a system of meaning representation and the development of semantically annotated parallel corpora, for use in applications such as machine translation, question answering, text summarization, and information retrieval. The work described below constitutes the first effort of any kind to annotate multiple translations of foreign-language texts with interlingual content. Three levels of representation are introduced: deep syntactic dependencies (IL0), intermediate semantic representations (IL1), and a normalized representation that unifies conversives, nonliteral language, and paraphrase (IL2). The resulting annotated, multilingually induced, parallel corpora will be useful as an empirical basis for a wide range of research, including the development and evaluation of interlingual NLP systems and paraphrase-extraction systems as well as a host of other research and development efforts in theoretical and applied linguistics, foreign language pedagogy, translation studies, and other related disciplines.
Publisher
Cambridge University Press (CUP)
Subject
Artificial Intelligence,Linguistics and Language,Language and Linguistics,Software
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献