Annotation projection for temporal information extraction-Reference-Cited by-同舟云学术

Annotation projection for temporal information extraction

Published:2019-05 Issue:3 Volume:25 Page:385-403
ISSN:1351-3249
Container-title:Natural Language Engineering
language:en
Short-container-title:Nat. Lang. Eng.

Author:

Giannella Chris R.,Winder Ransom K.,Jubinski Joseph P.

Abstract

AbstractApproaches to building temporal information extraction systems typically rely on large, manually annotated corpora. Thus, porting these systems to new languages requires acquiring large corpora of manually annotated documents in the new languages. Acquiring such corpora is difficult owing to the complexity of temporal information extraction annotation. One strategy for addressing this difficulty is to reduce or eliminate the need for manually annotated corpora through annotation projection. This technique utilizes a temporal information extraction system for a source language (typically English) to automatically annotate the source language side of a parallel corpus. It then uses automatically generated word alignments to project the annotations, thereby creating noisily annotated target language training data. We developed an annotation projection technique for producing target language temporal information extraction systems. We carried out an English (source) to French (target) case study wherein we compared a French temporal information extraction system built using annotation projection with one built using a manually annotated French corpus. While annotation projection has been applied to building other kinds of Natural Language Processing tools (e.g., Named Entity Recognizers), to our knowledge, this is the first paper examining annotation projection as applied to temporal information extraction where no manual corrections of the target language annotations were made. We found that, even using manually annotated data to build a temporal information extraction system, F-scores were relatively low (<0.35), which suggests that the problem is challenging even with manually annotated data. Our annotation projection approach performed well (relative to the system built from manually annotated data) on some aspects of temporal information extraction (e.g., event–document creation time temporal relation prediction), but it performed poorly on the other kinds of temporal relation prediction (e.g., event–event and event–time).

Publisher

Cambridge University Press (CUP)

Subject

Artificial Intelligence,Linguistics and Language,Language and Linguistics,Software

Reference54 articles.

1. A Baseline Temporal Tagger for all Languages

2. Stacking Approach to Temporal Relation Classification with Temporal Inference

3. Using Syntactic-Based Kernels for Classifying Temporal Relations

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. English Long Sentence Segmentation and Translation Optimization of Professional Literature Based on Hierarchical Network of Concepts;Mobile Information Systems;2022-08-25

2. Extraction of temporal information from social media messages using the BERT model;Earth Science Informatics;2022-01-10

3. Extraction of Temporal Events’ Frequency from Online News Channels;2020 30th International Conference on Computer Theory and Applications (ICCTA);2020-12-12

4. Extraction of temporal relations from clinical free text: A systematic review of current approaches;Journal of Biomedical Informatics;2020-08