Unsupervised Event Coreference Resolution-Reference-Cited by-同舟云学术

Unsupervised Event Coreference Resolution

Published:2014-06 Issue:2 Volume:40 Page:311-347
ISSN:0891-2017
Container-title:Computational Linguistics
language:en
Short-container-title:Computational Linguistics

Author:

Bejan Cosmin¹,Harabagiu Sanda²

Affiliation:

1. Vanderbilt University

2. University of Texas at Dallas

Abstract

The task of event coreference resolution plays a critical role in many natural language processing applications such as information extraction, question answering, and topic detection and tracking. In this article, we describe a new class of unsupervised, nonparametric Bayesian models with the purpose of probabilistically inferring coreference clusters of event mentions from a collection of unlabeled documents. In order to infer these clusters, we automatically extract various lexical, syntactic, and semantic features for each event mention from the document collection. Extracting a rich set of features for each event mention allows us to cast event coreference resolution as the task of grouping together the mentions that share the same features (they have the same participating entities, share the same location, happen at the same time, etc.). Some of the most important challenges posed by the resolution of event coreference in an unsupervised way stem from (a) the choice of representing event mentions through a rich set of features and (b) the ability of modeling events described both within the same document and across multiple documents. Our first unsupervised model that addresses these challenges is a generalization of the hierarchical Dirichlet process. This new extension presents the hierarchical Dirichlet process's ability to capture the uncertainty regarding the number of clustering components and, additionally, takes into account any finite number of features associated with each event mention. Furthermore, to overcome some of the limitations of this extension, we devised a new hybrid model, which combines an infinite latent class model with a discrete time series model. The main advantage of this hybrid model stands in its capability to automatically infer the number of features associated with each event mention from data and, at the same time, to perform an automatic selection of the most informative features for the task of event coreference. The evaluation performed for solving both within- and cross-document event coreference shows significant improvements of these models when compared against two baselines for this task.

Publisher

MIT Press - Journals

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Language and Linguistics

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/COLI_a_00174

Reference81 articles.

1. The stages of event extraction

2. Topic Detection and Tracking

3. Cross-document event coreference

Cited by 17 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Short Text Event Coreference Resolution Based on Context Prediction;Applied Sciences;2024-01-07

2. Discovering Emerging Threats in the Hacker Community: A Nonparametric Emerging Topic Detection Framework;MIS Quarterly;2022-12-01

3. Constructing a cross-document event coreference corpus for Dutch;Language Resources and Evaluation;2022-06-04

4. Generalizing Cross-Document Event Coreference Resolution Across Multiple Corpora;Computational Linguistics;2021-11

5. Anaphora and coreference resolution: A review;Information Fusion;2020-07