Event Extraction using Structured Learning and Rich Domain Knowledge-Reference-Cited by-同舟云学术

Event Extraction using Structured Learning and Rich Domain Knowledge

Published:2016-01-22 Issue:2 Volume:7 Page:1-34
ISSN:2157-6904
Container-title:ACM Transactions on Intelligent Systems and Technology
language:en
Short-container-title:ACM Trans. Intell. Syst. Technol.

Author:

Minkov Einat¹

Affiliation:

1. University of Haifa, Haifa, Israel

Abstract

We consider the task of record extraction from text documents, where the goal is to automatically populate the fields of target relations, such as scientific seminars or corporate acquisition events. There are various inferences involved in the record-extraction process, including mention detection, unification, and field assignments. We use structured learning to find the appropriate field-value assignments. Unlike previous works, the proposed approach generates feature-rich models that enable the modeling of domain semantics and structural coherence at all levels and across fields. Given labeled examples, such an approach can, for instance, learn likely event durations and the fact that start times should come before end times. While the inference space is large, effective learning is achieved using a perceptron-style method and simple, greedy beam decoding. A main focus of this article is on practical aspects involved in implementing the proposed framework for real-world applications. We argue and demonstrate that this approach is favorable in conditions of data shift, a real-world setting in which models learned using a limited set of labeled examples are applied to examples drawn from a different data distribution. Much of the framework’s robustness is attributed to the modeling of domain knowledge. We describe design and implementation details for the case study of seminar event extraction from email announcements, and discuss design adaptations across different domains and text genres.

Publisher

Association for Computing Machinery (ACM)

Subject

Artificial Intelligence,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/2801131

Reference62 articles.

1. Automatic ontology-based knowledge extraction from Web documents

2. Galia Angelova . 2010. Use of domain knowledge in the automatic extraction of structured representations from patient-related texts . Conceptual Structures : From Information to Intelligence Lecture Notes in Computer Science vol. 6208 Springer Berlin 14--27. Galia Angelova. 2010. Use of domain knowledge in the automatic extraction of structured representations from patient-related texts. Conceptual Structures: From Information to Intelligence Lecture Notes in Computer Science vol. 6208 Springer Berlin 14--27.

3. Adaptive name matching in information integration

4. Mary Elaine Califf and Raymond J. Mooney. 1999. Relational learning of pattern-match rules for information extraction. In AAAI/IAAI. Mary Elaine Califf and Raymond J. Mooney. 1999. Relational learning of pattern-match rules for information extraction. In AAAI/IAAI.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Relational social recommendation: Application to the academic domain;Expert Systems with Applications;2019-06

2. Prior Knowledge-Based Event Network for Chinese Text;International Journal of Digital Multimedia Broadcasting;2017