UIMA Ruta: Rapid development of rule-based information extraction applications-Reference-Cited by-同舟云学术

UIMA Ruta: Rapid development of rule-based information extraction applications

Published:2014-10-08 Issue:1 Volume:22 Page:1-40
ISSN:1351-3249
Container-title:Natural Language Engineering
language:en
Short-container-title:Nat. Lang. Eng.

Author:

KLUEGL PETER,TOEPFER MARTIN,BECK PHILIP-DANIEL,FETTE GEORG,PUPPE FRANK

Abstract

AbstractRule-based information extraction is an important approach for processing the increasingly available amount of unstructured data. The manual creation of rule-based applications is a time-consuming and tedious task, which requires qualified knowledge engineers. The costs of this process can be reduced by providing a suitable rule language and extensive tooling support. This paper presents UIMA Ruta, a tool for rule-based information extraction and text processing applications. The system was designed with focus on rapid development. The rule language and its matching paradigm facilitate the quick specification of comprehensible extraction knowledge. They support a compact representation while still providing a high level of expressiveness. These advantages are supplemented by the development environment UIMA Ruta Workbench. It provides, in addition to extensive editing support, essential assistance for explanation of rule execution, introspection, automatic validation, and rule induction. UIMA Ruta is a useful tool for academia and industry due to its open source license. We compare UIMA Ruta to related rule-based systems especially concerning the compactness of the rule representation, the expressiveness, and the provided tooling support. The competitiveness of the runtime performance is shown in relation to a popular and freely-available system. A selection of case studies implemented with UIMA Ruta illustrates the usefulness of the system in real-world scenarios.

Publisher

Cambridge University Press (CUP)

Subject

Artificial Intelligence,Linguistics and Language,Language and Linguistics,Software

Reference42 articles.

1. Entity annotation based on inverse index operations

2. Information Extraction: Past, Present and Future

Cited by 58 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. ICAD-MI: Interdisciplinary concept association discovery from the perspective of metaphor interpretation;Knowledge-Based Systems;2023-09

2. Research on Resume Recommendation of Employment Platform based on Decision Tree Algorithm;2023 IEEE 6th International Conference on Big Data and Artificial Intelligence (BDAI);2023-07-07

3. Programming techniques for improving rule readability for rule-based information extraction natural language processing pipelines of unstructured and semi-structured medical texts;Health Informatics Journal;2023-04

4. A model of integrating convolution and BiGRU dual-channel mechanism for Chinese medical text classifications;PLOS ONE;2023-03-16

5. Research on e-business requirement information resource extraction method in network big data;International Journal of Autonomous and Adaptive Communications Systems;2023