Affiliation:
1. University of Bayreuth & Hasselt University
2. Technion, Israel
3. University of Bayreuth
Abstract
A common conceptual view of text analysis is that of a two-step process, where we first extract relations from text documents and then apply a relational query over the result. Hence, text analysis shares technical challenges with, and can draw ideas from, relational databases. A framework that formally instantiates this connection is that of the document spanners. In this article, we review recent advances in various research efforts that adapt fundamental database concepts to text analysis through the lens of document spanners. Among others, we discuss aspects of query evaluation, aggregate queries, provenance, and distributed query planning.
Publisher
Association for Computing Machinery (ACM)
Subject
Information Systems,Software
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Modeling Regex Operators for Solving Regex Crossword Puzzles;Dependable Software Engineering. Theories, Tools, and Applications;2023-12-15
2. Enumerating grammar-based extractions;Discrete Applied Mathematics;2023-12
3. REmatch: A Novel Regex Engine for Finding All Matches;Proceedings of the VLDB Endowment;2023-07