Affiliation:
1. Goethe-Universität Frankfurt, Robert-Mayer-Straße 10, D-60325 Frankfurt am Main
2. Hochschule Luzern, Technikumstr. 21, 6048 Horw
Abstract
Abstract
We introduce a new text technology, called Wikidition, which automatically generates large
scale editions of corpora of natural language texts. Wikidition combines a wide range of
text mining tools for automatically linking lexical, sentential and textual units. This
includes the extraction of corpus-specific lexica down to the level of syntactic words and
their grammatical categories. To this end, we introduce a novel measure of text reuse and
exemplify Wikidition by means of the capitularies, that is, a corpus of Medieval Latin
texts.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献