Strategies in tracing linguistic variation in a corpus of Old Irish texts (CorPH)-Reference-Cited by-同舟云学术

Strategies in tracing linguistic variation in a corpus of Old Irish texts (CorPH)

Published:2022-09-20 Issue:4 Volume:27 Page:529-553
ISSN:1384-6655
Container-title:Corpus studies of language through time
language:en
Short-container-title:IJCL

Author:

Stifter David¹^ORCID,Qiu Fangzhe²^ORCID,Aquino-López Marco A.³^ORCID,Bauer Bernhard⁴^ORCID,Lash Elliott⁵^ORCID,White Nora¹^ORCID

Affiliation:

1. Maynooth University

2. University College Dublin

3. Centro de Investigación en Matemáticas

4. Karl-Franzens-Universität Graz

5. Georg-August-Universität Göttingen

Abstract

Abstract This article introduces Corpus PalaeoHibernicum (CorPH), a corpus currently consisting of 78 texts in Early Irish (c. 7th–10th cent.) created by the ERC-funded Chronologicon Hibernicum (ChronHib) project by bringing together pre-existing lexical and syntactic databases and adding further crucial texts from the period. In addition to being annotated for POS, morphological and syntactic information, another layer of annotation has been developed for CorPH – ‘Variation Tagging’, i.e. a tagset that numerically encodes synchronic language variation during the Early Irish period, thus allowing for much improved research on the chronological variation among the material. Another new pillar of studying linguistic variation is Bayesian Language Variation Analysis (BLaVA), in order to address the challenge that “not-so-big data” poses to statistical corpus methods. Instead of reflecting feature frequencies, BLaVA models language variation as probabilities of variation.

Publisher

John Benjamins Publishing Company

Subject

Linguistics and Language,Language and Linguistics

Link

http://www.jbe-platform.com/deliver/fulltext/ijcl.22018.sti.pdf

Reference45 articles.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Corpus linguistics and the social sciences;Corpus Linguistics and Linguistic Theory;2024-04-25