Affiliation:
1. School of English, Arts Building, Trinity College Dublin , Dublin 2, Ireland
Abstract
Abstract
This article presents a methodology for obtaining large datasets for the spelling of individual phonological segments in Old English texts, based on searching the Dictionary of Old English Corpus for the attested spellings listed in the Dictionary of Old English A-H. It exemplifies this ‘corpus philology’ through a study of 216,526 spellings for words beginning with h followed by a vowel, using a variety of techniques to evaluate the methodology’s precision and recall, which are calculated as very high for <h->initial spellings (precision 100% precision, recall 92.1%) and moderate, but still usable, for <h->less spellings (precision 85.5%, recall 58.3%). Data for fourteen other segments related to the behaviour of h- in Old English is presented in the Supplementary Materials that complement the paper online. This dataset of 379,484 spellings from 2,605 Old English texts is shown to seriously problematize the findings of traditional philology, the conclusions of which are in contrast based on only a handful of spellings from a few texts, and to have the potential to radically enhance our understanding of the literary and linguistic histories of English.
Publisher
Oxford University Press (OUP)
Subject
Computer Science Applications,Linguistics and Language,Language and Linguistics,Information Systems