Affiliation:
1. University of California, Santa Barbara
2. JLU Giessen
Abstract
Abstract
This paper discusses the degree to which most of the most widely-used measures of dispersion in corpus linguistics
are not particularly valid in the sense of actually measuring dispersion rather than some amalgam of a lot of frequency and a
little dispersion. The paper demonstrates these issues on the basis of data from a variety of corpora. I then outline how to
design a dispersion measure that only measures dispersion and show that (i) it indeed measures information that is different from
frequency in an intuitive way and (ii) has a higher degree of predictive power of lexical decision times from the MALD database
than nearly all other measures in nearly all corpora tested.
Publisher
John Benjamins Publishing Company
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献