1. See W.G. Cochran, “Some methods for strengthening the commonχ 2 tests,”Biometrics, 10 (1954), 417–451, especially page 420. This rule of thumb, according to Cochran, is adequate when the degrees of freedom are greater than one and less than thirty. A more conservative rule is often used, and is suggested for one degree of freedom: Choose cells so that the expected cell counts are not less than five except for at most 20 percent of the cells where the expected counts can be as low as one. For thirty or more degrees of freedom a normal approximation is often suggested when too many of the expected cell counts are lower than five.
2. Barron Brainerd, “An exploratory study of pronouns and articles as indices of genre in English,”Language and Style, 5 (1972), 239–259.
3. In general, the power of the test increases with the number of degrees of freedom. The power of a test is roughly speaking the chance of rejecting a hypothesis when it is false. This varies with the choice of critical level and the choice of alternative to the null hypothesis. A test is more powerful than another if no matter what the choice of critical level the first test has a higher probability of rejecting the hypothesis if it is false. Thus the chi-squared obtained in Table 3.3 should be given more consideration than that of Table 3.4.
4. Barron Brainerd, “Article use as an indicator of style among English-language authors” inLinguistik und Statistik, ed. S. Jäger (Braunschweig: Vieweg, 1972) pp. 11–32.
5. In some cases it can be shown that a certain modification of the Binomial distribution, based on a markov model of text generation, yields a better fit than the Poisson fit. However, one of the applications of the knowledge that a sample is Poisson lies in the remark that if a random variableX is Poisson, then the random variable $$Y = \sqrt {X + {3 \mathord{\left/ {\vphantom {3 8}} \right. \kern-\nulldelimiterspace} 8}} $$ is approximately normally distributed with variance 1/4. The classical hypothesis tests can then be applied toY. The sort of approximation that we achieve here is adequate for these purposes.