Affiliation:
1. University of Warwick, Coventry CV4 7AL, UK
2. University of Cambridge, Cambridge CB2 1TN, UK
Abstract
We present the
U
-statistic permutation (USP) test of independence in the context of discrete data displayed in a contingency table. Either Pearson’s
χ
2
-test of independence, or the
G
-test, are typically used for this task, but we argue that these tests have serious deficiencies, both in terms of their inability to control the size of the test, and their power properties. By contrast, the USP test is guaranteed to control the size of the test at the nominal level for all sample sizes, has no issues with small (or zero) cell counts, and is able to detect distributions that violate independence in only a minimal way. The test statistic is derived from a
U
-statistic estimator of a natural population measure of dependence, and we prove that this is the unique minimum variance unbiased estimator of this population quantity. The practical utility of the USP test is demonstrated on both simulated data, where its power can be dramatically greater than those of Pearson’s test, the
G
-test and Fisher’s exact test, and on real data. The USP test is implemented in the
R
package
USP
.
Funder
H2020 European Research Council
EPSRC
Subject
General Physics and Astronomy,General Engineering,General Mathematics
Reference20 articles.
1. X. On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling
2. The conditions under which chi square measures the discrepancy between observations and hypothesis;Fisher RA;J. R. Stat. Soc.,1924
3. Lehmann EL, Romano JP. 2005 Testing statistical hypotheses. New York, NY: Springer Science+Business Media, Inc..
4. McDonald JH. 2014 G-test of goodness-of-fit. In Handbook of biological statistics 3rd edn. pp. 53–58. Baltimore MD: Sparky House Publishing.
5. Accurate methods for the statistics of surprise and coincidence;Dunning T;Comput. Linguist.,1993
Cited by
11 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献