Author:
Bodenham Dean A.,Kawahara Yoshinobu
Abstract
AbstractThe maximum mean discrepancy (MMD) test is a nonparametric kernelised two-sample test that, when using a characteristic kernel, can detect any distributional change between two samples. However, when the total number of $$d$$
d
-dimensional observations is $$n$$
n
, direct computation of the test statistic is $$\mathcal {O}(dn^2 )$$
O
(
d
n
2
)
. While approximations with lower computational complexity are known, more efficient methods for computing the exact test statistic are unknown. This paper provides an exact method for computing the MMD test statistic for the univariate case in $$\mathcal {O}(n\log n)$$
O
(
n
log
n
)
using the Laplacian kernel. Furthermore, this exact method is extended to an approximate method for $$d$$
d
-dimensional real-valued data also with complexity log-linear in the number of observations. Experiments show that this approximate method can have good statistical performance when compared to the exact test, particularly in cases where $$d> n$$
d
>
n
.
Publisher
Springer Science and Business Media LLC
Subject
Computational Theory and Mathematics,Statistics, Probability and Uncertainty,Statistics and Probability,Theoretical Computer Science
Reference55 articles.
1. Baringhaus, L., Franz, C.: On a new multivariate two-sample test. J. Multivar. Anal. 88(1), 190–206 (2004)
2. Bickel, P.J., Lehmann, E.L.: Descriptive statistics for nonparametric models iv. spread. In: Jaroslav Hájek Memorial Volume. Springer, pp 519–526, (1979)
3. Borgwardt, K.M., Rasch, Gretton MJA.., Kriegel, H.P., et al.: Integrating structured biological data by kernel maximum mean discrepancy. Bioinformatics 22(14), e49–e57 (2006)
4. Cormen, T.H., Leiserson, C.E., Rivest, R.L., et al.: Introduction to Algorithms, 3rd edn. MIT Press (2009)
5. Cramér, H.: On the composition of elementary errors: first paper: mathematical deductions. Scand. Actuar. J. 1928(1), 13–74 (1928)
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献