Affiliation:
1. Department of Statistics and Data Science, and Biostatistics, University of California , Los Angeles, CA 90095 , USA
Abstract
Abstract
Traditional nonparametric estimation methods often lead to a slow convergence rate in large dimensions and require unrealistically large dataset sizes for reliable conclusions. We develop an approach based on partial derivatives, either observed or estimated, to effectively estimate the function at near-parametric convergence rates. This novel approach and computational algorithm could lead to methods useful to practitioners in many areas of science and engineering. Our theoretical results reveal behaviour universal to this class of nonparametric estimation problems. We explore a general setting involving tensor product spaces and build upon the smoothing spline analysis of variance framework. For d-dimensional models under full interaction, the optimal rates with gradient information on p covariates are identical to those for the (d−p)-interaction models without gradients and, therefore, the models are immune to the curse of interaction. For additive models, the optimal rates using gradient information are n, thus achieving the parametric rate. We demonstrate aspects of the theoretical results through synthetic and real data applications.
Funder
Eunice Kennedy Shriver National Institute of Child Health and Human Development
Publisher
Oxford University Press (OUP)
Reference48 articles.
1. Stochastic kriging for simulation metamodeling;Ankenman;Operations Research,2010
2. Theory of reproducing kernels;Aronszajn;Transactions of the American Mathematical Society,1950
3. A theorem on Fourier-Stieltjes integrals;Bochner;Bulletin of the American Mathematical Society,1934
4. On adaptive estimation of linear functionals;Cai;The Annals of Statistics,2005