Efficient basis selection for smoothing splines via rotated lattices
Author:
Diao Huaimin1ORCID,
Ai Mengtong2,
Tian Yubin1,
Yu Jun1ORCID
Affiliation:
1. School of Mathematics and Statistics Beijing Institute of Technology Beijing China
2. Department of Statistics University of Michigan Ann Arbor, Michigan USA
Abstract
Fitting a smoothing spline model on a large‐scale dataset is daunting due to the high computational cost. In this study, we develop an efficient basis selection method for smoothing spline calculation. The key idea is to force a nonparametric function in an infinite‐dimensional functional space to reside in a relatively small and finite‐dimensional model space without the loss of too much prediction accuracy. Such an approximation naturally allows for much faster numerical calculation, especially for large datasets. Among various basis selection methods, space‐filling basis selection has been proven to be more efficient since its model space dimension is smaller than that of others. Despite algorithmic benefits, most of the space‐filling selection methods only take the overall space‐filling property into account. These methods may be less efficient when the underlying response surface is not isomorphic. To overcome this obstacle, we develop an efficient algorithm to improve projective uniformity for space‐filling basis selection. It has been proved that the proposed estimator has the same convergence rate as the full bases estimator. Compared with the standard approach, the proposed method significantly reduces the computational cost. Simulation and real data studies demonstrate the efficiency and superiority of the proposed method.
Funder
National Natural Science Foundation of China
Natural Science Foundation of Beijing Municipality
Beijing Institute of Technology Research Fund Program for Young Scholars
Subject
Statistics, Probability and Uncertainty,Statistics and Probability
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Nonparametric Additive Models for Billion Observations;Journal of Computational and Graphical Statistics;2024-03-19