The conditionality principle in high-dimensional regression-Reference-Cited by-同舟云学术

The conditionality principle in high-dimensional regression

Published:2019-05-20 Issue:3 Volume:106 Page:702-707
ISSN:0006-3444
Container-title:Biometrika
language:en
Short-container-title:

Author:

Azriel D¹

Affiliation:

1. Faculty of Industrial Engineering and Management, Technion - Israel Institute of Technology, Technion City, Haifa 3200003, Israel

Abstract

Summary Consider a high-dimensional linear regression problem, where the number of covariates is larger than the number of observations and the interest is in estimating the conditional variance of the response variable given the covariates. A conditional and an unconditional framework are considered, where conditioning is with respect to the covariates, which are ancillary to the parameter of interest. In recent papers, a consistent estimator was developed in the unconditional framework when the marginal distribution of the covariates is normal with known mean and variance. In the present work, a certain Bayesian hypothesis test is formulated under the conditional framework, and it is shown that the Bayes risk is a constant. This implies that no consistent estimator exists in the conditional framework. However, when the marginal distribution of the covariates is normal, the conditional error of the above consistent estimator converges to zero, with probability converging to one. It follows that even in the conditional setting, information about the marginal distribution of an ancillary statistic may have a significant impact on statistical inference. The practical implication in the context of high-dimensional regression models is that additional observations where only the covariates are given are potentially very useful and should not be ignored. This finding is most relevant to semi-supervised learning problems where covariate information is easy to obtain.

Publisher

Oxford University Press (OUP)

Subject

Applied Mathematics,Statistics, Probability and Uncertainty,General Agricultural and Biological Sciences,Agricultural and Biological Sciences (miscellaneous),General Mathematics,Statistics and Probability

Link

http://academic.oup.com/biomet/advance-article-pdf/doi/10.1093/biomet/asz015/28694280/asz015.pdf

Reference18 articles.

1. Semi-supervised linear regression;Azriel,,2018

2. An ancillarity paradox which appears in multiple linear regression;Brown,;Ann. Statist.,1990

3. Models as approximations, part I: A conspiracy of nonlinearity and random regressors in linear regression;Buja,,2016

4. Efficient and adaptive linear regression in semi-supervised settings;Chakrabortty,;Ann. Statist.,2018

5. Semi-Supervised Learning

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A zero-estimator approach for estimating the signal level in a high-dimensional model-free setting;Journal of Statistical Planning and Inference;2025-01

2. Semi-Supervised Linear Regression;Journal of the American Statistical Association;2021-05-18