Abstract
AbstractDeciding which predictors to use plays an integral role in deriving statistical models in a wide range of applications. Motivated by the challenges of predicting events across a telecommunications network, we propose a semi-automated, joint model-fitting and predictor selection procedure for linear regression models. Our approach can model and account for serial correlation in the regression residuals, produces sparse and interpretable models and can be used to jointly select models for a group of related responses. This is achieved through fitting linear models under constraints on the number of nonzero coefficients using a generalisation of a recently developed mixed integer quadratic optimisation approach. The resultant models from our approach achieve better predictive performance on the motivating telecommunications data than methods currently used by industry.
Publisher
Springer Science and Business Media LLC
Subject
Computational Theory and Mathematics,Statistics, Probability and Uncertainty,Statistics and Probability,Theoretical Computer Science
Reference43 articles.
1. Akaike, H.: Information theory and an extension of the maximum likelihood principle. In: Petrov, B.N., Csaki, F. (eds.) 2nd International Symposium on Information Theory, pp. 267–281. Budapest Akademiai Kiado (1973)
2. Beale, E.M.L.: Note on procedures for variable selection in multiple regression. Technometrics 12(4), 909–914 (1970)
3. Berk, K.N.: Comparing subset regression procedures. Technometrics 20(1), 1–6 (1978)
4. Bertsimas, D., King, A.: OR forum-an algorithmic approach to linear regression. Oper. Res. 64(1), 2–16 (2016)
5. Bertsimas, D., King, A., Muzumder, R.: Best subset selection via a modern optimisation lens. Ann. Stat. 44, 813–852 (2016)
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献