Affiliation:
1. Leonard N. Stern School of Business, New York University
Abstract
Longitudinal and clustered data, where multiple observations for individuals are observed, require special models that reflect their hierarchical structure. The most commonly used such model is the linear multilevel model, which combines a linear model for the population-level fixed effects, a linear model for normally distributed individual-level random effects and normally distributed observation-level errors with constant variance. It has the advantage of simplicity of interpretation, but if the assumptions of the model do not hold inferences drawn can be misleading. In this paper, we discuss the use of regression trees that are designed for multilevel data to construct goodness-of-fit tests for this model that can be used to test for nonlinearity of the fixed effects or heteroscedasticity of the errors. Simulations show that the resultant tests are slightly conservative as 0.05 level tests, and have good power to identify explainable model violations (that is, ones that are related to available covariate information in the data). Application of the tests is illustrated on two real datasets.
Subject
Statistics, Probability and Uncertainty,Statistics and Probability
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Bibliography;Handbook of Regression Analysis With Applications in R;2020-09
2. Unbiased regression trees for longitudinal and clustered data;Computational Statistics & Data Analysis;2015-08
3. Unbiased Regression Trees for Longitudinal Data;SSRN Electronic Journal;2014