Affiliation:
1. EGE UNIVERSITY, FACULTY OF SCIENCE, DEPARTMENT OF STATISTICS
Abstract
In this study, we investigate whether the Tukey M robust regression method provides a solution for the data sets suffering from multicollinearity problem. It is observed that high values of variance inflation factors (VIF) which is a sign of the multiple linear link among the explanatory variables, cannot be controlled by the robust methods which work through the residual values. The reason for this fact is that multicollinearity and high values of VIF which is a result of multicollinearity do not produce extreme residuals. For this reason, the robust methods cannot provide a solution for the high VIF problem. This fact is shown by an extensive simulation study. In the simulation study, the explanatory variables were derived from trivariate normal distribution for three different correlation values. In this study, we also used two real-life data examples and we observed that the results support the findings of the simulation study. For all these reasons, we can conclude that specialized methods should be utilized in the case of multicollinearity.
Publisher
SDU Journal of Natural and Applied Sciences
Reference15 articles.
1. [1] Hocking, R.R., Pendleton, O.J. 1983. The regression dilemma. Commun. Stat. Theory Methods, 12(5), 497-527.
2. [2] Mansfield, E.R., Helms, B.P. 1982. Detecting multicollinearity. Am. Stat., 36, 158-160.
3. [3] Kutner, M.H., Nachtsheim, C.J., Neter, J., Li, W. 2004. Applied Linear Statistical Models, 5th edn. McGraw Hill, New York.
4. [4] Chatterjee, S., Hadi, A.S. 2012. Regression Analysis by Example, 5th edn. John Wiley and Sons, New Jersey.
5. [5] Marquaridt, D.W. 1970. Generalized inverses, ridge regression, biased linear estimation, and nonlinear estimation. Technometrics 12(3), 591-612.