Affiliation:
1. University of Waterloo
2. University of Toronto
Abstract
The authors analyze the efficiency of six missing data techniques for categorical item nonresponse under the assumption that data are missing at random or missing completely at random. By efficiency, the authors mean a procedure that produces an unbiased estimate of true sample properties that is also easy to implement. The investigated techniques include listwise deletion, mode substitution, random imputation, two regression imputations, and a Bayesian model-based procedure. The authors analyze efficiency under six experimental conditions for a survey-based data set. They find that listwise deletion is efficient for the data analyzed. If data loss due to listwise deletion is an issue, the analysis points to the Bayesian method. Regression imputation is also efficient, but the result is conditioned on the specific data structure and may not hold in general. Additional problems arise when using regression imputation, making it less appropriate.
Subject
Management of Technology and Innovation,Strategy and Management,General Decision Sciences
Reference19 articles.
1. Åstebro T. & Bernhardt, I. (in press). Start-up financing, owner characteristics and survival. Journal of Business and Economics.
2. A comparison of inclusive and restrictive strategies in modern missing data procedures.
3. Dempster, A. P., Laird, N. M. & Rubin, D. B. (1977). Maximum likelihood estimation from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, 39(Series B), 1-38.
4. A Primer on Maximum Likelihood Algorithms Available for Use With Missing Data
5. The impact of nonnormality on full information maximum-likelihood estimation for structural equation models with missing data.
Cited by
28 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献