Affiliation:
1. Charles H. Dyson School of Applied Economics and Management Cornell University Ithaca New York USA
2. USDA Economic Research Service Washington DC USA
Abstract
AbstractSuppressions in public data severely limit the usefulness of spatial data and hinder research applications. In this context, data imputation is necessary to deal with suppressed values. We present and validate a flexible data imputation method that can aid in the completion of under‐determined data systems. The validations use Monte Carlo and optimisation modelling techniques to recover suppressed data tables from the 2017 US Census of Agriculture. We then use econometric models to evaluate the accuracy of imputations from alternative models. Various metrics of forecast accuracy (i.e., MAPE, BIC, etc.) show the flexibility and capacity of this approach to accurately recover suppressed data. To illustrate the value of our method, we compare the livestock water withdrawal estimations with imputed data and suppressed data to show the bias in research applications when suppressions are simply dropped from analysis.
Funder
Cornell University
Economic Research Service
Department of Agriculture, Australian Government
Subject
Economics and Econometrics,Agricultural and Biological Sciences (miscellaneous)
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献