Affiliation:
1. Minnesota Population Center, University of Minnesota
Abstract
Abstract
Virtually all quantitative microdata used by social scientists derive from samples that incorporate clustering, strati cation, and weighting adjustments (Kish 1965, 1992). Such data can yield standard error estimates that differ dramatically from those derived from a simple random sample of the same size. Researchers using historical U.S. census microdata, however, usually apply methods designed for simple random samples. The resulting p values and confidence intervals could be inaccurate and could lead to erroneous research conclusions. Because U.S. census microdata samples are among the most widely used sources for social science and policy research, the need for reliable standard error estimation is critical. We evaluate the historical microdata samples of the Integrated Public Use Microdata Series (IPUMS) project from 1850 to 1950 in order to determine (1) the impact of sample design on standard error estimates, and (2) how to apply modern standard error estimation software to historical census samples. We exploit a unique new data source from the 1880 census to validate our methods for standard error estimation, and then we apply this approach to the 1850–1870 and 1900–1950 decennial censuses. We conclude that Taylor series estimation can be used effectively with the historical decennial census microdata samples and should be applied in research analyses that have the potential for substantial clustering effects.
Reference30 articles.
1. Estimating Standard Errors for Regression Coefficients Using the Current Population Survey’s Public Use File;Davern;Inquiry,2007
2. A Comparison of Variance Estimators Using the Taylor Series Approximation;Dippo,1984
3. The 1880 United States Population Database;Goeken;Historical Methods,2003
4. Survey Inference for Subpopulations;Graubard;American Journal of Epidemiology,1996
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献