Minmaxing of Bayesian Improved Surname Geocoding and Geography Level Ups in Predicting Race-Reference-Cited by-同舟云学术

Minmaxing of Bayesian Improved Surname Geocoding and Geography Level Ups in Predicting Race

Published:2021-11-29 Issue: Volume: Page:1-7
ISSN:1047-1987
Container-title:Political Analysis
language:en
Short-container-title:Polit. Anal.

Author:

Clark Jesse T.^ORCID,Curiel John A.^ORCID,Steelman Tyler S.^ORCID

Abstract

Abstract Racial identification is a critical factor in understanding a multitude of important outcomes in many fields. However, inferring an individual’s race from ecological data is prone to bias and error. This process was only recently improved via Bayesian improved surname geocoding (BISG). With surname and geographic-based demographic data, it is possible to more accurately estimate individual racial identification than ever before. However, the level of geography used in this process varies widely. Whereas some existing work makes use of geocoding to place individuals in precise census blocks, a substantial portion either skips geocoding altogether or relies on estimation using surname or county-level analyses. Presently, the trade-offs of such variation are unknown. In this letter, we quantify those trade-offs through a validation of BISG on Georgia’s voter file using both geocoded and nongeocoded processes and introduce a new level of geography—ZIP codes—to this method. We find that when estimating the racial identification of White and Black voters, nongeocoded ZIP code-based estimates are acceptable alternatives. However, census blocks provide the most accurate estimations when imputing racial identification for Asian and Hispanic voters. Our results document the most efficient means to sequentially conduct BISG analysis to maximize racial identification estimation while simultaneously minimizing data missingness and bias.

Publisher

Cambridge University Press (CUP)

Subject

Political Science and International Relations,Sociology and Political Science

Reference21 articles.

1. The Turnout Gap

2. Examining scientific writing styles from the perspective of linguistic complexity

3. Measuring the Rural Continuum in Political Science

4. Can Violent Protest Change Local Policy Support? Evidence from the Aftermath of the 1992 Los Angeles Riot

5. S-maup: Statistical test to measure the sensitivity to the modifiable areal unit problem

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Who Owns the Neighborhood? Ethnoracial Composition of Property Ownership and Neighborhood Trajectories in San Francisco;City & Community;2024-06-24

2. The Power of Characters: Evaluating Machine Learning-Modified Bayesian Improved Surname Geocoding Inference of Race in Redistricting;State Politics & Policy Quarterly;2024-05-22

3. It’s All in the Name: A Character-Based Approach to Infer Religion;Political Analysis;2023-03-23

4. Methods for retrospectively improving race/ethnicity data quality: a scoping review;Epidemiologic Reviews;2023

5. Validating the Applicability of Bayesian Inference with Surname and Geocoding to Congressional Redistricting;Political Analysis;2022-05-20