Automated Linking of Historical Data-Reference-Cited by-同舟云学术

Automated Linking of Historical Data

Published:2021-09-01 Issue:3 Volume:59 Page:865-918
ISSN:0022-0515
Container-title:Journal of Economic Literature
language:en
Short-container-title:Journal of Economic Literature

Author:

Abramitzky Ran¹,Boustan Leah²,Eriksson Katherine³,Feigenbaum James⁴,Pérez Santiago⁵

Affiliation:

1. Stanford University and NBER

2. Princeton University and NBER

3. UC Davis and NBER

4. Boston University and NBER

5. UC Davis and NBER.

Abstract

The recent digitization of complete count census data is an extraordinary opportunity for social scientists to create large longitudinal datasets by linking individuals from one census to another or from other sources to the census. We evaluate different automated methods for record linkage, performing a series of comparisons across methods and against hand linking. We have three main findings that lead us to conclude that automated methods perform well. First, a number of automated methods generate very low (less than 5 percent) false positive rates. The automated methods trace out a frontier illustrating the trade-off between the false positive rate and the (true) match rate. Relative to more conservative automated algorithms, humans tend to link more observations but at a cost of higher rates of false positives. Second, when human linkers and algorithms use the same linking variables, there is relatively little disagreement between them. Third, across a number of plausible analyses, coefficient estimates and parameters of interest are very similar when using linked samples based on each of the different automated methods. We provide code and Stata commands to implement the various automated methods. (JEL C81, C83, N01, N31, N32)

Publisher

American Economic Association

Subject

Economics and Econometrics

Link

https://pubs.aeaweb.org/doi/pdf/10.1257/jel.20201599

Cited by 84 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Public pensions and retirement: Evidence from the Railroad Retirement Act;Journal of Public Economics;2024-10

2. Elite persistence in Sierra Leone: What can names tell us?;Journal of Development Economics;2024-10

3. Institutional discrimination and assimilation: Evidence from the Chinese Exclusion Act of 1882;Explorations in Economic History;2024-10

4. Changing the pace of the melting pot: The effects of immigration restrictions on immigrant assimilation;Journal of Comparative Economics;2024-09

5. Asian American Diversity and Growth;Annual Review of Sociology;2024-08-12