Automated linkage of patient records from disparate sources-Reference-Cited by-同舟云学术

Automated linkage of patient records from disparate sources

Published:2016-07-20 Issue:1 Volume:27 Page:172-184
ISSN:0962-2802
Container-title:Statistical Methods in Medical Research
language:en
Short-container-title:Stat Methods Med Res

Author:

Li Xiaochun¹,Xu Huiping¹,Shen Changyu¹,Grannis Shaun¹

Affiliation:

1. Indiana University School of Medicine, Indianapolis, USA

Abstract

We introduce an automated method of record linkage that has two key features, automated selection of match field interactions to include in the model for estimation and automated threshold determination for classifying record pairs to matches or non-matches. We applied our method to two real-world examples. The first example demonstrated results consistent with our earlier work: When data quality is adequate and the match field discriminating power is high, matching algorithms exhibit similar performance. The second example demonstrated that our method yields a lower false positive rate and higher positive predictive value than the Fellegi-Sunter model in the face of low data quality. When compared to the Fellegi-Sunter model, simulation studies suggest that our method exhibits better overall performance as indicated by higher area under the curve, and less biased estimates for both the match prevalence rate and the m- and u-probabilities over a range of data scenarios, especially when the match prevalence is extreme. Computationally, our method is as efficient as the Fellegi-Sunter model. We recommend this method in situations that an unsupervised linking algorithm is needed.

Publisher

SAGE Publications

Subject

Health Information Management,Statistics and Probability,Epidemiology

Link

http://journals.sagepub.com/doi/pdf/10.1177/0962280215626180

Reference20 articles.

1. A Theory for Record Linkage

2. Linkage of patient records from disparate sources

3. Insights into latent class analysis of diagnostic test performance

4. A Probit Latent Class Model with General Correlation Structures for Evaluating Accuracy of Diagnostic Tests

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Synthetic data in health care: A narrative review;PLOS Digital Health;2023-01-06

2. Score Test for Assessing the Conditional Dependence in Latent Class Models and its Application to Record Linkage;Journal of the Royal Statistical Society Series C: Applied Statistics;2022-09-18

3. Syphilis testing adherence among women with livebirth deliveries: Indianapolis 2014-2016;BMC Pregnancy and Childbirth;2021-10-30

4. Daily Visualization of Statewide COVID-19 Healthcare Data;2020 Workshop on Visual Analytics in Healthcare (VAHC);2020-11

5. Incorporating conditional dependence in latent class models for probabilistic record linkage: Does it matter?;The Annals of Applied Statistics;2019-09-01