Design and implementation of a privacy preserving electronic health record linkage tool in Chicago

Author:

Kho Abel N1,Cashy John P12,Jackson Kathryn L1,Pah Adam R1,Goel Satyender1,Boehnke Jörn3,Humphries John Eric3,Kominers Scott Duke4,Hota Bala N5,Sims Shannon A5,Malin Bradley A6,French Dustin D7,Walunas Theresa L1,Meltzer David O2,Kaleba Erin O8,Jones Roderick C9,Galanter William L10

Affiliation:

1. Department of Medicine, and Center for Health Information Partnerships, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA

2. Department of Veterans Affairs, Pittsburgh PA

3. Department of Economics, University of Chicago, Chicago, IL, USA

4. Society of Fellows Department of Economics, Business School, Program For Evolutionary Dynamics, and Center for Research on Computation and Society, Harvard University, Cambridge, MA, USA

5. Department of Medicine, Rush University Medical Center, Chicago, IL, USA

6. Department of Biomedical Informatics, School of Medicine, and Department of Electrical Engineering and Computer Science, School of Engineering, Vanderbilt University, Nashville, TN, USA

7. Center for Healthcare Studies and Department of Ophthalmology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA

8. Alliance of Chicago Community Health Services, Chicago, IL, USA

9. Formerly of Chicago Department of Public Health, currently at Ann and Robert H. Lurie Children's Hospital, Chicago, IL, USA

10. University of Illinois Hospital and Health Sciences System, Chicago, IL, USA

Abstract

Abstract Objective To design and implement a tool that creates a secure, privacy preserving linkage of electronic health record (EHR) data across multiple sites in a large metropolitan area in the United States (Chicago, IL), for use in clinical research. Methods The authors developed and distributed a software application that performs standardized data cleaning, preprocessing, and hashing of patient identifiers to remove all protected health information. The application creates seeded hash code combinations of patient identifiers using a Health Insurance Portability and Accountability Act compliant SHA-512 algorithm that minimizes re-identification risk. The authors subsequently linked individual records using a central honest broker with an algorithm that assigns weights to hash combinations in order to generate high specificity matches. Results The software application successfully linked and de-duplicated 7 million records across 6 institutions, resulting in a cohort of 5 million unique records. Using a manually reconciled set of 11 292 patients as a gold standard, the software achieved a sensitivity of 96% and a specificity of 100%, with a majority of the missed matches accounted for by patients with both a missing social security number and last name change. Using 3 disease examples, it is demonstrated that the software can reduce duplication of patient records across sites by as much as 28%. Conclusions Software that standardizes the assignment of a unique seeded hash identifier merged through an agreed upon third-party honest broker can enable large-scale secure linkage of EHR data for epidemiologic and public health research. The software algorithm can improve future epidemiologic research by providing more comprehensive data given that patients may make use of multiple healthcare systems.

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3