Abstract
Unprecedented quantities of data that could help social scientists understand and ameliorate the challenges of human society are presently locked away inside companies, governments, and other organizations, in part because of privacy concerns. We address this problem with a general-purpose data access and analysis system with mathematical guarantees of privacy for research subjects, and statistical validity guarantees for researchers seeking social science insights. We build on the standard of “differential privacy,” correct for biases induced by the privacy-preserving procedures, provide a proper accounting of uncertainty, and impose minimal constraints on the choice of statistical methods and quantities estimated. We illustrate by replicating key analyses from two recent published articles and show how we can obtain approximately the same substantive results while simultaneously protecting privacy. Our approach is simple to use and computationally efficient; we also offer open-source software that implements all our methods.
Publisher
Cambridge University Press (CUP)
Subject
Political Science and International Relations,Sociology and Political Science
Reference58 articles.
1. Models for Sample Selection Bias
2. Differential Privacy: A Primer for a Non-Technical Audience;Wood;Vanderbilt Journal of Entertainment and Technology Law,2018
3. Differentially Private Significance Tests for Regression Coefficients
4. A New Model for Industry–Academic Partnerships;King;PS: Political Science and Politics,2020
5. Abowd, John M. 2018. “Staring-Down the Database Reconstruction Theorem.” In Joint Statistical Meetings, Vancouver, BC. https://bit.ly/census-reid.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献