Abstract
Differential privacy (DP) has emerged in the computer science literature as a measure of the impact on an individual’s privacy resulting from the publication of a statistical output such as a frequency table. This paper provides an introduction to DP for official statisticians and discuss its relevance, benefits and challenges from a National Statistical Organisation (NSO) perspective. We motivate our study by examining how privacy is evolving in the era of big data and how this might prompt a shift from traditional statistical disclosure techniques used in official statistics – which are generally applied on a cell-by-cell or table-by-table basis – to formal privacy methods, like DP, which are applied from a perspective encompassing the totality of the outputs generated from a given dataset. We identify an important interplay between DP’s holistic privacy risk measure and the difficulty for NSOs in implementing DP, showing that DP’s major advantage is also DP’s major challenge. This paper provides new work addressing two key DP research areas for NSOs: DP’s application to survey data and its incorporation within the Five Safes framework.
Subject
Statistics, Probability and Uncertainty,Economics and Econometrics,Management Information Systems
Reference27 articles.
1. On the Tradeoff Between Privacy and Utility in Data Publishing;Li;The 15th ACM SIGKDD international conference,2009
2. O’Keefe CM, Otorepec S, Elliot M, Mackey E, O’Hara K. The de-identification decision-making framework. CSIRO; 2017. Report No.: EP173122 and EP175702.
3. Revealing information while preserving privacy
4. The U.S. Census Bureau Adopts Differential Privacy
5. Official statistics in the era of big data opportunities and threats;Radermacher;International Journal of Data Science and Analytics.,2018