Affiliation:
1. Southern Methodist Univ., Dallas, TX
Abstract
The problem of user inference in statistical databases is discussed and illustrated with several examples. It is assumed that the database allows “total,” “average,” “count,” and “percentile” queries; a query may refer to any arbitrary subset of the database. Methods for protecting the security of such a database are considered; it is shown that any scheme which gives “statistically correct” answers is vulnerable to penetration. A precise definition of compromisability (in a statistical sense) is given. A general model of user inference is proposed; two special cases of this model appear to contain all previously published strategies for compromising a statistical database. A method for protecting the security of such a statistical database against these types of user inference is presented and discussed. It is shown that the number of queries required to compromise the database can be made arbitrarily large by accepting moderate increases in the variance of responses to queries. A numerical example is presented to illustrate the application of the techniques discussed.
Publisher
Association for Computing Machinery (ACM)
Reference24 articles.
1. Maintaining confidentiality on data in educational research: A systemic analysis.
2. Confidentiality-preserving modes of access to files and to interfile exchange for useful statistical analysis;CAMPBELL D.T.;Eval. Quart.,1977
3. Security in statistical databases for queries with small counts
4. Selective partial access to a database
5. DALENIUS T. Towards a methodology for statistical disclosure control. Stirtryck ur Statistisk tidskrift 15 (1977} 429-444. DALENIUS T. Towards a methodology for statistical disclosure control. Stirtryck ur Statistisk tidskrift 15 (1977} 429-444.
Cited by
77 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献