Privacy Preserving Feature Selection for Sparse Linear Regression-Reference-Cited by-同舟云学术

Privacy Preserving Feature Selection for Sparse Linear Regression

Published:2024-01 Issue:1 Volume:2024 Page:300-313
ISSN:2299-0984
Container-title:Proceedings on Privacy Enhancing Technologies
language:
Short-container-title:PoPETs

Author:

Akavia Adi¹,Galili Ben²,Shaul Hayim³,Weiss Mor⁴,Yakhini Zohar⁵

Affiliation:

1. University of Haifa

2. Technion

3. IBM Research

4. Bar-Ilan University

5. Reichman University and Technion

Abstract

Privacy-Preserving Machine Learning (PPML) provides protocols for learning and statistical analysis of data that may be distributed amongst multiple data owners (e.g., hospitals that own proprietary healthcare data), while preserving data privacy. The PPML literature includes protocols for various learning methods, including ridge regression. Ridge regression controls the L2 norm of the model, but does not aim to strictly reduce the number of non-zero coefficients, namely the L0 norm of the model. Reducing the number of non-zero coefficients (a form of feature selection) is important for avoiding overfitting, and for reducing the cost of using learnt models in practice. In this work, we develop a first privacy-preserving protocol for sparse linear regression under L0 constraints. The protocol addresses data contributed by several data owners (e.g., hospitals). Our protocol outsources the bulk of the computation to two non-colluding servers, using homomorphic encryption as a central tool. We provide a rigorous security proof for our protocol, where security is against semi-honest adversaries controlling any number of data owners and at most one server. We implemented our protocol, and evaluated performance with nearly a million samples and up to 40 features.

Publisher

Privacy Enhancing Technologies Symposium Advisory Board

Subject

General Medicine

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Privacy Preserving Epigenetic PaceMaker Stronger Privacy and Improved Efficiency;2024-02-20

2. Privacy Preserving Epigenetic PaceMaker: Stronger Privacy and Improved Efficiency;Lecture Notes in Computer Science;2024