Affiliation:
1. Laboratory for Data Security – EPFL
Abstract
Abstract
Tree-based models are among the most efficient machine learning techniques for data mining nowadays due to their accuracy, interpretability, and simplicity. The recent orthogonal needs for more data and privacy protection call for collaborative privacy-preserving solutions. In this work, we survey the literature on distributed and privacy-preserving training of tree-based models and we systematize its knowledge based on four axes: the learning algorithm, the collaborative model, the protection mechanism, and the threat model. We use this to identify the strengths and limitations of these works and provide for the first time a framework analyzing the information leakage occurring in distributed tree-based model learning.
Reference193 articles.
1. [1] “Amazon sagemaker - xgboost algorithm,” https://docs.aws.amazon.com/sagemaker/latest/dg/xgboost.html.
2. [2] “DBLP: Computer science bibliography,” https://dblp.org/.
3. [3] “Google scholar,” https://scholar.google.com/.
4. [4] “Microsoft academic,” https://academic.microsoft.com/home.
5. [5] M. Abspoel, D. Escudero, and N. Volgushev, “Secure training of decision trees with continuous attributes,” Proceedings on Privacy Enhancing Technologies, 2021.
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献