Affiliation:
1. TU Darmstadt, Germany
2. KTH, Sweden
3. Johannes Gutenberg University Mainz, Germany
Abstract
Distributed query processing is an effective means for processing large amounts of data. To abstract from the technicalities of distributed systems, algorithms for operator placement automatically distribute sequential data queries over the available processing units. However, current algorithms for operator placement focus on performance and ignore privacy concerns that arise when handling sensitive data.
We present a new methodology for privacy-aware operator placement that both prevents leakage of sensitive information and improves performance. Crucially, our approach is based on an information-flow type system for data queries to reason about the sensitivity of query subcomputations. Our solution unfolds in two phases. First, placement space reduction generates deployment candidates based on privacy constraints using a syntax-directed transformation driven by the information-flow type system. Second, constraint solving selects the best placement among the candidates based on a cost model that maximizes performance. We verify that our algorithm preserves the sequential behavior of queries and prevents leakage of sensitive data. We implemented the type system and placement algorithm for a new query language SecQL and demonstrate significant performance improvements in benchmarks.
Publisher
Association for Computing Machinery (ACM)
Subject
Safety, Risk, Reliability and Quality,Software
Reference58 articles.
1. Akka. 2019. Akka toolkit and runtime. http://akka.io . Akka. 2019. Akka toolkit and runtime. http://akka.io .
2. Secure database-as-a-service with Cipherbase
3. AWS. 2019. AWS Fargate. https://aws.amazon.com/de/fargate/ . AWS. 2019. AWS Fargate. https://aws.amazon.com/de/fargate/ .
4. TrustedDB: A Trusted Hardware-Based Database with Privacy and Data Confidentiality
5. Explainable security for relational databases
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Object Graph Programming;Proceedings of the IEEE/ACM 46th International Conference on Software Engineering;2024-02-06
2. An authorization model for query execution in the cloud;The VLDB Journal;2021-11-06
3. Distributed Query Evaluation over Encrypted Data;Data and Applications Security and Privacy XXXV;2021
4. WIP: Pods: Privacy Compliant Scalable Decentralized Data Services;Heterogeneous Data Management, Polystores, and Analytics for Healthcare;2021