LLAMA: A Low Latency Math Library for Secure Inference-Reference-Cited by-同舟云学术

LLAMA: A Low Latency Math Library for Secure Inference

Published:2022-10 Issue:4 Volume:2022 Page:274-294
ISSN:2299-0984
Container-title:Proceedings on Privacy Enhancing Technologies
language:
Short-container-title:PoPETs

Author:

Gupta Kanav¹,Kumaraswamy Deepak¹,Chandran Nishanth¹,Gupta Divya¹

Affiliation:

1. Microsoft Research.

Abstract

Secure machine learning (ML) inference can provide meaningful privacy guarantees to both the client (holding sensitive input) and the server (holding sensitive weights of the ML model) while realizing inferenceas-a-service. Although many specialized protocols exist for this task, including those in the preprocessing model (where a majority of the overheads are moved to an input independent offline phase), they all still suffer from large online complexity. Specifically, the protocol phase that executes once the parties know their inputs, has high communication, round complexity, and latency. Function Secret Sharing (FSS) based techniques offer an attractive solution to this in the trusted dealer model (where a dealer provides input independent correlated randomness to both parties), and 2PC protocols obtained based on these techniques have a very lightweight online phase. Unfortunately, current FSS-based 2PC works (AriaNN, PoPETS 2022; Boyle et al. Eurocrypt 2021; Boyle et al. TCC 2019) fall short of providing a complete solution to secure inference. First, they lack support for math functions (e.g., sigmoid, and reciprocal square root) and hence, are insufficient for a large class of inference algorithms (e.g. recurrent neural networks). Second, they restrict all values in the computation to be of the same bitwidth and this prevents them from benefitting from efficient float-to-fixed converters such as Tensorflow Lite that crucially use low bitwidth representations and mixed bitwidth arithmetic. In this work, we present LLAMA – an end-to-end, FSS based, secure inference library supporting precise low bitwidth computations (required by converters) as well as provably precise math functions; thus, overcoming all the drawbacks listed above. We perform an extensive evaluation of LLAMA and show that when compared with non-FSS based libraries supporting mixed bitwidth arithmetic and math functions (SIRNN, IEEE S&P 2021), it has at least an order of magnitude lower communication, rounds, and runtimes. We integrate LLAMA with the EzPC framework (IEEE EuroS&P 2019) and demonstrate its robustness by evaluating it on large benchmarks (such as ResNet-50 on the ImageNet dataset) as well as on benchmarks considered in AriaNN – here too LLAMA outperforms prior work.

Publisher

Privacy Enhancing Technologies Symposium Advisory Board

Subject

General Medicine

Cited by 14 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. PIPO: Privacy-Preserving Convolutional Neural Network Inference with Plaintext Operations;2024 IEEE 44th International Conference on Distributed Computing Systems (ICDCS);2024-07-23

2. Communication-Efficient Secure Logistic Regression;2024 IEEE 9th European Symposium on Security and Privacy (EuroS&P);2024-07-08

3. Efficient Privacy-Preserving Approximation of the Kidney Exchange Problem;Proceedings of the 19th ACM Asia Conference on Computer and Communications Security;2024-07

4. Nomadic: Normalising Maliciously-Secure Distance with Cosine Similarity for Two-Party Biometric Authentication;Proceedings of the 19th ACM Asia Conference on Computer and Communications Security;2024-07

5. Make Split, not Hijack: Preventing Feature-Space Hijacking Attacks in Split Learning;Proceedings of the 29th ACM Symposium on Access Control Models and Technologies;2024-06-24