Paralinguistic Privacy Protection at the Edge-Reference-Cited by-同舟云学术

Paralinguistic Privacy Protection at the Edge

Published:2023-04-13 Issue:2 Volume:26 Page:1-27
ISSN:2471-2566
Container-title:ACM Transactions on Privacy and Security
language:en
Short-container-title:ACM Trans. Priv. Secur.

Author:

Aloufi Ranya¹^ORCID,Haddadi Hamed¹^ORCID,Boyle David¹^ORCID

Affiliation:

1. Imperial College London, UK

Abstract

Voice user interfaces and digital assistants are rapidly entering our lives and becoming singular touch points spanning our devices. These always-on services capture and transmit our audio data to powerful cloud services for further processing and subsequent actions. Our voices and raw audio signals collected through these devices contain a host of sensitive paralinguistic information that is transmitted to service providers regardless of deliberate or false triggers. As our emotional patterns and sensitive attributes like our identity, gender, and well-being are easily inferred using deep acoustic models, we encounter a new generation of privacy risks by using these services. One approach to mitigate the risk of paralinguistic-based privacy breaches is to exploit a combination of cloud-based processing with privacy-preserving, on-device paralinguistic information learning and filtering before transmitting voice data. In this article we introduce EDGY , a configurable, lightweight, disentangled representation learning framework that transforms and filters high-dimensional voice data to identify and contain sensitive attributes at the edge prior to offloading to the cloud. We evaluate EDGY’s on-device performance and explore optimization techniques, including model quantization and knowledge distillation, to enable private, accurate, and efficient representation learning on resource-constrained devices. Our results show that EDGY runs in tens of milliseconds with 0.2% relative improvement in “zero-shot” ABX score or minimal performance penalties of approximately 5.95% word error rate (WER) in learning linguistic representations from raw voice signals, using a CPU and a single-core ARM processor without specialized hardware.

Publisher

Association for Computing Machinery (ACM)

Subject

Safety, Risk, Reliability and Quality,General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3570161

Reference92 articles.

1. The faults in our ASRs: An overview of attacks against automatic speech recognition and speaker identification systems;Abdullah Hadi;arXiv preprint arXiv:2007.06622,2020

2. Privacy guarantees for de-identifying text transformations;Adelani David Ifeoluwa;arXiv preprint arXiv:2008.03101,2020

3. Shimaa Ahmed, Amrita Roy Chowdhury, Kassem Fawaz, and Parmesh Ramanathan. 2020. Preech: A system for privacy-preserving speech transcription. In 29th \(\lbrace\) USENIX \(\rbrace\) Security Symposium ( \(\lbrace\) USENIX \(\rbrace\) Security’20). 2703–2720.

4. Ranya Aloufi, Hamed Haddadi, and David Boyle. 2019. Emotion filtering at the edge. In Proceedings of the 1st Workshop on Machine Learning on Edge in Sensor Systems. Association for Computing Machinery. 10.1145/3362743.3362960

5. Privacy-preserving Voice Analysis via Disentangled Representations

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Privacy preservation in sensor-based Human Activity Recognition through autoencoders for low-power IoT devices;Internet of Things;2024-07

2. Privacy-Oriented Manipulation of Speaker Representations;IEEE Access;2024

3. Construction and Management of Doctor-Patient Privacy Protection System under Big Data Computing Environment;2023 3rd International Conference on Mobile Networks and Wireless Communications (ICMNWC);2023-12-04