Affiliation:
1. University of Illinois at Urbana-Champaign
2. George Mason University
Abstract
In this paper, we present a novel deep neural network architecture that reconstructs the high-frequency audio of selected spoken human words from low-sampling-rate signals of (ego-)motion sensors, such as accelerometer and gyroscope data, recorded on everyday mobile devices. As the sampling rate of such motion sensors is much lower than the Nyquist rate of ordinary human voice (around 6kHz+), these motion sensor recordings suffer from a significant frequency aliasing effect. In order to recover the original high-frequency audio signal, our neural network introduces a novel layer, called the alias unfolding layer, specialized in expanding the bandwidth of an aliased signal by reversing the frequency folding process in the time-frequency domain. While perfect unfolding is known to be unrealizable, we leverage the sparsity of the original signal to arrive at a sufficiently accurate statistical approximation. Comprehensive experiments show that our neural network significantly outperforms the state of the art in audio reconstruction from motion sensor data, effectively reconstructing a pre-trained set of spoken keywords from low-frequency motion sensor signals (with a sampling rate of 100-400 Hz). The approach demonstrates the potential risk of information leakage from motion sensors in smart mobile devices.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Hardware and Architecture,Human-Computer Interaction
Reference44 articles.
1. 2019. Voice frequency. https://en.wikipedia.org/wiki/Voice_frequency 2019. Voice frequency. https://en.wikipedia.org/wiki/Voice_frequency
2. A Simple Cepstral Domain DNN Approach to Artificial Speech Bandwidth Extension
3. Spearphone
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Towards Bone-Conducted Vibration Speech Enhancement on Head-Mounted Wearables;Proceedings of the 21st Annual International Conference on Mobile Systems, Applications and Services;2023-06-18
2. Accuth;Proceedings of the 20th ACM Conference on Embedded Networked Sensor Systems;2022-11-06
3. An Audio Frequency Unfolding Framework for Ultra-Low Sampling Rate Sensors;2022 23rd International Symposium on Quality Electronic Design (ISQED);2022-04-06