Affiliation:
1. University at Buffalo, State University of New York, Department of Computer Science and Engineering, Buffalo, NY, USA
Abstract
With the rapid growth of artificial intelligence and mobile computing, intelligent speech interface has recently become one of the prevalent trends and has already presented huge potentials to the public. To address the privacy leakage issue during the speech interaction or accommodate some special demands, silent speech interfaces have been proposed to enable people's communication without vocalizing their sound (e.g., lip reading, tongue tracking). However, most existing silent speech mechanisms require either background illuminations or additional wearable devices. In this study, we propose the EchoWhisper as a novel user-friendly, smartphone-based silent speech interface. The proposed technique takes advantage of the micro-Doppler effect of the acoustic wave resulting from mouth and tongue movements and assesses the acoustic features of beamformed reflected echoes captured by the dual microphones in the smartphone. Using human subjects who perform a daily conversation task with over 45 different words, our system can achieve a WER (word error rate) of 8.33%, which shows the effectiveness of inferring silent speech content. Moreover, EchoWhisper has also demonstrated its reliability and robustness to a variety of configuration settings and environmental factors, such as smartphone orientations and distances, ambient noises, body motions, and so on.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Hardware and Architecture,Human-Computer Interaction
Reference61 articles.
1. Lip2Audspec: Speech Reconstruction from Silent Lip Movements Video
2. Speech and Music Classification and Separation: A Review
3. Constantine A Balanis. 2016. Antenna theory: analysis and design. John wiley & sons. Constantine A Balanis. 2016. Antenna theory: analysis and design. John wiley & sons.
Cited by
32 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. EarSSR: Silent Speech Recognition via Earphones;IEEE Transactions on Mobile Computing;2024-08
2. Lipwatch: Enabling Silent Speech Recognition on Smartwatches using Acoustic Sensing;Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies;2024-05-13
3. MELDER: The Design and Evaluation of a Real-time Silent Speech Recognizer for Mobile Devices;Proceedings of the CHI Conference on Human Factors in Computing Systems;2024-05-11
4. Robust Dual-Modal Speech Keyword Spotting for XR Headsets;IEEE Transactions on Visualization and Computer Graphics;2024-05
5. UFace;Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies;2024-03-06