Scribe-Reference-Cited by-同舟云学术

Scribe

Published:2023-12-19 Issue:4 Volume:7 Page:1-31
ISSN:2474-9567
Container-title:Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies
language:en
Short-container-title:Proc. ACM Interact. Mob. Wearable Ubiquitous Technol.

Author:

Bai Yang¹^ORCID,Shahid Irtaza¹^ORCID,Takawale Harshvardhan¹^ORCID,Roy Nirupam¹^ORCID

Affiliation:

1. University of Maryland College Park, Maryland, USA

Abstract

This paper presents the design and implementation of Scribe, a comprehensive voice processing and handwriting interface for voice assistants. Distinct from prior works, Scribe is a precise tracking interface that can co-exist with the voice interface on low sampling rate voice assistants. Scribe can be used for 3D free-form drawing, writing, and motion tracking for gaming. Taking handwriting as a specific application, it can also capture natural strokes and the individualized style of writing while occupying only a single frequency. The core technique includes an accurate acoustic ranging method called Cross Frequency Continuous Wave (CFCW) sonar, enabling voice assistants to use ultrasound as a ranging signal while using the regular microphone system of voice assistants as a receiver. We also design a new optimization algorithm that only requires a single frequency for time difference of arrival. Scribe prototype achieves 73 μm of median error for 1D ranging and 1.4 mm of median error in 3D tracking of an acoustic beacon using the microphone array used in voice assistants. Our implementation of an in-air handwriting interface achieves 94.1% accuracy with automatic handwriting-to-text software, similar to writing on paper (96.6%). At the same time, the error rate of voice-based user authentication only increases from 6.26% to 8.28%.

Funder

NSF

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3631411

Reference90 articles.

1. 2021. Roughly 1 in 4 U.S. adults now owns a smart speaker according to New Report. https://martech.org/roughly-1-in-4-u-s-adults-now-owns-a-smart-speaker-according-to-new-report/

2. 2022. 20kHz speaker. https://www.digikey.com/en/products/detail/pui-audio-inc./ASX05408-HD-R/7227653utm_adgroup=Speakers&utm_source=google&utm_medium=cpc&utm_campaign=Shopping_Product_Audio.

3. 2022. 60kHz ultrasound speaker. https://www.steminc.com/PZT/en/ultrasonic-air-transducer-60-khz.

4. 2022. 80kHz ultrasound speaker. https://www.steminc.com/PZT/en/ultrasonic-air-transducer-80-khz.

5. 2022. Google Speech-To-Text API. https://cloud.google.com/speech-to-text.