Comparing Speech and Keyboard Text Entry for Short Messages in Two Languages on Touchscreen Phones-Reference-Cited by-同舟云学术

Comparing Speech and Keyboard Text Entry for Short Messages in Two Languages on Touchscreen Phones

Published:2018-01-08 Issue:4 Volume:1 Page:1-23
ISSN:2474-9567
Container-title:Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies
language:en
Short-container-title:Proc. ACM Interact. Mob. Wearable Ubiquitous Technol.

Author:

Ruan Sherry¹,Wobbrock Jacob O.²,Liou Kenny³,Ng Andrew¹,Landay James A.¹

Affiliation:

1. Stanford University

2. University of Washington

3. Symantec Corp.

Abstract

With the ubiquity of mobile touchscreen devices like smartphones, two widely used text entry methods have emerged: small touch-based keyboards and speech recognition. Although speech recognition has been available on desktop computers for years, it has continued to improve at a rapid pace, and it is currently unknown how today's modern speech recognizers compare to state-of-the-art mobile touch keyboards, which also have improved considerably since their inception. To discover both methods' “upper-bound performance,” we evaluated them in English and Mandarin Chinese on an Apple iPhone 6 Plus in a laboratory setting. Our experiment was carried out using Baidu's Deep Speech 2, a deep learning-based speech recognition system, and the built-in Qwerty (English) or Pinyin (Mandarin) Apple iOS keyboards. We found that with speech recognition, the English input rate was 2.93 times faster (153 vs. 52 WPM), and the Mandarin Chinese input rate was 2.87 times faster (123 vs. 43 WPM) than the keyboard for short message transcription under laboratory conditions for both methods. Furthermore, although speech made fewer errors during entry (5.30% vs. 11.22% corrected error rate), it left slightly more errors in the final transcribed text (1.30% vs. 0.79% uncorrected error rate). Our results show that comparatively, under ideal conditions for both methods, upper-bound speech recognition performance has greatly improved compared to prior systems, and might see greater uptake in the future, although further study is required to quantify performance in non-laboratory settings for both methods.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture,Human-Computer Interaction

Link

https://dl.acm.org/doi/pdf/10.1145/3161187

Reference67 articles.

1. Interactions, partial interactions, and interaction contrasts in the analysis of variance.

2. The human factors of speech-based interfaces

Cited by 68 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Melvin is a conversational voice interface for cancer genomics data;Communications Biology;2024-01-05

2. EyeClick:A Robust Two-Step Eye-Hand Interaction for Text Entry in Augmented Reality Glasses;Adjunct Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology;2023-10-29

3. STAR: Smartphone-analogous Typing in Augmented Reality;Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology;2023-10-29

4. Synergi: A Mixed-Initiative System for Scholarly Synthesis and Sensemaking;Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology;2023-10-29

5. GlassMessaging;Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies;2023-09-27