Expanding a Large Inclusive Study of Human Listening Rates-Reference-Cited by-同舟云学术

Expanding a Large Inclusive Study of Human Listening Rates

Published:2021-09-30 Issue:3 Volume:14 Page:1-26
ISSN:1936-7228
Container-title:ACM Transactions on Accessible Computing
language:en
Short-container-title:ACM Trans. Access. Comput.

Author:

Bragg Danielle¹,Reinecke Katharina²,Ladner Richard E.²

Affiliation:

1. Microsoft Research, Cambridge, MA, USA

2. University of Washington, Seattle, WA, USA

Abstract

As conversational agents and digital assistants become increasingly pervasive, understanding their synthetic speech becomes increasingly important. Simultaneously, speech synthesis is becoming more sophisticated and manipulable, providing the opportunity to optimize speech rate to save users time. However, little is known about people’s abilities to understand fast speech. In this work, we provide an extension of the first large-scale study on human listening rates, enlarging the prior study run with 453 participants to 1,409 participants and adding new analyses on this larger group. Run on LabintheWild, it used volunteer participants, was screen reader accessible, and measured listening rate by accuracy at answering questions spoken by a screen reader at various rates. Our results show that people who are visually impaired, who often rely on audio cues and access text aurally, generally have higher listening rates than sighted people. The findings also suggest a need to expand the range of rates available on personal devices. These results demonstrate the potential for users to learn to listen to faster rates, expanding the possibilities for human-conversational agent interaction.

Funder

NSF

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Science Applications,Human-Computer Interaction

Link

https://dl.acm.org/doi/pdf/10.1145/3461700

Reference83 articles.

1. Gerry T. M. Altmann (Ed.). 1995. Cognitive Models of Speech Processing: Psycholinguistic and Computational Perspectives. MIT Press. Gerry T. M. Altmann (Ed.). 1995. Cognitive Models of Speech Processing: Psycholinguistic and Computational Perspectives. MIT Press.

2. Emotional statistical parametric speech synthesis using LSTM-RNNs

3. Apple. 2017. VoiceOver. http://www.apple.com/accessibility/mac/vision/. (Accessed 2017-09-02). Apple. 2017. VoiceOver. http://www.apple.com/accessibility/mac/vision/. (Accessed 2017-09-02).

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Uncovering Human Traits in Determining Real and Spoofed Audio: Insights from Blind and Sighted Individuals;Proceedings of the CHI Conference on Human Factors in Computing Systems;2024-05-11

2. Modality Synchronization When People With Aphasia Read With Text-to-Speech Support;American Journal of Speech-Language Pathology;2024-05

3. “Let the Volcano Erupt!”: Designing Sonification to Make Oceanography Accessible for Blind and Low Vision Students in Museum Environment;The 25th International ACM SIGACCESS Conference on Computers and Accessibility;2023-10-22

4. Assistive or Artistic Technologies? Exploring the Connections between Art, Disability and Wheelchair Use;The 24th International ACM SIGACCESS Conference on Computers and Accessibility;2022-10-22

5. Sonic Technologies of a Queer Breakup;Designing Interactive Systems Conference;2022-06-13