Comparing alignment toward American, British, and Indian English text-to-speech (TTS) voices: influence of social attitudes and talker guise-Reference-Cited by-同舟云学术

Comparing alignment toward American, British, and Indian English text-to-speech (TTS) voices: influence of social attitudes and talker guise

Published:2023-07-03 Issue: Volume:5 Page:
ISSN:2624-9898
Container-title:Frontiers in Computer Science
language:
Short-container-title:Front. Comput. Sci.

Author:

Dodd Nicole,Cohn Michelle,Zellou Georgia

Abstract

Text-to-speech (TTS) voices, which vary in their apparent native language and dialect, are increasingly widespread. In this paper, we test how speakers perceive and align toward TTS voices that represent American, British, and Indian dialects of English and the extent that social attitudes shape patterns of convergence and divergence. We also test whether top-down knowledge of the talker, manipulated as a “human” or “device” guise, mediates these attitudes and accommodation. Forty-six American English-speaking participants completed identical interactions with 6 talkers (2 from each dialect) and rated each talker on a variety of social factors. Accommodation was assessed with AXB perceptual similarity by a separate group of raters. Results show that speakers had the strongest positive social attitudes toward the Indian English voices and converged toward them more. Conversely, speakers rate the American English voices as less human-like and diverge from them. Finally, speakers overall show more accommodation toward TTS voices that were presented in a “human” guise. We discuss these results through the lens of the Communication Accommodation Theory (CAT).

Funder

National Science Foundation

Publisher

Frontiers Media SA

Subject

Computer Science Applications,Computer Vision and Pattern Recognition,Human-Computer Interaction,Computer Science (miscellaneous)

Reference65 articles.

1. The clear speech intelligibility benefit for text-to-speech voices: effects of speaking style and visual guise;Aoki;JASA Exp. Lett.,2022

2. Voice onset time in Indian English-accented speech;Awan;Clin. Ling. Phonetics,2011

3. Dialect divergence and convergence in New Zealand English;Babel;Lang. Soc.,2010

4. Evidence for phonetic and social selectivity in spontaneous phonetic imitation;Babel;J. Phon.,2012

5. Novelty and social preference in phonetic accommodation;Babel;Lab. Phonol.,2014

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Linguistic analysis of human-computer interaction;Frontiers in Computer Science;2024-05-21

2. Analysis of English Speech Learning Quality based on Speech Recognition Technology;2024 International Conference on Optimization Computing and Wireless Communication (ICOCWC);2024-01-29

3. Perceptual identification of oral and nasalized vowels across American English and British English listeners and TTS voices;Frontiers in Communication;2023-12-11