Affiliation:
1. University of Illinois at Urbana-Champaign
Abstract
Purpose
A critical issue in assessing speech recognition involves understanding the factors that cause listeners to make errors. Models like the articulation index show that average error decreases logarithmically with increases in signal-to-noise ratio (SNR). The authors investigated (a) whether this log-linear relationship holds across consonants and for individual tokens and (b) what accounts for differences in error rates at the across- and within-consonant levels.
Method
Listeners with normal hearing heard CV syllables (16 consonants and 4 vowels) spoken by 14 talkers, presented at 6 SNRs. Stimuli were presented randomly, and listeners indicated which syllable they heard.
Results
The log-linear relationship between error and SNR holds across consonants but breaks down at the token level. These 2 sources of variability (across- and within-consonant factors) explain the majority of listeners' errors. Moreover, simply adjusting for differences in token-level error thresholds explains 62% of the variability in listeners' responses.
Conclusions
These results demonstrate that speech tests must control for the large variability among tokens, not average across them, as is commonly done in clinical practice. Accounting for token-level differences in error thresholds with listeners with normal hearing provides a basis for tests designed to diagnostically evaluate individual differences with listeners with hearing impairment.
Publisher
American Speech Language Hearing Association
Subject
Speech and Hearing,Linguistics and Language,Language and Linguistics
Cited by
16 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献