Affiliation:
1. Department of Communicative Disorders and Deaf Education, Utah State University, Logan
2. Department of Speech and Hearing Science, The Ohio State University, Columbus
3. Department of Psychology, Utah State University, Logan
Abstract
Purpose:
Background noise reduces speech intelligibility. Time–frequency (T-F) masking is an established signal processing technique that improves intelligibility of neurotypical speech in background noise. Here, we investigated a novel application of T-F masking, assessing its potential to improve intelligibility of neurologically degraded speech in background noise.
Method:
Listener participants (
N
= 422) completed an intelligibility task either in the laboratory or online, listening to and transcribing audio recordings of neurotypical (control) and neurologically degraded (dysarthria) speech under three different processing types: speech in quiet (quiet), speech mixed with cafeteria noise (noise), and speech mixed with cafeteria noise and then subsequently processed by an ideal quantized mask (IQM) to remove the noise.
Results:
We observed significant reductions in intelligibility of dysarthric speech, even at highly favorable signal-to-noise ratios (+11 to +23 dB) that did not impact neurotypical speech. We also observed significant intelligibility improvements from speech in noise to IQM-processed speech for both control and dysarthric speech across a wide range of noise levels. Furthermore, the overall benefit of IQM processing for dysarthric speech was comparable with that of the control speech in background noise, as was the intelligibility data collected in the laboratory versus online.
Conclusions:
This study demonstrates proof of concept, validating the application of T-F masks to a neurologically degraded speech signal. Given that intelligibility challenges greatly impact communication, and thus the lives of people with dysarthria and their communication partners, the development of clinical tools to enhance intelligibility in this clinical population is critical.
Publisher
American Speech Language Hearing Association
Subject
Speech and Hearing,Linguistics and Language,Language and Linguistics
Reference59 articles.
1. Speech-to-noise levels and conversational intelligibility in hypophonia and Parkinson's disease;Adams S. G.;Journal of Medical Speech-Language Pathology,2008
2. American National Standards Institute. (1997). Methods for calculation of the Speech Intelligibility Index (ANSI/ASA S3.5-1997). Acoustical Society of America.
3. American National Standards Institute. (2004). Methods for manual pure-tone threshold audiometry (ANSI S3.21-2004 (R2009))
.
4. American National Standards Institute. (2010). Specification for audiometers (ANSI S3.6-2010)
.
5. Acoustic-Phonetic Contrasts and Intelligibility in the Dysarthria Associated With Mixed Cerebral Palsy
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献