The Application of Time–Frequency Masking To Improve Intelligibility of Dysarthric Speech in Background Noise-Reference-Cited by-同舟云学术

The Application of Time–Frequency Masking To Improve Intelligibility of Dysarthric Speech in Background Noise

Published:2023-05-09 Issue:5 Volume:66 Page:1853-1866
ISSN:1092-4388
Container-title:Journal of Speech, Language, and Hearing Research
language:en
Short-container-title:J Speech Lang Hear Res

Author:

Borrie Stephanie A.¹^ORCID,Yoho Sarah E.¹²,Healy Eric W.²,Barrett Tyson S.³^ORCID

Affiliation:

1. Department of Communicative Disorders and Deaf Education, Utah State University, Logan

2. Department of Speech and Hearing Science, The Ohio State University, Columbus

3. Department of Psychology, Utah State University, Logan

Abstract

Purpose: Background noise reduces speech intelligibility. Time–frequency (T-F) masking is an established signal processing technique that improves intelligibility of neurotypical speech in background noise. Here, we investigated a novel application of T-F masking, assessing its potential to improve intelligibility of neurologically degraded speech in background noise. Method: Listener participants ( N = 422) completed an intelligibility task either in the laboratory or online, listening to and transcribing audio recordings of neurotypical (control) and neurologically degraded (dysarthria) speech under three different processing types: speech in quiet (quiet), speech mixed with cafeteria noise (noise), and speech mixed with cafeteria noise and then subsequently processed by an ideal quantized mask (IQM) to remove the noise. Results: We observed significant reductions in intelligibility of dysarthric speech, even at highly favorable signal-to-noise ratios (+11 to +23 dB) that did not impact neurotypical speech. We also observed significant intelligibility improvements from speech in noise to IQM-processed speech for both control and dysarthric speech across a wide range of noise levels. Furthermore, the overall benefit of IQM processing for dysarthric speech was comparable with that of the control speech in background noise, as was the intelligibility data collected in the laboratory versus online. Conclusions: This study demonstrates proof of concept, validating the application of T-F masks to a neurologically degraded speech signal. Given that intelligibility challenges greatly impact communication, and thus the lives of people with dysarthria and their communication partners, the development of clinical tools to enhance intelligibility in this clinical population is critical.

Publisher

American Speech Language Hearing Association

Subject

Speech and Hearing,Linguistics and Language,Language and Linguistics

Link

http://pubs.asha.org/doi/pdf/10.1044/2023_JSLHR-22-00558

Reference59 articles.

1. Speech-to-noise levels and conversational intelligibility in hypophonia and Parkinson's disease;Adams S. G.;Journal of Medical Speech-Language Pathology,2008

2. American National Standards Institute. (1997). Methods for calculation of the Speech Intelligibility Index (ANSI/ASA S3.5-1997). Acoustical Society of America.

3. American National Standards Institute. (2004). Methods for manual pure-tone threshold audiometry (ANSI S3.21-2004 (R2009)) .

4. American National Standards Institute. (2010). Specification for audiometers (ANSI S3.6-2010) .

5. Acoustic-Phonetic Contrasts and Intelligibility in the Dysarthria Associated With Mixed Cerebral Palsy

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Agricultural commercialization in the Mekong region: A meta-narrative review and policy implications;Journal of Land Use Science;2023-03-23

2. Frontier tourism development and inequality in the Nepal Himalaya;Journal of Sustainable Tourism;2023-02-01