The Application of Time–Frequency Masking To Improve Intelligibility of Dysarthric Speech in Background Noise

Author:

Borrie Stephanie A.1ORCID,Yoho Sarah E.12,Healy Eric W.2,Barrett Tyson S.3ORCID

Affiliation:

1. Department of Communicative Disorders and Deaf Education, Utah State University, Logan

2. Department of Speech and Hearing Science, The Ohio State University, Columbus

3. Department of Psychology, Utah State University, Logan

Abstract

Purpose: Background noise reduces speech intelligibility. Time–frequency (T-F) masking is an established signal processing technique that improves intelligibility of neurotypical speech in background noise. Here, we investigated a novel application of T-F masking, assessing its potential to improve intelligibility of neurologically degraded speech in background noise. Method: Listener participants ( N = 422) completed an intelligibility task either in the laboratory or online, listening to and transcribing audio recordings of neurotypical (control) and neurologically degraded (dysarthria) speech under three different processing types: speech in quiet (quiet), speech mixed with cafeteria noise (noise), and speech mixed with cafeteria noise and then subsequently processed by an ideal quantized mask (IQM) to remove the noise. Results: We observed significant reductions in intelligibility of dysarthric speech, even at highly favorable signal-to-noise ratios (+11 to +23 dB) that did not impact neurotypical speech. We also observed significant intelligibility improvements from speech in noise to IQM-processed speech for both control and dysarthric speech across a wide range of noise levels. Furthermore, the overall benefit of IQM processing for dysarthric speech was comparable with that of the control speech in background noise, as was the intelligibility data collected in the laboratory versus online. Conclusions: This study demonstrates proof of concept, validating the application of T-F masks to a neurologically degraded speech signal. Given that intelligibility challenges greatly impact communication, and thus the lives of people with dysarthria and their communication partners, the development of clinical tools to enhance intelligibility in this clinical population is critical.

Publisher

American Speech Language Hearing Association

Subject

Speech and Hearing,Linguistics and Language,Language and Linguistics

Reference59 articles.

1. Speech-to-noise levels and conversational intelligibility in hypophonia and Parkinson's disease;Adams S. G.;Journal of Medical Speech-Language Pathology,2008

2. American National Standards Institute. (1997). Methods for calculation of the Speech Intelligibility Index (ANSI/ASA S3.5-1997). Acoustical Society of America.

3. American National Standards Institute. (2004). Methods for manual pure-tone threshold audiometry (ANSI S3.21-2004 (R2009)) .

4. American National Standards Institute. (2010). Specification for audiometers (ANSI S3.6-2010) .

5. Acoustic-Phonetic Contrasts and Intelligibility in the Dysarthria Associated With Mixed Cerebral Palsy

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3