Validity of Off-the-Shelf Automatic Speech Recognition for Assessing Speech Intelligibility and Speech Severity in Speakers With Amyotrophic Lateral Sclerosis-Reference-Cited by-同舟云学术

Validity of Off-the-Shelf Automatic Speech Recognition for Assessing Speech Intelligibility and Speech Severity in Speakers With Amyotrophic Lateral Sclerosis

Published:2022-06-08 Issue:6 Volume:65 Page:2128-2143
ISSN:1092-4388
Container-title:Journal of Speech, Language, and Hearing Research
language:en
Short-container-title:J Speech Lang Hear Res

Author:

Gutz Sarah E.¹^ORCID,Stipancic Kaila L.²^ORCID,Yunusova Yana³⁴⁵^ORCID,Berry James D.⁶,Green Jordan R.¹⁷^ORCID

Affiliation:

1. Program in Speech and Hearing Bioscience and Technology, Harvard Medical School, Boston, MA

2. Department of Communicative Disorders and Sciences, University at Buffalo, NY

3. Department of Speech-Language Pathology, University of Toronto, Ontario, Canada

4. Hurvitz Brain Sciences Program, Sunnybrook Research Institute, Toronto, Ontario, Canada

5. Toronto Rehabilitation Institute, University Health Network, Ontario, Canada

6. Sean M. Healey and AMG Center for ALS, Massachusetts General Hospital, Boston

7. Department of Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA

Abstract

Purpose: There is increasing interest in using automatic speech recognition (ASR) systems to evaluate impairment severity or speech intelligibility in speakers with dysarthria. We assessed the clinical validity of one currently available off-the-shelf (OTS) ASR system (i.e., a Google Cloud ASR API) for indexing sentence-level speech intelligibility and impairment severity in individuals with amyotrophic lateral sclerosis (ALS), and we provided guidance for potential users of such systems in research and clinic. Method: Using speech samples collected from 52 individuals with ALS and 20 healthy control speakers, we compared word recognition rate (WRR) from the commercially available Google Cloud ASR API (Machine WRR) to clinician-provided judgments of impairment severity, as well as sentence intelligibility (Human WRR). We assessed the internal reliability of Machine and Human WRR by comparing the standard deviation of WRR across sentences to the minimally detectable change (MDC), a clinical benchmark that indicates whether results are within measurement error. We also evaluated Machine and Human WRR diagnostic accuracy for classifying speakers into clinically established categories. Results: Human WRR achieved better accuracy than Machine WRR when indexing speech severity, and, although related, Human and Machine WRR were not strongly correlated. When the speech signal was mixed with noise (noise-augmented ASR) to reduce a ceiling effect, Machine WRR performance improved. Internal reliability metrics were worse for Machine than Human WRR, particularly for typical and mildly impaired severity groups, although sentence length significantly impacted both Machine and Human WRRs. Conclusions: Results indicated that the OTS ASR system was inadequate for early detection of speech impairment and grading overall speech severity. While Machine and Human WRR were correlated, ASR should not be used as a one-to-one proxy for transcription speech intelligibility or clinician severity ratings. Overall, findings suggested that the tested OTS ASR system, Google Cloud ASR, has limited utility for grading clinical speech impairment in speakers with ALS.

Publisher

American Speech Language Hearing Association

Subject

Speech and Hearing,Linguistics and Language,Language and Linguistics

Link

http://pubs.asha.org/doi/pdf/10.1044/2022_JSLHR-21-00589

Reference58 articles.

1. The diagnostic utility of patient-report and speech-language pathologists’ ratings for detecting the early onset of bulbar symptoms due to ALS

2. Shorter Sentence Length Maximizes Intelligibility and Speech Motor Performance in Persons With Dysarthria Due to Amyotrophic Lateral Sclerosis

3. Feasibility of Automatic Speech Recognition for Providing Feedback During Tablet-Based Treatment for Apraxia of Speech Plus Aphasia

4. Fitting Linear Mixed-Effects Models Usinglme4

5. Automatic speech recognition and speech variability: A review

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Accuracy of Speech Sound Analysis: Comparison of an Automatic Artificial Intelligence Algorithm With Clinician Assessment;Journal of Speech, Language, and Hearing Research;2024-09-12

2. Automatic Speech Recognition in Primary Progressive Apraxia of Speech;Journal of Speech, Language, and Hearing Research;2024-09-12

3. Maxillectomy patients' speech and performance of contemporary speaker‐independent automatic speech recognition platforms in Japanese;Journal of Oral Rehabilitation;2024-08-12

4. An automatic measure for speech intelligibility in dysarthrias—validation across multiple languages and neurological disorders;Frontiers in Digital Health;2024-07-23

5. Automatic Speech Recognition of Conversational Speech in Individuals With Disordered Speech;Journal of Speech, Language, and Hearing Research;2024-07-04