Characterizing Dysarthria Diversity for Automatic Speech Recognition: A Tutorial From the Clinical Perspective-Reference-Cited by-同舟云学术

Characterizing Dysarthria Diversity for Automatic Speech Recognition: A Tutorial From the Clinical Perspective

Published:2022-04-12 Issue: Volume:4 Page:
ISSN:2624-9898
Container-title:Frontiers in Computer Science
language:
Short-container-title:Front. Comput. Sci.

Author:

Rowe Hannah P.,Gutz Sarah E.,Maffei Marc F.,Tomanek Katrin,Green Jordan R.

Abstract

Despite significant advancements in automatic speech recognition (ASR) technology, even the best performing ASR systems are inadequate for speakers with impaired speech. This inadequacy may be, in part, due to the challenges associated with acquiring a sufficiently diverse training sample of disordered speech. Speakers with dysarthria, which refers to a group of divergent speech disorders secondary to neurologic injury, exhibit highly variable speech patterns both within and across individuals. This diversity is currently poorly characterized and, consequently, difficult to adequately represent in disordered speech ASR corpora. In this article, we consider the variable expressions of dysarthria within the context of established clinical taxonomies (e.g., Darley, Aronson, and Brown dysarthria subtypes). We also briefly consider past and recent efforts to capture this diversity quantitatively using speech analytics. Understanding dysarthria diversity from the clinical perspective and how this diversity may impact ASR performance could aid in (1) optimizing data collection strategies for minimizing bias; (2) ensuring representative ASR training sets; and (3) improving generalization of ASR for difficult-to-recognize speakers. Our overarching goal is to facilitate the development of robust ASR systems for dysarthric speech using clinical knowledge.

Funder

National Institute on Deafness and Other Communication Disorders

Publisher

Frontiers Media SA

Subject

General Medicine

Reference45 articles.

1. Fatigue in motor neuron diseases;Abraham;Neuromuscul. Disord,2012

2. Voice onset time in ataxic dysarthria;Ackermann;Brain Lang,1997

3. Automatic speech recognition and speech variability: a review;Benzeghiba;Speech Commun,2007

4. Dysarthria and Friedreich's ataxia: what can intelligibility assessment tell us?;Blaney;Int. J. Lang. Commun. Disord,2007

5. Acoustic variability in dysarthria and computer speech recognition;Blaney;Clin. Linguist. Phon,2000

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An automatic measure for speech intelligibility in dysarthrias—validation across multiple languages and neurological disorders;Frontiers in Digital Health;2024-07-23

2. Enhancing Communication Equity: Evaluation of an Automated Speech Recognition Application in Ghana;Proceedings of the CHI Conference on Human Factors in Computing Systems;2024-05-11

3. Self-Reported Voice and Swallow Questionnaires’ Alignment with Unified Parkinson’s Disease Rating Scale Questions: A Preliminary Study;Journal of Voice;2024-04

4. A Strategic Approach for Robust Dysarthric Speech Recognition;Wireless Personal Communications;2024-02

5. The Detection of Dysarthria Severity Levels Using AI Models: A Review;IEEE Access;2024