Affiliation:
1. University of Toronto, Ontario, Canada
2. Institute of Biomaterials and Biomedical Engineering, Toronto
3. Toronto Rehabilitation Institute
Abstract
PurposeIn this study, the authors explored articulatory information as a means of improving the recognition of dysarthric speech by machine.MethodData were derived chiefly from the TORGO database of dysarthric articulation (Rudzicz, Namasivayam, & Wolff, 2011) in which motions of various points in the vocal tract are measured during speech. In the 1st experiment, the authors provided a baseline model indicating a relatively low performance with traditional automatic speech recognition (ASR) using only acoustic data from dysarthric individuals. In the 2nd experiment, the authors used various measures of entropy (statistical disorder) to determine whether characteristics of dysarthric articulation can reduce uncertainty in features of dysarthric acoustics. These findings led to the 3rd experiment, in which recorded dysarthric articulation was directly encoded into the speech recognition process.ResultsThe authors found that 18.3% of the statistical disorder in the acoustics of speakers with dysarthria can be removed if articulatory parameters are known. Using articulatory models reduces phoneme recognition errors relatively by up to 6% for speakers with dysarthria in speaker-dependent systems.ConclusionsArticulatory knowledge is useful in reducing rates of error in ASR for speakers with dysarthria and in reducing statistical uncertainty of their acoustic signals. These findings may help to guide clinical decisions related to the use of ASR in the future.
Publisher
American Speech Language Hearing Association
Subject
Speech and Hearing,Linguistics and Language,Language and Linguistics
Reference60 articles.
1. Motor Impairment following Blockade of the Infraorbital Nerve: Implications for the Use of Anesthetization Techniques in Speech Research
2. American Speech-Language-Hearing Association. (2011). Speech-language pathology medical review guidelines. Retrieved from www.asha.org/practice/reimbursement/SLP-medical-review-guidelines
3. Augmentative Communication Inc. (2007). Section 3: Clinical aspects of AAC devices. Retrieved from www.augcominc.com/whatsnew/ncs3.html
4. Unsupervised learning;Barlow H. B.;Neural Computation,1989
5. Tongue control for speech and swallowing in healthy younger and older adults;Bennett J. W.;International Association of Orofacial Myology,2007
Cited by
10 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献