Reproducible Speech Research With the Artificial Intelligence–Ready PERCEPT Corpora-Reference-Cited by-同舟云学术

Reproducible Speech Research With the Artificial Intelligence–Ready PERCEPT Corpora

Published:2023-06-20 Issue:6 Volume:66 Page:1986-2009
ISSN:1092-4388
Container-title:Journal of Speech, Language, and Hearing Research
language:en
Short-container-title:J Speech Lang Hear Res

Author:

Benway Nina R.¹^ORCID,Preston Jonathan L.¹²^ORCID,Hitchcock Elaine³^ORCID,Rose Yvan⁴,Salekin Asif⁵,Liang Wendy⁶,McAllister Tara⁶^ORCID

Affiliation:

1. Department of Communication Sciences & Disorders, Syracuse University, NY

2. Haskins Laboratories, New Haven, CT

3. Department of Communication Sciences and Disorders, Montclair State University, NJ

4. Department of Linguistics, Memorial University, St. John's, Newfoundland and Labrador, Canada

5. Department of Electrical Engineering and Computer Science, Syracuse University, NY

6. Department of Communicative Sciences and Disorders, New York University, NY

Abstract

Background: Publicly available speech corpora facilitate reproducible research by providing open-access data for participants who have consented/assented to data sharing among different research teams. Such corpora can also support clinical education, including perceptual training and training in the use of speech analysis tools. Purpose: In this research note, we introduce the PERCEPT (Perceptual Error Rating for the Clinical Evaluation of Phonetic Targets) corpora, PERCEPT-R (Rhotics) and PERCEPT-GFTA (Goldman-Fristoe Test of Articulation), which together contain over 36 hr of speech audio (> 125,000 syllable, word, and phrase utterances) from children, adolescents, and young adults aged 6–24 years with speech sound disorder (primarily residual speech sound disorders impacting /ɹ/) and age-matched peers. We highlight PhonBank as the repository for the corpora and demonstrate use of the associated speech analysis software, Phon, to query PERCEPT-R. A worked example of research with PERCEPT-R, suitable for clinical education and research training, is included as an appendix. Support for end users and information/descriptive statistics for future releases of the PERCEPT corpora can be found in a dedicated Slack channel. Finally, we discuss the potential for PERCEPT corpora to support the training of artificial intelligence clinical speech technology appropriate for use with children with speech sound disorders, the development of which has historically been constrained by the limited representation of either children or individuals with speech impairments in publicly available training corpora. Conclusions: We demonstrate the use of PERCEPT corpora, PhonBank, and Phon for clinical training and research questions appropriate to child citation speech. Increased use of these tools has the potential to enhance reproducibility in the study of speech development and disorders.

Publisher

American Speech Language Hearing Association

Subject

Speech and Hearing,Linguistics and Language,Language and Linguistics

Link

http://pubs.asha.org/doi/pdf/10.1044/2023_JSLHR-22-00343

Reference42 articles.

1. Comparing Biofeedback Types for Children With Residual /ɹ/ Errors in American English: A Single-Case Randomization Design

2. Benway, N. R., Preston, J. L., Hitchcock, E. R., Salekin, A., Sharma, H., & McAllister, T. (2022). PERCEPT-R: An open-access American English child/clinical speech corpus specialized for the audio classification of /ɹ/. In H. Ko & J. H. L. Hansen (Eds.), Proceedings of Interspeech 2022 (pp. 3648–3652). International Speech Communication Association. https://doi.org/10.21437/Interspeech.2022-10785

3. Boersma P. & Weenink D. (2019). Praat: Doing phonetics by computer (Version 6.1.38) [Computer software]. https://www.fon.hum.uva.nl/praat/

4. The Articulatory Phonetics of /r/ for Residual Speech Errors

5. Selecting an acoustic correlate for automated measurement of /r/ production in children

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Evaluating acoustic representations and normalization for rhoticity classification in children with speech sound disorders;JASA Express Letters;2024-02-01