Evaluation of Acoustic Analyses of Voice in Nonoptimized Conditions-Reference-Cited by-同舟云学术

Evaluation of Acoustic Analyses of Voice in Nonoptimized Conditions

Published:2020-12-14 Issue:12 Volume:63 Page:3991-3999
ISSN:1092-4388
Container-title:Journal of Speech, Language, and Hearing Research
language:en
Short-container-title:J Speech Lang Hear Res

Author:

van der Woerd Benjamin¹^ORCID,Wu Min²,Parsa Vijay²³,Doyle Philip C.¹²⁴,Fung Kevin¹

Affiliation:

1. Department of Otolaryngology—Head and Neck Surgery, Western University, London, Ontario, Canada

2. School of Communication Sciences and Disorders, Western University, London, Ontario, Canada

3. Department of Electrical and Computer Engineering, Western University, London, Ontario, Canada

4. Department of Otolaryngology—Head and Neck Surgery, Stanford University School of Medicine, CA

Abstract

Objectives This study aimed to evaluate the fidelity and accuracy of a smartphone microphone and recording environment on acoustic measurements of voice. Method A prospective cohort proof-of-concept study. Two sets of prerecorded samples (a) sustained vowels (/a/) and (b) Rainbow Passage sentence were played for recording via the internal iPhone microphone and the Blue Yeti USB microphone in two recording environments: a sound-treated booth and quiet office setting. Recordings were presented using a calibrated mannequin speaker with a fixed signal intensity (69 dBA), at a fixed distance (15 in.). Each set of recordings (iPhone—audio booth, Blue Yeti—audio booth, iPhone—office, and Blue Yeti—office), was time-windowed to ensure the same signal was evaluated for each condition. Acoustic measures of voice including fundamental frequency ( f o ), jitter, shimmer, harmonic-to-noise ratio (HNR), and cepstral peak prominence (CPP), were generated using a widely used analysis program (Praat Version 6.0.50). The data gathered were compared using a repeated measures analysis of variance. Two separate data sets were used. The set of vowel samples included both pathologic ( n = 10) and normal ( n = 10), male ( n = 5) and female ( n = 15) speakers. The set of sentence stimuli ranged in perceived voice quality from normal to severely disordered with an equal number of male ( n = 12) and female ( n = 12) speakers evaluated. Results The vowel analyses indicated that the jitter, shimmer, HNR, and CPP were significantly different based on microphone choice and shimmer, HNR, and CPP were significantly different based on the recording environment. Analysis of sentences revealed a statistically significant impact of recording environment and microphone type on HNR and CPP. While statistically significant, the differences across the experimental conditions for a subset of the acoustic measures (viz., jitter and CPP) have shown differences that fell within their respective normative ranges. Conclusions Both microphone and recording setting resulted in significant differences across several acoustic measurements. However, a subset of the acoustic measures that were statistically significant across the recording conditions showed small overall differences that are unlikely to have clinical significance in interpretation. For these acoustic measures, the present data suggest that, although a sound-treated setting is ideal for voice sample collection, a smartphone microphone can capture acceptable recordings for acoustic signal analysis.

Publisher

American Speech Language Hearing Association

Subject

Speech and Hearing,Linguistics and Language,Language and Linguistics

Link

http://pubs.asha.org/doi/pdf/10.1044/2020_JSLHR-20-00212

Reference46 articles.

1. Evolution and current status of mhealth research: a systematic review

2. American Speech-Language-Hearing Association. (2002). Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V). American Speech-Language-Hearing Association Special Interest Division 3.

3. Toward the development of an objective index of dysphonia severity: A four‐factor acoustic model

4. Outcomes Measurement in Voice Disorders: Application of an Acoustic Index of Dysphonia Severity

5. Common Practices of Voice Therapists in the Evaluation of Patients

Cited by 21 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Conducting high-quality and reliable acoustic analysis: A tutorial focused on training research assistants;The Journal of the Acoustical Society of America;2024-04-01

2. Classification research of TCM pulse conditions based on multi-label voice analysis;Journal of Traditional Chinese Medical Sciences;2024-04

3. VOT in English by bilinguals with 2L1s: different approaches to voiceless and voiced stops;Folia Linguistica;2024-03-08

4. Machine Learning-Assisted Speech Analysis for Early Detection of Parkinson’s Disease: A Study on Speaker Diarization and Classification Techniques;Sensors;2024-02-26

5. Assessment of Voice Disorders Using Machine Learning and Vocal Analysis of Voice Samples Recorded through Smartphones;BioMedInformatics;2024-02-19