Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals-Reference-Cited by-同舟云学术

Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals

Published:2019-04-11 Issue:7 Volume:9 Page:1505
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Lei Zhengdong,Kennedy Evan,Fasanella Laura,Li-Jessen Nicole Yee-Key,Mongeau Luc^ORCID

Abstract

The purpose of this study was to investigate the feasibility of using neck-surface acceleration signals to discriminate between modal, breathy and pressed voice. Voice data for five English single vowels were collected from 31 female native Canadian English speakers using a portable Neck Surface Accelerometer (NSA) and a condenser microphone. Firstly, auditory-perceptual ratings were conducted by five clinically-certificated Speech Language Pathologists (SLPs) to categorize voice type using the audio recordings. Intra- and inter-rater analyses were used to determine the SLPs’ reliability for the perceptual categorization task. Mixed-type samples were screened out, and congruent samples were kept for the subsequent classification task. Secondly, features such as spectral harmonics, jitter, shimmer and spectral entropy were extracted from the NSA data. Supervised learning algorithms were used to map feature vectors to voice type categories. A feature wrapper strategy was used to evaluate the contribution of each feature or feature combinations to the classification between different voice types. The results showed that the highest classification accuracy on a full set was 82.5%. The breathy voice classification accuracy was notably greater (approximately 12%) than those of the other two voice types. Shimmer and spectral entropy were the best correlated metrics for the classification accuracy.

Funder

Foundation for the National Institutes of Health

Canadian Institutes of Health Research

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/9/7/1505/pdf

Reference36 articles.

1. Perception of Voice Quality;Kreiman,2008

2. Vocal quality factors: Analysis, synthesis, and perception

3. Evidence for Distinguishing Pressed, Normal, Resonant, and Breathy Voice Qualities by Laryngeal Resistance and Vocal Efficiency in Vocally Trained Subjects

4. Perceptual Evaluation of Voice Quality

5. Consensus Auditory-Perceptual Evaluation of Voice: Development of a Standardized Clinical Protocol

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Investigation of self-supervised pre-trained models for classification of voice quality from speech and neck surface accelerometer signals;Computer Speech & Language;2024-01

2. Convolutional Neural Networks for Classification of Voice Qualities from Speech and Neck Surface Accelerometer Signals;Interspeech 2022;2022-09-18

3. Wearable Neck Surface Accelerometers for Occupational Vocal Health Monitoring: Instrument and Analysis Validation Study;JMIR Formative Research;2022-08-05

4. Efficient and Explainable Deep Neural Networks for Airway Symptom Detection in Support of Wearable Health Technology;Advanced Intelligent Systems;2022-05-17

5. Efficient and Explainable Deep Neural Networks for Airway Symptom Detection in Support of Wearable Health Technology;2021-12-30