Prediction of Voice Fundamental Frequency and Intensity from Surface Electromyographic Signals of the Face and Neck-Reference-Cited by-同舟云学术

Prediction of Voice Fundamental Frequency and Intensity from Surface Electromyographic Signals of the Face and Neck

Published:2022-10-13 Issue:4 Volume:5 Page:692-710
ISSN:2571-631X
Container-title:Vibration
language:en
Short-container-title:Vibration

Author:

Vojtech Jennifer M.^ORCID,Mitchell Claire L.,Raiff Laura,Kline Joshua C.,De Luca Gianluca

Abstract

Silent speech interfaces (SSIs) enable speech recognition and synthesis in the absence of an acoustic signal. Yet, the archetypal SSI fails to convey the expressive attributes of prosody such as pitch and loudness, leading to lexical ambiguities. The aim of this study was to determine the efficacy of using surface electromyography (sEMG) as an approach for predicting continuous acoustic estimates of prosody. Ten participants performed a series of vocal tasks including sustained vowels, phrases, and monologues while acoustic data was recorded simultaneously with sEMG activity from muscles of the face and neck. A battery of time-, frequency-, and cepstral-domain features extracted from the sEMG signals were used to train deep regression neural networks to predict fundamental frequency and intensity contours from the acoustic signals. We achieved an average accuracy of 0.01 ST and precision of 0.56 ST for the estimation of fundamental frequency, and an average accuracy of 0.21 dB SPL and precision of 3.25 dB SPL for the estimation of intensity. This work highlights the importance of using sEMG as an alternative means of detecting prosody and shows promise for improving SSIs in future development.

Funder

National Institutes of Health

The De Luca Foundation

Publisher

MDPI AG

Subject

General Medicine

Link

https://www.mdpi.com/2571-631X/5/4/41/pdf

Reference86 articles.

1. Mental disorders and psychosocial support during the first year after total laryngectomy: A prospective cohort study

2. Long-term Quality of Life After Treatment of Laryngeal Cancer

3. Self-expression and identity after total laryngectomy: Implications for support

4. The impact of speech disorders quality of life: a questionnaire proposal

5. Crowded minds: The implicit bystander effect.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The Characterization of Normal Male and Female Voice from Surface Electromyographic Parameters;Journal of Personalized Medicine;2024-06-01

2. Intelligent Pathological Voice Detection Based on Social Media Application using Conditional Random Field Contrasted and Support Vector Machine Calculation;2023 International Conference on Data Science, Agents & Artificial Intelligence (ICDSAAI);2023-12-21