Silent speech command word recognition using stepped frequency continuous wave radar-Reference-Cited by-同舟云学术

Silent speech command word recognition using stepped frequency continuous wave radar

Published:2022-03-09 Issue:1 Volume:12 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Wagner Christoph,Schaffer Petr,Amini Digehsara Pouriya,Bärhold Michael,Plettemeier Dirk,Birkholz Peter

Abstract

AbstractRecovering speech in the absence of the acoustic speech signal itself, i.e., silent speech, holds great potential for restoring or enhancing oral communication in those who lost it. Radar is a relatively unexplored silent speech sensing modality, even though it has the advantage of being fully non-invasive. We therefore built a custom stepped frequency continuous wave radar hardware to measure the changes in the transmission spectra during speech between three antennas, located on both cheeks and the chin with a measurement update rate of 100 Hz. We then recorded a command word corpus of 40 phonetically balanced, two-syllable German words and the German digits zero to nine for two individual speakers and evaluated both the speaker-dependent multi-session and inter-session recognition accuracies on this 50-word corpus using a bidirectional long-short term memory network. We obtained recognition accuracies of 99.17% and 88.87% for the speaker-dependent multi-session and inter-session accuracy, respectively. These results show that the transmission spectra are very well suited to discriminate individual words from one another, even across different sessions, which is one of the key challenges for fully non-invasive silent speech interfaces.

Funder

Sächsische Aufbaubank

Technische Universität Dresden

Publisher

Springer Science and Business Media LLC

Subject

Multidisciplinary

Link

https://www.nature.com/articles/s41598-022-07842-9.pdf

Reference44 articles.

1. Gonzalez-Lopez, J. A. et al. Silent speech interfaces for speech restoration: A review. IEEE Access 8, 177995–178021. https://doi.org/10.1109/ACCESS.2020.3026579 (2020).

2. Schultz, T. et al. Biosignal-based spoken communication: A survey. IEEE/ACM Trans. Audio Speech Lang. Process. 25, 2257–2271. https://doi.org/10.1109/TASLP.2017.2752365 (2017).