A novel silent speech recognition approach based on parallel inception convolutional neural network and Mel frequency spectral coefficient-Reference-Cited by-同舟云学术

A novel silent speech recognition approach based on parallel inception convolutional neural network and Mel frequency spectral coefficient

Published:2022-09-02 Issue: Volume:16 Page:
ISSN:1662-5218
Container-title:Frontiers in Neurorobotics
language:
Short-container-title:Front. Neurorobot.

Author:

Wu Jinghan,Zhang Yakun,Xie Liang,Yan Ye,Zhang Xu,Liu Shuang,An Xingwei,Yin Erwei,Ming Dong

Abstract

Silent speech recognition breaks the limitations of automatic speech recognition when acoustic signals cannot be produced or captured clearly, but still has a long way to go before being ready for any real-life applications. To address this issue, we propose a novel silent speech recognition framework based on surface electromyography (sEMG) signals. In our approach, a new deep learning architecture Parallel Inception Convolutional Neural Network (PICNN) is proposed and implemented in our silent speech recognition system, with six inception modules processing six channels of sEMG data, separately and simultaneously. Meanwhile, Mel Frequency Spectral Coefficients (MFSCs) are employed to extract speech-related sEMG features for the first time. We further design and generate a 100-class dataset containing daily life assistance demands for the elderly and disabled individuals. The experimental results obtained from 28 subjects confirm that our silent speech recognition method outperforms state-of-the-art machine learning algorithms and deep learning architectures, achieving the best recognition accuracy of 90.76%. With sEMG data collected from four new subjects, efficient steps of subject-based transfer learning are conducted to further improve the cross-subject recognition ability of the proposed model. Promising results prove that our sEMG-based silent speech recognition system could have high recognition accuracy and steady performance in practical applications.

Funder

National Natural Science Foundation of China

Publisher

Frontiers Media SA

Subject

Artificial Intelligence,Biomedical Engineering

Reference63 articles.

1. Deep learning with convolutional neural networks applied to electromyography data: a resource for the classification of movements for prosthetic hands;Atzori;Front Neurorobot,2016

2. A maximum likelihood approach to continuous speech recognition;Bahl;IEEE Trans. Pattern Anal. Mach. Intell,1983

3. “Recognition and real time performances of a lightweight ultrasound based silent speech interface employing a language model,”;Cai,2011

4. “Xception: deep learning with depthwise separable convolutions,”;Chollet,2017

5. Surface electromyography signal processing and classification techniques;Chowdhury;Sensors

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Speech synthesis from three-axis accelerometer signals using conformer-based deep neural network;Computers in Biology and Medicine;2024-11

2. A simplified adversarial architecture for cross-subject silent speech recognition using electromyography;Journal of Neural Engineering;2024-09-03

3. Design and implementation of a silent speech recognition system based on sEMG signals: A neural network approach;Biomedical Signal Processing and Control;2024-06

4. Artificial intelligence in head and neck surgery: Potential applications and future perspectives;Journal of Surgical Oncology;2024-02-28

5. Mordo2: A Personalization Framework for Silent Command Recognition;IEEE Transactions on Neural Systems and Rehabilitation Engineering;2024