NeuroVAD: Real-Time Voice Activity Detection from Non-Invasive Neuromagnetic Signals-Reference-Cited by-同舟云学术

NeuroVAD: Real-Time Voice Activity Detection from Non-Invasive Neuromagnetic Signals

Published:2020-04-16 Issue:8 Volume:20 Page:2248
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Dash Debadatta^ORCID,Ferrari Paul^ORCID,Dutta Satwik^ORCID,Wang Jun

Abstract

Neural speech decoding-driven brain-computer interface (BCI) or speech-BCI is a novel paradigm for exploring communication restoration for locked-in (fully paralyzed but aware) patients. Speech-BCIs aim to map a direct transformation from neural signals to text or speech, which has the potential for a higher communication rate than the current BCIs. Although recent progress has demonstrated the potential of speech-BCIs from either invasive or non-invasive neural signals, the majority of the systems developed so far still assume knowing the onset and offset of the speech utterances within the continuous neural recordings. This lack of real-time voice/speech activity detection (VAD) is a current obstacle for future applications of neural speech decoding wherein BCI users can have a continuous conversation with other speakers. To address this issue, in this study, we attempted to automatically detect the voice/speech activity directly from the neural signals recorded using magnetoencephalography (MEG). First, we classified the whole segments of pre-speech, speech, and post-speech in the neural signals using a support vector machine (SVM). Second, for continuous prediction, we used a long short-term memory-recurrent neural network (LSTM-RNN) to efficiently decode the voice activity at each time point via its sequential pattern-learning mechanism. Experimental results demonstrated the possibility of real-time VAD directly from the non-invasive neural signals with about 88% accuracy.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/20/8/2248/pdf

Reference39 articles.

1. The locked-in syndrome: What is it like to be conscious but paralyzed and voiceless?;Laureys;Prog. Brain Res.,2005

2. Brain–computer interfaces for speech communication

3. Brain–computer interfaces for communication and control

4. Brain–computer-interface research: Coming of age

5. "Who" Is Saying "What"? Brain-Based Decoding of Human Voice and Speech

Cited by 17 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Simulation of optical image detection based on language activity detection algorithm in piano network teaching system;Optical and Quantum Electronics;2023-12-13

2. Recommendations for promoting user agency in the design of speech neuroprostheses;Frontiers in Human Neuroscience;2023-10-18

3. State-of-the-Art on Brain-Computer Interface Technology;Sensors;2023-06-28

4. Voice activity detection for piano online teaching based on digital network system;2023-06-05

5. Unmuting lucid dreams: Speech decoding and vocalization in real time.;Psychology of Consciousness: Theory, Research, and Practice;2023-03-13