Ultra-Low-Power Voice Activity Detection System Using Level-Crossing Sampling-Reference-Cited by-同舟云学术

Ultra-Low-Power Voice Activity Detection System Using Level-Crossing Sampling

Published:2023-02-05 Issue:4 Volume:12 Page:795
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Faghani Maral¹,Rezaee-Dehsorkh Hamidreza¹^ORCID,Ravanshad Nassim¹,Aminzadeh Hamed²^ORCID

Affiliation:

1. Department of Electrical Engineering, Sadjad University, Mashhad 9188148848, Iran

2. Department of Electrical Engineering, Payame Noor University (PNU), Tehran P.O. Box 19395-4697, Iran

Abstract

This paper presents an ultra-low-power voice activity detection (VAD) system to discriminate speech from non-speech parts of audio signals. The proposed VAD system uses level-crossing sampling for voice activity detection. The useless samples in the non-speech parts of the signal are eliminated due to the activity-dependent nature of this sampling scheme. A 40 ms moving window with a 30 ms overlap is exploited as a feature extraction block, within which the output samples of the level-crossing analog-to-digital converter (LC-ADC) are counted as the feature. The only variable used to distinguish speech and non-speech segments in the audio input signal is the number of LC-ADC output samples within a time window. The proposed system achieves an average of 91.02% speech hit rate and 82.64% non-speech hit rate over 12 noise types at −5, 0, 5, and 10 dB signal-to-noise ratios (SNR) over the TIMIT database. The proposed system including LC-ADC, feature extraction, and classification circuits was designed in 0.18 µm CMOS technology. Post-layout simulation results show a power consumption of 394.6 nW with a silicon area of 0.044 mm2, which makes it suitable as an always-on device in an automatic speech recognition system.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/12/4/795/pdf

Reference34 articles.

1. A 2.3 nJ/frame Voice Activity Detector-Based Audio Front-End for Context-Aware System-On-Chip Applications in 32-nm CMOS;Raychowdhury;IEEE J. Solid-State Circuits,2013

2. A Low-Power Speech Recognizer and Voice Activity Detector Using Deep Neural Networks;Price;IEEE J. Solid-State Circuits,2018

3. A 90 nm CMOS, 6 mW Power-Proportional Acoustic Sensing Frontend for Voice Activity Detection;Badami;IEEE J. Solid-State Circuits,2016

4. An Acoustic Signal Processing Chip with 142-nW Voice Activity Detection Using Mixer-Based Sequential Frequency Scanning and Neural Network Classification;Oh;IEEE J. Solid-State Circuits,2019

5. Voice Activity Detection Using Generalized Exponential Kernels for Time and Frequency Domains;Soares;IEEE Trans. Circuits Syst. I Regul. Pap.,2019

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A multimodal teacher speech emotion recognition method in the smart classroom;Internet of Things;2024-04

2. PA2BLO: Low-Power, Personalized Audio Badge;2024 IEEE International Conference on Pervasive Computing and Communications (PerCom);2024-03-11

3. A Reconfigurable Hybrid ADC Using a Jump Search Algorithm;Electronics;2024-02-01

4. A 34.7 µW Speech Keyword Spotting IC Based on Subband Energy Feature Extraction;Electronics;2023-07-31

5. Speech Enhancement Performance Based on the MANNER Network Using Feature Fusion;Electronics;2023-04-07