A comprehensive empirical review of modern voice activity detection approaches for movies and TV shows-Reference-Cited by-同舟云学术

A comprehensive empirical review of modern voice activity detection approaches for movies and TV shows

Published:2022-07 Issue: Volume:494 Page:116-131
ISSN:0925-2312
Container-title:Neurocomputing
language:en
Short-container-title:Neurocomputing

Author:

Sharma Mayank,Joshi Sandeep,Chatterjee Tamojit,Hamid Raffay

Publisher

Elsevier BV

Subject

Artificial Intelligence,Cognitive Neuroscience,Computer Science Applications

Reference95 articles.

1. Robust speech activity detection in movie audio: Data resources and experimental evaluation;Hebbar,2019

2. S. Chaudhuri, J. Roth, D.P. Ellis, A. Gallagher, L. Kaver, R. Marvin, C. Pantofaru, N. Reale, L.G. Reid, K. Wilson, et al., Ava-speech: A densely labeled dataset of speech activity in movies, arXiv preprint arXiv:1808.00606.

3. R. Zazo, T.N. Sainath, G. Simko, C. Parada, Feature learning with raw-waveform cldnns for voice activity detection, in: Interspeech 2016, 17th Annual Conference of the International Speech Communication Association, San Francisco, CA, USA, September 8–12, 2016, 2016, pp. 3668–3672. doi:10.21437/Interspeech.2016-268. url:https://doi.org/10.21437/Interspeech.2016-268.

4. A neural network approach to audio-assisted movie dialogue detection;Kotti;Neurocomputing,2007

5. Multi-scale and single-scale fully convolutional networks for sound event detection;Wang;Neurocomputing,2021

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Enabling Hands-Free Voice Assistant Activation on Earphones;Proceedings of the 22nd Annual International Conference on Mobile Systems, Applications and Services;2024-06-03

2. Investigating conversational dynamics in triads: Effects of noise, hearing impairment, and hearing aids;Frontiers in Psychology;2024-04-12

3. Robust voice activity detection using an auditory-inspired masked modulation encoder based convolutional attention network;Speech Communication;2024-02

4. Effects of Training and Calibration Data on Surface Electromyogram-Based Recognition for Upper Limb Amputees;Sensors;2024-01-31

5. Active Speaker Detection Using Audio, Visual, and Depth Modalities: A Survey;IEEE Access;2024