1. Juntae Kim, Jaeseok Kim, Seunghyung Lee, Jinuk Park, Minsoo Hahn, “Vowel based voice activity detection with LSTM recurrent neural network,” Proc. of 8th Int. Conf. on Signal Processing Systems, 21–24 Nov. 2016, Auckland, New Zealand (ACM, NY, 2016). DOI: 10.1145/3015166.3015207.
2. A. Benyassine, E. Shlomot, H.-Y. Su, D. Massaloux, C. Lamblin, J.-P. Petit, “ITU-T Recommendation G.729 Annex B: a silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications,” IEEE Commun. Mag.
35, No. 9, 64 (1997). DOI: 10.1109/35.620527.
3. L. Karray, A. Martin, “Towards improving speech detection robustness for speech recognition in adverse conditions,” Speech Commun.
40, No. 3, 261 (2003). DOI: 10.1016/S0167-6393(02)00066-3.
4. J. Alam, P. Kenny, P. Ouellet, T. Stafylakis, P. Dumouchel, “Supervised/unsupervised voice activity detectors for text-dependent speaker recognition on the RSR2015 corpus,” Proc. of Odyssey 2014: The Speaker and Language Recognition Workshop, 16–19 June 2014, Joensuu, Finland (Joensuu, 2014), pp. 123–130.
5. S. Graf, T. Herbig, M. Buck, G. Schmidt, “Features for voice activity detection: a comparative analysis,” EURASIP J. Advances Signal Processing
2015, 91 (2015). DOI: 10.1186/s13634-015-0277-z.