Discriminative Learning of Filterbank Layer within Deep Neural Network Based Speech Recognition for Speaker Adaptation-Reference-Cited by-同舟云学术

Discriminative Learning of Filterbank Layer within Deep Neural Network Based Speech Recognition for Speaker Adaptation

Published:2019-02-01 Issue:2 Volume:E102.D Page:364-374
ISSN:0916-8532
Container-title:IEICE Transactions on Information and Systems
language:en
Short-container-title:IEICE Trans. Inf. & Syst.

Author:

SEKI Hiroshi¹,YAMAMOTO Kazumasa²,AKIBA Tomoyosi¹,NAKAGAWA Seiichi¹²

Affiliation:

1. Department of Computer Science and Engineering, Toyohashi University of Technology

2. Department of Computer Science, Chubu University

Publisher

Institute of Electronics, Information and Communications Engineers (IEICE)

Subject

Artificial Intelligence,Electrical and Electronic Engineering,Computer Vision and Pattern Recognition,Hardware and Architecture,Software

Link

https://www.jstage.jst.go.jp/article/transinf/E102.D/2/E102.D_2018EDP7252/_pdf

Reference50 articles.

1. [1] G. Hinton, L. Deng, D. Yu, G.E. Dahl, A.-R. Mohamed, N. Jitaly, A. Senior, V. Vanhoucke, P. Nguyen, T.N. Sainath, and B. Kingsbury, “Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups,” IEEE Signal Process. Mag., vol.29, no.6, pp.82-97, 2012. 10.1109/msp.2012.2205597

2. [2] T.N. Sainath, R.J. Weiss, A. Senior, K.W. Wilson, and O. Vinyals, “Learning the speech front-end with raw waveform CLDNNs,” Proc. Interspeech, pp.1-5, 2015.

3. [3] H.B. Sailor and H.A. Patil, “Novel unsupervised auditory filterbank learning using convolutional RBM for speech recognition,” IEEE Trans. Audio, Speech, Language Process., vol.24, no.12, pp.2341-2353, 2016. 10.1109/taslp.2016.2607341

4. [4] Z. Zhu, J.H. Engel, and A. Hannun, “Learning multiscale features directly from waveforms,” Proc. Interspeech, pp.1305-1309, 2016. 10.21437/interspeech.2016-256

5. [5] Z. Chen, S. Watanabe, H. Erdogan, and J.R. Hershey, “Speech enhancement and recognition using multi-task learning of long short-term memory recurrent neural networks,” Proc. Interspeech, pp.3274-3278, 2015.

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Speech Recognition Utilizing Deep Learning: A Systematic Review of the Latest Developments;HUM-CENT COMPUT INFO;2024

2. DNN controlled adaptive front-end for replay attack detection systems;Speech Communication;2023-10

3. Study on recognition and classification of English accents using deep learning algorithms;Journal of Intelligent Systems;2023-01-01

4. Low-Level Physiological Implications of End-to-End Learning for Speech Recognition;Interspeech 2022;2022-09-18

5. Partial Label Learning Based on Fully Connected Deep Neural Network;International Journal of Circuits, Systems and Signal Processing;2022-01-12