1. A.Y. Hannun, C. Case, J. Casper, B. Catanzaro, G. Diamos, E. Elsen, R. Prenger, S. Satheesh, S. Sengupta, A. Coates, A.Y. Ng, Deep speech: Scaling up end-to-end speech recognition, CoRR abs/1412.5567. arXiv:1412.5567. URL http://arxiv.org/abs/1412.5567.
2. Deep speech 2 : End-to-end speech recognition in english and mandarin;Amodei,2016
3. Y. Wu, M. Schuster, Z. Chen, Q.V. Le, M. Norouzi, W. Macherey, M. Krikun, Y. Cao, Q. Gao, K. Macherey, J. Klingner, A. Shah, M. Johnson, X. Liu, L. Kaiser, S. Gouws, Y. Kato, T. Kudo, H. Kazawa, K. Stevens, G. Kurian, N. Patil, W. Wang, C. Young, J. Smith, J. Riesa, A. Rudnick, O. Vinyals, G. Corrado, M. Hughes, J. Dean, Google’s neural machine translation system: Bridging the gap between human and machine translation, CoRR abs/1609.08144. arXiv:1609.08144. URL http://arxiv.org/abs/1609.08144.
4. Long Short-Term Memory, Vol. 9;Hochreiter,1997
5. ESE: efficient speech recognition engine with sparse LSTM on FPGA;Han,2017