1. DARPA TIMIT acoustic-phonetic continous speech corpus CD-ROM. NIST speech disc 1-1.1;Garofolo John S;NASA STI/Recon technical report n,1993
2. "Hidden Markov models for speech recognition;Juang Biing Hwang;Technometrics,1991
3. Graves , Alex , "Connectionist temporal classification : labelling unsegmented sequence data with recurrent neural networks." Proceedings of the 23rd international conference on Machine learning . 2006 . Graves, Alex, "Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks." Proceedings of the 23rd international conference on Machine learning. 2006.
4. Chan , William , " Listen , attend and spell: A neural network for large vocabulary conversational speech recognition." 2016 IEEE international conference on acoustics, speech and signal processing (ICASSP) . IEEE , 2016 . Chan, William, "Listen, attend and spell: A neural network for large vocabulary conversational speech recognition." 2016 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, 2016.
5. Molau , Sirko , " Computing mel-frequency cepstral coefficients on the power spectrum." 2001 IEEE international conference on acoustics, speech, and signal processing . Proceedings (cat. No. 01CH37221) . Vol. 1 . IEEE, 2001 . Molau, Sirko, "Computing mel-frequency cepstral coefficients on the power spectrum." 2001 IEEE international conference on acoustics, speech, and signal processing. Proceedings (cat. No. 01CH37221). Vol. 1. IEEE, 2001.