1. Katz, S.M.: Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer. IEEE Trans. ASSP 35(3), 400–401 (1987)
2. Eseen, H.N., Kneser, R.: On Structuring Probabilistic Dependencies in Stochastic Language Modeling. Computer, Speech, and Language 8, 1–38 (1994)
3. Kneser, R., Ney, H.: Improved Backing-off for M-gram Language Modeling. In: Proc. of ICASSP, vol. 1, pp. 181–184 (1995)
4. Chen, S.F., Goodman, J.: An Empirical Study of Smoothing Techniques for Language Modeling. Technical Report TR-10-98, Harvard University Center for Research in Computing Technology (1998)
5. Chelba, C., Acero, A.: Discriminative Training of N-gram Classifier for Speech and Text Routing. In: Proc. of Eurospeech, pp. 1–4 (2003)