1. Jelinek, F.: Statistical methods for speech recognition. MIT press (1997)
2. Jelinek, F.: Interpolated estimation of markov source parameters from sparse data. Pattern Recognition in Practice, 381–397 (1980)
3. Katz, S.: Estimation of probabilities from sparse data for the language model component of a speech recognizer. IEEE Transactions on Acoustics, Speech and Signal Processing 35, 400–401 (1987)
4. Kneser, R., Ney, H.: Improved clustering techniques for class-based statistical language modelling. In: Third European Conference on Speech Communication and Technology (1993)
5. Kneser, R., Ney, H.: Improved backing-off for m-gram language modeling. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1995, vol. 1, pp. 181–184 (1995)