1. A neural probabilistic language model;Bengio;J. Mach. Learn. Res.,2003
2. An empirical study of smoothing techniques for language modeling;Chen,1996
3. R. Collobert, Deep learning for efficient discriminative parsing, in: International Conference on Artificial Intelligence and Statistics, 2011, pp. 224–232.
4. A. Conneau, H. Schwenk, L. Barrault, Y. LeCun, Very deep convolutional networks for natural language processing, 2016.
5. Y.N. Dauphin, A. Fan, M. Auli, D. Grangier, Language modeling with gated convolutional networks, 2016.