1. Character-level language modeling with deeper self-attention;Al-Rfou,2019
2. Layer normalization;Ba;CoRR.,2016
3. LSTM language models for LVCSR in first-pass decoding and lattice-rescoring;Beck;CoRR.,2019
4. A neural probabilistic language model;Bengio;J. Mach. Learn. Res.,2003
5. Recurrent neural network language model adaptation for multi-genre broadcast speech recognition;Chen,2015