1. Layer normalization;Ba,2016
2. Bahdanau, D., Cho, K., & Bengio, Y. (2015). Neural machine translation by jointly learning to align and translate. In 3rd international conference on learning representations, ICLR 2015.
3. End-to-end attention-based large vocabulary speech recognition;Bahdanau,2016
4. NLTK: The natural language toolkit;Bird,2006
5. Listen, attend and spell: A neural network for large vocabulary conversational speech recognition;Chan,2016