1. Towards end-to-end speech recognition with recurrent neural networks;Graves
2. Attention-based models for speech recognition;Chorowski
3. Listen, attend and spell: A neural network for large vocabulary conversational speech recognition
4. Sequence to sequence learning with neural networks;Sutskever
5. Neural machine translation by jointly learning to align and translate;Bahdanau