1. Neural machine translation by jointly learning to align and translate;bahdanau;arXiv 1409 0473,2014
2. Learning Visual Question Answering by Bootstrapping Hard Attention
3. Sequence to sequence learning with neural networks;sutskever;Proc Adv Neural Inf Process Syst,2014
4. Latent alignment and variational attention;deng;Proc Adv Neural Inf Process Syst,2018
5. Attention is all you need;vaswani;Proc Adv Neural Inf Process Syst,2017