1. Probabilistic Grammars and their Applications;Johnson;Int. Encycl. Soc. Behav. Sci.,2001
2. Attention in Natural Language Processing;Galassi;IEEE Trans. Neural Netw. Learn. Syst.,2020
3. An attentive survey of attention models;Chaudhari;ACM Trans. Intell. Syst. Technol.,2021
4. Neural machine translation by jointly learning to align and translate;Bahdanau;arXiv preprint arXiv:1409.,2014
5. Context-aware neural machine translation learns anaphora resolution;Voita;arXiv preprint arXiv:1805.,2018