1. Neural machine translation by jointly learning to align and translate;Bahdanau;arXiv,2014
2. A. Vaswani, N. Shazeer, N. Parmar, et al., Attention is all you need, Proceedings of the 31st International Conference on Neural Information Processing Systems, December 4–9, 2017, Long Beach, USA, pp. 6000–6010.
3. A survey of transformers;Lin;AI Open,2022
4. N. Parmar, A. Vaswani, J. Uszkoreit, et al., Image transformer, Proceedings of the 35th International Conference on Machine Learning, July 10–15, 2018, Stockholm, Sweden, SPMLR 80, pp. 4055–4064.
5. Molecular transformer: A model for uncertainty-calibrated chemical reaction prediction;Schwaller;ACS Cent. Sci.,2019