1. Turing, A. M. (1950). Computing machinery and intelligence. Mind. Vol. 59, no. 236. P. 433–460.
2. Attention is all you need / A. Vaswani, N. Shazeer, N. Parmar [et al.]. ArXiv.org, August 2. Available at: https://arxiv.org/abs/1706.03762 (accessed: 16.08.2023). arXiv 1706.03762v7. DOI 10.48550/arXiv.1706.03762.
3. Sutskever, I., Vinyals, O. and Le, Q. V. (2014). Sequence to sequence learning with neural networks. ArXiv.org, December 14. Available at: https://arxiv.org/abs/1409.3215 (accessed: 16.08.2023). arXiv 1409.3215v3. DOI 10.48550/arXiv.1409.3215.
4. Devlin, J., Chang, M.-W., Lee, K. and Toutanova, K. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding. ArXiv.org, May 24. Available at: https://arxiv.org/abs/1810.04805 (accessed: 11.08.2023). arXiv 1810.04805v2. DOI 10.48550/arXiv.1810.04805.
5. Burton, W. K., Cabrera, N. and Frank, F. C. (1951). The growth of crystals and the equilibrium structure of their surfaces. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences. Vol. 243, no. 866. P. 299–358. DOI 10.1098/rsta.1951.0006.