1. Vaswani A, et al. Attention is all you need. In: Guyon I, et al., editors. Advances in neural information processing systems, vol. 30. New York: Curran Associates Inc; 2017.
2. Oord A et al. Wavenet: a generative model for raw audio. 2016; arXiv preprint arXiv:1609.03499.
3. Devlin J, Chang M-W, Lee K, Toutanova K. Bert: pre-training of deep bidirectional transformers for language understanding. 2018; arXiv preprint arXiv:1810.04805.
4. Douglas RJ, Martin KA. Recurrent neuronal circuits in the neocortex. Curr Biol. 2007;17(13):R496–500.
5. Lukoševičius M, Jaeger H. Reservoir computing approaches to recurrent neural network training. Comput Sci Rev. 2009;3(3):127–49.