1. Weston J, Chopra S, Bordes A (2015) Memory networks. In: 3rd Int conf learn represent ICLR 2015—conf track proc. https://doi.org/10.1007/978-3-030-82184-5_11
2. Sukhbaatar S, Szlam A, Weston J, Fergus R (2015) End-to-end memory networks. In: Advances in neural information processing systems
3. Daniluk M, Rocktäschel T, Welbl J, Riedel S (2017) Frustratingly short attention spans in neural language modeling. CoRR abs/1702.0
4. Vaswani A, Shazeer N, Parmar N et al (2017) Attention is all you need. In: Advances in neural information processing systems. pp 5999–6009
5. Radford A, Narasimhan K, Salimans T, Sutskever I (2018) improving language understanding by generative pre-training. Homol Homotopy Appl