1. Lightweight adaptive mixture of neural and n-gram language models;Bakhtin;arXiv preprint arXiv:1804.07705,2018
2. Nathanael Chambers and Dan Jurafsky. “Unsupervised learning of narrative event chains”. In: Proceedings of ACL-08: HLT. 2008, pp. 789–797.
3. Learning phrase representations using RNN encoder-decoder for statistical machine translation;Cho;arXiv preprint arXiv:1406.1078,2014
4. Xiao Ding et al. “Knowledge-driven event embedding for stock prediction”. In: Proceedings of coling 2016, the 26th international conference on computational linguistics: Technical papers. 2016, pp. 2133–2142.
5. Model-agnostic meta-learning for fast adaptation of deep networks;Finn,2017