1. Efficient estimation of word representations in vector space;mikolov;ArXiv Preprint,2013
2. A neural probabilistic language model;bengio;J Machine Learning Research,2003
3. Semi-supervised classification with graph convolutional networks;kipf;ArXiv Preprint,2016
4. A simple but tough-to-beat baseline for sentence embeddings;arora;Proc Int Conf Learning Representations (ICLR),0
5. Schema induction and analogical transfer