1. A neural probabilistic language model;Bengio;J. Mach. Learn. Res. (JMLR),2003
2. Distributional memory explainable word embeddings in continuous space;Snidaro,2019
3. Efficient estimation of word representations in vector space;Mikolov,2013
4. GloVe: Global vectors for word representation;Pennington,2014
5. Enriching word vectors with subword information;Bojanowski;Trans. Assoc. Comput. Linguist. (TACL),2017