1. Distributed representations of words and phrases and their compositionality;Mikolov,2013
2. P. Bojanowski, E. Grave, A. Joulin, T. Mikolov, Enriching word vectors with subword information, arXiv preprint arXiv:1607.04606.
3. Deep contextualized word representations;Peters,2018
4. Universal language model fine-tuning for text classification;Howard,2018
5. B. McCann, J. Bradbury, C. Xiong, R. Socher, Learned in translation: Contextualized word vectors., in: I. Guyon, U. von Luxburg, S. Bengio, H.M. Wallach, R. Fergus, S.V.N. Vishwanathan, R. Garnett (Eds.), NIPS, 2017, pp. 6297–6308. URL:http://dblp.uni-trier.de/db/conf/nips/nips2017.html#McCannBXS17.