1. Foundations of Linear and Generalized Linear Models;Agresti,2015
2. Don't count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors;Baroni,2014
3. A neural probabilistic language model;Bengio;Journal of Machine Learning Research,2003
4. Pattern Recognition and Machine Learning;Bishop,2006
5. Open sourcing bert: state-of-the-art pre-training for natural language processing;Devlin,2018