1. Bert: Pretraining of deep bidirectional transformers for language understanding;Devlin;arXiv preprint,2018
2. Language models are few-shot learners;Brown;Advances in neural information processing systems,2020
3. Supervised machine learning: A review of classification techniques;Kotsiantis;Emerging artificial intelligence applications in computer engineering,2007
4. A neural probabilistic language model;Bengio;Advances in neural information processing systems,2000