1. Distilbert, a distilled version of bert: smaller, faster, cheaper and lighter;sanh,2019
2. Bert: Pre-training of deep bidirectional transformers for language understanding;devlin,2018
3. What Supercomputers Say: A Study of Five System Logs
4. Term-weighting approaches in automatic text retrieval
5. Fasttext. zip: Compressing text classification models;joulin,2016