1. Scikit-learn: Machine learning in Python;pedregosa;J Mach Learn Res,2012
2. RoBERTa: A robustly optimized BERT pretraining approach;liu;arXiv 1907 11692,2019
3. DistilBERT, a distilled version of BERT: Smaller, faster, cheaper and lighter;sanh;arXiv 1910 01108,2019
4. BERT: Pre-training of deep bidirectional transformers for language understanding;devlin;Proc Conf North Amer Chapter Assoc Comput Linguistics Hum Lang Technol (NAACL-HLT),2019