1. Albert: A lite bert for self-supervised learning of language representations;lan;arXiv preprint arXiv 1909 11324,2019
2. Distilling the knowledge in a neural network;hinton;arXiv preprint arXiv 1503 02531,2015
3. The Probabilistic Relevance Framework: BM25 and Beyond
4. Know what you don’t know: Unanswerable questions for squad;rajpurkar;arXiv preprint arXiv 1806 03822,2018
5. Bleu: a method for automatic evaluation of machine translation;papineni;Proceedings of the 40th Annual Meeting on Association for Computational Linguistics - ACL '02,2002