1. On Losses for Modern Language Models
2. Elias Bassani . 2022. ranx: A Blazing-Fast Python Library for Ranking Evaluation and Comparison . In ECIR (2) (Lecture Notes in Computer Science , Vol. 13186). Springer, 259-- 264 . Elias Bassani. 2022. ranx: A Blazing-Fast Python Library for Ranking Evaluation and Comparison. In ECIR (2) (Lecture Notes in Computer Science, Vol. 13186). Springer, 259--264.
3. J. M. Bland and D. G. Altman . 1995. Multiple significance tests: the Bonferroni method . BMJ , Vol. 310 , 6973 ( Jan. 1995 ), 170. http://bmj.bmjjournals.com/cgi/content/full/310/6973/170 J. M. Bland and D. G. Altman. 1995. Multiple significance tests: the Bonferroni method. BMJ, Vol. 310, 6973 (Jan. 1995), 170. http://bmj.bmjjournals.com/cgi/content/full/310/6973/170
4. Wei-Cheng Chang , Felix X. Yu , Yin-Wen Chang , Yiming Yang , and Sanjiv Kumar . 2020 . Pre-training Tasks for Embedding-based Large-scale Retrieval. In International Conference on Learning Representations. https://openreview.net/forum?id=rkg-mA4FDr Wei-Cheng Chang, Felix X. Yu, Yin-Wen Chang, Yiming Yang, and Sanjiv Kumar. 2020. Pre-training Tasks for Embedding-based Large-scale Retrieval. In International Conference on Learning Representations. https://openreview.net/forum?id=rkg-mA4FDr
5. Jacob Devlin , Ming-Wei Chang , Kenton Lee , and Kristina Toutanova . 2019 . BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding . In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies , Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, Minnesota, 4171--4186. https://doi.org/10. 18653/v1/N19--1423 10.18653/v1 Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, Minnesota, 4171--4186. https://doi.org/10.18653/v1/N19--1423