Author:
Li Chenchen,Shi Linfeng,Zhou Chunyi,Huan Zhaoxin,Tang Chengfu,Zhang Xiaolu,Wang Xudong,Zhou Jun,Liu Song
Publisher
Springer Nature Switzerland
Reference20 articles.
1. Berry, K.J., Mielke, P.W., Jr.: Spearman’s footrule as a measure of agreement. Psychol. Rep. 80(3), 839–846 (1997)
2. Bommasani, R., et al.: On the opportunities and risks of foundation models. arXiv preprint arXiv:2108.07258 (2021)
3. Chang, Y., et al.: A survey on evaluation of large language models. ACM Trans. Intell. Syst. Technol. 15, 1–45 (2023)
4. Opencompass Contributors.: Opencompass: a universal evaluation platform for foundation models (2023). https://github.com/open-compass/opencompass
5. Elo, A.E.: The proposed USCF rating system, its development, theory, and applications. Chess Life 22(8), 242–247 (1967)