Publisher
Springer Nature Switzerland
Reference24 articles.
1. Achiam, J., et al.: Gpt-4 technical report. arXiv preprint arXiv:2303.08774 (2023)
2. Brown, T., et al.: Language models are few-shot learners. Adv. Neural Inform. Process. Syst. (2020)
3. Cechák, J., Pelánek, R.: Experimental evaluation of similarity measures for educational items. Intern. Educ. Data Mining Soc. (2021)
4. Cer, D., et al.: SemEval-2017 task 1: Semantic textual similarity multilingual and crosslingual focused evaluation. In: SemEval-2017. ACL (2017)
5. Chiang, C.H., Lee, H.Y.: Can large language models be an alternative to human evaluations? arXiv preprint arXiv:2305.01937 (2023)