Affiliation:
1. Educational Testing Service
2. Duolingo, Inc.
3. National Board of Medical Examiners
4. Cambium Assessment, Inc.
5. Boston College
6. University of Iowa
Abstract
AbstractThe remarkable strides in artificial intelligence (AI), exemplified by ChatGPT, have unveiled a wealth of opportunities and challenges in assessment. Applying cutting‐edge large language models (LLMs) and generative AI to assessment holds great promise in boosting efficiency, mitigating bias, and facilitating customized evaluations. Conversely, these innovations raise significant concerns regarding validity, reliability, transparency, fairness, equity, and test security, necessitating careful thinking when applying them in assessments. In this article, we discuss the impacts and implications of LLMs and generative AI on critical dimensions of assessment with example use cases and call for a community effort to equip assessment professionals with the needed AI literacy to harness the potential effectively.
Reference118 articles.
1. ACT. (n.d.).Test Day. ACT. Retrieved fromhttps://www.act.org/content/act/en/products‐and‐services/the‐act/test‐day.html
2. AERA APA & NCME. (2014).Standards for educational and psychological testing.https://www.apa.org/science/programs/testing/standards
3. Automated essay scoring with e‐rater® V. 2;Attali Y.;The Journal of Technology, Learning and Assessment,2006
4. The interactive reading task: Transformer-based automatic item generation
5. Bapna A. Caswell I. Kreutzer J. Firat O. vanEsch D. Siddhant A. … &Hughes M.(2022).Building machine translation systems for the next thousand languages.arXiv preprint arXiv:2205.03983.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献