1. Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, et al. 2023. Gpt-4 technical report.
2. Automatic Assessment of Open Ended Questions with a Bleu-Inspired Algorithm and Shallow NLP
3. A reliable approach to automatic assessment of short answer free responses
4. Barbara, Ben Hamner, Jaison Morgan, lynnvandev, and Mark Shermis. 2012. The Hewlett Foundation: Short Answer Scoring. https://kaggle.com/competitions/asap-sas
5. Isaac I Bejar, David M Williamson, and Robert J Mislevy. 2006. Human scoring. Lawrence Erlbaum, Mahwah, NJ. 49--81 pages.