1. All that’s ‘human’ is not gold: evaluating human evaluation of generated text;Clark,2021
2. Meet GPT-3. It has learned to code (and blog and argue);Metz,2020
3. Towards automating healthcare question answering in a noisy multilingual low-resource setting;Daniel,2019
4. Improving access to justice with legal chatbots;Queudot;Stats,2020
5. Generative grading: near human-level accuracy for automated feedback on richly structured problems;Malik,2019