Music teachers’ labeling accuracy and quality ratings of lesson plans by artificial intelligence (AI) and humans-Reference-Cited by-同舟云学术

Music teachers’ labeling accuracy and quality ratings of lesson plans by artificial intelligence (AI) and humans

Published:2024-05-08 Issue: Volume: Page:
ISSN:0255-7614
Container-title:International Journal of Music Education
language:en
Short-container-title:International Journal of Music Education

Author:

Cooper Patrick K¹^ORCID

Affiliation:

1. Florida International University, USA

Abstract

This study explored the potential of artificial intelligence (ChatGPT) to generate lesson plans for music classes that were indistinguishable from music lesson plans created by humans, with current music teachers as assessors. Fifty-six assessors made a total of 410 ratings across eight lesson plans, assigning a quality score to each lesson plan and labeling if they believed each lesson plan was created by a human or generated by AI. Despite the human-made lesson plans being rated higher in quality as a group ( p < .01, d = 0.44), assessors were unable to accurately label if a lesson plan was created by a human or generated by AI (55% accurate overall). Labeling accuracy was positively predicted by quality scores on human-made lesson plans and previous personal use of AI, while accuracy was negatively predicted by quality scores on AI-generated lesson plans and perception of how useful AI will be in the future. Open-ended responses from 42 teachers suggested assessors used three factors when making evaluations: specific details, evidence of classroom knowledge, and wording. Implications provide suggestions for how music teachers can use prompt engineering with a GPT model to create a virtual assistant or Intelligent Tutor System (ITS) for their classroom.

Publisher

SAGE Publications

Link

https://journals.sagepub.com/doi/pdf/10.1177/02557614241249163

Reference28 articles.

1. Ariza C. (2009). The interrogator as critic: The Turing test and the evaluation of generative music systems. Computer Music Journal, 33(2), 48–70. https://doi.org/10.1162/comj.2009.33.2.48

2. Avdeeff M. (2019). Artificial intelligence & popular music: SKYGGE, flow machines, and the audio uncanny valley. Arts (Basel), 8(4), 130. https://doi.org/10.3390/arts8040130

3. Boden M. A. (2010). The Turing test and artistic creativity. Kybernetes, 39(3), 409–413. https://doi.org/10.1108/03684921011036132

4. Chen M. (2020). Imagination machines, Dartmouth-based Turing-tests, & a potted history of responses, AI & Society, 35, 283–287. https://doi.org/10.1007/s00146-018-0855-3

5. Clancey W. J., Hoffman R. R. (2021). Methods and standards for research on explainable artificial intelligence: Lessons from intelligent tutoring systems. Applied AI Letters, 2(4), e53. https://doi.org/10.1002/ail2.53