Rubric development for AI-enabled scoring of three-dimensional constructed-response assessment aligned to NGSS learning progression-Reference-Cited by-同舟云学术

Rubric development for AI-enabled scoring of three-dimensional constructed-response assessment aligned to NGSS learning progression

Published:2022-11-25 Issue: Volume:7 Page:
ISSN:2504-284X
Container-title:Frontiers in Education
language:
Short-container-title:Front. Educ.

Author:

Kaldaras Leonora,Yoshida Nicholas R.,Haudek Kevin C.

Abstract

IntroductionThe Framework for K-12 Science Education (the Framework) and the Next- Generation Science Standards (NGSS) define three dimensions of science: disciplinary core ideas, scientific and engineering practices, and crosscutting concepts and emphasize the integration of the three dimensions (3D) to reflect deep science understanding. The Framework also emphasizes the importance of using learning progressions (LPs) as roadmaps to guide assessment development. These assessments capable of measuring the integration of NGSS dimensions should probe the ability to explain phenomena and solve problems. This calls for the development of constructed response (CR) or open-ended assessments despite being expensive to score. Artificial intelligence (AI) technology such as machine learning (ML)-based approaches have been utilized to score and provide feedback on open-ended NGSS assessments aligned to LPs. ML approaches can use classifications resulting from holistic and analytic coding schemes for scoring short CR assessments. Analytic rubrics have been shown to be easier to evaluate for the validity of ML-based scores with respect to LP levels. However, a possible drawback of using analytic rubrics for NGSS-aligned CR assessments is the potential for oversimplification of integrated ideas. Here we describe how to deconstruct a 3D holistic rubric for CR assessments probing the levels of an NGSS-aligned LP for high school physical sciences.MethodsWe deconstruct this rubric into seven analytic categories to preserve the 3D nature of the rubric and its result scores and provide subsequent combinations of categories to LP levels.ResultsThe resulting analytic rubric had excellent human- human inter-rater reliability across seven categories (Cohen’s kappa range 0.82–0.97). We found overall scores of responses using the combination of analytic rubric very closely agreed with scores assigned using a holistic rubric (99% agreement), suggesting the 3D natures of the rubric and scores were maintained. We found differing levels of agreement between ML models using analytic rubric scores and human-assigned scores. ML models for categories with a low number of positive cases displayed the lowest level of agreement.DiscussionWe discuss these differences in bin performance and discuss the implications and further applications for this rubric deconstruction approach.

Funder

National Science Foundation

Publisher

Frontiers Media SA

Subject

Education

Reference33 articles.

1. Designing educational systems to support enactment of the next generation science standards.;Anderson;J. Res. Sci. Teach.,2018

2. A coefficient of agreement for nominal scales;Cohen;Educ. Psychol. Meas.,1960

3. Designing knowledge-in-use assessments to promote deeper learning.;Harris;Educ. Meas. Issues Pract.,2019

4. What are they thinking? Automated analysis of student writing about acid–base chemistry in introductory biology.;Haudek;CBE—Life Sci. Educ.,2012

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Developing valid assessments in the era of generative artificial intelligence;Frontiers in Education;2024-08-07

2. Employing technology-enhanced feedback and scaffolding to support the development of deep science understanding using computer simulations;International Journal of STEM Education;2024-07-11

3. Potenziare il Giudizio Descrittivo nella Scuola Primaria con l’uso dell’IA generativa;IUL Research;2024-06-28

4. Extending a Pretrained Language Model (BERT) using an Ontological Perspective to Classify Students’ Scientific Expertise Level from Written Responses;2024-01-26

5. Supporting chemistry teachers’ formative assessment with a three-dimensional learning progression;International Journal of Science Education;2024-01-19