Using automated analysis to assess middle school students' competence with scientific argumentation-Reference-Cited by-同舟云学术

Using automated analysis to assess middle school students' competence with scientific argumentation

Published:2023-05-04 Issue:1 Volume:61 Page:38-69
ISSN:0022-4308
Container-title:Journal of Research in Science Teaching
language:en
Short-container-title:J Res Sci Teach

Author:

Wilson Christopher D.¹^ORCID,Haudek Kevin C.²^ORCID,Osborne Jonathan F.³^ORCID,Buck Bracey Zoë E.¹^ORCID,Cheuk Tina⁴^ORCID,Donovan Brian M.¹^ORCID,Stuhlsatz Molly A. M.¹,Santiago Marisol M.²,Zhai Xiaoming⁵^ORCID

Affiliation:

1. BSCS Science Learning Colorado Springs Colorado USA

2. Michigan State University East Lansing Michigan USA

3. Stanford University Stanford California USA

4. California Polytechnic State University San Luis Obispo California USA

5. University of Georgia Athens Georgia USA

Abstract

AbstractArgumentation is fundamental to science education, both as a prominent feature of scientific reasoning and as an effective mode of learning—a perspective reflected in contemporary frameworks and standards. The successful implementation of argumentation in school science, however, requires a paradigm shift in science assessment from the measurement of knowledge and understanding to the measurement of performance and knowledge in use. Performance tasks requiring argumentation must capture the many ways students can construct and evaluate arguments in science, yet such tasks are both expensive and resource‐intensive to score. In this study we explore how machine learning text classification techniques can be applied to develop efficient, valid, and accurate constructed‐response measures of students' competency with written scientific argumentation that are aligned with a validated argumentation learning progression. Data come from 933 middle school students in the San Francisco Bay Area and are based on three sets of argumentation items in three different science contexts. The findings demonstrate that we have been able to develop computer scoring models that can achieve substantial to almost perfect agreement between human‐assigned and computer‐predicted scores. Model performance was slightly weaker for harder items targeting higher levels of the learning progression, largely due to the linguistic complexity of these responses and the sparsity of higher‐level responses in the training data set. Comparing the efficacy of different scoring approaches revealed that breaking down students' arguments into multiple components (e.g., the presence of an accurate claim or providing sufficient evidence), developing computer models for each component, and combining scores from these analytic components into a holistic score produced better results than holistic scoring approaches. However, this analytical approach was found to be differentially biased when scoring responses from English learners (EL) students as compared to responses from non‐EL students on some items. Differences in the severity between human and computer scores for EL between these approaches are explored, and potential sources of bias in automated scoring are discussed.

Funder

National Science Foundation

Publisher

Wiley

Subject

Education

Link

https://onlinelibrary.wiley.com/doi/am-pdf/10.1002/tea.21864

Reference79 articles.

1. Mining Text Data

2. Designing educational systems to support enactment of the Next Generation Science Standards

3. High-Stakes Testing and Curricular Control: A Qualitative Metasynthesis

4. The Schooled Society

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Measuring scientific inquiry ability related to hands-on practice: An automated approach based on multimodal data analysis;Education and Information Technologies;2024-08-28

2. FEW questions, many answers: using machine learning to assess how students connect food–energy–water (FEW) concepts;Humanities and Social Sciences Communications;2024-08-13

3. A Learning Measurement Model for English Comprehensive Ability Based on the Background of Educational Informatization;International Journal of Web-Based Learning and Teaching Technologies;2024-07-26

4. TOWARDS EFFECTIVE ARGUMENTATION: DESIGN AND IMPLEMENTATION OF A GENERATIVE AI-BASED EVALUATION AND FEEDBACK SYSTEM;Journal of Baltic Science Education;2024-04-20

5. Öğretmen Adaylarının Bilimsel Argümanları Sınıflama Düzeyleri;Eğitim Bilim ve Araştırma Dergisi;2024-03-28