Comparison of Machine Learning Performance Using Analytic and Holistic Coding Approaches Across Constructed Response Assessments Aligned to a Science Learning Progression-Reference-Cited by-同舟云学术

Comparison of Machine Learning Performance Using Analytic and Holistic Coding Approaches Across Constructed Response Assessments Aligned to a Science Learning Progression

Published:2020-09-26 Issue:2 Volume:30 Page:150-167
ISSN:1059-0145
Container-title:Journal of Science Education and Technology
language:en
Short-container-title:J Sci Educ Technol

Author:

Jescovitch Lauren N.^ORCID,Scott Emily E.^ORCID,Cerchiara Jack A.^ORCID,Merrill John,Urban-Lurain Mark^ORCID,Doherty Jennifer H.^ORCID,Haudek Kevin C.^ORCID

Abstract

AbstractWe systematically compared two coding approaches to generate training datasets for machine learning (ML): (i) a holistic approach based on learning progression levels and (ii) a dichotomous, analytic approach of multiple concepts in student reasoning, deconstructed from holistic rubrics. We evaluated four constructed response assessment items for undergraduate physiology, each targeting five levels of a developing flux learning progression in an ion context. Human-coded datasets were used to train two ML models: (i) an 8-classification algorithm ensemble implemented in the Constructed Response Classifier (CRC), and (ii) a single classification algorithm implemented in LightSide Researcher’s Workbench. Human coding agreement on approximately 700 student responses per item was high for both approaches with Cohen’s kappas ranging from 0.75 to 0.87 on holistic scoring and from 0.78 to 0.89 on analytic composite scoring. ML model performance varied across items and rubric type. For two items, training sets from both coding approaches produced similarly accurate ML models, with differences in Cohen’s kappa between machine and human scores of 0.002 and 0.041. For the other items, ML models trained with analytic coded responses and used for a composite score, achieved better performance as compared to using holistic scores for training, with increases in Cohen’s kappa of 0.043 and 0.117. These items used a more complex scenario involving movement of two ions. It may be that analytic coding is beneficial to unpacking this additional complexity.

Funder

National Science Foundation

Publisher

Springer Science and Business Media LLC

Subject

General Engineering,Education

Link

https://link.springer.com/content/pdf/10.1007/s10956-020-09858-0.pdf

Reference75 articles.

1. Aggarwal, C. C., & Zhai, C. (2012). A survey of text classification algorithms. In C. Aggarwal & C. Zhai (Eds.), Mining text data. Springer.

2. Ali, K. M., & Pazzani, M. J. (1996). Error reduction through learning multiple descriptions. Mach Learn, 24(3), 173–202.

3. Allen, D., & Tanner, K. (2006). Rubrics: tools for making learning goals and evaluation criteria explicit for both teachers and learners. CBE Life Sciences Education, 5(3), 197–203. https://doi.org/10.1187/cbe.06-06-0168.

4. American Association for the Advancement of Science, AAAS. (2011). Vision and change in undergraduate biology education: a call to action. Washington, DC.

5. Anderson, C. W., de los Santos, E. X., Bodbyl, S., Covitt, B. A., Edwards, K. D., Hancock II, J. B., Lin, Q., Thomas, C. M., Penuel, W. R., & Welch, M. M. (2018). Designing educational systems to support enactment of the next generation science standards. J Res Sci Teach, 55(7), 1026–1052. https://doi.org/10.1002/tea.21484.

Cited by 43 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Examining and Supporting Mechanistic Explanations Across Chemistry and Biology Courses;CBE—Life Sciences Education;2024-09

2. FEW questions, many answers: using machine learning to assess how students connect food–energy–water (FEW) concepts;Humanities and Social Sciences Communications;2024-08-13

3. Enhancing engineering education through mini project-based learning in computer integrated manufacturing laboratory: A student-centric approach;Innovations in Education and Teaching International;2024-06-03

4. Applying large language models and chain-of-thought for automatic scoring;Computers and Education: Artificial Intelligence;2024-06

5. Automatic assessment of text-based responses in post-secondary education: A systematic review;Computers and Education: Artificial Intelligence;2024-06