Application of an Automated Essay Scoring engine to English writing assessment using Many-Facet Rasch Measurement-Reference-Cited by-同舟云学术

Application of an Automated Essay Scoring engine to English writing assessment using Many-Facet Rasch Measurement

Published:2022-02-26 Issue:1 Volume:40 Page:61-85
ISSN:0265-5322
Container-title:Language Testing
language:en
Short-container-title:Language Testing

Author:

Chan Kinnie Kin Yee¹^ORCID,Bond Trevor²,Yan Zi³

Affiliation:

1. Hong Kong Metropolitan University, Hong Kong

2. James Cook University, Australia

3. The Education University of Hong Kong, Hong Kong

Abstract

We investigated the relationship between the scores assigned by an Automated Essay Scoring (AES) system, the Intelligent Essay Assessor (IEA), and grades allocated by trained, professional human raters to English essay writing by instigating two procedures novel to written-language assessment: the logistic transformation of AES raw scores into hierarchically ordered grades, and the co-calibration of all essay scoring data in a single Rasch measurement framework. A total of 3453 essays were written by 589 US students (in Grades 4, 6, 8, 10, and 12), in response to 18 National Assessment of Educational Progress (NAEP) writing prompts at three grade levels (4, 8, & 12). We randomly assigned one of two versions of the assessment, A or B, to each student. Each version comprised a narrative (N), an informative (I), and a persuasive (P) prompt. Nineteen experienced assessors graded the essays holistically using NAEP scoring guidelines, using a rotating plan in which each essay was rated by four raters. Each essay was additionally scored using the IEA. We estimated the effects of rater, prompt, student, and rubric by using a Many-Facet Rasch Measurement (MFRM) model. Last, within a single Rasch measurement scale, we co-calibrated the students’ grades from human raters and their grades from the IEA to compare them. The AES machine maintained equivalence with human scored ratings and were more consistent than those from human raters.

Publisher

SAGE Publications

Subject

Linguistics and Language,Social Sciences (miscellaneous),Language and Linguistics

Link

http://journals.sagepub.com/doi/pdf/10.1177/02655322221076025

Reference60 articles.

1. Technology in Teaching English Language Learners: The Case of Three Middle School Teachers

2. Automated Essay Evaluation and the Computational Paradigm: Machine Scoring Enters the Classroom

3. Historical view of the influences of measurement and writing theories on the practice of writing assessment in the United States

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Assessing second-language academic writing: AI vs. Human raters;Journal of Educational Technology and Online Learning;2023-12-31

2. An Automated English Essay Scoring Engine Based on Neutrosophic Ontology for Electronic Education Systems;Applied Sciences;2023-07-26

3. Editorial: Learning analytics for supporting individualization: data-informed adaptation of learning;Frontiers in Education;2023-06-30

4. Experienced but detached from reality: Theorizing and operationalizing the relationship between experience and rater effects;Assessing Writing;2023-04

5. Automatic Scoring of English Essays Based on Machine Learning Technology in a Wireless Network Environment;Security and Communication Networks;2022-05-28