More efficient processes for creating automated essay scoring frameworks: A demonstration of two algorithms-Reference-Cited by-同舟云学术

More efficient processes for creating automated essay scoring frameworks: A demonstration of two algorithms

Published:2020-07-04 Issue:2 Volume:38 Page:247-272
ISSN:0265-5322
Container-title:Language Testing
language:en
Short-container-title:Language Testing

Author:

Shin Jinnie¹^ORCID,Gierl Mark J.¹

Affiliation:

1. University of Alberta, Canada

Abstract

Automated essay scoring (AES) has emerged as a secondary or as a sole marker for many high-stakes educational assessments, in native and non-native testing, owing to remarkable advances in feature engineering using natural language processing, machine learning, and deep-neural algorithms. The purpose of this study is to compare the effectiveness and the performance of two AES frameworks, each based on machine learning with deep language features, or complex language features, and deep neural algorithms. More specifically, support vector machines (SVMs) in conjunction with Coh-Metrix features were used for a traditional AES model development, and the convolutional neural networks (CNNs) approach was used for more contemporary deep-neural model development. Then, the strengths and weaknesses of the traditional and contemporary models under different circumstances (e.g., types of the rubric, length of the essay, and the essay type) were tested. The results were evaluated using the quadratic weighted kappa (QWK) score and compared with the agreement between the human raters. The results indicated that the CNNs model performs better, meaning that it produced more comparable results to the human raters than the Coh-Metrix + SVMs model. Moreover, the CNNs model also achieved state-of-the-art performance in most of the essay sets with a high average QWK score.

Publisher

SAGE Publications

Subject

Linguistics and Language,Social Sciences (miscellaneous),Language and Linguistics

Link

http://journals.sagepub.com/doi/pdf/10.1177/0265532220937830

Reference15 articles.

1. Automated scoring of junior and senior high essays using Coh-Metrix features: Implications for large-scale language testing

Cited by 24 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Demystifying large language models in second language development research;Computer Speech & Language;2025-01

2. Harnessing LLMs for multi-dimensional writing assessment: Reliability and alignment with human judgments;Heliyon;2024-07

3. Utilizing large language models for EFL essay grading: An examination of reliability and validity in rubric‐based assessments;British Journal of Educational Technology;2024-06-04

4. Study on Intelligent Scoring of English Composition Based on Machine Learning from the Perspective of Natural Language Processing;ACM Transactions on Asian and Low-Resource Language Information Processing;2024-06-04

5. Incorporating Fine-Grained Linguistic Features and Explainable AI into Multi-Dimensional Automated Writing Assessment;Applied Sciences;2024-05-15