Automated language essay scoring systems: a literature review-Reference-Cited by-同舟云学术

Automated language essay scoring systems: a literature review

Published:2019-08-12 Issue: Volume:5 Page:e208
ISSN:2376-5992
Container-title:PeerJ Computer Science
language:en
Short-container-title:

Author:

Hussein Mohamed Abdellatif¹,Hassan Hesham²,Nassef Mohammad²

Affiliation:

1. Information and Operations, National Center for Examination and Educational Evaluation, Cairo, Egypt

2. Faculty of Computers and Information, Computer Science Department, Cairo University, Cairo, Egypt

Abstract

Background Writing composition is a significant factor for measuring test-takers’ ability in any language exam. However, the assessment (scoring) of these writing compositions or essays is a very challenging process in terms of reliability and time. The need for objective and quick scores has raised the need for a computer system that can automatically grade essay questions targeting specific prompts. Automated Essay Scoring (AES) systems are used to overcome the challenges of scoring writing tasks by using Natural Language Processing (NLP) and machine learning techniques. The purpose of this paper is to review the literature for the AES systems used for grading the essay questions. Methodology We have reviewed the existing literature using Google Scholar, EBSCO and ERIC to search for the terms “AES”, “Automated Essay Scoring”, “Automated Essay Grading”, or “Automatic Essay” for essays written in English language. Two categories have been identified: handcrafted features and automatically featured AES systems. The systems of the former category are closely bonded to the quality of the designed features. On the other hand, the systems of the latter category are based on the automatic learning of the features and relations between an essay and its score without any handcrafted features. We reviewed the systems of the two categories in terms of system primary focus, technique(s) used in the system, the need for training data, instructional application (feedback system), and the correlation between e-scores and human scores. The paper includes three main sections. First, we present a structured literature review of the available Handcrafted Features AES systems. Second, we present a structured literature review of the available Automatic Featuring AES systems. Finally, we draw a set of discussions and conclusions. Results AES models have been found to utilize a broad range of manually-tuned shallow and deep linguistic features. AES systems have many strengths in reducing labor-intensive marking activities, ensuring a consistent application of scoring criteria, and ensuring the objectivity of scoring. Although many techniques have been implemented to improve the AES systems, three primary challenges have been identified. The challenges are lacking of the sense of the rater as a person, the potential that the systems can be deceived into giving a lower or higher score to an essay than it deserves, and the limited ability to assess the creativity of the ideas and propositions and evaluate their practicality. Many techniques have only been used to address the first two challenges.

Publisher

PeerJ

Subject

General Computer Science

Link

https://peerj.com/articles/cs-208.pdf

Reference33 articles.

1. Automatic text scoring using neural networks;Alikaniotis,2016

2. Automated essay scoring with E-Rater® V.2.0;Attali;ETS Research Report Series,2014

3. The e-rater scoring engine: automated essay scoring with natural language processing;Burstein,2003

4. Marine exploitation of Atlantic salmon (Salmo salar L.) from the River Bush, Northern Ireland;Crozier;Fisheries Research,1994

5. Augmenting textual qualitative features in deep convolution recurrent neural network for automatic essay scoring;Dasgupta,2018

Cited by 88 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Neural Networks or Linguistic Features? - Comparing Different Machine-Learning Approaches for Automated Assessment of Text Quality Traits Among L1- and L2-Learners’ Argumentative Essays;International Journal of Artificial Intelligence in Education;2024-09-13

2. Reliability of ChatGPT in automated essay scoring for dental undergraduate examinations;BMC Medical Education;2024-09-03

3. Linking essay-writing tests using many-facet models and neural automated essay scoring;Behavior Research Methods;2024-08-20

4. A multifaceted architecture to Automate Essay Scoring for assessing english article writing: Integrating semantic, thematic, and linguistic representations;Computers and Electrical Engineering;2024-08

5. Comparing ChatGPT's correction and feedback comments with that of educators in the context of primary students' short essays written in English and Greek;Education and Information Technologies;2024-07-27