Abstract
To analyze English discourse more accurately and provide more detailed feedback information, this study applies Rasch measurement and Conditional Random Field (CRF) models to English discourse analysis. The Rasch measurement model is widely used to evaluate and quantify the potential traits of individuals, and it has remarkable advantages in measurement and evaluation. By combining the CRF model, the Rasch model is employed to model the structural and semantic information in the discourse and use this model to carry out sequence labeling, to enhance the ability to capture the internal relations of the discourse. Finally, this study conducts comparative experiments on integrating the Rasch measurement and CRF models, comparing the outcomes against traditional scoring methods and the standalone CRF model. The research findings indicate that: (1) The discourse component syntactic analysis model on the Penn Treebank (PTB) database obtained Unlabeled Attachment Score (UAS) values of 94.07, 95.76, 95.67, and 95.43, and Labeled Attachment Score (LAS) values of 92.47, 92.33, 92.49, and 92.46 for the LOC, CRF, CRF2O, and MFVI models, respectively. After adding the Rasch measurement model, the UAS values of the four models on the PTB database are 96.85, 96.77, 96.92, and 96.78 for the LOC, CRF, CRF2O, and MFVI models, respectively, with LAS values of 95.33, 95.34, 95.39, and 95.32, all showing significant improvement. (2) By combining contextual information with CRF models, students can better understand their discourse expression, capture the connections between English discourse sentences, and analyze English discourse more comprehensively. This study provides new ideas and methods for researchers in English language education and linguistics.
Publisher
Public Library of Science (PLoS)
Reference35 articles.
1. A law of ordinal random error: The Rasch measurement model and random error distributions of ordinal assessments;D Andrich;Measurement,2019
2. An introduction to conditional random fields;C Sutton;Foundations and Trends® in Machine Learning,2012
3. Grammatical Cohesive Devices in Reading Text: A Discourse Analysis of English Test for Junior High School;D Jayanti;JET ADI BUANA,2021
4. STABC-IR:An air target intention recognition method based on bidirectional gated recurrent unit and conditional random field with space-time attention mechanism;S Wang;Journal of Aeronautics and Astronautics of China: English version,2023
5. Text segmentation for patent claim simplification via Bidirectional Long-Short Term Memory and Conditional Random Field;B. Geng;Computational Intelligence,2022