Development and validation of a rating scale for summarization as an integrated task-Reference-Cited by-同舟云学术

Development and validation of a rating scale for summarization as an integrated task

Published:2021-07-01 Issue:1 Volume:6 Page:
ISSN:2363-5169
Container-title:Asian-Pacific Journal of Second and Foreign Language Education
language:en
Short-container-title:Asian. J. Second. Foreign. Lang. Educ.

Author:

Li Jiuliang^ORCID,Wang Qian

Abstract

AbstractSummary writing is essential for academic success, and has attracted renewed interest in academic research and large-scale language test. However, less attention has been paid to the development and evaluation of the scoring scales of summary writing. This study reports on the validation of a summary rubric that represented an approach to scale development with limited resources out of consideration for practicality. Participants were 83 students and three raters. Diagnostic evaluation of the scale components and categories was based on raters’ perception of their use and the scores of students’ summaries which were analyzed using multifaceted Rasch measurement (MFRM). Correlation analysis revealed significant relationships among the scoring components, but the coefficients among some of the components were over high. MFRM analysis provided evidence in support of the usefulness of the scoring rubric, but also suggested the need of a refinement of the components and categories. According to the raters, the rubric was ambiguous in addressing some crucial text features. This study has implications for summarization task design, scoring scale development and validation in particular.

Funder

National Education Examinations Authority of China & British Council

Beijing Institute of Fashion Technology

Publisher

Springer Science and Business Media LLC

Subject

Linguistics and Language,Language and Linguistics,Education

Link

https://link.springer.com/content/pdf/10.1186/s40862-021-00113-6.pdf

Reference72 articles.

1. Andrich, D. (1996). Measurement criteria for choosing among models with graded responses. In A. von Eye, & C. C. Clogg (Eds.), Categorical variables in developmental research: Methods of analysis, (pp. 3–35). San Diego, CA: Academic Press.

2. Asención Delaney, Y. (2008). Investigating the reading-to-write construct. Journal of English for Academic Purposes, 7(3), 140–150. https://doi.org/10.1016/j.jeap.2008.04.001.

3. Asención, Y. (2004). Validation of reading-to-write assessment tasks performed by second language learners. Unpublished PhD thesis, Northern Arizona University.

4. Bachman, L. F., & Palmer, A. (1996). Language testing in practice. Oxford: Oxford University Press.

5. Bernhardt, E.B. (1991). Reading development in a second language: Theoretical, empirical, and classroom perspectives. Norwood, NJ: Ablex.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Comparing Chinese L2 writing performance in paper-based and computer-based modes: Perspectives from the writing product and process;Assessing Writing;2024-07

2. Özetleme Stratejileri Eğitiminin 7. Sınıf Öğrencilerinin Özetleme Başarısına Etkisi;İnönü Üniversitesi Eğitim Fakültesi Dergisi;2024-05-15

3. The flipped learning perception scale: A validity and reliability study;Education and Information Technologies;2023-04-03