The Information in Multiple Ratings-Reference-Cited by-同舟云学术

The Information in Multiple Ratings

Published:2002-12 Issue:4 Volume:26 Page:364-375
ISSN:0146-6216
Container-title:Applied Psychological Measurement
language:en
Short-container-title:Applied Psychological Measurement

Author:

Bock R. Darrell¹,Brennan Robert L.²,Muraki Eiji³

Affiliation:

1. University of Chicago

2. University of Iowa

3. Tohoku University

Abstract

In assessment programs where scores are reported for individual examinees, it is desirable to have responses to performance exercises graded by more than one rater. If more than one item on each test form is so graded, it is also desirable that different raters grade the responses of any one examinee. This gives rise to sampling designs in which raters are nested within items. These designs lead to simple methods for estimating variance components owing to examinees and to interactions of examinees by items and examinees by raters within items. The authors review here some useful results from generalizability analysis based on these estimates and show that they may be used to correct the item response information functions and standard errors for conditional dependence of multiple ratings. Examples based on data from two performance testing studies are presented.

Publisher

SAGE Publications

Subject

Psychology (miscellaneous),Social Sciences (miscellaneous)

Link

http://journals.sagepub.com/doi/pdf/10.1177/014662102237794

Reference12 articles.

1. Performance Assessments from the Perspective of Generalizability Theory

2. Generalizability Theory

Cited by 24 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Improving the Precision of Classroom Observation Scores Using a Multi-Rater and Multi-Timepoint Item Response Theory Model;Educational Assessment;2024-04-02

2. Some thoughts on analytical choices in the scaling model for test scores in international large-scale assessment studies;Measurement Instruments for the Social Sciences;2022-09-03

3. Effects of Using Double Ratings as Item Scores on IRT Proficiency Estimation;Applied Measurement in Education;2022-04-03

4. Rater and interlocutor training;The Routledge Handbook of Language Testing;2021-11-01

5. Are ratings in the eye of the beholder? A non-technical primer on many facet Rasch measurement to evaluate rater effects on teacher behavior rating scales;Journal of School Psychology;2021-06