Published Studies of Interrater Reliability Often Overestimate Reliability: Computing the Correct Coefficient-Reference-Cited by-同舟云学术

Published Studies of Interrater Reliability Often Overestimate Reliability: Computing the Correct Coefficient

Published:2000-08 Issue:4 Volume:60 Page:532-542
ISSN:0013-1644
Container-title:Educational and Psychological Measurement
language:en
Short-container-title:Educational and Psychological Measurement

Author:

Fan Xitao¹,Chen Michael²

Affiliation:

1. Utah State University

2. University of Mississippi

Abstract

It is erroneous to generalize the interrater reliability coefficient estimated from two or more raters rating only a (small) portion of the sample to the rest of the sample data for which only one rater is used for scoring, although such generalization is often made implicitly in practice. If the interrater reliability estimate from part of a sample is available, the score reliability for the rest of the sample data for which only one rater is used for scoring can be estimated both within the framework of classical reliability theory and that of generalizability theory. As intuitively expected, score reliability when only one rater is used for scoring is lower than the score reliability for which two raters are used. The authors provide a sample of published studies in different disciplines that inappropriately generalized reliability coefficients involving several raters to scores generated by a single rater.

Publisher

SAGE Publications

Subject

Applied Mathematics,Applied Psychology,Developmental and Educational Psychology,Education

Link

http://journals.sagepub.com/doi/pdf/10.1177/00131640021970709

Reference20 articles.

1. Solitary and Collaborative Pretense Play in Early Childhood: Sources of Individual Variation in the Development of Representational Competence

2. Activities and Interactions of Mothers and Their Firstborn Infants in the First Six Months of Life: Covariation, Stability, Continuity, Correspondence, and Prediction

3. Generalizability Theory

4. Interscorer Reliability for the Hand Test Administered to Children

Cited by 14 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A process for establishing and maintaining inter-rater reliability for two observation instruments as a fidelity of implementation measure: A large-scale randomized controlled trial perspective;Studies in Educational Evaluation;2019-09

2. Reliability analysis of the objective structured clinical examination using generalizability theory;Medical Education Online;2016-01-01

3. Putting Quality Indicators to the Test: An Examination of 30 Years of Research;Journal of Emotional and Behavioral Disorders;2010-12-21

4. The Edumetric Quality of New Modes of Assessment: Some Issues and Prospects;Assessment, Learning and Judgement in Higher Education;2008-11-13

5. Prospects for Group Processes and Intergroup Relations Research: A Review of 70 Years' Progress;Group Processes & Intergroup Relations;2008-10