Detecting Differential Rater Functioning in Severity and Centrality: The Dual DRF Facets Model-Reference-Cited by-同舟云学术

Detecting Differential Rater Functioning in Severity and Centrality: The Dual DRF Facets Model

Published:2021-09-02 Issue: Volume: Page:001316442110432
ISSN:0013-1644
Container-title:Educational and Psychological Measurement
language:en
Short-container-title:Educational and Psychological Measurement

Author:

Jin Kuan-Yu¹^ORCID,Eckes Thomas²

Affiliation:

1. Hong Kong Examinations and Assessment Authority, Wan Chai, Hong Kong

2. TestDaF Institute, University of Bochum, Bochum, Germany

Abstract

Performance assessments heavily rely on human ratings. These ratings are typically subject to various forms of error and bias, threatening the assessment outcomes’ validity and fairness. Differential rater functioning (DRF) is a special kind of threat to fairness manifesting itself in unwanted interactions between raters and performance- or construct-irrelevant factors (e.g., examinee gender, rater experience, or time of rating). Most DRF studies have focused on whether raters show differential severity toward known groups of examinees. This study expands the DRF framework and investigates the more complex case of dual DRF effects, where DRF is simultaneously present in rater severity and centrality. Adopting a facets modeling approach, we propose the dual DRF model (DDRFM) for detecting and measuring these effects. In two simulation studies, we found that dual DRF effects (a) negatively affected measurement quality and (b) can reliably be detected and compensated under the DDRFM. Using sample data from a large-scale writing assessment ( N = 1,323), we demonstrate the practical measurement consequences of the dual DRF effects. Findings have implications for researchers and practitioners assessing the psychometric quality of ratings.

Publisher

SAGE Publications

Subject

Applied Mathematics,Applied Psychology,Developmental and Educational Psychology,Education

Link

http://journals.sagepub.com/doi/pdf/10.1177/00131644211043207

Reference19 articles.

1. Language Proficiency Assessments in Higher Education Admissions

2. Bayesian Data Analysis

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Creativity in the West and the East: A Meta-Analysis of Cross-Cultural Differences;Creativity Research Journal;2024-07-09

2. The Impact of Insufficient Effort Responses on the Order of Category Thresholds in the Polytomous Rasch Model;Educational and Psychological Measurement;2024-04-13

3. Detecting Rater Bias in Mixed-Format Assessments;Measurement: Interdisciplinary Research and Perspectives;2024-01-02

4. Human ratings take time: A hierarchical facets model for the joint analysis of ratings and rating times;Behavior Research Methods;2023-11-02

5. Measuring the Impact of Peer Interaction in Group Oral Assessments with an Extended Many‐Facet Rasch Model;Journal of Educational Measurement;2023-09-15