Estimating Standard Errors of Cut Scores for Item Rating and Mapmark Procedures-Reference-Cited by-同舟云学术

Estimating Standard Errors of Cut Scores for Item Rating and Mapmark Procedures

Published:2007-06-06 Issue:1 Volume:68 Page:25-41
ISSN:0013-1644
Container-title:Educational and Psychological Measurement
language:en
Short-container-title:Educational and Psychological Measurement

Author:

Ping Yin ¹,Sconing James²

Affiliation:

1. ACT, Inc.,

2. ACT, Inc.

Abstract

Standard-setting methods are widely used to determine cut scores on a test that examinees must meet for a certain performance standard. Because standard setting is a measurement procedure, it is important to evaluate variability of cut scores resulting from the standard-setting process. Generalizability theory is used in this study to estimate standard errors of cut scores resulting from two standard-setting methods: item rating (Angoff-based) and mapmark (bookmark-based) methods. In this study, two different generalizability (G) study designs and four different decision (D) study designs were examined, and the impact of varying different aspects of the study design and universes of generalization was examined. Results suggest that cut scores were generally consistent for both methods. The first round standard setting contributed the most to the overall variability for the mapmark method. Also, it is clear that there is no one standard error associated with a certain cut score.

Publisher

SAGE Publications

Subject

Applied Mathematics,Applied Psychology,Developmental and Educational Psychology,Education

Link

http://journals.sagepub.com/doi/pdf/10.1177/0013164407301546

Reference28 articles.

1. Angoff, W.H. (1971). Scales, norms, and equivalent scores. In R. L. Thorndike (Ed.), Educational measurement (2nd ed., pp. 508-600). Washington, DC: American Council on Education.

2. A Consumer’s Guide to Setting Performance Standards on Criterion-Referenced Tests

3. A Brief History of Item Theory Response

4. Generalizability Theory

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Forming intervals of predicted total scores for cut-off scores evaluation: a generalizability theory application with Bootstrapping;Current Psychology;2024-08-14

2. Which method is optimal for estimating variance components and their variability in generalizability theory? evidence form a set of unified rules for bootstrap method;PLOS ONE;2023-07-14

3. Setting a standard for low reading proficiency: A comparison of the bookmark procedure and constrained mixture Rasch model;PLOS ONE;2021-11-29

4. How to improve reliability of cut‐off scores in dental competency exam: A comparison of rating methods in standard setting;European Journal of Dental Education;2020-07-12

5. Cut-scores revisited: feasibility of a new method for group standard setting;BMC Medical Education;2018-06-07