Re-conceptualising and accounting for examiner (cut-score) stringency in a ‘high frequency, small cohort’ performance test-Reference-Cited by-同舟云学术

Re-conceptualising and accounting for examiner (cut-score) stringency in a ‘high frequency, small cohort’ performance test

Published:2020-09-02 Issue:2 Volume:26 Page:369-383
ISSN:1382-4996
Container-title:Advances in Health Sciences Education
language:en
Short-container-title:Adv in Health Sci Educ

Author:

Homer Matt^ORCID

Abstract

AbstractVariation in examiner stringency is an ongoing problem in many performance settings such as in OSCEs, and usually is conceptualised and measured based on scores/grades examiners award. Under borderline regression, the standard within a station is set using checklist/domain scores and global grades acting in combination. This complexity requires a more nuanced view of what stringency might mean when considering sources of variation of cut-scores in stations. This study uses data from 349 administrations of an 18-station, 36 candidate single circuit OSCE for international medical graduates wanting to practice in the UK (PLAB2). The station-level data was gathered over a 34-month period up to July 2019. Linear mixed models are used to estimate and then separate out examiner (n = 547), station (n = 330) and examination (n = 349) effects on borderline regression cut-scores. Examiners are the largest source of variation in cut-scores accounting for 56% of variance in cut-scores, compared to 6% for stations, < 1% for exam and 37% residual. Aggregating to the exam level tends to ameliorate this effect. For 96% of examinations, a ‘fair’ cut-score, equalising out variation in examiner stringency that candidates experience, is within one standard error of measurement (SEM) of the actual cut-score. The addition of the SEM to produce the final pass mark generally ensures the public is protected from almost all false positives in the examination caused by examiner cut-score stringency acting in candidates’ favour.

Publisher

Springer Science and Business Media LLC

Subject

Education,General Medicine

Link

https://link.springer.com/content/pdf/10.1007/s10459-020-09990-x.pdf

Reference33 articles.

1. Bartman, I., Smee, S., & Roy, M. (2013). A method for identifying extreme OSCE examiners. The Clinical Teacher, 10(1), 27–31. https://doi.org/10.1111/j.1743-498X.2012.00607.x.

2. Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 1–48. https://doi.org/10.18637/jss.v067.i01.

3. Cai, J., Morris, A., Hohensee, C., Hwang, S., Robison, V., & Hiebert, J. (2018). The role of replication studies in educational research. Journal for Research in Mathematics Education, 49(1), 2–8.

4. Chong, L., Taylor, S., Haywood, M., Adelstein, B.-A., & Shulruf, B. (2017). The sights and insights of examiners in objective structured clinical examinations. Journal of Educational Evaluation for Health Professions. https://doi.org/10.3352/jeehp.2017.14.34.

5. Cizek, G. J., & Bunch, M. B. (2007). Standard setting: A guide to establishing and evaluating performance standards on tests (1st ed.). Thousand Oaks, CA: SAGE Publications Inc.

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Exploring the use of Rasch modelling in “common content” items for multi-site and multi-year assessment;Advances in Health Sciences Education;2024-07-08

2. Inconsistencies in rater-based assessments mainly affect borderline candidates: but using simple heuristics might improve pass-fail decisions;Advances in Health Sciences Education;2024-04-23

3. Towards a more nuanced conceptualisation of differential examiner stringency in OSCEs;Advances in Health Sciences Education;2023-10-16

4. Pass/fail decisions and standards: the impact of differential examiner stringency on OSCE outcomes;Advances in Health Sciences Education;2022-03-01

5. Determining influence, interaction and causality of contrast and sequence effects in objective structured clinical exams;Medical Education;2022-01-11