VESCA’s variable precision: Determining the accuracy of adjustment for examiner differences in distributed OSCEs-Reference-Cited by-同舟云学术

VESCA’s variable precision: Determining the accuracy of adjustment for examiner differences in distributed OSCEs

Published:2023-05-04 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Yeates Peter¹^ORCID,McCray Gareth¹^ORCID

Affiliation:

1. Keele University

Abstract

Abstract Introduction: Ensuring examiner equivalence across assessment locations is a priority within distributed Objective Structured Clinical Exams (OSCEs) but is challenging due to lack of overlap in performances judged by different groups of examiners. Yeates et al have develop a methodology (Video-based Examiner Score Comparison and Adjustment (VESCA)) to compare and (potentially) adjust for the influence of different groups of examiners within OSCEs. Whilst initial research has been promising, the accuracy of the adjusted scores produced by VESCA is unknown. As this is critical to VESCA’s utility, we aimed to investigate the accuracy of adjusted scores produced by VESCA under a range of plausible operational parameters. Methods: using statistical simulation, we investigated how: 1/proportion of participating examiners, 2/ number of linking videos, 3/baseline differences in examiner stringency between schools, 4/number of OSCE stations and 5/different degrees of random error within examiners’ judgements influenced accuracy of adjusted scores. We generated distributions of students’ “true” performances across several stations, added examiner error, and simulated linking through crossed video-scoring, before using Many Facet Rasch Modelling to produce adjusted scores, replicating 1000 times for each permutation, to determine average error reduction and the proportion of students whose scores became more accurate. Results: Under all conditions where no baseline difference existed between groups of examiners (i.e. random rather than systematic variance), score adjustment minimally improved or worsened score accuracy. Conversely, as modelled (systematic) baseline differences between schools increased, adjustment accuracy increased, reducing error by up to 71% and making scores more accurate for up to 93% of students in the 20% baseline-difference condition. Conclusions: score adjustment through VESCA will substantially enhance equivalence for candidates in distributed OSCEs when 10–20% baseline differences exist between examiners in different schools. As such differences are plausible in practice, consideration should be given to use of VESCA in large scale/national exams.

Publisher

Research Square Platform LLC

Reference35 articles.

1. Frank JR, Snell LS, Cate O ten, Holmboe ES, Carraccio C, Swing SR, et al. Competency-based medical education: theory to practice. Med Teach [Internet]. 2010 Aug 27;32(8):638–45. Available from: http://www.tandfonline.com/doi/full/10.3109/0142159X.2010.501190

2. Techniques for measuring clinical competence: objective structured clinical examinations;Newble D;Med Educ,2004

3. Boursicot K, Kemp S, Wilkinson T, Findyartini A, Canning C, Cilliers F, et al. Performance assessment: Consensus statement and recommendations from the 2020 Ottawa Conference. Med Teach [Internet]. 2021 Jan 2;43(1):58–67. Available from: https://doi.org/10.1080/0142159X.2020.1830052

4. Downing SM. Validity: on meaningful interpretation of assessment data. Med Educ [Internet]. 2003 Sep;37(9):830–7. Available from: http://www.ncbi.nlm.nih.gov/pubmed/14506816

5. Valentine N, Durning S, Shanahan EM, Schuwirth L. Fairness in human judgement in assessment: a hermeneutic literature review and conceptual framework. Advances in Health Sciences Education [Internet]. 2020;26(2):713–38. Available from: https://doi.org/10.1007/s10459-020-10002-1