Measurement precision at the cut score in medical multiple choice exams: Theory matters-Reference-Cited by-同舟云学术

Measurement precision at the cut score in medical multiple choice exams: Theory matters

Published:2020-05-28 Issue:4 Volume:9 Page:220-228
ISSN:2212-2761
Container-title:Perspectives on Medical Education
language:en
Short-container-title:Perspect Med Educ

Author:

Lahner Felicitas-Maria^ORCID,Schauber Stefan,Lörwald Andrea Carolin,Kropf Roger,Guttormsen Sissel,Fischer Martin R.^ORCID,Huwendiek Sören^ORCID

Abstract

Abstract Introduction In high-stakes assessment, the measurement precision of pass-fail decisions is of great importance. A concept for analyzing the measurement precision at the cut score is conditional reliability, which describes measurement precision for every score achieved in an exam. We compared conditional reliabilities in Classical Test Theory (CTT) and Item Response Theory (IRT) with a special focus on the cut score and potential factors influencing conditional reliability at the cut score. Methods We analyzed 32 multiple-choice exams from three Swiss medical schools comparing conditional reliability at the cut score in IRT and CCT. Additionally, we analyzed potential influencing factors such as the range of examinees’ performance, year of study, and number of items using multiple regression. Results In CTT, conditional reliability was highest for very low and very high scores, whereas examinees with medium scores showed low conditional reliabilities. In IRT, the maximum conditional reliability was in the middle of the scale. Therefore, conditional reliability at the cut score was significantly higher in IRT compared with CTT. It was influenced by the range of examinees’ performance and number of items. This influence was more pronounced in CTT. Discussion We found that conditional reliability shows inverse distributions and conclusions regarding the measurement precision at the cut score depending on the theory used. As the use of IRT seems to be more appropriate for criterion-oriented standard setting in the framework of competency-based medical education, our findings might have practical implications for the design and quality assurance of medical education assessments.

Publisher

Springer Science and Business Media LLC

Subject

Education

Link

https://link.springer.com/content/pdf/10.1007/s40037-020-00586-0.pdf

Reference36 articles.

1. Downing SM. Validity: on the meaningful interpretation of assessment data. Med Educ. 2003;37(9):830–7.

2. Bandaranayake RC. Setting and maintaining standards in multiple choice examinations: AMEE Guide No. 37. Med Teach. 2008;30(9–10):836–45.

3. Kane M. The precision of measurements. Appl Meas Educ. 1996;9(4):355–79.

4. AERA, APA, NCME. Standards for educational and psychological testing. Washington, DC: American Educational Research Association; 2014.

5. Cronbach L. Coefficient alpha and the internal structure of tests. Psychometrika. 1951;16(3):297–334.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Optimizing a national examination for medical undergraduates via modern automated test assembly approaches;BMC Medical Education;2024-08-25

2. Utility of a multimodal computer-based assessment format for assessment with a higher degree of reliability and validity;Medical Teacher;2022-10-28

3. Análise da adequação dos itens do Teste de Progresso em medicina;Revista Brasileira de Educação Médica;2022