Experts’ prediction of the actual item difficulty of multiple-choice questions in the Ethiopian Undergraduate Medicine Licensure Examination-Reference-Cited by-同舟云学术

Experts’ prediction of the actual item difficulty of multiple-choice questions in the Ethiopian Undergraduate Medicine Licensure Examination

Published:2024-02-28 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Gedamu Shewatatek¹,Tadesse Tefera²,Moges Belay³,Schauber Stefan⁴

Affiliation:

1. Jimma University

2. Addis Ababa University

3. Dilla University

4. University of Oslo

Abstract

Abstract Background The ability of expert ratings to predict the difficulty level of items to identify test-taker performance is an important aspect of licensure exams. Expert judgment is used as a primary source of information for users to make prior decisions to determine the pass rate of test takers. The nature of the raters involved in predicting item difficulty is central to setting credible standards. Therefore, this study aimed to assess and compare raters' prediction and actual MCQ item difficulty of the 2022 Ethiopian undergraduate medicine licensure examination (UGMLE). Method 200 Multiple-Choice Questions (MCQs) of the 2022 cohort of 815 UGMLE examinees were used in this study and seven physicians participated in the difficulty level ratings. Then, analysis was conducted to understand experts' rating variation in predicting the computed actual difficulty levels of examinees. Descriptive statistics to profile the rater’s assessment results and ANOVA to examine differences between the raters' estimations of the difficulty levels of the MCQs were computed. Additionally, regression analysis was used to understand the interrater variations in their predictions of difficult levels compared to actual difficult levels of MCQs.It was also used to examine the proportion of variation that each rater explained in the predictions of actual difficulty levels across the whole set of MCQs and all UGMLE fourteen domains. Results The study revealed statistically significant differences in the mean difficult level ratings of some raters and moderate to high positive linear relationships with some exam domains. But also, statistically nonsignificant relationships were found in some other domains. Thus, considerations have been needed on the observed variability in difficult-level rating values across raters and UGMLE domains. In the regression results, experts' ratings accounted for 33% of the variations in the actual UGMLE difficulty levels. Between the expert-rated and actual difficulty levels, the regression model showed a moderately positive linear correlation (R = 0.57) that was statistically significant at p = .05. Conclusion This study demonstrated the complex nature of rating the level of difficulty of MCQs on UGMLE and the benefits of employing prior expert ratings. To improve a rater's rating accuracy in UGMLE, designing strategies in line with the changing nature of assessment methods guarantees to maintain the required reliability and validity of the exam.

Publisher

Research Square Platform LLC

Reference47 articles.

1. Trends in national licensing examinations in medicine;Swanson DB;Med Educ,2016

2. Breithaupt K. Medical Licensure Testing, White Paper for the Assessment Review Task Force of the Medical Council of Canada. 2011.

3. Archer DJ, Lynn DN, Roberts MM, Lee D, Gale DT. A Systematic Review on the impact of licensing examinations for doctors in countries comparable to the UK. 2019.

4. Castle RA. Developing a Certification or Licensure Exam. 2002.

5. Biggs JB, Tang CS, Maidenhead kum. England New York, NY: McGraw-Hill, Society for Research into Higher Education & Open University Press; 2011. 279–389 p. (SRHE and Open University Press imprint; vol. 4th edition).