Assessing Item Fit Using Expected Score Curve Under Restricted Recalibration-Reference-Cited by-同舟云学术

Assessing Item Fit Using Expected Score Curve Under Restricted Recalibration

Published:2024-09-04 Issue: Volume: Page:
ISSN:1076-9986
Container-title:Journal of Educational and Behavioral Statistics
language:en
Short-container-title:Journal of Educational and Behavioral Statistics

Author:

Han Youngjin¹^ORCID,Yang Ji Seung¹,Liu Yang¹

Affiliation:

1. University of Maryland, College Park

Abstract

In item response theory applications, item fit analysis is often performed for precalibrated items using response data from subsequent test administrations. Because such practices lead to the involvement of sampling variability from two distinct samples that must be properly addressed for statistical inferences, conventional item fit analysis can be revisited and modified. This study extends the item fit analysis originally proposed by Haberman et al., which involves examining the discrepancy between the model-implied and empirical expected score curve. We analytically derive the standard errors that accurately account for the sampling variability from two samples within the framework of restricted recalibration. After derivation, we present the findings from a simulation study that evaluates the performance of our proposed method in terms of the empirical Type I error rate and power, for both dichotomous and polytomous items. An empirical example is also provided, in which we assess the item fit of pediatric short-form scale in the Patient-Reported Outcome Measurement Information System.

Funder

Institute of Education Sciences

Publisher

American Educational Research Association (AERA)

Link

https://journals.sagepub.com/doi/pdf/10.3102/10769986241268604

Reference54 articles.

1. Estimating item parameters and latent ability when responses are scored in two or more nominal categories

2. Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm

3. Comparing Attitudes Across Groups: An IRT-Based Item-Fit Statistic for the Analysis of Measurement Invariance

4. Limited-information goodness-of-fit testing of item response theory models for sparse 2P tables