Sequential Bayesian Ability Estimation Applied to Mixed-Format Item Tests-Reference-Cited by-同舟云学术

Sequential Bayesian Ability Estimation Applied to Mixed-Format Item Tests

Published:2023-09 Issue:5-6 Volume:47 Page:402-419
ISSN:0146-6216
Container-title:Applied Psychological Measurement
language:en
Short-container-title:Applied Psychological Measurement

Author:

Xiong Jiawei¹^ORCID,Cohen Allan S.²^ORCID,Xiong Xinhui (Maggie)³

Affiliation:

1. Pearson, Athens, GA, USA

2. The University of Georgia, Athens, GA, USA

3. Educational Testing Service, Princeton, NJ, USA

Abstract

Large-scale tests often contain mixed-format items, such as when multiple-choice (MC) items and constructed-response (CR) items are both contained in the same test. Although previous research has analyzed both types of items simultaneously, this may not always provide the best estimate of ability. In this paper, a two-step sequential Bayesian (SB) analytic method under the concept of empirical Bayes is explored for mixed item response models. This method integrates ability estimates from different item formats. Unlike the empirical Bayes method, the SB method estimates examinees’ posterior ability parameters with individual-level sample-dependent prior distributions estimated from the MC items. Simulations were used to evaluate the accuracy of recovery of ability and item parameters over four factors: the type of the ability distribution, sample size, test length (number of items for each item type), and person/item parameter estimation method. The SB method was compared with a traditional concurrent Bayesian (CB) calibration method, EAPsum, that uses scaled scores for summed scores to estimate parameters from the MC and CR items simultaneously in one estimation step. From the simulation results, the SB method showed more accurate and reliable ability estimation than the CB method, especially when the sample size was small (150 and 500). Both methods presented similar recovery results for MC item parameters, but the CB method yielded a bit better recovery of the CR item parameters. The empirical example suggested that posterior ability estimated by the proposed SB method had higher reliability than the CB method.

Publisher

SAGE Publications

Subject

Psychology (miscellaneous),Social Sciences (miscellaneous)

Link

http://journals.sagepub.com/doi/pdf/10.1177/01466216231201986

Reference32 articles.

1. Some Observations on the Metric of PC-BILOG Results

2. Item Response Theory

3. Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm

4. Adaptive EAP Estimation of Ability in a Microcomputer Environment