The NEAT Equating Via Chaining Random Forests in the Context of Small Sample Sizes: A Machine-Learning Method-Reference-Cited by-同舟云学术

The NEAT Equating Via Chaining Random Forests in the Context of Small Sample Sizes: A Machine-Learning Method

Published:2022-09-04 Issue: Volume: Page:001316442211208
ISSN:0013-1644
Container-title:Educational and Psychological Measurement
language:en
Short-container-title:Educational and Psychological Measurement

Author:

Jiang Zhehan¹,Han Yuting¹,Xu Lingling¹^ORCID,Shi Dexin²^ORCID,Liu Ren³^ORCID,Ouyang Jinying¹,Cai Fen¹

Affiliation:

1. Peking University Health Science Center, Beijing, China

2. University of South Carolina, Columbia, USA

3. University of California, Merced, USA

Abstract

The part of responses that is absent in the nonequivalent groups with anchor test (NEAT) design can be managed to a planned missing scenario. In the context of small sample sizes, we present a machine learning (ML)-based imputation technique called chaining random forests (CRF) to perform equating tasks within the NEAT design. Specifically, seven CRF-based imputation equating methods are proposed based on different data augmentation methods. The equating performance of the proposed methods is examined through a simulation study. Five factors are considered: (a) test length (20, 30, 40, 50), (b) sample size per test form (50 versus 100), (c) ratio of common/anchor items (0.2 versus 0.3), and (d) equivalent versus nonequivalent groups taking the two forms (no mean difference versus a mean difference of 0.5), and (e) three different types of anchors (random, easy, and hard), resulting in 96 conditions. In addition, five traditional equating methods, (1) Tucker method; (2) Levine observed score method; (3) equipercentile equating method; (4) circle-arc method; and (5) concurrent calibration based on Rasch model, were also considered, plus seven CRF-based imputation equating methods for a total of 12 methods in this study. The findings suggest that benefiting from the advantages of ML techniques, CRF-based methods that incorporate the equating result of the Tucker method, such as IMP_total_Tucker, IMP_pair_Tucker, and IMP_Tucker_cirlce methods, can yield more robust and trustable estimates for the “missingness” in an equating task and therefore result in more accurate equated scores than other counterparts in short-length tests with small samples.

Funder

National Natural Science Foundation of China

Publisher

SAGE Publications

Subject

Applied Mathematics,Applied Psychology,Developmental and Educational Psychology,Education

Link

http://journals.sagepub.com/doi/pdf/10.1177/00131644221120899

Reference49 articles.

1. equate: An R Package for Observed-Score Linking and Equating

2. A Comparison of Equating Methods and Linking Designs for Developing an Item Pool under Item Response Theory

3. Babcock B., Albano A., Raymond M. (2012). Nominal weights mean equating: A method for very small samples. Educational and Psychological Measurement, 72, 608–628. https://doi.org/10.1177/0013164411428609

4. Babcock B., Hodge K. (2020). Rasch versus classical equating in the context of small sample sizes. Educational and Psychological Measurement, 80(3), 499–521. https://doi.org/10.1177/0013164419878483