Performance of Polytomous IRT Models With Rating Scale Data: An Investigation Over Sample Size, Instrument Length, and Missing Data-Reference-Cited by-同舟云学术

Performance of Polytomous IRT Models With Rating Scale Data: An Investigation Over Sample Size, Instrument Length, and Missing Data

Published:2021-09-17 Issue: Volume:6 Page:
ISSN:2504-284X
Container-title:Frontiers in Education
language:
Short-container-title:Front. Educ.

Author:

Dai Shenghai,Vo Thao Thu,Kehinde Olasunkanmi James,He Haixia,Xue Yu,Demir Cihan,Wang Xiaolin

Abstract

The implementation of polytomous item response theory (IRT) models such as the graded response model (GRM) and the generalized partial credit model (GPCM) to inform instrument design and validation has been increasing across social and educational contexts where rating scales are usually used. The performance of such models has not been fully investigated and compared across conditions with common survey-specific characteristics such as short test length, small sample size, and data missingness. The purpose of the current simulation study is to inform the literature and guide the implementation of GRM and GPCM under these conditions. For item parameter estimations, results suggest a sample size of at least 300 and/or an instrument length of at least five items for both models. The performance of GPCM is stable across instrument lengths while that of GRM improves notably as the instrument length increases. For person parameters, GRM reveals more accurate estimates when the proportion of missing data is small, whereas GPCM is favored in the presence of a large amount of missingness. Further, it is not recommended to compare GRM and GPCM based on test information. Relative model fit indices (AIC, BIC, LL) might not be powerful when the sample size is less than 300 and the length is less than 5. Synthesis of the patterns of the results, as well as recommendations for the implementation of polytomous IRT models, are presented and discussed.

Publisher

Frontiers Media SA

Subject

Education

Reference47 articles.

1. The Impact of Omitted Responses on the Accuracy of Ability Estimation in Item Response Theory;Ayala;J. Educ. Meas.,2001

2. Psychometric Properties of Three New National Survey of Student Engagement Based Engagement Scales: An Item Response Theory Analysis;Carle;Res. High Educ.,2009

3. Some General Guidelines for Choosing Missing Data Handling Methods in Educational Research;Cheema;J. Mod. Appl. Stat. Methods,2014

4. Statistical Power Analysis for the Behavioral Sciences

Cited by 21 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Nine Versions of the Parent Financial Socialization Scale: Full, Short, and Minimal Versions for Emerging Adults, Adolescents, and Parents;Journal of Family and Economic Issues;2024-06-15

2. Extending the PROMIS item bank “ability to participate in social roles and activities”: a psychometric evaluation using IRT;Quality of Life Research;2024-05-23

3. The Patient Activation Measure-13 (PAM-13) in an oncology patient population: psychometric properties and dimensionality evaluation;Health and Quality of Life Outcomes;2024-05-20

4. Psychometric Assessment of an Item Bank for Adaptive Testing on Patient-Reported Experience of Care Environment for Severe Mental Illness: Validation Study;JMIR Mental Health;2024-05-16

5. Item Response Theory Analysis and Measurement Invariance Testing of the Cultural Humility and Enactment Scale;Measurement and Evaluation in Counseling and Development;2024-04-19