Methods for Estimating Item-Score Reliability-Reference-Cited by-同舟云学术

Methods for Estimating Item-Score Reliability

Published:2018-04-09 Issue:7 Volume:42 Page:553-570
ISSN:0146-6216
Container-title:Applied Psychological Measurement
language:en
Short-container-title:Applied Psychological Measurement

Author:

Zijlmans Eva A. O.¹,van der Ark L. Andries²,Tijmstra Jesper¹,Sijtsma Klaas¹

Affiliation:

1. Tilburg University, Tilburg, Netherlands

2. University of Amsterdam, Amsterdam, Netherlands

Abstract

Reliability is usually estimated for a test score, but it can also be estimated for item scores. Item-score reliability can be useful to assess the item’s contribution to the test score’s reliability, for identifying unreliable scores in aberrant item-score patterns in person-fit analysis, and for selecting the most reliable item from a test to use as a single-item measure. Four methods were discussed for estimating item-score reliability: the Molenaar–Sijtsma method (method MS), Guttman’s method [Formula: see text], the latent class reliability coefficient (method LCRC), and the correction for attenuation (method CA). A simulation study was used to compare the methods with respect to median bias, variability (interquartile range [IQR]), and percentage of outliers. The simulation study consisted of six conditions: standard, polytomous items, unequal [Formula: see text] parameters, two-dimensional data, long test, and small sample size. Methods MS and CA were the most accurate. Method LCRC showed almost unbiased results, but large variability. Method [Formula: see text] consistently underestimated item-score reliabilty, but showed a smaller IQR than the other methods.

Publisher

SAGE Publications

Subject

Psychology (miscellaneous),Social Sciences (miscellaneous)

Link

http://journals.sagepub.com/doi/pdf/10.1177/0146621618758290

Reference45 articles.

1. Item Response Theory

2. The Predictive Validity of Multiple-Item versus Single-Item Measures of the Same Constructs

3. Coefficient alpha and the internal structure of tests

4. The Influence of Multidimensionality on the Graded Response Model

Cited by 31 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Occupational self-efficacy scale: Validity in teachers;Acta Psychologica;2024-09

2. Classification of nomophobia among Chinese college students: Evidence from latent profile and ROC analysis;Journal of Behavioral Addictions;2024-06-26

3. Evidence of the validity of the child self‐regulation & behaviour questionnaire for the Brazilian context;Infant and Child Development;2024-06-26

4. Estimating Reliability for Tests With One Constructed‐Response Item in a Section;ETS Research Report Series;2024-06-24

5. Does employee engagement mediate the nexus of job resource and employee turnover intentions?;IIMT Journal of Management;2024-05-21