Nominal Response Model Is Useful for Scoring Multiple-Choice Situational Judgment Tests-Reference-Cited by-同舟云学术

Nominal Response Model Is Useful for Scoring Multiple-Choice Situational Judgment Tests

Published:2018-11-25 Issue:2 Volume:23 Page:342-366
ISSN:1094-4281
Container-title:Organizational Research Methods
language:en
Short-container-title:Organizational Research Methods

Author:

Zu Jiyun¹^ORCID,Kyllonen Patrick C.¹

Affiliation:

1. Educational Testing Service, Princeton, NJ, USA

Abstract

We evaluated the use of the nominal response model (NRM) to score multiple-choice (also known as “select the best option”) situational judgment tests (SJTs). Using data from two large studies, we compared the reliability and correlations of NRM scores with those from various classical and item response theory (IRT) scoring methods. The SJTs measured emotional management (Study 1) and teamwork and collaboration (Study 2). In Study 1 the NRM scoring method was shown to be superior in reliability and in yielding higher correlations with external measures to three classical test theory–based and four other IRT-based methods. In Study 2, only slight differences between scoring methods were observed. An explanation for the discrepancy in findings is that in cases where item keys are ambiguous (as in Study 1), the NRM accommodates that ambiguity, but in cases where item keys are clear (as in Study 2), different methods provide interchangeable scores. We characterize ambiguous and clear keys using category response curves based on parameter estimates of the NRM and discuss the relationships between our findings and those from the wisdom-of-the-crowd literature.

Funder

Educational Testing Service

Publisher

SAGE Publications

Subject

Management of Technology and Innovation,Strategy and Management,General Decision Sciences

Link

http://journals.sagepub.com/doi/pdf/10.1177/1094428118812669

Reference65 articles.

1. A rating formulation for ordered response categories

2. Comparative evaluation of three situational judgment test response formats in terms of construct-related validity, subgroup differences, and susceptibility to response distortion.

3. The importance of distinguishing between constructs and methods when comparing predictors in personnel selection research and practice.