A Comparative Study of Item Response Theory Models for Mixed Discrete-Continuous Responses

Author:

Zopluoglu Cengiz1ORCID,Lockwood J. R.2ORCID

Affiliation:

1. College of Education, University of Oregon, Eugene, OR 97403, USA

2. Duolingo, Inc., Pittsburgh, PA 15206, USA

Abstract

Language proficiency assessments are pivotal in educational and professional decision-making. With the integration of AI-driven technologies, these assessments can more frequently use item types, such as dictation tasks, producing response features with a mixture of discrete and continuous distributions. This study evaluates novel measurement models tailored to these unique response features. Specifically, we evaluated the performance of the zero-and-one-inflated extensions of the Beta, Simplex, and Samejima’s Continuous item response models and incorporated collateral information into the estimation using latent regression. Our findings highlight that while all models provided highly correlated results regarding item and person parameters, the Beta item response model showcased superior out-of-sample predictive accuracy. However, a significant challenge was the absence of established benchmarks for evaluating model and item fit for these novel item response models. There is a need for further research to establish benchmarks for evaluating the fit of these innovative models to ensure their reliability and validity in real-world applications.

Funder

Duolingo, Inc.

Publisher

MDPI AG

Reference35 articles.

1. Multilevel Item Response Models: An Approach to Errors in Variables Regression;Adams;Journal of Educational and Behavioral Statistics,1997

2. An Application of the Continuous Response Level Model to Personality Measurement;Bejar;Applied Psychological Measurement,1977

3. Betancourt, Michael (2018). A Conceptual Introduction to Hamiltonian Monte Carlo. arXiv, Available online: http://arxiv.org/abs/1701.02434.

4. Bommasani, Rishi, Hudson, Drew A., Adeli, Ehsan, Altman, Russ, Arora, Simran, Arx, Sydney von, Bernstein, Michael S., Bohg, Jeannette, Bosselut, Antoine, and Brunskill, Emma (2021). On the Opportunities and Risks of Foundation Models. arXiv, Available online: http://arxiv.org/abs/2108.07258.

5. On the complexity of item response theory models;Bonifay;Multivariate Behavioral Research,2017

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3