Transformer-Based Deep Neural Language Modeling for Construct-Specific Automatic Item Generation-Reference-Cited by-同舟云学术

Transformer-Based Deep Neural Language Modeling for Construct-Specific Automatic Item Generation

Published:2021-12-14 Issue: Volume: Page:
ISSN:0033-3123
Container-title:Psychometrika
language:en
Short-container-title:Psychometrika

Author:

Hommel Björn E.^ORCID,Wollang Franz-Josef M.^ORCID,Kotova Veronika^ORCID,Zacher Hannes^ORCID,Schmukle Stefan C.^ORCID

Abstract

AbstractAlgorithmic automatic item generation can be used to obtain large quantities of cognitive items in the domains of knowledge and aptitude testing. However, conventional item models used by template-based automatic item generation techniques are not ideal for the creation of items for non-cognitive constructs. Progress in this area has been made recently by employing long short-term memory recurrent neural networks to produce word sequences that syntactically resemble items typically found in personality questionnaires. To date, such items have been produced unconditionally, without the possibility of selectively targeting personality domains. In this article, we offer a brief synopsis on past developments in natural language processing and explain why the automatic generation of construct-specific items has become attainable only due to recent technological progress. We propose that pre-trained causal transformer models can be fine-tuned to achieve this task using implicit parameterization in conjunction with conditional generation. We demonstrate this method in a tutorial-like fashion and finally compare aspects of validity in human- and machine-authored items using empirical data. Our study finds that approximately two-thirds of the automatically generated items show good psychometric properties (factor loadings above .40) and that one-third even have properties equivalent to established and highly curated human-authored items. Our work thus demonstrates the practical use of deep neural networks for non-cognitive automatic item generation.

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,General Psychology

Link

https://link.springer.com/content/pdf/10.1007/s11336-021-09823-9.pdf

Reference72 articles.

1. Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., Devin, M., Ghemawat, S., Irving, G., Isard, M., Kudlur, M., Levenberg, J., Monga, R., Moore, S., Murray, D. G., Steiner, B., Tucker, P., Vasudevan, V., Warden, P., ...Zheng, X. (2016). TensorFlow: A system for large-scale machine learning. 12th USENIX symposium on operating systems design and implementation (OSDI 16), 265–283. https://www.usenix.org/system/files/conference/osdi16/osdi16-abadi.pdf

2. Angleitner, A., John, O. P., & Löhr, F.-J. (1986). It’s what you ask and how you ask it: An itemmetric analysis of personality questionnaires. In A. Angleitner & J. S. Wiggins (Eds.), Personality assessment via questionnaires (pp. 61–108). Springer. https://doi.org/10.1007/978-3-642-70751-3_5

3. Bejar, I. (2013). Item generation: Implications for a validity argument. In M. J. Gierl & T. M. Haladyna (Eds.), Automatic item generation: Theory and practice (pp. 40–55). Routledge.

4. Bengio, Y. (2008). Neural net language models. Scholarpedia, 3(1), 3881. https://doi.org/10.4249/scholarpedia.3881

5. Bengio, Y., Simard, P., & Frasconi, P. (1994). Learning long-term dependencies with gradient descent is difficult. IEEE Transactions on Neural Networks, 5(2), 157–166.

Cited by 17 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Investigating the capability of ChatGPT for generating multiple-choice reading comprehension items;System;2024-07

2. Efficiency of computerized adaptive testing with a cognitively designed item bank;Frontiers in Psychology;2024-06-26

3. Exploring quality criteria and evaluation methods in automated question generation: A comprehensive survey;Education and Information Technologies;2024-06-07

4. Analyzing Generative Models for Realistic Data Augmentation across Modalities and Applications;2024 11th International Conference on Computing for Sustainable Global Development (INDIACom);2024-02-28

5. MxML (Exploring the Relationship between Measurement and Machine Learning): Current State of the Field;Educational Measurement: Issues and Practice;2024-01-29