An Ensemble Learning Approach Based on TabNet and Machine Learning Models for Cheating Detection in Educational Tests-Reference-Cited by-同舟云学术

An Ensemble Learning Approach Based on TabNet and Machine Learning Models for Cheating Detection in Educational Tests

Published:2023-08-21 Issue: Volume: Page:
ISSN:0013-1644
Container-title:Educational and Psychological Measurement
language:en
Short-container-title:Educational and Psychological Measurement

Author:

Zhen Yang¹,Zhu Xiaoyan¹^ORCID

Affiliation:

1. Anhui Technical College of Industry and Economy, Hefei, China

Abstract

The pervasive issue of cheating in educational tests has emerged as a paramount concern within the realm of education, prompting scholars to explore diverse methodologies for identifying potential transgressors. While machine learning models have been extensively investigated for this purpose, the untapped potential of TabNet, an intricate deep neural network model, remains uncharted territory. Within this study, a comprehensive evaluation and comparison of 12 base models (naive Bayes, linear discriminant analysis, Gaussian process, support vector machine, decision tree, random forest, Extreme Gradient Boosting (XGBoost), AdaBoost, logistic regression, k-nearest neighbors, multilayer perceptron, and TabNet) was undertaken to scrutinize their predictive capabilities. The area under the receiver operating characteristic curve (AUC) was employed as the performance metric for evaluation. Impressively, the findings underscored the supremacy of TabNet (AUC = 0.85) over its counterparts, signifying the profound aptitude of deep neural network models in tackling tabular tasks, such as the detection of academic dishonesty. Encouraged by these outcomes, we proceeded to synergistically amalgamate the two most efficacious models, TabNet (AUC = 0.85) and AdaBoost (AUC = 0.81), resulting in the creation of an ensemble model christened TabNet-AdaBoost (AUC = 0.92). The emergence of this novel hybrid approach exhibited considerable potential in research endeavors within this domain. Importantly, our investigation has unveiled fresh insights into the utilization of deep neural network models for the purpose of identifying cheating in educational tests.

Funder

Natural Science Foundation of the Higher Education Institutions of Anhui Province, China

Innovative Foundation for Industry-University-Research of the Higher Education Institutions of China

Publisher

SAGE Publications

Subject

Applied Mathematics,Applied Psychology,Developmental and Educational Psychology,Education

Link

http://journals.sagepub.com/doi/pdf/10.1177/00131644231191298

Reference62 articles.

1. Alexandron G., Lee S., Chen Z., Pritchard D. E. (2016). Detecting cheaters in MOOCs using item response theory and learning analytics. In UMAP (extended proceedings). https://ceur-ws.org/Vol-1618/PALE9.pdf

2. Anguita D., Ghelardoni L., Ghio A., Oneto L., Ridella S. (2012). The “k” in k-fold cross validation. In ESANN (pp. 441–446). https://www.esann.org/sites/default/files/proceedings/legacy/es2012-62.pdf

3. Arik S. Ö., Pfister T. (2021). TabNet: Attentive interpretable tabular learning. In Proceedings of the AAAI conference on artificial intelligence (Vol. 35, pp. 6679–6687). https://ojs.aaai.org/index.php/AAAI/article/view/16826

4. A survey of cross-validation procedures for model selection

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The Identification of Guessing Patterns in Progress Testing as a Machine Learning Classification Problem;2024-08-02

2. The Identification of Guessing Patterns in Progress Testing as a Machine Learning Classification Problem;2024-07-16