Affiliation:
1. Anhui Technical College of Industry and Economy, Hefei, China
Abstract
The pervasive issue of cheating in educational tests has emerged as a paramount concern within the realm of education, prompting scholars to explore diverse methodologies for identifying potential transgressors. While machine learning models have been extensively investigated for this purpose, the untapped potential of TabNet, an intricate deep neural network model, remains uncharted territory. Within this study, a comprehensive evaluation and comparison of 12 base models (naive Bayes, linear discriminant analysis, Gaussian process, support vector machine, decision tree, random forest, Extreme Gradient Boosting (XGBoost), AdaBoost, logistic regression, k-nearest neighbors, multilayer perceptron, and TabNet) was undertaken to scrutinize their predictive capabilities. The area under the receiver operating characteristic curve (AUC) was employed as the performance metric for evaluation. Impressively, the findings underscored the supremacy of TabNet (AUC = 0.85) over its counterparts, signifying the profound aptitude of deep neural network models in tackling tabular tasks, such as the detection of academic dishonesty. Encouraged by these outcomes, we proceeded to synergistically amalgamate the two most efficacious models, TabNet (AUC = 0.85) and AdaBoost (AUC = 0.81), resulting in the creation of an ensemble model christened TabNet-AdaBoost (AUC = 0.92). The emergence of this novel hybrid approach exhibited considerable potential in research endeavors within this domain. Importantly, our investigation has unveiled fresh insights into the utilization of deep neural network models for the purpose of identifying cheating in educational tests.
Funder
Natural Science Foundation of the Higher Education Institutions of Anhui Province, China
Innovative Foundation for Industry-University-Research of the Higher Education Institutions of China
Subject
Applied Mathematics,Applied Psychology,Developmental and Educational Psychology,Education
Reference62 articles.
1. Alexandron G., Lee S., Chen Z., Pritchard D. E. (2016). Detecting cheaters in MOOCs using item response theory and learning analytics. In UMAP (extended proceedings). https://ceur-ws.org/Vol-1618/PALE9.pdf
2. Anguita D., Ghelardoni L., Ghio A., Oneto L., Ridella S. (2012). The “k” in k-fold cross validation. In ESANN (pp. 441–446). https://www.esann.org/sites/default/files/proceedings/legacy/es2012-62.pdf
3. Arik S. Ö., Pfister T. (2021). TabNet: Attentive interpretable tabular learning. In Proceedings of the AAAI conference on artificial intelligence (Vol. 35, pp. 6679–6687). https://ojs.aaai.org/index.php/AAAI/article/view/16826
4. A survey of cross-validation procedures for model selection