Affiliation:
1. Department of Operations and Information Management, Aston University, Birmingham B4 7ET, UK
2. Department of Computing and Games, Teesside University, Middlesbrough TS1 3BX, UK
Abstract
In the cybersecurity industry, where legitimate transactions far outnumber fraudulent ones, detecting fraud is of paramount significance. In order to evaluate the accuracy of detecting fraudulent transactions in imbalanced real datasets, this study compares the efficacy of two approaches, random under-sampling and oversampling, using the synthetic minority over-sampling technique (SMOTE). Random under-sampling aims for fairness by excluding examples from the majority class, but this compromises precision in favor of recall. To strike a balance and ensure statistical significance, SMOTE was used instead to produce artificial examples of the minority class. Based on the data obtained, it is clear that random under-sampling achieves high recall (92.86%) at the expense of low precision, whereas SMOTE achieves a higher accuracy (86.75%) and a more even F1 score (73.47%) at the expense of a slightly lower recall. As true fraudulent transactions require at least two methods for verification, we investigated different machine learning methods and made suitable balances between accuracy, F1 score, and recall. Our comparison sheds light on the subtleties and ramifications of each approach, allowing professionals in the field of cybersecurity to better choose the approach that best meets the needs of their own firm. This research highlights the need to resolve class imbalances for effective fraud detection in cybersecurity, as well as the need for constant monitoring and the investigation of new approaches to increase applicability.
Funder
VC Research
Leverhulme Trust
International Science Partnerships Fund
Reference30 articles.
1. Digital payment fraud detection methods in digital ages and Industry 4.0;Chang;Comput. Electr. Eng.,2022
2. Dynamic Graph-Based Anomaly Detection in the Electrical Grid;Li;IEEE Trans. Power Syst.,2022
3. Ali, A., Razak, S.A., Othman, S.H., Eisa, T.A.E., AI-Dhaqm, A., Nasser, M., Elhassan, T., Elshafie, H., and Saif, A. (2022). Financial Fraud Detection Based on Machine Learning: A Systematic Literature Review. Appl. Sci., 12.
4. Khando, K., Islam, M.S., and Gao, S. (2022). The Emerging Technologies of Digital Payments and Associated Challenges: A Systematic Literature Review. Future Internet, 15.
5. Alsenaani, K. (2022). Fraud Detection in Financial Services using Machine Learning. [Master’s Thesis, RIT 1 Lomb Memorial Dr].