Unmasking Banking Fraud: Unleashing the Power of Machine Learning and Explainable AI (XAI) on Imbalanced Data
-
Published:2024-05-23
Issue:6
Volume:15
Page:298
-
ISSN:2078-2489
-
Container-title:Information
-
language:en
-
Short-container-title:Information
Author:
Nobel S. M. Nuruzzaman1ORCID, Sultana Shirin1, Singha Sondip Poul1ORCID, Chaki Sudipto1ORCID, Mahi Md. Julkar Nayeen2ORCID, Jan Tony3ORCID, Barros Alistair4, Whaiduzzaman Md34ORCID
Affiliation:
1. Department of Computer Science and Engineering, Bangladesh University of Business and Technology, Dhaka 1216, Bangladesh 2. Department of Software Engineering, Daffodil International University, Dhaka 1207, Bangladesh 3. Design and Creative Technologies, Torrens University, Brisbane, QLD 4006, Australia 4. School of Information Systems, Queensland University of Technology, Brisbane, QLD 4000, Australia
Abstract
Recognizing fraudulent activity in the banking system is essential due to the significant risks involved. When fraudulent transactions are vastly outnumbered by non-fraudulent ones, dealing with imbalanced datasets can be difficult. This study aims to determine the best model for detecting fraud by comparing four commonly used machine learning algorithms: Support Vector Machine (SVM), XGBoost, Decision Tree, and Logistic Regression. Additionally, we utilized the Synthetic Minority Over-sampling Technique (SMOTE) to address the issue of class imbalance. The XGBoost Classifier proved to be the most successful model for fraud detection, with an accuracy of 99.88%. We utilized SHAP and LIME analyses to provide greater clarity into the decision-making process of the XGBoost model and improve overall comprehension. This research shows that the XGBoost Classifier is highly effective in detecting banking fraud on imbalanced datasets, with an impressive accuracy score. The interpretability of the XGBoost Classifier model was further enhanced by applying SHAP and LIME analysis, which shed light on the significant features that contribute to fraud detection. The insights and findings presented here are valuable contributions to the ongoing efforts aimed at developing effective fraud detection systems for the banking industry.
Reference53 articles.
1. Awoyemi, J.O., Adetunmbi, A.O., and Oluwadare, S.A. (2017, January 29–31). Credit card fraud detection using machine learning techniques: A comparative analysis. Proceedings of the 2017 International Conference on Computing Networking and Informatics (ICCNI), Lagos, Nigeria. 2. Mytnyk, B., Tkachyk, O., Shakhovska, N., Fedushko, S., and Syerov, Y. (2023). Application of Artificial Intelligence for Fraudulent Banking Operations Recognition. Big Data Cogn. Comput., 7. 3. Credit card fraud detection using machine learning as data mining technique;Yee;J. Telecommun. Electron. Comput. Eng. (JTEC),2018 4. Raval, J., Bhattacharya, P., Jadav, N.K., Tanwar, S., Sharma, G., Bokoro, P.N., Elmorsy, M., Tolba, A., and Raboaca, M.S. (2023). RaKShA: A Trusted Explainable LSTM Model to Classify Fraud Patterns on Credit Card Transactions. Mathematics, 11. 5. Irénée, M., Wang, Y., Hei, X., Song, X., Turiho, J.C., and Nyesheja, E.M. (2023). XTS: A Hybrid Framework to Detect DNS-Over-HTTPS Tunnels Based on XGBoost and Cooperative Game Theory. Mathematics, 11.
|
|