Unmasking Banking Fraud: Unleashing the Power of Machine Learning and Explainable AI (XAI) on Imbalanced Data-Reference-Cited by-同舟云学术

Unmasking Banking Fraud: Unleashing the Power of Machine Learning and Explainable AI (XAI) on Imbalanced Data

Published:2024-05-23 Issue:6 Volume:15 Page:298
ISSN:2078-2489
Container-title:Information
language:en
Short-container-title:Information

Author:

Nobel S. M. Nuruzzaman¹^ORCID,Sultana Shirin¹,Singha Sondip Poul¹^ORCID,Chaki Sudipto¹^ORCID,Mahi Md. Julkar Nayeen²^ORCID,Jan Tony³^ORCID,Barros Alistair⁴,Whaiduzzaman Md³⁴^ORCID

Affiliation:

1. Department of Computer Science and Engineering, Bangladesh University of Business and Technology, Dhaka 1216, Bangladesh

2. Department of Software Engineering, Daffodil International University, Dhaka 1207, Bangladesh

3. Design and Creative Technologies, Torrens University, Brisbane, QLD 4006, Australia

4. School of Information Systems, Queensland University of Technology, Brisbane, QLD 4000, Australia

Abstract

Recognizing fraudulent activity in the banking system is essential due to the significant risks involved. When fraudulent transactions are vastly outnumbered by non-fraudulent ones, dealing with imbalanced datasets can be difficult. This study aims to determine the best model for detecting fraud by comparing four commonly used machine learning algorithms: Support Vector Machine (SVM), XGBoost, Decision Tree, and Logistic Regression. Additionally, we utilized the Synthetic Minority Over-sampling Technique (SMOTE) to address the issue of class imbalance. The XGBoost Classifier proved to be the most successful model for fraud detection, with an accuracy of 99.88%. We utilized SHAP and LIME analyses to provide greater clarity into the decision-making process of the XGBoost model and improve overall comprehension. This research shows that the XGBoost Classifier is highly effective in detecting banking fraud on imbalanced datasets, with an impressive accuracy score. The interpretability of the XGBoost Classifier model was further enhanced by applying SHAP and LIME analysis, which shed light on the significant features that contribute to fraud detection. The insights and findings presented here are valuable contributions to the ongoing efforts aimed at developing effective fraud detection systems for the banking industry.

Publisher

MDPI AG

Link

https://www.mdpi.com/2078-2489/15/6/298/pdf

Reference53 articles.

1. Awoyemi, J.O., Adetunmbi, A.O., and Oluwadare, S.A. (2017, January 29–31). Credit card fraud detection using machine learning techniques: A comparative analysis. Proceedings of the 2017 International Conference on Computing Networking and Informatics (ICCNI), Lagos, Nigeria.

2. Mytnyk, B., Tkachyk, O., Shakhovska, N., Fedushko, S., and Syerov, Y. (2023). Application of Artificial Intelligence for Fraudulent Banking Operations Recognition. Big Data Cogn. Comput., 7.

3. Credit card fraud detection using machine learning as data mining technique;Yee;J. Telecommun. Electron. Comput. Eng. (JTEC),2018

4. Raval, J., Bhattacharya, P., Jadav, N.K., Tanwar, S., Sharma, G., Bokoro, P.N., Elmorsy, M., Tolba, A., and Raboaca, M.S. (2023). RaKShA: A Trusted Explainable LSTM Model to Classify Fraud Patterns on Credit Card Transactions. Mathematics, 11.

5. Irénée, M., Wang, Y., Hei, X., Song, X., Turiho, J.C., and Nyesheja, E.M. (2023). XTS: A Hybrid Framework to Detect DNS-Over-HTTPS Tunnels Based on XGBoost and Cooperative Game Theory. Mathematics, 11.