Author:
Hassan Hadiza,Ahmad Muhammad Aminu,Mustapha Rabi
Abstract
As the world is becoming a cashless society with increasing use of online transactions, the number of credit cards users has also increased substantially. This led to credit card fraud, which is among the major cybercrimes faced by users with consequential damages to financial institutions. Therefore, credit card fraud detection is crucial due to the increasing number of credit card transactions. Machine learning based credit card fraud detection systems exist, but machine learning approaches have problems with imbalanced data and the need to selected best features for effective classification. Imbalance classification occurs when there are small number of observations of the minority class compared with the majority in a dataset. This study addresses the challenges of feature selection and data imbalance in credit card fraud detection through an enhanced feature engineering method. We propose a technique that uses wrapper to select the best features and mitigate data imbalance using a hybrid approach that combines SMOTE, random oversampling and under-sampling techniques. Five popular machine learning classifiers—Random Forest, Naïve Bayes, K Nearest Neighbor, Decision Tree and Support Vector Machine—are used with balanced and imbalanced datasets to evaluate the technique. The results show significant improvements in accuracy, precision, recall, F1-score, and Kappa score with the enhanced method. Specifically, and K Nearest Neighbor, Random Forest and Support Vector Machine achieve perfect accuracy with the balanced data.
Publisher
Federal University Dutsin-Ma
Reference24 articles.
1. Akila, S. & Reddy, U. S., 2018. Cost-Sensitive Risk Induced Bayesian Inference Bagging (RIBIB) for Credit Card Fraud Detection. Journal of Computational Science, Volume 27, pp. 247-254.
2. Alkhatib, K. I.-A. (2021). Credit Card Fraud Detection Based on Deep Neural Network Approach. 12th International Conference on Information and Communication Systems (ICICS) (pp. 153-156). IEEE.
3. Askari, S. M. S., & Hussain, M. A. (2020). IFDTC4. 5: Intuitionistic fuzzy logic based decision tree for E-transactional fraud detection. Journal of Information Security and Applications, 52, 102469.
4. Carcillo, F. et al., 2019. Combining Unsupervised and Supervised Learning in Credit Card Fraud Detection. Information Sciences, pp. 10-11.
5. Debachudamani Prusti, S. S. Harshini Padmanabhuni, Santanu Kumar Rath (2020) Safety, Security, and Reliability of Robotic Systems, 1st Edition, 2020, Imprint CRC Press. eBook ISBN 9781003031352.