Abstract
Credit card transactions may contain some categorical attributes with large domains, involving up to hundreds of possible values, also known as high-cardinality attributes. The inclusion of such attributes makes analysis harder, due to results with poorer generalization and higher resource usage. A common practice is, therefore, to ignore such attributes, removing them, albeit wasting the information they provided. Contrariwise, this paper reports our findings on the positive impacts of using high-cardinality attributes on credit card fraud detection. Thus, we present a new algorithm for domain reduction that preserves the fraud-detection capabilities. Experiments applying a deep feedforward neural network on real datasets from a major Brazilian financial institution have shown that, when measured by the F-1 metric, the inclusion of such attributes does improve fraud-detection quality. As a main contribution, this proposed algorithm was able to reduce attribute cardinality, improving the training times of a model while preserving its predictive capabilities.
Funder
Brazilian Aeronautics Institute of Technology
Casimiro Montenegro Filho Foundation
2RP Net Enterprise
Brazilian Ministry of Education
Subject
General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)
Reference55 articles.
1. Sequence classification for credit-card fraud detection
2. Card Fraud Losses Reach $22.80 Billion,2017
3. 2016 Global Consumer Card Fraud: Where Card Fraud Is Coming From;Knieff,2016
4. Ensemble learning for credit card fraud detection
5. Credit Card Fraud Detection with Artificial Immune System;Gadi;Proceedings of the Artificial Immune Systems,2008
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献