Author:
Andrade João Paulo A.,Paulucio Leonardo S.,Paixão Thiago M.,Berriel Rodrigo F.,Carneiro Teresa Cristina Janes,Carneiro Raphael V.,Souza Alberto F. De,Badue Claudine,Oliveira-Santos Thiago
Abstract
Companies created for money-laundering or as a means for taxevasion are harmful to the country's economy and society. This problem is usually tackled by governmental agencies by having officials to pore over companies' financial data and to single out those that exhibit fraudulent behavior. Such work tends to be slow-paced and tedious. This paper proposes a machine learning-based system capable of classifying whether a company is likely to be involved in fraud or not. Based on financial and tax data from various companies, four different classifiers – k-Nearest Neighbors, Random Forest, Support Vector Machine (SVM), and a Neural Network – were trained and then used to indicate fraud. The best-performing model achieved a macro-averaged F1-score of 92.98% with the Random Forest.
Publisher
Sociedade Brasileira de Computação - SBC
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献