Author:
Ye Huanzhuo,Xiang Lin,Gan Yanping
Abstract
Abstract
The study explores the comparison of various classification models in detecting fraudulent financial statements (FFS). Due to the high-class imbalance in this unique domain, the samples chosen in existing researches tend to be processed not so realistically. Therefore Random Forest is adopted to learn imbalanced data, in addition, sampling with SMOTE. Some more effective measure metrics of performance are also added. The experimental dataset includes 11726 publicly available Chinese financial disclosures from 2007 to 2017, of which 1314 financial statements were accused of fraud by CSRC. The result shows that the Random Forest outperforms other algorithms: Artificial Neural Network (ANN), Logistics Regression (LR), Support Vector Machines (SVM), CART, Decision Trees, Bayesian Networks, Bagging, Stacking and Adaboost.
Reference34 articles.
1. Detection of Fraud in Financial Statements: French Companies as a Case Study [J];Amara;International Journal of Academic Research in Accounting Finance & Management Sciences,2013
2. Développement d’un modèle de prédiction du churn clientèle en télécommunication;Bekkar,2009
3. Enhanced SMOTE algorithm for classification of imbalanced big-data using Random Forest [C];Bhagat,2015
Cited by
12 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献