Author:
Khemakhem Sihem,Boujelbene Younes
Abstract
PurposeData mining for predicting credit risk is a beneficial tool for financial institutions to evaluate the financial health of companies. However, the ubiquity of selecting parameters and the presence of unbalanced data sets is a very typical problem of this technique. This study aims to provide a new method for evaluating credit risk, taking into account not only financial and non-financial variables, but also the class imbalance.Design/methodology/approachThe most significant financial and non-financial variables were determined to build a credit scoring model and identify the creditworthiness of companies. Moreover, the Synthetic Minority Oversampling Technique was used to solve the problem of class imbalance and improve the performance of the classifier. The artificial neural networks and decision trees were designed to predict default risk.FindingsResults showed that profitability ratios, repayment capacity, solvency, duration of a credit report, guarantees, size of the company, loan number, ownership structure and the corporate banking relationship duration turned out to be the key factors in predicting default. Also, both algorithms were found to be highly sensitive to class imbalance. However, with balanced data, the decision trees displayed higher predictive accuracy for the assessment of credit risk than artificial neural networks.Originality/valueClassification results depend on the appropriateness of data characteristics and the appropriate analysis algorithm for data sets. The selection of financial and non-financial variables, as well as the resolution of class imbalance allows companies to assess their credit risk successfully.
Subject
General Economics, Econometrics and Finance,Finance,Accounting
Reference102 articles.
1. Credit scoring, statistical techniques and evaluation criteria: a review of the literature;Intelligent Systems in Accounting, Finance and Management,2011
2. Bankruptcy forecasting: an empirical comparison of AdaBoost and neural networks;Decision Support Systems,2008
3. A genetic programming approach for bankruptcy prediction using a highly unbalanced database;EvoWorkshops 2007: Applications of Evolutionary Computing,2007
4. Financial ratios, discriminant analysis and the prediction of corporate bankruptcy;Journal of Finance,1968
5. ZETA analysis: a new model to identify bankruptcy risk of corporations;Journal of Banking and Finance,1977
Cited by
23 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献