Affiliation:
1. School of Computer Science and Technology, Shandong University of Finance and Economics, Jinan 250000, China
2. School of Mathematics, Shandong University, Jinan 250000, China
3. School of Management Science and Engineering, Shandong University of Finance and Economics, Jinan 250000, China
Abstract
Personal credit scoring is a challenging issue. In recent years, research has shown that machine learning has satisfactory performance in credit scoring. Because of the advantages of feature combination and feature selection, decision trees can match credit data which have high dimension and a complex correlation. Decision trees tend to overfitting yet. eXtreme Gradient Boosting is an advanced gradient enhanced tree that overcomes its shortcomings by integrating tree models. The structure of the model is determined by hyperparameters, which is aimed at the time-consuming and laborious problem of manual tuning, and the optimization method is employed for tuning. As particle swarm optimization describes the particle state and its motion law as continuous real numbers, the hyperparameter applicable to eXtreme Gradient Boosting can find its optimal value in the continuous search space. However, classical particle swarm optimization tends to fall into local optima. To solve this problem, this paper proposes an eXtreme Gradient Boosting credit scoring model that is based on adaptive particle swarm optimization. The swarm split, which is based on the clustering idea and two kinds of learning strategies, is employed to guide the particles to improve the diversity of the subswarms, in order to prevent the algorithm from falling into a local optimum. In the experiment, several traditional machine learning algorithms and popular ensemble learning classifiers, as well as four hyperparameter optimization methods (grid search, random search, tree-structured Parzen estimator, and particle swarm optimization), are considered for comparison. Experiments were performed with four credit datasets and seven KEEL benchmark datasets over five popular evaluation measures: accuracy, error rate (type I error and type II error), Brier score, and
score. Results demonstrate that the proposed model outperforms other models on average. Moreover, adaptive particle swarm optimization performs better than the other hyperparameter optimization strategies.
Funder
National Natural Science Foundation of China
Subject
General Engineering,General Mathematics
Reference45 articles.
1. Pattern Recognition and Neural Networks
2. Study on credit scoring model and forecasting based on probabilistic neural network;S.-L. Pang;Xitong Gongcheng Lilun Yu Shijian/System Engineering Theory and Practice,2005
3. Statistical learning theory;V. Vapnik,1998
4. A hybrid neural network approach for credit scoring
5. A two-stage fuzzy neural approach for credit risk assessment in a Brazilian credit card company
Cited by
54 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献