COMBINING B&B-BASED HYBRID FEATURE SELECTION AND THE IMBALANCE-ORIENTED MULTIPLE-CLASSIFIER ENSEMBLE FOR IMBALANCED CREDIT RISK ASSESSMENT

Author:

SUN Jie1,LEE Young-Chan2,LI Hui1,HUANG Qing-Hua1

Affiliation:

1. Zhejiang Normal University

2. Dongguk University

Abstract

An ideal model for credit risk assessment is supposed to select important features and process imbalanced data sets in an effective manner. This paper proposes an integrated method that combines B&B (branch and bound)-based hybrid feature selection (BBHFS) with the imbalanceoriented multiple-classifier ensemble (IOMCE) for imbalanced credit risk assessment and uses the support vector machine (SVM) and the multiple discriminant analysis (MDA) as the base predictor. BBHFS is a hybrid feature selection method that integrates the t-test and B&B with the k-fold crossvalidation method to search for a satisfactory feature subset. The IOMCE divides majority samples into several subsets and then combines them with minority samples to construct several training sets for constructing a multiple-classifier ensemble model. We conduct main experiments using a 1:3 imbalanced corporate credit risk data set with continuous features and extended experiments using a 1:5 imbalanced data set with continuous features and a 1:3 imbalanced data set with discrete and nominal features. We combine no feature selection and five feature selection methods (the pure B&B, the factor analysis, the pure t-test, t-test & correlation analysis, and BBHFS) with single-classifier and the IOMCE to construct SVM and MDA models for an empirical comparison. When all features are continuous, the BBHFS-IOMCE method generally outperforms all the other methods. More specifically, BBHFS provides more stable and satisfactory results than the other feature selection methods, and compared with single-classifier models, IOMCE models can significantly enhance the recognition rate for minority samples while incurring a small reduction in the recognition rate for majority samples and maintaining an acceptable overall accuracy. When the features are almost discrete or nominal, the IOMCE method retains its ability to deal with an imbalanced data set, although the five feature selection methods have no significant advantages over no feature selection. This suggests that BBHFS is effective in retaining useful information when reducing the dimensionality of continuous features and that the BBHFS-IOMCE method is an important tool for imbalanced credit risk assessment.

Publisher

Vilnius Gediminas Technical University

Subject

Finance

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3