Machine Learning for Credit Risk Prediction: A Systematic Literature Review-Reference-Cited by-同舟云学术

Machine Learning for Credit Risk Prediction: A Systematic Literature Review

Published:2023-11-07 Issue:11 Volume:8 Page:169
ISSN:2306-5729
Container-title:Data
language:en
Short-container-title:Data

Author:

Noriega Jomark Pablo¹²^ORCID,Rivera Luis Antonio¹³^ORCID,Herrera José Alfredo¹⁴^ORCID

Affiliation:

1. Departamento Académico de Ciencia de la Computacion, Universidad Nacional Mayor de San Marcos, Decana de América, Lima 15081, Peru

2. Financiera QAPAQ, Lima 150120, Peru

3. Centro de Ciências Exatas e Tecnologia, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes 28013-602, Brazil

4. Programme in Biotechnology, Engineering and Chemical Technology, Universidad Pablo de Olavide, 41013 Sevilla, Spain

Abstract

In this systematic review of the literature on using Machine Learning (ML) for credit risk prediction, we raise the need for financial institutions to use Artificial Intelligence (AI) and ML to assess credit risk, analyzing large volumes of information. We posed research questions about algorithms, metrics, results, datasets, variables, and related limitations in predicting credit risk. In addition, we searched renowned databases responding to them and identified 52 relevant studies within the credit industry of microfinance. Challenges and approaches in credit risk prediction using ML models were identified; we had difficulties with the implemented models such as the black box model, the need for explanatory artificial intelligence, the importance of selecting relevant features, addressing multicollinearity, and the problem of the imbalance in the input data. By answering the inquiries, we identified that the Boosted Category is the most researched family of ML models; the most commonly used metrics for evaluation are Area Under Curve (AUC), Accuracy (ACC), Recall, precision measure F1 (F1), and Precision. Research mainly uses public datasets to compare models, and private ones to generate new knowledge when applied to the real world. The most significant limitation identified is the representativeness of reality, and the variables primarily used in the microcredit industry are data related to the Demographic, Operation, and Payment behavior. This study aims to guide developers of credit risk management tools and software towards the existing ability of ML methods, metrics, and techniques used to forecast it, thereby minimizing possible losses due to default and guiding risk appetite.

Publisher

MDPI AG

Subject

Information Systems and Management,Computer Science Applications,Information Systems

Link

https://www.mdpi.com/2306-5729/8/11/169/pdf

Reference67 articles.

1. Lombardo, G., Pellegrino, M., Adosoglou, G., Cagnoni, S., Pardalos, P.M., and Poggi, A. (2022). Machine Learning for Bankruptcy Prediction in the American Stock Market: Dataset and Benchmarks. Future Internet, 14.

2. Ziemba, P., Becker, J., Becker, A., Radomska-Zalas, A., Pawluk, M., and Wierzba, D. (2021). Credit decision support based on real set of cash loans using integrated machine learning algorithms. Electronics, 10.

3. Finding the next interesting loan for investors on a peer-to-peer lending platform;Liu;IEEE Access,2021

4. A holistic approach to interpretability in financial lending: Models, visualizations, and summary-explanations;Chen;Decis. Support Syst.,2022

5. Shih, D.H., Wu, T.W., Shih, P.Y., Lu, N.A., and Shih, M.H. (2022). A Framework of Global Credit-Scoring Modeling Using Outlier Detection and Machine Learning in a P2P Lending Platform. Mathematics, 10.

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Striking a Balance: Evaluating Credit Risk with Traditional and Machine Learning Models;Bulletin of Business and Economics (BBE);2024-08-28

2. LEVERAGING ARTIFICIAL INTELLIGENCE (AI) IN PUBLIC SECTOR FINANCIAL RISK MANAGEMENT: INNOVATIONS, CHALLENGES, AND FUTURE DIRECTIONS;EDPACS;2024-07-16

3. Artificial intelligence (AI) and strategic planning process within VUCA environments: a research agenda and guidelines;Management Decision;2024-07-15

4. Probability of Default. A Machine Learning Approach for Non-Financial Companies;European Journal of Theoretical and Applied Sciences;2024-07-01

5. Enhancing Supervised Model Performance in Credit Risk Classification Using Sampling Strategies and Feature Ranking;Big Data and Cognitive Computing;2024-03-06