Abstract
We propose a fuzzy random survival forest (FRSF) to model lapse rates in a life insurance portfolio containing imprecise or incomplete data such as missing, outlier, or noisy values. Following the random forest methodology, the FRSF is proposed as a new machine learning technique for solving time-to-event data using an ensemble of multiple fuzzy survival trees. In the learning process, the combination of methods such as the c-index, fuzzy sets theory, and the ensemble of multiple trees enable the automatic handling of imprecise data. We analyse the results of several experiments and test them statistically; they show the FRSF’s robustness, verifying that its generalisation capacity is not reduced when modelling imprecise data. Furthermore, the results obtained using a real portfolio of a life insurance company demonstrate that the FRSF has a better performance in comparison with other state-of-the-art algorithms such as the traditional Cox model and other tree-based machine learning techniques such as the random survival forest.
Subject
General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)
Reference30 articles.
1. International Accounting Standards Board (2022, November 23). Available online: https://www.ifrs.org/supporting-implementation/supporting-materials-by-ifrs-standards/ifrs-17/.
2. EIOPA (2011). EIOPA Report on the Fifth Quantitative Impact Study (QIS5) for Solvency II.
3. Regression Models and Life-tables;Cox;J. R. Stat. Soc. Ser. B,1972
4. Random Forests;Breiman;Mach. Learn.,2001
5. Asymptotically Efficient Rank Invariant Test Procedures;Peto;J. R. Stat. Soc. Ser. A,1972
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献