On hybrid tree-based methods for short-term insurance claims
-
Published:2023-03-08
Issue:2
Volume:37
Page:597-620
-
ISSN:0269-9648
-
Container-title:Probability in the Engineering and Informational Sciences
-
language:en
-
Short-container-title:Prob. Eng. Inf. Sci.
Author:
Quan ZhiyuORCID,
Wang Zhiguo,
Gan Guojun,
Valdez Emiliano A.
Abstract
AbstractTwo-part framework and the Tweedie generalized linear model (GLM) have traditionally been used to model loss costs for short-term insurance contracts. For most portfolios of insurance claims, there is typically a large proportion of zero claims that leads to imbalances, resulting in lower prediction accuracy of these traditional approaches. In this article, we propose the use of tree-based methods with a hybrid structure that involves a two-step algorithm as an alternative approach. For example, the first step is the construction of a classification tree to build the probability model for claim frequency. The second step is the application of elastic net regression models at each terminal node from the classification tree to build the distribution models for claim severity. This hybrid structure captures the benefits of tuning hyperparameters at each step of the algorithm; this allows for improved prediction accuracy, and tuning can be performed to meet specific business objectives. An obvious major advantage of this hybrid structure is improved model interpretability. We examine and compare the predictive performance of this hybrid structure relative to the traditional Tweedie GLM using both simulated and real datasets. Our empirical results show that these hybrid tree-based methods produce more accurate and informative predictions.
Publisher
Cambridge University Press (CUP)
Subject
Industrial and Manufacturing Engineering,Management Science and Operations Research,Statistics, Probability and Uncertainty,Statistics and Probability
Reference27 articles.
1. Penalized regressions: The bridge versus the Lasso;Fu;Journal of Computational and Graphical Statistics,1998
2. Generalized regression trees;Chaudhuri;Statistica Sinica,1995
3. Multivariate Frequency-Severity Regression Models in Insurance
4. Exponential dispersion models;Jørgensen;Journal of the Royal Statistical Society: Series B (Methodological),1987
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献