Variable Selection Algorithm for a Mixture of Poisson Regression for Handling Overdispersion in Claims Frequency Modeling Using Telematics Car Driving Data-Reference-Cited by-同舟云学术

Variable Selection Algorithm for a Mixture of Poisson Regression for Handling Overdispersion in Claims Frequency Modeling Using Telematics Car Driving Data

Published:2022-04-12 Issue:4 Volume:10 Page:83
ISSN:2227-9091
Container-title:Risks
language:en
Short-container-title:Risks

Author:

Chan Jennifer S. K.^ORCID,Choy S. T. Boris,Makov Udi,Shamir Ariel^ORCID,Shapovalov Vered

Abstract

In automobile insurance, it is common to adopt a Poisson regression model to predict the number of claims as part of the actuarial pricing process. The Poisson assumption can rarely be justified, often due to overdispersion, and alternative modeling is often considered, typically zero-inflated models, which are special cases of finite mixture distributions. Finite mixture regression modeling of telematics data is challenging to implement since the huge number of covariates computationally prohibits the essential variable selection needed to attain a model with desirable predictive power devoid of overfitting. This paper aims at devising an algorithm that can carry the task of variable selection in the presence of a large number of covariates. This is achieved by generating sub-samples of the data corresponding to each component of the Poisson mixture, and wherein variable selection is applied following the enhancement of the Poisson assumption by means of controlling the number of zero claims. The resulting algorithm is assessed by measuring the out-of-sample AUC (Area Under the Curve), a Machine Learning tool for quantifying predictive power. Finally, the application of the algorithm is demonstrated by using data of claim history and telematics data describing driving behavior. It transpires that unlike alternative algorithms related to Poisson regression, the proposed algorithm is both implementable and enjoys an improved AUC (0.71). The proposed algorithm allows more accurate pricing in an era where telematics data is used for automobile insurance.

Funder

The Society of Actuaries’Committee on Knowledge and Extension Research (CKER) and the Casualty Actuarial Society

Publisher

MDPI AG

Subject

Strategy and Management,Economics, Econometrics and Finance (miscellaneous),Accounting

Link

https://www.mdpi.com/2227-9091/10/4/83/pdf

Reference54 articles.

1. Improving automobile insurance ratemaking using telematics: incorporating mileage and driver behaviour data

2. A new approach to categorising continuous variables in prediction models: Proposal and validation

3. Evaluation measures for models assessment over imbalanced data sets;Bekkar;Journal of Information Engineering and Applications,2013

4. Modelling Unobserved Heterogeneity in Claim Counts Using Finite Mixture Models

5. Experience rating with Poisson mixtures

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Claim Prediction and Premium Pricing for Telematics Auto Insurance Data Using Poisson Regression with Lasso Regularisation;Risks;2024-08-28

2. Weather Conditions and Telematics Panel Data in Monthly Motor Insurance Claim Frequency Models;Risks;2023-03-09

3. Research on CBRN Practical Assessment Technology Based on Artificial Intelligence Technology;Advanced Intelligent Technologies for Information and Communication;2023