Insurance Analytics with Clustering Techniques-Reference-Cited by-同舟云学术

Insurance Analytics with Clustering Techniques

Published:2024-09-05 Issue:9 Volume:12 Page:141
ISSN:2227-9091
Container-title:Risks
language:en
Short-container-title:Risks

Author:

Jamotton Charlotte¹^ORCID,Hainaut Donatien¹,Hames Thomas²

Affiliation:

1. Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA), Université Catholique de Louvain (UCLouvain), 1348 Louvain-la-Neuve, Belgium

2. Detralytics, Rue Belliard 2-B, 1040 Brussels, Belgium

Abstract

The K-means algorithm and its variants are well-known clustering techniques. In actuarial applications, these partitioning methods can identify clusters of policies with similar attributes. The resulting partitions provide an actuarial framework for creating maps of dominant risks and unsupervised pricing grids. This research article aims to adapt well-established clustering methods to complex insurance datasets containing both categorical and numerical variables. To achieve this, we propose a novel approach based on Burt distance. We begin by reviewing the K-means algorithm to establish the foundation for our Burt distance-based framework. Next, we extend the scope of application of the mini-batch and fuzzy K-means variants to heterogeneous insurance data. Additionally, we adapt spectral clustering, a technique based on graph theory that accommodates non-convex cluster shapes. To mitigate the computational complexity associated with spectral clustering’s O(n3) runtime, we introduce a data reduction method for large-scale datasets using our Burt distance-based approach.

Funder

Excellence of Science

Publisher

MDPI AG

Link

https://www.mdpi.com/2227-9091/12/9/141/pdf

Reference34 articles.

1. Customer segmentation and profiling for life insurance using k-modes clustering and decision tree classifier;Arifin;International Journal of Advanced Computer Science and Applications,2021

2. Belkin, Mikhail, and Niyogi, Partha (2001). Laplacian eigenmaps and spectral techniques for embedding and clustering. Advances in Neural Information Processing Systems, 14. Available online: https://proceedings.neurips.cc/paper_files/paper/2001/file/f106b7f99d2cb30c3db1c3cc0fde9ccb-Paper.pdf.

3. Fcm: The fuzzy c-means clustering algorithm;Bezdek;Computers & Geosciences,1984

4. The factorial analysis of qualitative data;Burt;British Journal of Statistical Psychology,1950

5. Campo, Bavo D. C., and Antonio, Katrien (2024). On clustering levels of a hierarchical categorical risk factor. Annals of Actuarial Science, 1–39. Available online: https://www.cambridge.org/core/journals/annals-of-actuarial-science/article/on-clustering-levels-of-a-hierarchical-categorical-risk-factor/1D8A7F6E50B9BFA70478815ABEA1B128#article.