Abstract
AbstractWe present the La Mobilière insurance customers dataset: a 12-year-long longitudinal collection of data on policies of customers of the Swiss insurance company La Mobilière. To preserve the privacy of La Mobilière customers, we propose the data aggregated at two geographical levels, based on the place of residence of the customer: postal areas and municipalities. For each geographical area, the data provides summary statistics on: (i) the demographic characteristics of the customer base, (ii) characteristics of vehicles insurance policies and (iii) characteristics of housing and building insurance policies. To assess the validity of the data, we investigate the temporal consistency of the data and the representativeness of La Mobilière customer base along several dimensions (total population, percentage of foreigners, etc.). We also show how the insurance data can reliably model the spatial patterns of socio-economic indicators at a high geographical resolution. We believe that the reuse of this data provides an opportunity for researchers to broaden the socio-economic characterization of Swiss areas beyond the use of official data sources.
Publisher
Springer Science and Business Media LLC
Subject
Library and Information Sciences,Statistics, Probability and Uncertainty,Computer Science Applications,Education,Information Systems,Statistics and Probability
Reference20 articles.
1. Sivarajah, U., Kamal, M. M., Irani, Z. & Weerakkody, V. Critical analysis of big data challenges and analytical methods. Journal of Business Research 70, 263–286 (2017).
2. Florescu, D. et al. Will ‘big data’transform official statistics. In European Conference on the QualityofOfficial Statistics. Vienna, Austria, 2–5 (2014).
3. Augustin, M., Reich, K., Glaeske, G., Schaefer, I. & Radtke, M. Co-morbidity and age-related prevalence of psoriasis: analysis of health insurance data in germany. Acta dermato-venereologica 90, 147–151 (2010).
4. Heyer, K., Herberger, K., Protz, K., Glaeske, G. & Augustin, M. Epidemiology of chronic wounds in germany: Analysis of statutory health insurance data. Wound Repair and Regeneration 24, 434–442 (2016).
5. Rawte, V. & Anuradha, G. Fraud detection in health insurance using data mining techniques. In 2015 International Conference on Communication, Information & Computing Technology (ICCICT), 1–5 (IEEE, 2015).