Abstract
AbstractIn applications of predictive modeling, such as insurance pricing, indirect or proxy discrimination is an issue of major concern. Namely, there exists the possibility that protected policyholder characteristics are implicitly inferred from non-protected ones by predictive models and are thus having an undesirable (and possibly illegal) impact on prices. A technical solution to this problem relies on building a best-estimate model using all policyholder characteristics (including protected ones) and then averaging out the protected characteristics for calculating individual prices. However, such an approach requires full knowledge of policyholders’ protected characteristics, which may in itself be problematic. Here, we address this issue by using a multi-task neural network architecture for claim predictions, which can be trained using only partial information on protected characteristics and produces prices that are free from proxy discrimination. We demonstrate the proposed method on both synthetic data and a real-world motor claims dataset, in which proxy discrimination can be observed. In both examples we find that the predictive accuracy of the multi-task network is comparable to a conventional feed-forward neural network, when the protected information is available for at least half of the insurance policies. However, the multi-task network has superior performance in the case when the protected information is known for less than half of the insurance policyholders.
Publisher
Springer Science and Business Media LLC
Subject
Statistics, Probability and Uncertainty,Economics and Econometrics,Statistics and Probability
Reference26 articles.
1. Abbas A, Sutter D, Zoufal C, Lucchi A, Figalli A, Woerner S (2021) The power of quantum neural networks. Nat Comput Sci 1:403–409
2. Araiza Iturria CA, Hardy M, Marriott P (2022) A discrimination-free premium under a causal framework. SSRN Manuscript ID 4079068
3. Batista GEAPA, Monard MC (2002) A study of $$k$$-nearest neighbour as an imputation method. In: Abraham A, Ruiz-del-Solar J, Köppen M (eds) Soft computing systems—design, management and applications, Frontiers in Artificial Intelligence and Applications, vol 87. IOS Press, Amsterdam, pp 251–260
4. Buolamwini J, Gebru T (2018) Gender shades: intersectional accuracy disparities in commercial gender classification. In: Conference on fairness, accountability and transparency, proceedings of machine learning research, vol 81, pp 77–91
5. Chaudhuri A, Christofides TC (2013) Indirect questioning in sample surveys. Springer, Berlin
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献