The use of autoencoders for training neural networks with mixed categorical and numerical features-Reference-Cited by-同舟云学术

The use of autoencoders for training neural networks with mixed categorical and numerical features

Published:2023-04-24 Issue:2 Volume:53 Page:213-232
ISSN:0515-0361
Container-title:ASTIN Bulletin
language:en
Short-container-title:ASTIN Bull.

Author:

Delong Łukasz,Kozak Anna

Abstract

AbstractWe focus on modelling categorical features and improving predictive power of neural networks with mixed categorical and numerical features in supervised learning tasks. The goal of this paper is to challenge the current dominant approach in actuarial data science with a new architecture of a neural network and a new training algorithm. The key proposal is to use a joint embedding for all categorical features, instead of separate entity embeddings, to determine the numerical representation of the categorical features which is fed, together with all other numerical features, into hidden layers of a neural network with a target response. In addition, we postulate that we should initialize the numerical representation of the categorical features and other parameters of the hidden layers of the neural network with parameters trained with (denoising) autoencoders in unsupervised learning tasks, instead of using random initialization of parameters. Since autoencoders for categorical data play an important role in this research, they are investigated in more depth in the paper. We illustrate our ideas with experiments on a real data set with claim numbers, and we demonstrate that we can achieve a higher predictive power of the network.

Publisher

Cambridge University Press (CUP)

Subject

Economics and Econometrics,Finance,Accounting

Reference33 articles.

1. Hespe, N. (2020) Building autoencoders on sparse, one-hot encoded data. https://towardsdatascience.com/building-autoencoders-on-sparse-one-hot-encoded-data-53eefdfdbcc7.

2. Extracting and composing robust features with denoising autoencoders

3. Grari, V. , Charpentier, A. , Lamprier, S. and Detyniecki, M. (2022) A fair pricing model via adversarial learning. https://arxiv.org/abs/2202.12008.

4. Lei, L. , Petterson, A. and White, M. (2018) Supervised autoencoders: Improving generalization performance with unsupervised regularizers. Proceedings of the 32nd International Conference on Neural Information Processing Systems, pp. 107–117.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Autoencoders and AutoML for intrusion detection;2023 15th International Conference on Electronics, Computers and Artificial Intelligence (ECAI);2023-06-29

2. Conditional Expectation Network for SHAP;SSRN Electronic Journal;2023