Variational autoencoder for synthetic insurance data
Charlotte Jamotton and
Donatien Hainaut ()
Additional contact information
Charlotte Jamotton: Université catholique de Louvain, LIDAM/ISBA, Belgium
Donatien Hainaut: Université catholique de Louvain, LIDAM/ISBA, Belgium
No 2023025, LIDAM Discussion Papers ISBA from Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA)
Abstract:
This article explores the application of variational autoencoders (VAEs) to insurance data. Previous research has demonstrated the successful use of generative models, particularly VAEs, in various domains such as image recognition, text classification, and recommender systems. However, their application to insurance data, specifically heterogeneous insurance portfolios with mixed continuous and discrete attributes, remains unexplored. This study introduces novel insights into utilizing VAEs for unsupervised learning tasks in the actuarial field, including dimensionality reduction and synthetic data generation. We propose a VAE model with a quantile transformation of continuous data and a reconstruction loss that combines categorical cross-entropy and mean squared error, along with a KL divergence-based regularization term. The architecture of our VAE model eliminates the need for pre-training layers to fine-tune categorical features representations. We analyze our VAE's ability to reconstruct complex insurance data and generate synthetic insurance policies using a motor portfolio. Our experimental results and analysis highlight the potential of VAEs for addressing challenges related to privacy and anti-discriminatory regulations, bias correction, and data availability in the insurance industry.
Keywords: Autoencoder; variational inference; synthetic data generation; heterogeneous insurance data; dimensionality reduction (search for similar items in EconPapers)
Pages: 36
Date: 2023-06-29
New Economics Papers: this item is included in nep-big
References: Add references at CitEc
Citations:
Downloads: (external link)
https://dial.uclouvain.be/pr/boreal/en/object/bore ... tastream/PDF_01/view (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:aiz:louvad:2023025
Access Statistics for this paper
More papers in LIDAM Discussion Papers ISBA from Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA) Voie du Roman Pays 20, 1348 Louvain-la-Neuve (Belgium). Contact information at EDIRC.
Bibliographic data for series maintained by Nadja Peiffer ().