Non-Differentiable Loss Function Optimization and Interaction Effect Discovery in Insurance Pricing Using the Genetic Algorithm
Robin Van Oirbeek (),
Félix Vandervorst,
Thomas Bury,
Gireg Willame,
Christopher Grumiau and
Tim Verdonck
Additional contact information
Robin Van Oirbeek: DKV Belgium, Data Science and Governance, Rue de Loxum 25, 1000 Brussels, Belgium
Félix Vandervorst: Allianz Benelux, Data Office, Koning Albert II Laan 32, 1000 Brussels, Belgium
Thomas Bury: Allianz Benelux, Data Office, Koning Albert II Laan 32, 1000 Brussels, Belgium
Gireg Willame: Detralytics, Avenue du Boulevard 21, 1210 Brussels, Belgium
Christopher Grumiau: Department of Mathematics, University of Antwerp-imec, Middelheimlaan 1, 2020 Antwerp, Belgium
Tim Verdonck: Faculty of Economics and Business, KU Leuven, Naamsestraat 69, 3000 Leuven, Belgium
Risks, 2024, vol. 12, issue 5, 1-19
Abstract:
Insurance pricing is the process of determining the premiums that policyholders pay in exchange for insurance coverage. In order to estimate premiums, actuaries use statistical based methods, assessing various factors such as the probability of certain events occurring (like accidents or damages), where the Generalized Linear Models (GLMs) are the industry standard method. Traditional GLM approaches face limitations due to non-differentiable loss functions and expansive variable spaces, including both main and interaction terms. In this study, we address the challenge of selecting relevant variables for GLMs used in non-life insurance pricing both for frequency or severity analyses, amidst an increasing volume of data and variables. We propose a novel application of the Genetic Algorithm (GA) to efficiently identify pertinent main and interaction effects in GLMs, even in scenarios with a high variable count and diverse loss functions. Our approach uniquely aligns GLM predictions with those of black box machine learning models, enhancing their interpretability and reliability. Using a publicly available non-life motor data set, we demonstrate the GA’s effectiveness by comparing its selected GLM with a Gradient Boosted Machine (GBM) model. The results show a strong consistency between the main and interaction terms identified by GA for the GLM and those revealed in the GBM analysis, highlighting the potential of our method to refine and improve pricing models in the insurance sector.
Keywords: Genetic Algorithm; non-life insurance; variable selection; model explainability (search for similar items in EconPapers)
JEL-codes: C G0 G1 G2 G3 K2 M2 M4 (search for similar items in EconPapers)
Date: 2024
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2227-9091/12/5/79/pdf (application/pdf)
https://www.mdpi.com/2227-9091/12/5/79/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jrisks:v:12:y:2024:i:5:p:79-:d:1394505
Access Statistics for this article
Risks is currently edited by Mr. Claude Zhang
More articles in Risks from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().