Delta Boosting Implementation of Negative Binomial Regression in Actuarial Pricing
Simon CK Lee
Additional contact information
Simon CK Lee: Department of Statistics and Actuarial Science, The University of Hong Kong, Pokfulam Road, Hong Kong
Risks, 2020, vol. 8, issue 1, 1-21
Abstract:
This study proposes an efficacious approach to analyze the over-dispersed insurance frequency data as it is imperative for the insurers to have decisive informative insights for precisely underwriting and pricing insurance products, retaining existing customer base and gaining an edge in the highly competitive retail insurance market. The delta boosting implementation of the negative binomial regression, both by one-parameter estimation and a novel two-parameter estimation, was tested on the empirical data. Accurate parameter estimation of the negative binomial regression is complicated with considerations of incomplete insurance exposures, negative convexity, and co-linearity. The issues mainly originate from the unique nature of insurance operations and the adoption of distribution outside the exponential family. We studied how the issues could significantly impact the quality of estimation. In addition to a novel approach to simultaneously estimate two parameters in regression through boosting, we further enrich the study by proposing an alteration of the base algorithm to address the problems. The algorithm was able to withstand the competition against popular regression methodologies in a real-life dataset. Common diagnostics were applied to compare the performance of the relevant candidates, leading to our conclusion to move from light-tail Poisson to negative binomial for over-dispersed data, from generalized linear model (GLM) to boosting for non-linear and interaction patterns, from one-parameter to two-parameter estimation to reflect more closely the reality.
Keywords: boosting trees; gradient boosting; predictive modeling; insurance; machine learning; negative binomial (search for similar items in EconPapers)
JEL-codes: C G0 G1 G2 G3 K2 M2 M4 (search for similar items in EconPapers)
Date: 2020
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2227-9091/8/1/19/pdf (application/pdf)
https://www.mdpi.com/2227-9091/8/1/19/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jrisks:v:8:y:2020:i:1:p:19-:d:322684
Access Statistics for this article
Risks is currently edited by Mr. Claude Zhang
More articles in Risks from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().