Cyber-Risk Forecasting using Machine Learning Models and Generalized Extreme Value Distributions
Jules Sadefo Kamdem and
Danielle Selambi
Additional contact information
Danielle Selambi: African Institute for Mathematical Sciences (AIMS-Cameroon)
Working Papers from HAL
Abstract:
In this paper, we estimate the cost of a data breach using the number of compromised records. The number of such records is predicted by means of a machine learning model, particularly the Random Forest. We further analyse the fat tail phenomena which capture the underlying dynamics in the number of affected records. The objective is to calculate the maximum loss in order to answer the question of the insurability of cyber risk. Our results show that the total number of affected records follow a Frechet distribution, and we then estimate the Generalized Extreme Value (GEV) parameters to calculate the value at risk (VaR). This analysis is critical because it gives an idea of the maximum loss that can be generated by an enterprise data breach. These results are usable in anticipating the premiums for cyber risk coverage in the insurance markets.
Keywords: Cyber insurance; Cyber risk; Machine Learning; Regression Trees; Random Forest; Generalized Extreme Value (search for similar items in EconPapers)
Date: 2022-10-13
New Economics Papers: this item is included in nep-big, nep-cmp, nep-dcm, nep-fmk and nep-rmg
Note: View the original document on HAL open archive server: https://hal.science/hal-03814979v1
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://hal.science/hal-03814979v1/document (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:hal:wpaper:hal-03814979
Access Statistics for this paper
More papers in Working Papers from HAL
Bibliographic data for series maintained by CCSD ().