Optimal dividend and capital injection under self-exciting claims

Aubert, Paulin; Chevalier, Etienne; Ly Vath, Vathana

Optimal dividend and capital injection under self-exciting claims

Paulin Aubert, Etienne Chevalier () and Vathana Ly Vath
Additional contact information
Paulin Aubert: LaMME - Laboratoire de Mathématiques et Modélisation d'Evry - ENSIIE - Ecole Nationale Supérieure d'Informatique pour l'Industrie et l'Entreprise - UEVE - Université d'Évry-Val-d'Essonne - Université Paris-Saclay - CNRS - Centre National de la Recherche Scientifique - INRAE - Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement
Etienne Chevalier: LaMME - Laboratoire de Mathématiques et Modélisation d'Evry - ENSIIE - Ecole Nationale Supérieure d'Informatique pour l'Industrie et l'Entreprise - UEVE - Université d'Évry-Val-d'Essonne - Université Paris-Saclay - CNRS - Centre National de la Recherche Scientifique - INRAE - Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement
Vathana Ly Vath: LaMME - Laboratoire de Mathématiques et Modélisation d'Evry - ENSIIE - Ecole Nationale Supérieure d'Informatique pour l'Industrie et l'Entreprise - UEVE - Université d'Évry-Val-d'Essonne - Université Paris-Saclay - CNRS - Centre National de la Recherche Scientifique - INRAE - Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement, ENSIIE - Ecole Nationale Supérieure d'Informatique pour l'Industrie et l'Entreprise

Working Papers from HAL

Abstract: In this paper, we study an optimal dividend and capital-injection problem in a Cramér-Lundberg model where claim arrivals follow a Hawkes process, capturing clustering effects often observed in insurance portfolios. We establish key analytical properties of the value function and characterise the optimal capital-injection strategy through an explicit threshold. We also show that the value function is the unique viscosity solution of the associated HJB variational inequality. For numerical purposes, we first compute a benchmark solution via a monotone finite-difference scheme with Howard's policy iteration. We then develop a reinforcement learning approach based on policy-gradient and actor-critic methods. The learned strategies closely match the PDE benchmark and remain stable across initial conditions. The results highlight the relevance of policy-gradient techniques for dividend optimisation under self-exciting claim dynamics and point toward scalable methods for higher-dimensional extensions.

Date: 2026-06-02
Note: View the original document on HAL open archive server: https://hal.science/hal-05382935v2
References: Add references at CitEc
Citations:

Downloads: (external link)
https://hal.science/hal-05382935v2/document (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:hal:wpaper:hal-05382935

Access Statistics for this paper

More papers in Working Papers from HAL
Bibliographic data for series maintained by CCSD ().