EconPapers    
Economics at your fingertips  
 

Stochastic Policy Gradient Methods in the Uncertain Volatility Model

Lokman A Abbas-Turki, Jean-Fran\c{c}ois Chassagneux, Jean-Philippe Lemor, Gr\'egoire Loeper and Simon Sananes
Additional contact information
Lokman A Abbas-Turki: LPSM
Jean-Fran\c{c}ois Chassagneux: ENSAE Paris
Jean-Philippe Lemor: LPSM
Gr\'egoire Loeper: LPSM
Simon Sananes: LPSM

Papers from arXiv.org

Abstract: The multidimensional Uncertain Volatility Model leads to robust option pricing problems under joint volatility and correlation uncertainty. Their numerical resolution quickly becomes challenging because the associated stochastic control problem is high-dimensional. We propose a backward actor-critic stochastic policy gradient scheme tailored to this setting. The method combines a discrete dynamic programming principle with Proximal Policy Optimization and shallow neural-network approximations of both the value function and the control policy. A key ingredient is the policy parameterization: continuous controls are represented through a squashed Gaussian policy built on a C-vine representation of correlation matrices, which enforces positive semidefiniteness by construction. Numerical experiments on a range of multidimensional derivatives show that the method yields accurate prices, remains computationally efficient, and compares favorably with existing Monte Carlo and machine-learning-based benchmarks for robust pricing in the Uncertain Volatility Model.

Date: 2026-04
References: Add references at CitEc
Citations:

Downloads: (external link)
http://arxiv.org/pdf/2605.06670 Latest version (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:arx:papers:2605.06670

Access Statistics for this paper

More papers in Papers from arXiv.org
Bibliographic data for series maintained by arXiv administrators ().

 
Page updated 2026-05-11
Handle: RePEc:arx:papers:2605.06670