Deep learning for high-dimensional continuous-time stochastic optimal control without explicit solution
Jean-Loup Dupret and
Donatien Hainaut ()
Additional contact information
Jean-Loup Dupret: Université catholique de Louvain, LIDAM/ISBA, Belgium
Donatien Hainaut: Université catholique de Louvain, LIDAM/ISBA, Belgium
No 2024016, LIDAM Discussion Papers ISBA from Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA)
Abstract:
This paper introduces the GPI-PINN algorithm, a novel numerical scheme for solving continuous-time stochastic optimal control problems in high dimensions when the optimal control does not admit an explicit solution. Combining Physics-Informed Neural Networks with an Actor-Critic structure built upon the Generalized Policy Iteration technique, this successive deep learning algorithm employs two separate neural networks to approximate both the value function and the multidimensional optimal control. This way, the GPI-PINN algorithm manages to achieve a global approximation of the optimal solution across all time and space, which can be evaluated online rapidly. The optimality and convergence of the scheme are demonstrated theoretically and its accuracy and efficacy are shown empirically based on two numerical examples. In particular, we generalize the standard Almgren-Chriss model arising from optimal liquidation in finance by allowing for a price impact model with fully nonlinear temporary and permanent impact functions and by considering a multidimensional setting with numerous co-integrated assets.
Keywords: Machine learning; Stochastic optimal control; Deep learning; Physics-Informed Neural Networks; Optimal liquidation (search for similar items in EconPapers)
Pages: 31
Date: 2024-05-30
New Economics Papers: this item is included in nep-big and nep-cmp
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://dial.uclouvain.be/pr/boreal/en/object/bore ... tastream/PDF_01/view (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:aiz:louvad:2024016
Access Statistics for this paper
More papers in LIDAM Discussion Papers ISBA from Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA) Voie du Roman Pays 20, 1348 Louvain-la-Neuve (Belgium). Contact information at EDIRC.
Bibliographic data for series maintained by Nadja Peiffer ().