Deep neural networks algorithms for stochastic control problems on finite horizon: numerical applications

Bachouch, Achref; Huré, Côme; Langrené, Nicolas; Pham, Huyen

Deep neural networks algorithms for stochastic control problems on finite horizon: numerical applications

Achref Bachouch, Côme Huré, Nicolas Langrené () and Huyen Pham ()
Additional contact information
Achref Bachouch: UiO - University of Oslo
Côme Huré: LPSM (UMR_8001) - Laboratoire de Probabilités, Statistique et Modélisation - UPD7 - Université Paris Diderot - Paris 7 - SU - Sorbonne Université - CNRS - Centre National de la Recherche Scientifique
Nicolas Langrené: CSIRO - Data61 [Canberra] - ANU - Australian National University - CSIRO - Commonwealth Scientific and Industrial Research Organisation [Australia]
Huyen Pham: LPSM (UMR_8001) - Laboratoire de Probabilités, Statistique et Modélisation - UPD7 - Université Paris Diderot - Paris 7 - SU - Sorbonne Université - CNRS - Centre National de la Recherche Scientifique

Post-Print from HAL

Abstract: This paper presents several numerical applications of deep learning-based algorithms that have been introduced in [HPBL18]. Numerical and comparative tests using TensorFlow illustrate the performance of our different algorithms, namely control learning by performance iteration (algorithms NNcontPI and ClassifPI), control learning by hybrid iteration (algorithms Hybrid-Now and Hybrid-LaterQ), on the 100-dimensional nonlinear PDEs examples from [EHJ17] and on quadratic backward stochastic differential equations as in [CR16]. We also performed tests on low-dimension control problems such as an option hedging problem in finance, as well as energy storage problems arising in the valuation of gas storage and in microgrid management. Numerical results and comparisons to quantization-type algorithms Qknn, as an efficient algorithm to numerically solve low-dimensional control problems, are also provided; and some corresponding codes are available on https://github.com/comeh/.

Keywords: reinforcement learning; Policy iteration algorithm; Deep learning; value iteration; quantization (search for similar items in EconPapers)
Date: 2020
New Economics Papers: this item is included in nep-big, nep-cmp and nep-ore
Note: View the original document on HAL open archive server: https://hal.science/hal-01949221v3
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Published in Methodology and Computing in Applied Probability, In press

Downloads: (external link)
https://hal.science/hal-01949221v3/document (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:hal:journl:hal-01949221

Access Statistics for this paper

More papers in Post-Print from HAL
Bibliographic data for series maintained by CCSD ().