Continuous‐time mean–variance portfolio selection: A reinforcement learning framework

Wang, Haoran; Zhou, Xun Yu

Continuous‐time mean–variance portfolio selection: A reinforcement learning framework

Haoran Wang and Xun Yu Zhou

Mathematical Finance, 2020, vol. 30, issue 4, 1273-1308

Abstract: We approach the continuous‐time mean–variance portfolio selection with reinforcement learning (RL). The problem is to achieve the best trade‐off between exploration and exploitation, and is formulated as an entropy‐regularized, relaxed stochastic control problem. We prove that the optimal feedback policy for this problem must be Gaussian, with time‐decaying variance. We then prove a policy improvement theorem, based on which we devise an implementable RL algorithm. We find that our algorithm and its variant outperform both traditional and deep neural network based algorithms in our simulation and empirical studies.

Date: 2020
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (15)

Downloads: (external link)
https://doi.org/10.1111/mafi.12281

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bla:mathfi:v:30:y:2020:i:4:p:1273-1308

Ordering information: This journal article can be ordered from
http://www.blackwell ... bs.asp?ref=0960-1627

Access Statistics for this article

Mathematical Finance is currently edited by Jerome Detemple

More articles in Mathematical Finance from Wiley Blackwell
Bibliographic data for series maintained by Wiley Content Delivery ().