EconPapers    
Economics at your fingertips  
 

Learning equilibrium mean‐variance strategy

Min Dai, Yuchao Dong and Yanwei Jia

Mathematical Finance, 2023, vol. 33, issue 4, 1166-1212

Abstract: We study a dynamic mean‐variance portfolio optimization problem under the reinforcement learning framework, where an entropy regularizer is introduced to induce exploration. Due to the time–inconsistency involved in a mean‐variance criterion, we aim to learn an equilibrium policy. Under an incomplete market setting, we obtain a semi‐analytical, exploratory, equilibrium mean‐variance policy that turns out to follow a Gaussian distribution. We then focus on a Gaussian mean return model and propose a reinforcement learning algorithm to find the equilibrium policy. Thanks to a thoroughly designed policy iteration procedure in our algorithm, we prove the convergence of our algorithm under mild conditions, despite that dynamic programming principle and the usual policy improvement theorem failing to hold for an equilibrium policy. Numerical experiments are given to demonstrate our algorithm. The design and implementation of our reinforcement learning algorithm apply to a general market setup.

Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://doi.org/10.1111/mafi.12402

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bla:mathfi:v:33:y:2023:i:4:p:1166-1212

Ordering information: This journal article can be ordered from
http://www.blackwell ... bs.asp?ref=0960-1627

Access Statistics for this article

Mathematical Finance is currently edited by Jerome Detemple

More articles in Mathematical Finance from Wiley Blackwell
Bibliographic data for series maintained by Wiley Content Delivery ().

 
Page updated 2025-03-19
Handle: RePEc:bla:mathfi:v:33:y:2023:i:4:p:1166-1212