The QLBS Q-Learner goes NuQLear: fitted Q iteration, inverse RL, and option portfolios
Igor Halperin
Quantitative Finance, 2019, vol. 19, issue 9, 1543-1553
Abstract:
The QLBS model is a discrete-time option hedging and pricing model that is based on Dynamic Programming (DP) and Reinforcement Learning (RL). It combines the famous Q-Learning method for RL with the Black–Scholes (–Merton) (BSM) model's idea of reducing the problem of option pricing and hedging to the problem of optimal rebalancing of a dynamic replicating portfolio for the option, which is made of a stock and cash. Here we expand on several NuQLear (Numerical Q-Learning) topics with the QLBS model. First, we investigate the performance of Fitted Q Iteration for an RL (data-driven) solution to the model, and benchmark it versus a DP (model-based) solution, as well as versus the BSM model. Second, we develop an Inverse Reinforcement Learning (IRL) setting for the model, where we only observe prices and actions (re-hedges) taken by a trader, but not rewards. Third, we outline how the QLBS model can be used for pricing portfolios of options, rather than a single option in isolation, thus providing its own, data-driven and model-independent solution to the (in)famous volatility smile problem of the Black–Scholes model.
Date: 2019
References: Add references at CitEc
Citations: View citations in EconPapers (14)
Downloads: (external link)
http://hdl.handle.net/10.1080/14697688.2019.1622302 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:taf:quantf:v:19:y:2019:i:9:p:1543-1553
Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/RQUF20
DOI: 10.1080/14697688.2019.1622302
Access Statistics for this article
Quantitative Finance is currently edited by Michael Dempster and Jim Gatheral
More articles in Quantitative Finance from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().