Deep Reinforcement Learning in Non-Markov Market-Making

Lalor, Luca; Swishchuk, Anatoliy

Deep Reinforcement Learning in Non-Markov Market-Making

Luca Lalor () and Anatoliy Swishchuk
Additional contact information
Luca Lalor: Department of Mathematics and Statistics, University of Calgary, 2500 University Dr NW, Calgary, AB T2N 1N4, Canada
Anatoliy Swishchuk: Department of Mathematics and Statistics, University of Calgary, 2500 University Dr NW, Calgary, AB T2N 1N4, Canada

Risks, 2025, vol. 13, issue 3, 1-27

Abstract: We develop a deep reinforcement learning (RL) framework for an optimal market-making (MM) trading problem, specifically focusing on price processes with semi-Markov and Hawkes Jump-Diffusion dynamics. We begin by discussing the basics of RL and the deep RL framework used; we deployed the state-of-the-art Soft Actor–Critic (SAC) algorithm for the deep learning part. The SAC algorithm is an off-policy entropy maximization algorithm more suitable for tackling complex, high-dimensional problems with continuous state and action spaces, like those in optimal market-making (MM). We introduce the optimal MM problem considered, where we detail all the deterministic and stochastic processes that go into setting up an environment to simulate this strategy. Here, we also provide an in-depth overview of the jump-diffusion pricing dynamics used and our method for dealing with adverse selection within the limit order book, and we highlight the working parts of our optimization problem. Next, we discuss the training and testing results, where we provide visuals of how important deterministic and stochastic processes such as the bid/ask prices, trade executions, inventory, and the reward function evolved. Our study includes an analysis of simulated and real data. We include a discussion on the limitations of these results, which are important points for most diffusion style models in this setting.

Keywords: algorithmic and high-frequency trading; limit order books; deep reinforcement learning; Hawkes process; semi-Markov process; market simulation (search for similar items in EconPapers)
JEL-codes: C G0 G1 G2 G3 K2 M2 M4 (search for similar items in EconPapers)
Date: 2025
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-9091/13/3/40/pdf (application/pdf)
https://www.mdpi.com/2227-9091/13/3/40/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jrisks:v:13:y:2025:i:3:p:40-:d:1598238

Access Statistics for this article

Risks is currently edited by Mr. Claude Zhang

More articles in Risks from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().