Practical Application of Deep Reinforcement Learning to Optimal Trade Execution

Byun, Woo Jae; Choi, Bumkyu; Kim, Seongmin; Jo, Joohyun

Practical Application of Deep Reinforcement Learning to Optimal Trade Execution

Woo Jae Byun (), Bumkyu Choi (), Seongmin Kim () and Joohyun Jo ()
Additional contact information
Woo Jae Byun: Qraft Technologies, Inc., 3040 Three IFC, 10 Gukjegeumyung-ro, Yeongdeungpo-gu, Seoul 07326, Republic of Korea
Bumkyu Choi: Qraft Technologies, Inc., 3040 Three IFC, 10 Gukjegeumyung-ro, Yeongdeungpo-gu, Seoul 07326, Republic of Korea
Seongmin Kim: Qraft Technologies, Inc., 3040 Three IFC, 10 Gukjegeumyung-ro, Yeongdeungpo-gu, Seoul 07326, Republic of Korea
Joohyun Jo: Qraft Technologies, Inc., 3040 Three IFC, 10 Gukjegeumyung-ro, Yeongdeungpo-gu, Seoul 07326, Republic of Korea

FinTech, 2023, vol. 2, issue 3, 1-16

Abstract: Although deep reinforcement learning (DRL) has recently emerged as a promising technique for optimal trade execution, two problems still remain unsolved: (1) the lack of a generalized model for a large collection of stocks and execution time horizons; and (2) the inability to accurately train algorithms due to the discrepancy between the simulation environment and real market. In this article, we address the two issues by utilizing a widely used reinforcement learning (RL) algorithm called proximal policy optimization (PPO) with a long short-term memory (LSTM) network and by building our proprietary order execution simulation environment based on historical level 3 market data of the Korea Stock Exchange (KRX). This paper, to the best of our knowledge, is the first to achieve generalization across 50 stocks and across an execution time horizon ranging from 165 to 380 min along with dynamic target volume. The experimental results demonstrate that the proposed algorithm outperforms the popular benchmark, the volume-weighted average price (VWAP), highlighting the potential use of DRL for optimal trade execution in real-world financial markets. Furthermore, our algorithm is the first commercialized DRL-based optimal trade execution algorithm in the South Korea stock market.

Keywords: deep reinforcement learning; optimal trade execution; artificial intelligence; market microstructure; financial application (search for similar items in EconPapers)
JEL-codes: C6 F3 G O3 (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2674-1032/2/3/23/pdf (application/pdf)
https://www.mdpi.com/2674-1032/2/3/23/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jfinte:v:2:y:2023:i:3:p:23-429:d:1182401

Access Statistics for this article

FinTech is currently edited by Ms. Lizzy Zhou

More articles in FinTech from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().