Practical Application of Deep Reinforcement Learning to Optimal Trade Execution
Woo Jae Byun (), 
Bumkyu Choi (), 
Seongmin Kim () and 
Joohyun Jo ()
Additional contact information 
Woo Jae Byun: Qraft Technologies, Inc., 3040 Three IFC, 10 Gukjegeumyung-ro, Yeongdeungpo-gu, Seoul 07326, Republic of Korea
Bumkyu Choi: Qraft Technologies, Inc., 3040 Three IFC, 10 Gukjegeumyung-ro, Yeongdeungpo-gu, Seoul 07326, Republic of Korea
Seongmin Kim: Qraft Technologies, Inc., 3040 Three IFC, 10 Gukjegeumyung-ro, Yeongdeungpo-gu, Seoul 07326, Republic of Korea
Joohyun Jo: Qraft Technologies, Inc., 3040 Three IFC, 10 Gukjegeumyung-ro, Yeongdeungpo-gu, Seoul 07326, Republic of Korea
FinTech, 2023, vol. 2, issue 3, 1-16
Abstract:
Although deep reinforcement learning (DRL) has recently emerged as a promising technique for optimal trade execution, two problems still remain unsolved: (1) the lack of a generalized model for a large collection of stocks and execution time horizons; and (2) the inability to accurately train algorithms due to the discrepancy between the simulation environment and real market. In this article, we address the two issues by utilizing a widely used reinforcement learning (RL) algorithm called proximal policy optimization (PPO) with a long short-term memory (LSTM) network and by building our proprietary order execution simulation environment based on historical level 3 market data of the Korea Stock Exchange (KRX). This paper, to the best of our knowledge, is the first to achieve generalization across 50 stocks and across an execution time horizon ranging from 165 to 380 min along with dynamic target volume. The experimental results demonstrate that the proposed algorithm outperforms the popular benchmark, the volume-weighted average price (VWAP), highlighting the potential use of DRL for optimal trade execution in real-world financial markets. Furthermore, our algorithm is the first commercialized DRL-based optimal trade execution algorithm in the South Korea stock market.
Keywords: deep reinforcement learning; optimal trade execution; artificial intelligence; market microstructure; financial application (search for similar items in EconPapers)
JEL-codes: C6 F3 G O3  (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc 
Citations: View citations in EconPapers (1) 
Downloads: (external link)
https://www.mdpi.com/2674-1032/2/3/23/pdf (application/pdf)
https://www.mdpi.com/2674-1032/2/3/23/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX 
RIS (EndNote, ProCite, RefMan) 
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jfinte:v:2:y:2023:i:3:p:23-429:d:1182401
Access Statistics for this article
FinTech is currently edited by Ms. Lizzy Zhou
More articles in FinTech  from  MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().