Distributed Dynamic Pricing Strategy Based on Deep Reinforcement Learning Approach in a Presale Mechanism
Yilin Liang,
Yuping Hu (),
Dongjun Luo,
Qi Zhu,
Qingxuan Chen and
Chunmei Wang
Additional contact information
Yilin Liang: School of Informatics, Guangdong University of Finance & Economics, Guangzhou 510320, China
Yuping Hu: School of Informatics, Guangdong University of Finance & Economics, Guangzhou 510320, China
Dongjun Luo: School of Informatics, Guangdong University of Finance & Economics, Guangzhou 510320, China
Qi Zhu: School of Informatics, Guangdong University of Finance & Economics, Guangzhou 510320, China
Qingxuan Chen: School of Informatics, Guangdong University of Finance & Economics, Guangzhou 510320, China
Chunmei Wang: College of Internet Finance & Information Engineering, Guangdong University of Finance, Guangzhou 510521, China
Sustainability, 2023, vol. 15, issue 13, 1-20
Abstract:
Despite the emergence of a presale mechanism that reduces manufacturing and ordering risks for retailers, optimizing the real-time pricing strategy in this mechanism and unknown demand environment remains an unsolved issue. Consequently, we propose an automatic real-time pricing system for e-retailers under the inventory backlog impact in the presale mode, using deep reinforcement learning technology based on the Dueling DQN algorithm. This system models the multicycle pricing problem with a finite sales horizon as a Markov decision process (MDP) to cope with the uncertain environment. We train and evaluate the proposed environment and agent in a simulation environment and compare it with two tabular reinforcement learning algorithms (Q-learning and SARSA). The computational results demonstrate that our proposed real-time pricing learning framework for joint inventory impact can effectively maximize retailers’ profits and has universal applicability to a wide range of presale models. Furthermore, according to a series of experiments, we find that retailers should not neglect the impact of the presale or previous prices on consumers’ purchase behavior. If consumers pay more attention to past prices, the retailer must decrease the current price. When the cost of inventory backlog increases, they need to offer deeper discounts in the early selling period. Additionally, introducing blockchain technology can improve the transparency of commodity traceability information, thus increasing consumer demand for purchase.
Keywords: presale; dynamic pricing; deep reinforcement learning; revenue management; blockchain (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2071-1050/15/13/10480/pdf (application/pdf)
https://www.mdpi.com/2071-1050/15/13/10480/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:15:y:2023:i:13:p:10480-:d:1186000
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().