EconPapers    
Economics at your fingertips  
 

Self-learning Agents for Recommerce Markets

Jan Groeneveld (), Judith Herrmann (), Nikkel Mollenhauer (), Leonard Dreeßen (), Nick Bessin (), Johann Schulze Tast (), Alexander Kastius (), Johannes Huegle () and Rainer Schlosser ()
Additional contact information
Jan Groeneveld: University of Potsdam
Judith Herrmann: University of Potsdam
Nikkel Mollenhauer: University of Potsdam
Leonard Dreeßen: University of Potsdam
Nick Bessin: University of Potsdam
Johann Schulze Tast: University of Potsdam
Alexander Kastius: University of Potsdam
Johannes Huegle: University of Potsdam
Rainer Schlosser: University of Potsdam

Business & Information Systems Engineering: The International Journal of WIRTSCHAFTSINFORMATIK, 2024, vol. 66, issue 4, No 3, 463 pages

Abstract: Abstract Nowadays, customers as well as retailers look for increased sustainability. Recommerce markets – which offer the opportunity to trade-in and resell used products – are constantly growing and help to use resources more efficiently. To manage the additional prices for the trade-in and the resale of used product versions challenges retailers as substitution and cannibalization effects have to be taken into account. An unknown customer behavior as well as competition with other merchants regarding both sales and buying back resources further increases the problem’s complexity. Reinforcement learning (RL) algorithms offer the potential to deal with such tasks. However, before being applied in practice, self-learning algorithms need to be tested synthetically to examine whether they and which work in different market scenarios. In the paper, the authors evaluate and compare different state-of-the-art RL algorithms within a recommerce market simulation framework. They find that RL agents outperform rule-based benchmark strategies in duopoly and oligopoly scenarios. Further, the authors investigate the competition between RL agents via self-play and study how performance results are affected if more or less information is observable (cf. state components). Using an ablation study, they test the influence of various model parameters and infer managerial insights. Finally, to be able to apply self-learning agents in practice, the authors show how to calibrate synthetic test environments from observable data to be used for effective pre-training.

Keywords: Recommerce; Dynamic pricing; Competition; Reinforcement learning; Market simulation; Sustainability (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s12599-023-00841-8 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:binfse:v:66:y:2024:i:4:d:10.1007_s12599-023-00841-8

Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/12599

DOI: 10.1007/s12599-023-00841-8

Access Statistics for this article

Business & Information Systems Engineering: The International Journal of WIRTSCHAFTSINFORMATIK is currently edited by Martin Bichler

More articles in Business & Information Systems Engineering: The International Journal of WIRTSCHAFTSINFORMATIK from Springer, Gesellschaft für Informatik e.V. (GI)
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-20
Handle: RePEc:spr:binfse:v:66:y:2024:i:4:d:10.1007_s12599-023-00841-8