The beer game bullwhip effect mitigation: a deep reinforcement learning approach
Maxim Rozhkov,
Nataliya Alyamovskaya and
Gleb Zakhodiakin
International Journal of Production Research, 2025, vol. 63, issue 18, 6630-6647
Abstract:
This article investigates the application of reinforcement learning (RL) methods to optimise a four-echelon linear supply chain model with stochastic demand. The proposed supply chain configuration is largely based on the production-distribution supply chain of the MIT Supply Chain Beer Game. We show that RL can significantly improve ordering efficiency and overall supply chain performance. The model environment is adapted for the OpenAI ‘gymnasium’ interface with the usage of reward shaping (reward engineering) in the model training process. The algorithm employs two reward function components: costs and order variance metric. We evaluate the effectiveness of RL against Order-Up-To inventory management policies for several supply chain configurations and assess the impact on the overall supply chain stability. An algorithm based on a recurrent proximal policy optimisation (RPPO) is effective for the beer game setup and outperforms Order-Up-To approaches. This RL algorithm generates different ordering patterns and tends to narrow the action space for the agent and thus, to mitigate the bullwhip effect in a more effective way. Our findings suggest that an improvement in the reduction of the bullwhip effect impact is present even if only one agent in the supply chain uses the algorithm as an ordering policy.
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://hdl.handle.net/10.1080/00207543.2025.2479831 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:taf:tprsxx:v:63:y:2025:i:18:p:6630-6647
Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/TPRS20
DOI: 10.1080/00207543.2025.2479831
Access Statistics for this article
International Journal of Production Research is currently edited by Professor A. Dolgui
More articles in International Journal of Production Research from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().