A multimodal deep reinforcement learning framework for multi-period inventory decision-making under demand uncertainty

Tian, Yu-Xin; Zhang, Chuan

A multimodal deep reinforcement learning framework for multi-period inventory decision-making under demand uncertainty

Yu-Xin Tian and Chuan Zhang ()
Additional contact information
Yu-Xin Tian: Northeastern University, School of Business Administration
Chuan Zhang: Northeastern University, School of Business Administration

Fuzzy Optimization and Decision Making, 2025, vol. 24, issue 4, No 6, 723-750

Abstract: Abstract We address multi-period inventory decision-making using multisource multimodal data and propose a deep reinforcement learning (DRL) method—Word Embedding and Transformer-enhanced Twin Delayed Deep Deterministic Policy Gradient (WET-TD3). This method integrates multimodal environmental perception with policy optimization to produce end-to-end replenishment decisions for each period. First, we design multimodal feature-aware agent neural networks that incorporate word embeddings and Transformer modules to process structured demand-related features and unstructured customer reviews from multiple sources. This design constructs a state space responsive to dynamic markets. Second, we integrate into the TD3 algorithm a multimodal Actor-Critic architecture tailored for high-dimensional heterogeneous inputs. Additionally, we introduce delayed policy updates, experience replay, and exploration noise mechanisms to improve training stability. Experiments on real-world data show WET-TD3 outperforms benchmarks, reducing average cost by over 53.69%. It dynamically adjusts replenishment strategies based on the relative magnitudes of holding and underage costs, maintaining stable performance across cost structures. These results underscore the value of deeply integrating textual reviews and structured data, and demonstrate the DRL framework’s effectiveness for long-term optimization goals under demand uncertainty.

Keywords: Inventory optimization; Deep reinforcement learning; Multimodal data fusion; Decision-making under uncertainty; Transformer (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s10700-025-09462-0 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:fuzodm:v:24:y:2025:i:4:d:10.1007_s10700-025-09462-0

Ordering information: This journal article can be ordered from
http://www.springer.com/journal/10700

DOI: 10.1007/s10700-025-09462-0

Access Statistics for this article

Fuzzy Optimization and Decision Making is currently edited by Shu-Cherng Fang and Boading Liu

More articles in Fuzzy Optimization and Decision Making from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().