A multi-agent deep reinforcement learning approach for solving the multi-depot vehicle routing problem
Ali Arishi and
Krishna Krishnan
Journal of Management Analytics, 2023, vol. 10, issue 3, 493-515
Abstract:
The multi-depot vehicle routing problem (MDVRP) is one of the most essential and useful variants of the traditional vehicle routing problem (VRP) in supply chain management (SCM) and logistics studies. Many supply chains (SC) choose the joint distribution of multiple depots to cut transportation costs and delivery times. However, the ability to deliver quality and fast solutions for MDVRP remains a challenging task. Traditional optimization approaches in operation research (OR) may not be practical to solve MDVRP in real-time. With the latest developments in artificial intelligence (AI), it becomes feasible to apply deep reinforcement learning (DRL) for solving combinatorial routing problems. This paper proposes a new multi-agent deep reinforcement learning (MADRL) model to solve MDVRP. Extensive experiments are conducted to evaluate the performance of the proposed approach. Results show that the developed MADRL model can rapidly capture relative information embedded in graphs and effectively produce quality solutions in real-time.
Date: 2023
References: Add references at CitEc
Citations:
Downloads: (external link)
http://hdl.handle.net/10.1080/23270012.2023.2229842 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:taf:tjmaxx:v:10:y:2023:i:3:p:493-515
Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/tjma20
DOI: 10.1080/23270012.2023.2229842
Access Statistics for this article
Journal of Management Analytics is currently edited by Li Xu
More articles in Journal of Management Analytics from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().