Dynamic scheduling of flexible bus services with hybrid requests and fairness: Heuristics-guided multi-agent reinforcement learning with imitation learning

Wu, Weitiao; Zhu, Yanchen; Liu, Ronghui

Dynamic scheduling of flexible bus services with hybrid requests and fairness: Heuristics-guided multi-agent reinforcement learning with imitation learning

Weitiao Wu, Yanchen Zhu and Ronghui Liu

Transportation Research Part B: Methodological, 2024, vol. 190, issue C

Abstract: Flexible bus is a class of demand-responsive transit that provides door-to-door service. It is gaining popularity now but also encounters many challenges, such as high dynamism, immediacy requirements, and financial sustainability. Scientific literature designs flexible bus services only for reservation demand, overlooking the potential market for immediate demand that can improve ride pooling and financial sustainability. The increasing availability of historical travel demand data provides opportunities for leveraging future demand prediction in optimizing fleet utilization. This study investigates prediction failure risk-aware dynamic scheduling flexible bus services with hybrid requests allowing for both reservation and immediate demand. Equity in request waiting time for immediate demand is emphasized as a key objective. We model this problem as a multi-objective Markov decision process to jointly optimize vehicle routing, timetable, holding control and passenger assignment. To solve this problem, we develop a novel heuristics-guided multi-agent reinforcement learning (MARL) framework entailing three salient features: 1) incorporating the demand forecasting and prediction error correction modules into the MARL framework; 2) combining the benefits of MARL, local search algorithm, and imitation learning (IL) to improve solution quality; 3) incorporating an improved strategy in action selection with time-related information about spatio-temporal relationships between vehicles and passengers to enhance training efficiency. These enhancements are general methodological contributions to the artificial intelligence and operations research communities. Numerical experiments show that our proposed method is comparable to prevailing benchmark methods both with respect to training stability and solution quality. The benefit of demand prediction is significant even when the prediction is imperfect. Our model and algorithm are applied to a real-world case study in Guangzhou, China. Managerial insights are also provided.

Keywords: Flexible bus; Multi-agent reinforcement learning; Imitation learning; Local search; Demand prediction (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0191261524001930
Full text for ScienceDirect subscribers only

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:eee:transb:v:190:y:2024:i:c:s0191261524001930

Ordering information: This journal article can be ordered from
http://www.elsevier.com/wps/find/supportfaq.cws_home/regional
https://shop.elsevie ... _01_ooc_1&version=01

DOI: 10.1016/j.trb.2024.103069

Access Statistics for this article

Transportation Research Part B: Methodological is currently edited by Fred Mannering

More articles in Transportation Research Part B: Methodological from Elsevier
Bibliographic data for series maintained by Catherine Liu ().