EconPapers    
Economics at your fingertips  
 

Risk-Aware Travel Path Planning Algorithm Based on Reinforcement Learning during COVID-19

Zhijian Wang (), Jianpeng Yang, Qiang Zhang and Li Wang
Additional contact information
Zhijian Wang: School of Electrical and Control Engineering, North China University of Technology, Beijing 100144, China
Jianpeng Yang: School of Electrical and Control Engineering, North China University of Technology, Beijing 100144, China
Qiang Zhang: Beijing Aerospace Measurement & Control Technology Co. Ltd., Beijing 100041, China
Li Wang: Beijing Aerospace Measurement & Control Technology Co. Ltd., Beijing 100041, China

Sustainability, 2022, vol. 14, issue 20, 1-25

Abstract: The outbreak of COVID-19 brought great inconvenience to people’s daily travel. In order to provide people with a path planning scheme that takes into account both safety and travel distance, a risk aversion path planning model in urban traffic scenarios was established through reinforcement learning. We have designed a state and action space of agents in a “point-to-point” way. Moreover, we have extracted the road network model and impedance matrix through SUMO simulation, and have designed a Restricted Reinforcement Learning-Artificial Potential Field (RRL-APF) algorithm, which can optimize the Q-table initialization operation before the agent learning and the action selection strategy during learning. The greedy coefficient is dynamically adjusted through the improved greedy strategy. Finally, according to different scenarios, our algorithm is verified by the road network model and epidemic historical data in the surrounding areas of Xinfadi, Beijing, China, and comparisons are made with common Q-Learning, the Sarsa algorithm and the artificial potential field-based reinforcement learning (RLAFP) algorithm. The results indicate that our algorithm improves convergence speed by 35% on average and the travel distance is reduced by 4.3% on average, while avoiding risk areas, compared with the other three algorithms.

Keywords: urban traffic; path generation; reinforcement learning; resident travel; COVID-19; restricted search (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2071-1050/14/20/13364/pdf (application/pdf)
https://www.mdpi.com/2071-1050/14/20/13364/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:14:y:2022:i:20:p:13364-:d:944781

Access Statistics for this article

Sustainability is currently edited by Ms. Alexandra Wu

More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jsusta:v:14:y:2022:i:20:p:13364-:d:944781