Reinforcement Learning for Optimizing Driving Policies on Cruising Taxis Services
Kun Jin,
Wei Wang,
Xuedong Hua and
Wei Zhou
Additional contact information
Kun Jin: Jiangsu Key Laboratory of Urban ITS, Southeast University, Nanjing 211189, China
Wei Wang: Jiangsu Key Laboratory of Urban ITS, Southeast University, Nanjing 211189, China
Xuedong Hua: Jiangsu Key Laboratory of Urban ITS, Southeast University, Nanjing 211189, China
Wei Zhou: Jiangsu Key Laboratory of Urban ITS, Southeast University, Nanjing 211189, China
Sustainability, 2020, vol. 12, issue 21, 1-19
Abstract:
As the key element of urban transportation, taxis services significantly provide convenience and comfort for residents’ travel. However, the reality has not shown much efficiency. Previous researchers mainly aimed to optimize policies by order dispatch on ride-hailing services, which cannot be applied in cruising taxis services. This paper developed the reinforcement learning (RL) framework to optimize driving policies on cruising taxis services. Firstly, we formulated the drivers’ behaviours as the Markov decision process (MDP) progress, considering the influences after taking action in the long run. The RL framework using dynamic programming and data expansion was employed to calculate the state-action value function. Following the value function, drivers can determine the best choice and then quantify the expected future reward at a particular state. By utilizing historic orders data in Chengdu, we analysed the function value’s spatial distribution and demonstrated how the model could optimize the driving policies. Finally, the realistic simulation of the on-demand platform was built. Compared with other benchmark methods, the results verified that the new model performs better in increasing total revenue, answer rate and decreasing waiting time, with the relative percentages of 4.8%, 6.2% and −27.27% at most.
Keywords: Markov decision process; reinforcement learning; optimizing driving policies; cruising taxis services (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2020
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2071-1050/12/21/8883/pdf (application/pdf)
https://www.mdpi.com/2071-1050/12/21/8883/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:12:y:2020:i:21:p:8883-:d:434973
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().