An Optimal Scheduling Strategy of a Microgrid with V2G Based on Deep Q-Learning
Yuxin Wen,
Peixiao Fan (),
Jia Hu,
Song Ke,
Fuzhang Wu and
Xu Zhu
Additional contact information
Yuxin Wen: School of Electrical Engineering and Automation, Wuhan University, Wuhan 430072, China
Peixiao Fan: School of Electrical Engineering and Automation, Wuhan University, Wuhan 430072, China
Jia Hu: State Grid Hubei Electric Power Co., Ltd., Wuhan 430072, China
Song Ke: School of Electrical Engineering and Automation, Wuhan University, Wuhan 430072, China
Fuzhang Wu: School of Electrical Engineering and Automation, Wuhan University, Wuhan 430072, China
Xu Zhu: School of Electrical Engineering and Automation, Wuhan University, Wuhan 430072, China
Sustainability, 2022, vol. 14, issue 16, 1-18
Abstract:
In recent years, the access of various distributed power sources and electric vehicles (EVs) has brought more and more randomness and uncertainty to the operation and regulation of microgrids. Therefore, an optimal scheduling strategy for microgrids with EVs based on Deep Q-learning is proposed in this paper. Firstly, a vehicle-to-grid (V2G) model considering the mobility of EVs and the randomness of user charging behavior is proposed. The charging time distribution model, charging demand model, state-of-charge (SOC) dynamic model and the model of travel location are comprehensively established, thereby realizing the construction of the mathematical model of the microgrid with EVs: it can obtain the charging/discharging situation in the EV station, so as to obtain the overall output power of the EV station. Secondly, based on Deep Q-learning, the state space and action space are set up according to the actual microgrid system, and the design of the optimal scheduling reward function is completed with the goal of economy. Finally, the calculation example results show that compared with the traditional optimization algorithm, the strategy proposed in this paper has the ability of online learning and can cope with the randomness of renewable resources better. Meanwhile, the agent with experience replay ability can be trained to complete the evolution process, so as to adapt to the nonlinear influence caused by the mobility of EVs and the periodicity of user behavior, which is feasible and superior in the field of optimal scheduling of microgrids with renewable resources and EVs.
Keywords: renewable energy; electric vehicles; deep Q-learning; microgrid scheduling; V2G (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (6)
Downloads: (external link)
https://www.mdpi.com/2071-1050/14/16/10351/pdf (application/pdf)
https://www.mdpi.com/2071-1050/14/16/10351/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:14:y:2022:i:16:p:10351-:d:892811
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().