Congestion Control in Charging Stations Allocation with Q-Learning
Abstract
:1. Introduction
2. Electric Vehicle Charging Station Allocation Problem
2.1. EV Charging Time Cost Model
2.1.1. Route Travel Time Cost
2.1.2. Queuing Time Cost
2.2. Congestion Game-Based System Model
2.2.1. Congestion Game Model
2.2.2. Existence of Nash Equilibrium
3. EV Station Allocation Based on Q-Learning Algorithm
3.1. Environment, State, and Action Set
3.2. Q-Learning Algorithm
Algorithm1: Q-learning algorithm |
Input: N—The Number Of Evs M—The Number Of Stations K—The Number Of Roads Epsilon—, Explore Factor, A Designed Constant λ, CAP, β—Designed Constant D—The Distance Matrix A0—The Roads General Congestion Matrix Output: History—The Allocation Strategy |
Initialization: ① (S, A) ② rewards with roadsaverage congestion factor’ ③ Set the terminalSet as the three stations’ positions Repeat (for each EV): Repeat for the episode:): Choose status and action (S, A) using policy ε-greedy Repeat (for each step of the episode): Take action A, observe R,S′ Choose A′ from S′ using policy ε-greedy Update status–activity with Bellman equation S←S′; A←A′. Update rewards for passing roads and the selected station Until S is in terminalSet Update the output History |
4. Experiments and Results
4.1. Data Introduction
4.2. Experiments and Results Statement
4.2.1. Experiment for One EV
4.2.2. Experiment for 20 EVs
4.3. Comparison of Q-Learning and Genetic Algorithms
5. Discussion
5.1. Parameters Sensitivity Analysis
5.1.1. Road Capacity
5.1.2. Number of EVs
6. Conclusions
Author Contributions
Funding
Conflicts of Interest
References
- Bigerna, S.; Micheli, S. Attitudes Toward Electric Vehicles: The Case of Perugia Using a Fuzzy Set Analysis. Sustainability 2018, 10, 3999. [Google Scholar] [CrossRef]
- Vazifeh, M.M.; Zhang, H.; Santi, P.; Ratti, C. Optimizing the deployment of electric vehicle charging stations using pervasive mobility data. Transp. Res. Part A Policy Pract. 2019, 121, 75–91. [Google Scholar] [CrossRef] [Green Version]
- Akbari, M.; Brenna, M.; Longo, M. Optimal Locating of Electric Vehicle Charging Stations by Application of Genetic Algorithm. Sustainability 2018, 10, 1076. [Google Scholar] [CrossRef]
- Bak, D.-B.; Bak, J.-S.; Kim, S.-Y. Strategies for Implementing Public Service Electric Bus Lines by Charging Type in Daegu Metropolitan City, South Korea. Sustainability 2018, 10, 3386. [Google Scholar] [CrossRef]
- Xu, J.; Zhong, L.; Yao, L.; Wu, Z. An interval type-2 fuzzy analysis towards electric vehicle charging station allocation from a sustainable perspective. Sustain. Cities Soc. 2018, 40, 335–351. [Google Scholar] [CrossRef]
- Luo, L.; Gu, W.; Wu, Z.; Zhou, S. Joint planning of distributed generation and electric vehicle charging stations considering real-time charging navigation. Appl. Energy 2019, 242, 1274–1284. [Google Scholar] [CrossRef]
- Liu, H.; Yin, W.; Yuan, X.; Niu, M. Reserving Charging Decision-Making Model and Route Plan for Electric Vehicles Considering Information of Traffic and Charging Station. Sustainability 2018, 10, 1324. [Google Scholar] [CrossRef]
- Cui, S.; Zhao, H.; Wen, H.; Zhang, C. Locating Multiple Size and Multiple Type of Charging Station for Battery Electricity Vehicles. Sustainability 2018, 10, 3267. [Google Scholar] [CrossRef]
- Rosenthal, R.W. A Class of Games Possessing Pure-Strategy Nash Equilibria. Int. J. Game Theory 1973, 2, 65–67. [Google Scholar] [CrossRef]
- Igal, M. Congestion Games with Player-Specific Payoff Functions. Games Econ. Behav. 1996, 13, 111–124. [Google Scholar] [Green Version]
- Johari, R.; Tsitsiklis, J.N. Network Resource Allocation and A Congestion Game: The single link case. In Proceedings of the 42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475), Maui, HI, USA, 9–12 December 2003. [Google Scholar]
- Mabrouk, A.; Senhadji, M.; Kobbane, A.; Walid, A.; Sabir, E.; Koutbi, M.E. A Congestion Game-based Routing Algorithm for Communicating VANETs. In Proceedings of the 2014 International Wireless Communications and Mobile Computing Conference (IWCMC), Nicosia, Cyprus, 4–8 August 2014. [Google Scholar]
- Mejri, N.; Ayari, M.; Langar, R.; Saidane, L. Con2PaS: A Constrained Congestion Game for Parking Space Assignment. In Proceedings of the 2016 International Conference on Performance Evaluation and Modeling in Wired and Wireless Networks (PEMWN), Paris, France, 22–25 November 2016. [Google Scholar]
- Tran, T.X.; Pompili, D. Joint Task Offloading and Resource Allocation for Multi-Server Mobile-Edge Computing Networks. IEEE Trans. Veh. Technol. 2019, 68, 856–868. [Google Scholar] [CrossRef]
- Ibars, C.; Navarro, M.; Giupponi, L. Distributed Demand Management in Smart Grid with a Congestion Game. In Proceedings of the 2010 First IEEE International Conference on Smart Grid Communications, Gaithersburg, MD, USA, 4–6 Octomber 2010; pp. 495–500. [Google Scholar]
- Sutton, R.S.; Barto, A.G. Reinforcement Learning: An Introduction; The MIT Press: Cambridge, MA, USA; London, UK, 2018. [Google Scholar]
- Christopher, J.C.; Watkins, H.; Dayan, P. Q-Learning. Mach. Learn. 1992, 8, 279–292. [Google Scholar]
- Li, J.; Chai, T.; Lewis, F.L.; Fan, J.; Ding, Z.; Ding, J. Off-Policy Q-Learning: Set-Point Design for Optimizing Dual-Rate Rougher Flotation Operational Processes. IEEE Trans. Ind. Electron. 2018, 65, 4092–4102. [Google Scholar] [CrossRef]
- Vamvoudakis, K.G.; Hespanha, J.P. Cooperative Q-Learning for Rejection of Persistent Adversarial Inputs in Networked Linear Quadratic Systems. IEEE T Automat Contr. 2018, 63, 1018–1031. [Google Scholar] [CrossRef]
- Song, R.; Lewis, F.L.; Wei, Q. Off-Policy Integral Reinforcement Learning Method to Solve Nonlinear Continuous-Time Multiplayer Nonzero-Sum Games. IEEE Trans. Neural Netw. Learn. Syst. 2017, 28, 704–713. [Google Scholar] [CrossRef] [PubMed]
- Tian, D.; Zhou, J.; Wang, Y.; Sheng, Z.; Duan, X.; Leung, V.C. Channel Access Optimization with Adaptive Congestion Pricing for Cognitive Vehicular Networks: An Evolutionary Game Approach. IEEE Trans. Mob. Comput. 2019, 1, 1–18. [Google Scholar]
- Jiang, Y.; Hao, K.; Cai, X.; Ding, Y. An improved reinforcement-immune algorithm for agricultural resource allocation optimization. J. Comput. Sci. 2018, 27, 320–328. [Google Scholar] [CrossRef]
- Vamvoudakis, K.G. Q-learning for continuous-time linear systems: A model-free infinite horizon optimal control approach. Syst. Control Lett. 2017, 100, 14–20. [Google Scholar] [CrossRef]
- Xiong, Y.; Gan, J.; An, B.; Miao, C.; Bazzan, A.L.C. Optimal Electric Vehicle Fast Charging Station Placement Based on Game Theoretical Framework. IEEE Trans. Intell. Transp. Syst. 2018, 19, 2493–2504. [Google Scholar] [CrossRef]
- Lewis, F.L.; Vrabie, D.; Vamvoudakis, K.G. Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers. IEEE Control Syst. 2012, 32, 76–105. [Google Scholar]
- Osman, M.S.; Abo-Sinna, M.A.; Mousa, A.A. An effective genetic algorithm approach to multiobjective resource allocation problems (MORAPs). Appl. Math. Comput. 2005, 163, 755–768. [Google Scholar] [CrossRef]
Iteration | Station 1 | Station 2 | Station 3 | Iteration | Station 1 | Station 2 | Station 3 |
---|---|---|---|---|---|---|---|
1 | 3 | 5 | 12 | 7 | 1 | 8 | 11 |
2 | 3 | 7 | 10 | 8 | 1 | 1 | 18 |
3 | 5 | 7 | 8 | 9 | 6 | 6 | 8 |
4 | 3 | 7 | 10 | 10 | 6 | 6 | 8 |
5 | 5 | 6 | 9 | 11 | 3 | 7 | 10 |
6 | 6 | 2 | 12 | 12 | 4 | 3 | 13 |
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhang, L.; Gong, K.; Xu, M. Congestion Control in Charging Stations Allocation with Q-Learning. Sustainability 2019, 11, 3900. https://doi.org/10.3390/su11143900
Zhang L, Gong K, Xu M. Congestion Control in Charging Stations Allocation with Q-Learning. Sustainability. 2019; 11(14):3900. https://doi.org/10.3390/su11143900
Chicago/Turabian StyleZhang, Li, Ke Gong, and Maozeng Xu. 2019. "Congestion Control in Charging Stations Allocation with Q-Learning" Sustainability 11, no. 14: 3900. https://doi.org/10.3390/su11143900
APA StyleZhang, L., Gong, K., & Xu, M. (2019). Congestion Control in Charging Stations Allocation with Q-Learning. Sustainability, 11(14), 3900. https://doi.org/10.3390/su11143900