Cost-Optimized Microgrid Coalitions Using Bayesian Reinforcement Learning
Abstract
:1. Introduction
2. Related Work
3. System Model
4. Bayesian Coalition Formation Game
4.1. Game Formulation
4.2. Stability Notation
- No player believes there exists a better tuple than.
- All of the coalition members accept it according to their beliefs about the expected rewards of other players.
4.3. Coalition Formation
- A proposer can stay in its current coalition and propose a new demand from the coalition.
- A proposer can decide to split from its current coalition and propose merging to other coalition with new demand .
5. Bayesian Reinforcement Learning Coalition Formation
5.1. Conventional Bayesian RL
5.2. Bayesian Reinforcement Learning Coalition Formation
5.3. Computational Approximations
6. Performance Evaluation
6.1. Benchmarks
6.1.1. Maximum a Posterior Estimation (MAPE)
Algorithm 1: Coalition formation with BCRL for distributed energy trading among microgrids |
1Initialization: 2for all microgrid m, do 3end 4 Randomly assigns the power level. 5 Randomly assign the location 6 Randomly assign to the coalition 7 initializes demand using direct power loss to macrogrid 8 Broadcast to all microgrids and set 9Main Loop: time slot all microgrid m, 10 Update coalition action according to the agreement . 11 Update current reward . 12 Update transition probabilities and beliefs. 13 Estimate (27) 14BR Coalition Formation: 15 Randomly selects a proposer microgrid m with the probability . 16 Make a proposal which maximize microgrid m beliefs about . 17 Send to all microgrid n, . 18for all microgrid j, do 19end 20ifthen 21end 22 set a response and send to microgrid m 23else 24end 25 set a response 26ifthen 27end 28 Update agreement 29 set the state 30 set the type 31 broadcast to all microgrid j, |
6.1.2. Fully Myopic Estimation (FME)
6.1.3. Q-Learning Based Method
6.1.4. Bayesian Coalitional Game Theory (BCG)
6.1.5. Coalitional Game Theory (CG)
6.2. Numerical Results and Discussions
7. Conclusions
Author Contributions
Funding
Conflicts of Interest
Abbreviations
ICT | Information and communications technologies |
p2p | peer to peer |
microgrid | Micro grid |
BCFG | Bayesian coalition formation game |
BCRL | Bayesian coalition reinforcement learning |
CG | Coalitional game theory |
EV | Electrical Vehicle |
AI | Artificial intelligence |
SBC | Strong Bayesian Core |
MDP | Markov decision process |
RL | Reinforcement learning |
POMDP | partially observable MDP |
M | Number of microgrids |
The amount of generated energy by microgrid | |
microgrid m demand | |
Surplus energy of microgrid m | |
Total cost of power transaction from m-th microgrid to n-th microgrid | |
Length of power lines between m-th and n-th microgrids | |
Scaling factor | |
The power that is being traded plus the loss that happens during trading | |
Power loss in trading energy between m-th and n-th microgrids | |
w | Weighting coefficient |
Lower bound for w | |
Distance threshold | |
Higher bound for w | |
weight factor for energy transactions with the macrogrid | |
Resistance of line | |
Power loss | |
Voltage in transformer | |
Fraction of power loss in transformer | |
C | Coalition |
Coalitional value | |
Number of members of coalition C | |
Maximum achievable coalition value | |
Set of agent types | |
Set of agent beliefs | |
Set of coalition action | |
Set of states | |
Reward function | |
m-th microgrid’s beliefs about the types of other players | |
Probability assigned to other agents about their types | |
Set of coalition actions | |
Value of Member j in the coalition | |
Coalition C value | |
Demand | |
Demand vector of coalition | |
Proposal by prosper m | |
discount factor | |
State at time t | |
Belief at time t |
References
- Amin, W.; Huang, Q.; Umer, K.; Zhang, Z.; Afzal, M.; Khan, A.A.; Ahmed, S.A. A motivational game-theoretic approach for peer-to-peer energy trading in islanded and grid-connected microgrid. Int. J. Electr. Power Energy Syst. 2020, 123, 106307. [Google Scholar] [CrossRef]
- Brolin, M.; Pihl, H. Design of a local energy market with multiple energy carriers. Int. J. Electr. Power Energy Syst. 2020, 118, 105739. [Google Scholar] [CrossRef]
- Zhang, W.; Maleki, A.; Rosen, M.A. A heuristic-based approach for optimizing a small independent solar and wind hybrid power scheme incorporating load forecasting. J. Clean. Prod. 2019, 241, 117920. [Google Scholar] [CrossRef]
- Abrishambaf, O.; Lezama, F.; Faria, P.; Vale, Z. Towards transactive energy systems: An analysis on current trends. Energy Strategy Rev. 2019, 26, 100418. [Google Scholar] [CrossRef]
- Erol-Kantarci, M.; Kantarci, B.; Mouftah, H.T. Reliable overlay topology design for the smart microgrid network. IEEE Netw. 2011, 25, 38–43. [Google Scholar] [CrossRef]
- Saad, W.; Han, Z.; Poor, H.V. Coalitional Game Theory for Cooperative Micro-Grid Distribution Networks. In Proceedings of the IEEE International Conference on Communications Workshops, Kyoto, Japan, 5–9 June 2011. [Google Scholar]
- Zhou, H.; Erol-Kantarci, M. Correlated Deep Q-learning based Microgrid Energy Management. In Proceedings of the 2020 IEEE 25th International Workshop on Computer Aided Modeling and Design of Communication Links and Networks (CAMAD), Pisa, Italy, 14–16 September; 2020; pp. 1–6. [Google Scholar] [CrossRef]
- Xiao, X.; Dai, C.; Li, Y.; Zhou, C.; Xiao, L. Energy Trading Game for Microgrids Using Reinforcement Learning. In Game Theory for Networks; Springer International Publishing: Cham, Switzerland, 2017; pp. 131–140. [Google Scholar]
- Chalkiadakis, G.; Boutilier, C. Bayesian reinforcement learning for coalition formation under uncertainty. In Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, New York, NY, USA, 19–23 July 2004; Volume 3, pp. 1090–1097. [Google Scholar]
- Asheralieva, A. Bayesian Reinforcement Learning-Based Coalition Formation for Distributed Resource Sharing by Device-to-Device Users in Heterogeneous Cellular Networks. IEEE Trans. Wirel. Commun. 2017, 16, 5016–5032. [Google Scholar] [CrossRef]
- Sadeghi, M.; Mollahasani, S.; Erol-Kantarci, M. Power Loss-Aware Transactive Microgrid Coalitions under Uncertainty. Energies 2020, 13, 5782. [Google Scholar] [CrossRef]
- Lee, J.; Guo, J.; Choi, J.K.; Zukerman, M. Distributed Energy Trading in Microgrids: A Game-Theoretic Model and Its Equilibrium Analysis. IEEE Trans. Ind. Electron. 2015, 62, 3524–3533. [Google Scholar] [CrossRef]
- Jadhav, A.; Patne, N. Priority Based Energy Scheduling in a Smart Distributed Network with Multiple Microgrids. IEEE Trans. Ind. Inform. 2017, 13, 1. [Google Scholar] [CrossRef]
- Tushar, W.; Zhang, J.A.; Smith, D.; Poor, V.H.; Thiebaux, S. Prioritizing Consumers in Smart Grid: A Game Theoretic Approach. IEEE Trans. Smart Grid 2014, 5, 1429–1438. [Google Scholar] [CrossRef] [Green Version]
- Wang, H.; Huang, T.; Liao, X.; Abu-Rub, H.; Chen, G. Reinforcement Learning in Energy Trading Game Among Smart Microgrids. IEEE Trans. Ind. Electron. 2016, 63, 5109–5119. [Google Scholar] [CrossRef]
- Feng, C.; Wen, F.; You, S.; Li, Z.; Shahnia, F.; Shahidehpour, M. Coalitional Game Based Transactive Energy Management in Local Energy Communities. IEEE Trans. Power Syst. 2019, 35, 1729–1740. [Google Scholar] [CrossRef]
- Lahon, R.; Gupta, C.P.; Fernandez, E. Coalition formation strategies for cooperative operation of multiple microgrids. IET Gener. Transm. Distrib. 2019, 13, 3661–3672. [Google Scholar] [CrossRef]
- Mei, J.; Chen, C.; Wang, J.; Kirtley, J.L. Coalitional game theory based local power exchange algorithm for networked microgrids. Appl. Energy 2019, 239, 133–141. [Google Scholar] [CrossRef]
- Essayeh, C.; El Fenni, M.R.; Dahmouni, H. Optimization of energy exchange in microgrid networks: A coalition formation approach. Prot. Control. Mod. Power Syst. 2019, 4, 24. [Google Scholar] [CrossRef] [Green Version]
- Misra, S.; Krishna, P.V.; Saritha, V.; Obaidat, M.S. Learning automata as a utility for power management in smart grids. IEEE Commun. Mag. 2013, 51, 98–104. [Google Scholar] [CrossRef]
- Jiang, B.; Fei, Y. Dynamic Residential Demand Response and Distributed Generation Management in Smart Microgrid with Hierarchical Agents. Energy Procedia 2011, 12, 76–90. [Google Scholar] [CrossRef] [Green Version]
- Xu, Y.; Zhang, W.; Liu, W.; Ferrese, F. Multiagent-Based Reinforcement Learning for Optimal Reactive Power Dispatch. IEEE Trans. Syst. Man Cybern. Part A (Appl. Rev.) 2012, 42, 1742–1751. [Google Scholar] [CrossRef]
- Guan, C.; Wang, Y.; Lin, X.; Nazarian, S.; Pedram, M. Reinforcement learning-based control of residential energy storage systems for electric bill minimization. In Proceedings of the 2015 12th Annual IEEE Consumer Communications and Networking Conference (CCNC), Las Vegas, NV, USA, 9–12 January 2015; pp. 637–642. [Google Scholar]
- Kim, B.G.; Zhang, Y.; van der Schaar, M.; Lee, J.W. Dynamic pricing for smart grid with reinforcement learning. In Proceedings of the 2014 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), Toronto, ON, Canada, 27 April–2 May 2014; pp. 640–645. [Google Scholar]
- Xiao, L. Reinforcement Learning-Based Energy Trading for Microgrids. Available online: https://arxiv.org/abs/1801.06285 (accessed on 5 November 2021).
- Sadeghi, M.; Erol-Kantarci, M. Power Loss Minimization in Microgrids Using Bayesian Reinforcement Learning with Coalition Formation. In Proceedings of the 2019 IEEE 30th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC), Istanbul, Turkey, 8–11 September 2019; pp. 1–6. [Google Scholar]
- Sadeghi, M.; Mollahasani, S.; Erol-Kantarci, M. Cost-Aware Dynamic Bayesian Coalitional Game for Energy Trading among Microgrids. In Proceedings of the IEEE International Conference on Communications Workshops (ICC), Montreal, QC, Canada, 14–23 June 2021; pp. 1–6. [Google Scholar]
- Machowski, J.; Lubosny, Z.; Bialek, J.W.; Bumby, J.R. Power System Dynamics: Stability and Control; John Wiley & Sons: Hoboken, NJ, USA, 2020. [Google Scholar]
- Ieong, S.; Shoham, Y. Bayesian Coalitional Games. In Proceedings of the 23rd National Conference Artificial Intelligence, Chicago, IL, USA, 13–17 July 2008; AAAI: Palo Alto, CA, USA, 2008; pp. 95–100. [Google Scholar]
- Duff, M.O.; Barto, A. Optimal Learning: Computational Procedures for Bayes-Adaptive Markov Decision Processes; University of Massachusetts Amherst: Amherst, MA, USA, 2002. [Google Scholar]
- Kolter, J.Z.; Ng, A.Y. Near-Bayesian exploration in polynomial time. In Proceedings of the 26th Annual International Conference on Machine Learning, Montreal, QC, Canada, 14–18 June 2009; pp. 513–520. [Google Scholar]
Paper | Objective | Elements | Uncertainty | Methods |
---|---|---|---|---|
[6] | Power Loss | Macrogrid, microgrids | ✗ | CG |
[16] | Energy Management | Macrogrid, microgrids | ✗ | CG |
[17] | Energy Management | Macrogrid, microgrids | ✗ | CG |
[18] | Cost | Macrogrid, microgrids | ✗ | CG |
[19] | Cost | Macrogrid, microgrids | ✗ | CG |
[11] | Power Loss | Macrogrid, microgrids, EVs | ✓ | BCG |
[26] | Power Loss | Macrogrid, microgrids | ✓ | CG, BRL |
Parameters | Value |
---|---|
Line Resistance ( | 0.2 |
Medium Voltage ( | 50 kV |
Low voltage ( | 22 kV |
Transformer loss fraction () | 0.02 |
Threshold distance () | 5 km |
Virtual cost parameter () | 0.02 |
Virtual cost parameter () | 0.04 |
Virtual cost parameter () | 0.08 |
Scaling parameter () | 0.95 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Sadeghi, M.; Mollahasani, S.; Erol-Kantarci, M. Cost-Optimized Microgrid Coalitions Using Bayesian Reinforcement Learning. Energies 2021, 14, 7481. https://doi.org/10.3390/en14227481
Sadeghi M, Mollahasani S, Erol-Kantarci M. Cost-Optimized Microgrid Coalitions Using Bayesian Reinforcement Learning. Energies. 2021; 14(22):7481. https://doi.org/10.3390/en14227481
Chicago/Turabian StyleSadeghi, Mohammad, Shahram Mollahasani, and Melike Erol-Kantarci. 2021. "Cost-Optimized Microgrid Coalitions Using Bayesian Reinforcement Learning" Energies 14, no. 22: 7481. https://doi.org/10.3390/en14227481
APA StyleSadeghi, M., Mollahasani, S., & Erol-Kantarci, M. (2021). Cost-Optimized Microgrid Coalitions Using Bayesian Reinforcement Learning. Energies, 14(22), 7481. https://doi.org/10.3390/en14227481