Day-Ahead Economic Dispatch Strategy for Distribution Networks with Multi-Class Distributed Resources Based on Improved MAPPO Algorithm
Abstract
:1. Introduction
- An improved MAPPO agent algorithm considering the GAE agent function is proposed to enhance the convergence effect of the algorithm in the training process. The current MAPPO algorithm usually uses the Monte Carlo method to obtain the agent function, which is prone to blind trial-and-error, as well as falling into local optima during the training process in the face of the complex distribution network scheduling strategy solving problem. In this paper, the GAE agent function is introduced to enhance the MAPPO algorithm, improving the convergence speed during the training process and further reducing the time of offline training of the algorithm;
- The designed improved MAPPO algorithm framework can balance the bias and variance in the agent’s reward value in the algorithm in the process of updating the distribution network scheduling strategy by introducing a discount factor, clarifying the direction of strategy updates while improving the algorithm’s convergence speed, reducing fluctuations in the agent’s reward value, and enhancing the stability of strategy updates during training;
- A day-ahead economic dispatch model for distribution networks, incorporating distributed resources such as new energy sources and various controllable units, is designed, and its dispatch model is optimized and solved by improving the MAPPO algorithm. This improves the economic efficiency of the scheduling strategy for distribution networks with multiple distributed resources, as well as the online decision-making efficiency.
2. Principle of Improved MAPPO Algorithm
2.1. MAPPO Algorithm Framework
2.2. Improved Method Based on Generalized Advantage Estimation Function
3. Distribution Network Economic Dispatch Models
3.1. Objective Function
3.2. Constraints
3.2.1. Current Constraints
3.2.2. Nodal Voltage Constraints
3.2.3. Interaction Constraints with Higher-Level Grids
3.2.4. Gas Turbine Constraints
3.2.5. Distributed Wind and Photovoltaic Output Constraints
3.2.6. Energy Storage Constraints
3.2.7. Adjustable Load Restraint
4. Transformation of Economic Dispatch Model for Distribution Network Based on Improved MAPPO Algorithm
4.1. Design of Action Space
4.2. State Space Design
4.3. Design of the Reward Function
4.4. Solution Flow
5. Example Analysis
5.1. Analysis of Improved MAPPO Training Results
5.2. Analysis of the Results of the Improved MAPPO Test
5.3. Economic Analysis of Algorithm Results
6. Conclusions
- Compared to traditional optimization algorithms, the MAPPO algorithm significantly reduces decision-making time. The improved MAPPO algorithm is 18.5% faster than the unimproved version, enhancing its solving efficiency.
- The improved MAPPO algorithm has fewer fluctuations in reward values during convergence, and the convergence process is more stable compared to the unimproved MAPPO algorithm, which verifies that GAE further improves the algorithm learning performance.
- The improved MAPPO algorithm achieves the lowest cost in solving the economic dispatch problem for distribution networks with multiple resource types, reducing operational costs by up to 18.1% compared to other algorithms, thus improving the economic performance of the network.
- Future research could further explore the application of the improved MAPPO algorithm in real-time dispatch problems, integrating emerging technologies such as edge computing and artificial intelligence to achieve more efficient energy management. Additionally, research could examine the scalability of the algorithm, particularly in optimizing performance when handling larger more complex distributed networks. Another potential direction includes investigating the coordinated optimization between different types of resources to enhance overall system scheduling and improve the ability to handle unexpected situations.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Appendix A
Numbering | Grid Node | Maximum Sale of Electricity | Maximum Purchase of Electricity |
---|---|---|---|
1 | 1 | 50 | 50 |
Numbering | Access Node | Capacity (kW) | Minimum Starting Capacity (kW) | Maximum Climbing Rate | Minimum Climbing Rate | Cost Coefficient | |
---|---|---|---|---|---|---|---|
agas | cgas | ||||||
1 | 2 | 80 | 8 | 0.16 | −0.16 | 1.1 | 0.4 |
2 | 5 | 80 | 8 | 0.16 | −0.16 | 1.1 | 0.4 |
3 | 8 | 60 | 6 | 0.16 | −0.16 | 1.1 | 0.4 |
4 | 11 | 60 | 6 | 0.16 | −0.16 | 1.1 | 0.4 |
5 | 13 | 60 | 6 | 0.16 | −0.16 | 1.1 | 0.4 |
Numbering | Access Node | Capacity (kW) |
---|---|---|
1 | 17 | 60 |
2 | 22 | 60 |
Numbering | Access Node | Capacity (kW) |
---|---|---|
1 | 19 | 40 |
2 | 21 | 40 |
Numbering | Access Node | SOCmin | SOCmax | ηd | ||||
---|---|---|---|---|---|---|---|---|
1 | 6 | 200 | 40 | 40 | 0.2 | 0.95 | 0.96 | 0.2 |
2 | 10 | 200 | 40 | 40 | 0.2 | 0.95 | 0.96 | 0.2 |
Numbering | Access Node | Maximum Adjustable Rate | Maximum Climbing Rate | Minimum Climbing Rate | Call Cost |
---|---|---|---|---|---|
1 | 5 | 0.4 | 0.2 | −0.2 | −style (17) |
Parameter | MAPO | Parameter | MAPO | Parameter | MAPO | Parameter | MAPO | Parameter | MAPO |
---|---|---|---|---|---|---|---|---|---|
8 | 20,000 | 0.0001 | 24 | 64 | |||||
0.99 | 5 | 0.0001 | 0.2 | 0.95 | |||||
0.95 | 64 | 0.1 | 10 | 10 | |||||
100 | 10 | 0.1 |
Agent | Observable Region |
---|---|
Gas1 | {3,4,6,7,9,10,12,14,15,16,28} |
Gas2 | {1,2,4,6,7} |
Gas3 | {3,4,6,7,9,10,17,20,21,22,25,27,29,30} |
Gas4 | {4,7,9,10,17,20,21,22,28} |
Gas5 | {3,4,6,12,14,15,16,17,18,23} |
DL | {1,2,4,6,7} |
Ess1 | {3,4,9,15,17,18,20,21,22,27,28} |
Ess2 | {7,9,16,19,20,21,22,28} |
References
- Wang, Z.; Ma, S.; Li, G.; Bian, J. Day-ahead-intraday two-stage rolling optimal scheduling of power grid considering the access of composite energy storage plant. J. Sol. Energy 2022, 43, 400–408. [Google Scholar]
- Huang, J.; Zhang, H.; Tian, D.; Zhang, Z.; Yu, C.; Hancke, G.P. Multi-agent deep reinforcement learning with enhanced collaboration for distribution network voltage control. J. Eng. Appl. Artif. Intell. 2024, 134, 108677. [Google Scholar] [CrossRef]
- Liu, F.; Lin, C.; Chen, C.; Liu, R.; Li, G.; Bie, Z. A post-disaster time-sequence load recovery method for distribution networks considering the dynamic uncertainty of distributed new energy sources. J. Power Autom. Equip. 2022, 42, 159–167. [Google Scholar]
- Li, H.; Ren, Z.; Fan, M.; Li, W.; Xu, Y.; Jiang, Y.; Xia, W. A review of scenario analysis methods in planning and operation of modern power systems: Methodologies, applications, and A review of scenario analysis methods in planning and operation of modern power systems: Methodologies, applications, and challenges. J. Electr. Power Syst. Res. 2022, 205, 107722. [Google Scholar] [CrossRef]
- Qi, N.; Cheng, L.; Tian, L.; Guo, J.; Huang, R.; Wang, C. Review and outlook of distribution network planning research considering flexible load access. J. Power Syst. Autom. 2020, 44, 193–207. [Google Scholar]
- Cheng, L.; Wan, Y.; Qi, N.; Tian, L. Review and outlook on the operational reliability of distribution systems with multiple distributed resources. J. Power Syst. Autom. 2021, 45, 191–207. [Google Scholar]
- Liu, H.; Xu, Z.; Ge, S.; Yang, W.; Liu, M.; Zhu, G. Coordinated active-reactive operation and voltage control of active distribution networks considering energy storage regulation. J. Power Syst. Autom. 2019, 43, 51–58. [Google Scholar]
- Wang, S.; Li, Q.; Zhao, Q.; Lin, Z.; Wang, K. Improved particle swarm algorithm for multi-objective optimisation of AC/DC distribution network voltage taking into account source-load stochasticity. J. Power Syst. Autom. 2021, 33, 10–17. [Google Scholar]
- Jin, G.; Pan, D.; Chen, Q.; Shi, C.; Li, G. An energy optimisation method for multi-voltage level DC distribution networks considering adaptive real-time scheduling. J. Grid Technol. 2021, 45, 3906–3917. [Google Scholar]
- Mehrjerdi, H.; Hemmati, R. Modelling and optimal scheduling of battery energy storage systems in electric power distribution networks. J. Clean. Prod. 2019, 234, 810–821. [Google Scholar] [CrossRef]
- Li, J.; Ma, D.; Zhu, X.; Li, C.; Hou, T. Hierarchical optimal economic dispatch of active distribution networks based on ADMM algorithm. J. Electr. Power Constr. 2022, 43, 76–86. [Google Scholar]
- Hu, W.-H.; Cao, D.; Huang, Q.; Zhang, B.; Li, S.; Chen, Z. Application of deep reinforcement learning in optimal operation of distribution networks. J. Power Syst. Autom. 2023, 14, 174–191. [Google Scholar]
- Liao, Q.; Lu, L.; Liu, Y.; Zhang, Y.; Xiong, J. Voltage control model and algorithm for renewable energy-containing distribution networks considering reconfiguration. J. Power Syst. Autom. 2017, 41, 32–39. [Google Scholar]
- Duan, J.; Yi, Z.; Shi, D.; Lin, C.; Lu, X.; Wang, Z. Reinforcement-learning-based optimal control of hybrid energy storage systems in hybrid AC-DC microgrids. J. IEEE Trans. Ind. Inform. 2019, 15, 5355–5364. [Google Scholar] [CrossRef]
- Bui, V.H.; Hussain, A.; Kim, H.M. Double deep Q-learning-based distributed operation of battery energy storage system considering uncertainties. J. IEEE Trans. Smart Grid 2019, 11, 457–469. [Google Scholar] [CrossRef]
- Yu, I.; Yang, J.; Yang, M.; Gao, Y. Integrated scheduling of wind farm energy storage system prediction and decision making based on deep reinforcement learning. J. Power Syst. Autom. 2021, 45, 132–140. [Google Scholar]
- Yang, Z.; Ren, Z.; Sun, Z.; Liu, M.; Jiang, J.; Yin, Y. A security-constrained economic dispatch method for new energy power system based on proximal strategy optimisation algorithm. J. Grid Technol. 2023, 47, 988–998. [Google Scholar]
- Yang, B.; Chen, Y.; Yao, W.; Shi, T.; Shu, H. A review of power system stability assessment and decision-making based on new generation artificial intelligence technology. J. Power Syst. Autom. 2022, 46, 200–223. [Google Scholar]
- Wu, T.; Wang, J.; Lu, X.; Du, Y. AC/DC hybrid distribution network reconfiguration with microgrid formation using multi-agent soft actor-critic. J. Appl. Energy 2022, 307, 118189. [Google Scholar] [CrossRef]
- Wang, X.; Zhou, J.; Qin, B.; Guo, L. Coordinated control of wind turbine and hybrid energy storage system based on multi-agent deep reinforcement learning for wind power smoothing. J. Energy Storage 2023, 57, 106297. [Google Scholar] [CrossRef]
- Zhang, J.; Pu, T.; Li, Y.; Wang, Y.; Zhou, X. Optimal scheduling strategy for distributed power supply based on multi-intelligence deep reinforcement learning. J. Power Grid Technol. 2022, 46, 3496–3504. [Google Scholar]
- Caicedo, A.M.D.; Mejia, É.F.; Luna, E.G. Revolutionizing protection dynamics in microgrids: Local validation environment and a novel global management control through multi-agent systems. J. Comput. Electr. Eng. 2024, 120, 109748. [Google Scholar] [CrossRef]
- Dvir, E.; Shifrin, M.; Gurewitz, O. Cooperative Multi-Agent Reinforcement Learning for Data Gathering in Energy-Harvesting Wireless Sensor Networks. Mathematics 2024, 12, 2102. [Google Scholar] [CrossRef]
- Wang, G.; Sun, Y.; Li, J.; Jiang, Y.; Li, C.; Yu, H.; Wang, H.; Li, S. Dynamic Economic Scheduling with Self-Adaptive Uncertainty in Distribution Network Based on Deep Reinforcement Learning. J. Energy Eng. 2024, 121, 1671–1695. [Google Scholar] [CrossRef]
- Chen, L. Research on Value Functions in Deep Reinforcement Learning. Ph.D. Thesis, University of Mining and Technology, Xuzhou, China, 2021. [Google Scholar]
- Shen, Y. Research on Proximal Policy Optimisation Algorithms for Reinforcement Learning Problems. Ph.D. Thesis, Suzhou University, Suzhou, China, 2021. [Google Scholar]
- He, Y.; Chen, Y.; Liu, Y.; Liu, H.; Liu, D.; Sun, C. Analysis of kWh cost and mileage cost of energy storage. New Technol. J. Electr. Power 2019, 38, 1–10. [Google Scholar]
- Pan, H.; Liang, Z.; Xue, Q.; Zheng, F.; Xiao, Y. Economic dispatch of a virtual power plant with wind-photovoltaic-gas-storage based on time-of-use tariff. J. Sol. Energy 2020, 41, 115–122. [Google Scholar]
- Wang, K.; Zhang, J.; Qiu, X.; Wang, J.; Wang, C. Accurate current sharing with SOC balancing in DC microgrid. J. Electr. Power Syst. Res. 2024, 232, 110386. [Google Scholar] [CrossRef]
Algorithm Name | Cost/USD Yuan | Offline Training Time/s | Online Decision Time/s |
---|---|---|---|
PSO algorithm | 5199.548 | / | 73,508.57 |
MAPPO algorithm | 4561.03 | 50,621.41 | 13.65 |
Improvement of the MAPPO algorithm | 4258.366 | 39,906.16 | 11.13 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zuo, J.; Ai, Q.; Wang, W.; Tao, W. Day-Ahead Economic Dispatch Strategy for Distribution Networks with Multi-Class Distributed Resources Based on Improved MAPPO Algorithm. Mathematics 2024, 12, 3993. https://doi.org/10.3390/math12243993
Zuo J, Ai Q, Wang W, Tao W. Day-Ahead Economic Dispatch Strategy for Distribution Networks with Multi-Class Distributed Resources Based on Improved MAPPO Algorithm. Mathematics. 2024; 12(24):3993. https://doi.org/10.3390/math12243993
Chicago/Turabian StyleZuo, Juan, Qian Ai, Wenbo Wang, and Weijian Tao. 2024. "Day-Ahead Economic Dispatch Strategy for Distribution Networks with Multi-Class Distributed Resources Based on Improved MAPPO Algorithm" Mathematics 12, no. 24: 3993. https://doi.org/10.3390/math12243993
APA StyleZuo, J., Ai, Q., Wang, W., & Tao, W. (2024). Day-Ahead Economic Dispatch Strategy for Distribution Networks with Multi-Class Distributed Resources Based on Improved MAPPO Algorithm. Mathematics, 12(24), 3993. https://doi.org/10.3390/math12243993