Signal-Guided Cooperative Optimization Method for Active Distribution Networks Oriented to Microgrid Clusters
Abstract
1. Introduction
- A distributed carbon-emission-flow framework grounded in STCIEM that circumvents the privacy limitations of centralized models and enables privacy-preserving, spatiotemporal allocation of emissions between ADNs and MGs.
- A carbon–electricity dual-market mechanism that supports peer-to-peer trading of carbon allowances together with ADN backstop clearing, thereby enhancing market flexibility while safeguarding allocative fairness.
- A constraint-aware multi-agent scheduling policy based on MADDPG that achieves supply–demand co-optimization under decentralized strategic interaction and improves system stability and decarbonization outcomes.
2. Spatio-Temporal Equalization–Based Method for Computing Carbon-Emission Intensity
3. Carbon–Electricity Dual-Market Optimization for ADN and MGs
3.1. Dual-Market Optimization Model for the ADN Operator
3.2. Dual-Market Optimization Model for MG Operators
- First item Micro-gas-turbine operational constraints
- 2.
- Micro-gas-turbine operational constraints
- 3.
- DR operational constraints
- 4.
- Energy-storage system operational constraints
3.3. P2P Carbon-Trading Mechanism
3.3.1. Bidding and Clearing Rules
- A sell quote (ask): the amount of allowances offered and the minimum acceptable unit price ;
- A buy quote (bid): the amount of allowances requested and the maximum willingness-to-pay .
3.3.2. ADN Participation and Clearing Functionality
4. Multi-Agent Learning Framework and Scheduling-Strategy Optimization
4.1. Multi-Agent Markov Game Formulation
4.1.1. Observation
4.1.2. Action
4.1.3. State Transition
4.1.4. Reward Function
4.2. Multi-Agent Optimization Based on MADDPG
5. Case Study
5.1. Case Setup
5.2. Result Analysis
5.3. Comparative Analysis of Low-Carbon Economic Optimal Dispatch Decisions
6. Conclusions
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
| ADN | Active Distribution Network |
| MG/MGs | Microgrid(s) |
| MMG/MMGs | Multiple Microgrids/Multi-Microgrid(s) |
| DER/DERs | Distributed Energy Resource(s) |
| CEF | Carbon Emission Flow |
| PCC | Point of Common Coupling |
| IDSM | Intelligent Demand-Side Management |
| DSM | Demand-Side Management |
| DR | Demand Response |
| P2P | Peer-to-Peer |
| STCIEM | Spatiotemporal Carbon Intensity Equilibrium Model |
| MADDPG | Multi-Agent Deep Deterministic Policy Gradient |
| MADRL | Multi-Agent Deep Reinforcement Learning |
| TRPO | Trust Region Policy Optimization |
| PoMG | Partially Observable Markov Game |
| RL | Reinforcement Learning |
| DDPG | Deep Deterministic Policy Gradient |
| CTDE | Centralized Training, Decentralized Execution |
| UPG | Upstream Grid |
| SOC | State of Charge |
| BESS/BESSs | Battery Energy Storage System(s) |
| GT/GTs | Gas Turbine(s) |
| PV | Photovoltaic |
| EV/EVs | Electric Vehicle(s) |
| TCL | Thermostatically Controlled Load(s) |
| HVAC | Heating, Ventilation, and Air Conditioning |
| IEEE | Institute of Electrical and Electronics Engineers |
| CPU | Central Processing Unit |
| GPU | Graphics Processing Unit |
| GeLU | Gaussian Error Linear Unit |
References
- Zhang, Z.; Kang, C. Challenges and Prospects for Constructing the New-type Power System Towards a Carbon Neutrality Future. Proc. CSEE 2022, 42, 2806–2819. [Google Scholar] [CrossRef]
- Shu, Y.; Zhang, L.; Zhang, Y.; Wang, Y.; Lu, G.; Yuan, B.; Xia, P. Carbon Peak and Carbon Neutrality Path for China’s Power Industry. Chin. J. Eng. 2021, 23, 1–14. [Google Scholar] [CrossRef]
- Xu, Z.; Sun, H.; Guo, Q. Review and Prospect of Integrated Demand Response. Proc. CSEE 2018, 38, 7194–7205+7446. [Google Scholar] [CrossRef]
- Liu, Y.; Li, Y.; Zhou, C.; Song, J.; Deng, H.; Du, E.; Zhang, N.; Kang, C. Overview of Carbon Measurement and Analysis Methods in Power Systems. Proc. CSEE 2024, 44, 2220–2236. [Google Scholar] [CrossRef]
- Kang, C.; Zhou, T.; Chen, Q.; Wang, J.; Sun, Y.; Xia, Q.; Yan, H. Carbon Emission Flow from Generation to Demand: A Network-Based Model. IEEE Trans. Smart Grid 2015, 6, 2386–2394. [Google Scholar] [CrossRef]
- Lu, Z.; Bai, L.; Wang, J.; Wei, J.; Xiao, Y.; Chen, Y. Peer-to-Peer Joint Electricity and Carbon Trading Based on Carbon-Aware Distribution Locational Marginal Pricing. IEEE Trans. Power Syst. 2023, 38, 835–852. [Google Scholar] [CrossRef]
- Wan, T.; Tao, Y.; Qiu, J.; Lai, S. Distributed Energy and Carbon Emission Right Trading in Local Energy Systems Considering the Emission Obligation on Demand Side. IEEE Syst. J. 2023, 17, 6292–6301. [Google Scholar] [CrossRef]
- Zhang, M.; Xu, Y.; Yi, Z. Two-Stage Carbon-Oriented Scheduling of an Active Distribution Network with Thermostatically Controlled Load Aggregators. IEEE Trans. Sustain. Energy 2024, 15, 1462–1474. [Google Scholar] [CrossRef]
- Liang, Z.; Mu, L. Multi-agent low-carbon optimal dispatch of regional integrated energy system based on mixed game theory. Energy 2024, 295, 130953. [Google Scholar] [CrossRef]
- Wang, Y.; Qiu, J.; Tao, Y. Optimal Power Scheduling Using Data-Driven Carbon Emission Flow Modelling for Carbon Intensity Control. IEEE Trans. Power Syst. 2022, 37, 2894–2905. [Google Scholar] [CrossRef]
- Zhang, Q.; Dehghanpour, K.; Wang, Z.; Huang, Q. A Learning-Based Power Management Method for Networked Microgrids Under Incomplete Information. IEEE Trans. Smart Grid 2020, 11, 1193–1204. [Google Scholar] [CrossRef]
- Ibrahim, N.N.; Jamian, J.J.; Md Rasid, M. Optimal multi-objective sizing of renewable energy sources and battery energy storage systems for formation of a multi-microgrid system considering diverse load patterns. Energy 2024, 304, 131921. [Google Scholar] [CrossRef]
- Du, Y.; Li, F. Intelligent Multi-Microgrid Energy Management Based on Deep Neural Network and Model-Free Reinforcement Learning. IEEE Trans. Smart Grid 2020, 11, 1066–1076. [Google Scholar] [CrossRef]
- Ceja-Espinosa, C.; Pirnia, M.; Cañizares, C.A. An Affine Arithmetic-Based Energy Management System for Cooperative Multi-Microgrid Networks. IEEE Trans. Smart Grid 2024, 15, 1317–1329. [Google Scholar] [CrossRef]
- Liu, X.; Li, S.; Zhu, J. Optimal Coordination for Multiple Network-Constrained VPPs via Multi-Agent Deep Reinforcement Learning. IEEE Trans. Smart Grid 2023, 14, 3016–3031. [Google Scholar] [CrossRef]
- Yan, M.; Shahidehpour, M.; Paaso, A.; Zhang, L.; Alabdulwahab, A.; Abusorrah, A. Distribution Network-Constrained Optimization of Peer-to-Peer Transactive Energy Trading Among Multi-Microgrids. IEEE Trans. Smart Grid 2021, 12, 1033–1047. [Google Scholar] [CrossRef]
- Wu, Y.; Zhao, T.; Yan, H.; Liu, M.; Liu, N. Hierarchical Hybrid Multi-Agent Deep Reinforcement Learning for Peer-to-Peer Energy Trading Among Multiple Heterogeneous Microgrids. IEEE Trans. Smart Grid 2023, 14, 4649–4665. [Google Scholar] [CrossRef]
- Pinto, R.S.; Unsihuay-Vila, C. A data-driven distributionally robust expansion planning model for ADNs with multi-microgrids considering energy trading strategy based on game theory. Sustain. Energy Grids Netw. 2024, 39, 101431. [Google Scholar] [CrossRef]
- Chen, X.; Zhai, J.; Jiang, Y.; Ni, C.; Wang, S.; Nimmegeers, P. Decentralized coordination between active distribution network and multi-microgrids through a fast decentralized adjustable robust operation framework. Sustain. Energy Grids Netw. 2023, 34, 101068. [Google Scholar] [CrossRef]
- Chen, T.; Bu, S.; Liu, X.; Kang, J.; Yu, F.R.; Han, Z. Peer-to-Peer Energy Trading and Energy Conversion in Interconnected Multi-Energy Microgrids Using Multi-Agent Deep Reinforcement Learning. IEEE Trans. Smart Grid 2022, 13, 715–727. [Google Scholar] [CrossRef]
- Monfaredi, F.; Shayeghi, H.; Siano, P. Multi-agent deep reinforcement learning-based optimal energy management for grid-connected multiple energy carrier microgrids. Int. J. Electr. Power Energy Syst. 2023, 153, 109292. [Google Scholar] [CrossRef]
- Gong, J.; Liu, Y. Coordinated Optimization of Active Distribution Network Based on Deep Deterministic Policy Gradient Algorithm. Autom. Electr. Power Syst. 2020, 44, 113–120. [Google Scholar]
- Gao, Y.; Yu, N. Model-augmented safe reinforcement learning for Volt-VAR control in power distribution networks. Appl. Energy 2022, 313, 118762. [Google Scholar] [CrossRef]
- Hu, Z.; Chan, K.W.; Zhu, Z.; Wei, X.; Zheng, W.; Bu, S. Techno–Economic Modeling and Safe Operational Optimization of Multi-Network Constrained Integrated Community Energy Systems. Adv. Appl. Energy 2024, 15, 100183. [Google Scholar] [CrossRef]
- Zhang, Y.; Mei, Z.; Wu, X.; Jiang, H.; Zhang, J.; Gao, W. Two-Step Diffusion Policy Deep Reinforcement Learning Method for Low-Carbon Multi-Energy Microgrid Energy Management. IEEE Trans. Smart Grid 2024, 15, 4576–4588. [Google Scholar] [CrossRef]
- Cao, D.; Zhao, J.; Hu, W.; Yu, N.; Ding, F.; Huang, Q.; Chen, Z. Deep Reinforcement Learning Enabled Physical-Model-Free Two-Timescale Voltage Control Method for Active Distribution Systems. IEEE Trans. Smart Grid 2022, 13, 149–165. [Google Scholar] [CrossRef]
- Wang, Y.; Qiu, D.; Sun, M.; Strbac, G.; Gao, Z. Secure energy management of multi-energy microgrid: A physical-informed safe reinforcement learning approach. Appl. Energy 2023, 335, 120759. [Google Scholar] [CrossRef]
- Ye, Y.; Wang, H.; Chen, P.; Tang, Y.; Strbac, G. Safe Deep Reinforcement Learning for Microgrid Energy Management in Distribution Networks with Leveraged Spatial–Temporal Perception. IEEE Trans. Smart Grid 2023, 14, 3759–3775. [Google Scholar] [CrossRef]
- Zhang, J.; Sang, L.; Xu, Y.; Sun, H. Networked Multiagent-Based Safe Reinforcement Learning for Low-Carbon Demand Management in Distribution Networks. IEEE Trans. Sustain. Energy 2024, 15, 1528–1545. [Google Scholar] [CrossRef]
- Ye, T.; Huang, Y.; Yang, W.; Cai, G.; Yang, Y.; Pan, F. Safe multi-agent deep reinforcement learning for decentralized low-carbon operation in active distribution networks and multi-microgrids. Appl. Energy 2025, 387, 125609. [Google Scholar] [CrossRef]










| Refs | User Data Privacy | Budget Fairness | Price Bounds | Clearing Stability | Safety Violation Rate |
|---|---|---|---|---|---|
| [1,6,11,16,21,26] | ✓ | × | × | × | × |
| [2,7,12,17,22,27] | ✓ | × | × | × | ✓ |
| [3,8,13,18,23,28] | × | × | × | ✓ | ✓ |
| [4,9,14,19,24] | ✓ | × | × | ✓ | × |
| [5,10,15,20,25,29,30] | ✓ | × | × | × | ✓ |
| This work | ✓ | ✓ | ✓ | ✓ | ✓ |
| Parameter | Value | Parameter | Value |
|---|---|---|---|
| 500 ($/MWh) | 0.05 | ||
| 0.8 (t/MWh) | 0.95 | ||
| 150 ($/MWh) | 100 ($/t) |
| Hyperparameters | Value | Hyperparameters | Value |
|---|---|---|---|
| Training steps | 3 × 105 | Number of iterations per step | 24 |
| Strategy network learning rate | 4 × 10−4 | 0.01 | |
| Value network learning rate | 4 × 10−4 | Gamma | 0.96 |
| Hidden layer dimensions | 50 and 250 | Smoothing area size | 1 × 105 |
| Dynamic batch size | ✓ | Layer normalization | ✓ |
| Initialization of interactions | ✓ | Initialization of interactions | ✓ |
| Network | Structure | ||
| Strategy network | L1 → LN → GeLU → L2 → GeLU → L3 → Tanh (policy head) | ||
| Value network | L1 → LN → GeLU → L2 → GeLU → L3 (value head) | ||
| Algorithm | C-DDPG | IDDPG | RBD | MADDPG |
|---|---|---|---|---|
| Total carbon emissions of MGs | 28.55 | 27.85 | 28.9 | 27.16 |
| MGs operating cost ( | 2741.12 | 2488.07 | 2689.76 | 2568.41 |
| Carbon intensity ratio of the ADN | 0.88 | 0.86 | 0.89 | 0.85 |
| ADN operating revenue ($) | 429.33 | 458.9 | 393.23 | 466.82 |
| Case | Case 1 | Case 2 | Case 3 |
|---|---|---|---|
| Total carbon emissions of MGs | 30.24 | 17.25 | 27.16 |
| MGs operating cost | 2901.16 | 4841.01 | 2568.41 |
| Carbon intensity ratio of the ADN | 0.91 | 0.81 | 0.85 |
| ADN operating revenue ($) | 498.5 | −177.4 | 466.82 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wang, Z.; Li, S.; Yu, K.; Wei, W.; Lin, G.; Zhou, X.; Huang, Y.; Huang, Y. Signal-Guided Cooperative Optimization Method for Active Distribution Networks Oriented to Microgrid Clusters. Energies 2025, 18, 6614. https://doi.org/10.3390/en18246614
Wang Z, Li S, Yu K, Wei W, Lin G, Zhou X, Huang Y, Huang Y. Signal-Guided Cooperative Optimization Method for Active Distribution Networks Oriented to Microgrid Clusters. Energies. 2025; 18(24):6614. https://doi.org/10.3390/en18246614
Chicago/Turabian StyleWang, Zihao, Shuoyu Li, Kai Yu, Wenjing Wei, Guo Lin, Xiqiu Zhou, Yilin Huang, and Yuping Huang. 2025. "Signal-Guided Cooperative Optimization Method for Active Distribution Networks Oriented to Microgrid Clusters" Energies 18, no. 24: 6614. https://doi.org/10.3390/en18246614
APA StyleWang, Z., Li, S., Yu, K., Wei, W., Lin, G., Zhou, X., Huang, Y., & Huang, Y. (2025). Signal-Guided Cooperative Optimization Method for Active Distribution Networks Oriented to Microgrid Clusters. Energies, 18(24), 6614. https://doi.org/10.3390/en18246614

