A Graph Deep Reinforcement Learning-Based Fault Restoration Method for Active Distribution Networks
Abstract
1. Introduction
- (1)
- The literature [1] proposed a Deep Q-Network (DQN)-based load transfer method, delivering comprehensive optimal transfer schemes during faults.
- (2)
- The literature [2] partitioned distribution networks into microgrids, implementing distributed co-operative restoration via multi-agent systems.
- (3)
- The literature [3] developed an improved Dueling Double Deep Q-Network (D3QN) approach with dual agents to reduce dimensionality in urban grid load transfer.
- (1)
- Complex graph neural network modifications to agent architectures, coupled with high-penalty constraints for illegal actions, degrade convergence and induce suboptimal actions.
- (2)
- Fixed exploration mechanisms in value-function-based DRL algorithms constrain parameter optimization and generalization capability, compromising restoration quality stability.
- (3)
- Simplified load/DG output models limit adaptability to complex time-varying operational states in active distribution networks.
2. Fault Recovery Framework for Active Distribution Networks Based on the GAT-SAC Algorithm
2.1. GAT Algorithm
- (1)
- Critical node enhancement: Power sources (e.g., substations) consistently gain high weights (αij > 0.8), prioritizing backbone power supply paths.
- (2)
- Peripheral node suppression: Terminal load nodes receive attenuated attention (αij < 0.2), enabling flexible adjustment of non-critical loads.
- (3)
- Fault response: Weights on fault-adjacent branches surge by 300% during restoration. In the line 8–9 fault case (Section 4.2, illustrated in Figure 5), GAT assigns 4.2× higher weights to source nodes, driving the agent to close the nearest tie switch (S34).
2.2. SAC Algorithm
- 1
- Exploration efficiency: SAC’s entropy regularization (Equation (4)) drives more effective global exploration in continuous action spaces, while PPO’s clipping mechanism constrains exploration, causing local optima traps [13].
- 2
- Stability guarantee: SAC’s twin Q-network design (Equations (9) and (10)) avoids value overestimation risks inherent in TD3, ensuring stable policy updates in topologically volatile distribution networks [16].
- 3
- Scenario adaptability: The max-entropy framework (Equation (11)) maintains policy robustness under time-varying source–load conditions, achieving 4.2% higher load recovery than PPO/TD3 baselines (Table 1).
2.3. GAT-SAC Algorithm
2.4. Active Distribution Network Fault Recovery Method Based on the GAT-SAC Algorithm
2.4.1. Overall Framework of the Active Distribution Network Fault Recovery Method
2.4.2. Application Process of the Active Distribution Network Fault Recovery Method
3. Graph-Based Deep Reinforcement Learning Model for Active Distribution Network Fault Restoration
3.1. Agent and Action Space
- (1)
- Static rules: Hard-coded prohibitions include operating faulted-line switches (as defined in Equation (12)) and violating radial topology constraints.
- (2)
- Dynamic detection: Real-time switch state updates via adjacency matrix X identify redundant actions (e.g., re-toggling already operated switches), with historical actions stored per episode.
3.2. State Space
- (1)
- Dynamic feature embedding: The 24-period source–load profiles (Appendix A, Figure A2) are embedded as node features, where timestamps couple temporal dimensions with topology.
- (2)
- Spatiotemporal attention: GAT’s multi-head attention (Equations (1)–(3)) dynamically adjusts neighbor weights (e.g., enhancing grid connection weights when DG output drops at 19:00.
- (3)
- Temporal strategy adaptation: SAC’s entropy regularization (Equations (4) and (11)) drives time-specific exploration, generating distinct action sequences across periods (67% variation between 09:00/19:00 in Appendix A, Table A2).
3.3. Reward Function
3.3.1. Reward Component
3.3.2. Penalty Component
4. Case Study
4.1. Model Performance Evaluation
4.2. Fault Recovery Schemes
4.2.1. Fault Recovery Scheme of GAT-SAC Algorithm
4.2.2. Comparison of Restoration Schemes Across DRL Algorithms
4.3. Fault Recovery Efficiency Comparison
4.4. Fault Recovery Performance Under Topology Variations
- (1)
- Node reduction: Sectionalizing switch S30 was opened.
- (2)
- Connection alteration: Sectionalizing switch S9 was opened while tie switch S34 was closed; sectionalizing switch S27 was opened while tie switch S36 was closed.
4.5. Large-Scale System Validation
4.6. Sensitivity Analysis of DG Penetration Levels
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Appendix A
DG Type | Node Number | Rated Capacity/kW |
---|---|---|
Photovoltaic power | 6 | 500 |
12 | 500 | |
23 | 500 | |
Wind power | 20 | 200 |
30 | 300 |
Fault Line | The Moment When the Fault Occurs | Fault Recovery Plan | Load Power Supply Rate Prate (%) | Network Loss Rate (%) |
---|---|---|---|---|
8–9 | 09:00 | Combined S34 → combined S32 → disconnect S6 | 100 | 2.48 |
14:00 | Combined S34 → combined S36 → disconnect S27 | 100 | 2.08 | |
19:00 | Combined S34 → combined S32 → disconnect S6 → combined S36 → disconnect S27 | 100 | 3.56 | |
13–14 | 09:00 | Combined S33 → combined S36 → disconnect S25 | 100 | 2.94 (3.25) |
14:00 | Combined S33 → combined S36 → disconnect S25 → disconnect S31 | 98.38 | 2.28 (2.59) | |
19:00 | Combined S33 → combined S32 → disconnect S6 → combined S36 → disconnect S27 | 100 | 3.72 (4.70) | |
26–27 | 09:00 | Combined S36 | 100 | 2.95 |
14:00 | Combined S36 | 100 | 2.34 | |
19:00 | Combined S36 → combined S34 → disconnect S10 → combined S35 → disconnect S31 | 100 | 3.59 |
DRL Algorithm | The Moment When the Fault Occurs | Fault Recovery Plan | Load Power Supply Rate Prate (%) | Network Loss Rate (%) | Number of Switch Actions | Voltage/Current Exceeds the Limit | Ring Network |
---|---|---|---|---|---|---|---|
DQN | 09:00 | Combined S33 → combined S36 → disconnect S24 → combined S34 → disconnect S12 → disconnect S31 | 98.38 | 2.82 | 6 | No | No |
14:00 | Combined S33 → combined S34 → disconnect S11 → combined S36 → disconnect S26 | 100 | 2.19 | 5 | No | No | |
19:00 | Combined S34 → combined S32 → disconnect S6 → combined S33 → disconnect S7 → disconnect S9 → disconnect S31 | 96.77 | 3.42 | 7 | No | No | |
D3QN | 09:00 | Combined S34 → combined S33 → disconnect S7 → disconnect S31 | 98.38 | 2.50 | 4 | No | No |
14:00 | Combined S34 → combined S35 → disconnect S31 | 100 | 2.12 | 3 | No | No | |
19:00 | Combined S34 → combined S36 → disconnect S26 → combined S33 → disconnect S7 → disconnect S27 | 98.38 | 3.62 | 6 | The voltage exceeds the lower limit. | No | |
SAC | 09:00 | Combined S34 → combined S33 → disconnect S13 → combined S36 → disconnect S25 | 100 | 2.72 | 5 | No | No |
14:00 | Combined S34 → combined S36 → disconnect S27 | 100 | 2.08 | 3 | No | No | |
19:00 | Combined S34 → combined S32 → disconnect S6 → combined S35 → disconnect S31 → disconnect S9 | 98.38 | 3.47 | 6 | No | No | |
GAT-SAC | 09:00 | Combined S34 → combined S32 → disconnect S6 | 100 | 2.48 | 3 | No | No |
14:00 | Combined S34 → combined S36 → disconnect S27 | 100 | 2.08 | 3 | No | No | |
19:00 | Combined S34 → combined S32 → disconnect S6 → combined S36 → disconnect S27 | 100 | 3.56 | 5 | No | No |
References
- Xu, Z.; Tang, J.; Jiang, Y.; Qin, R.; Ma, H.; Yang, Y.; Zhao, C. Analysis of Fault Characteristics of Distribution Network with PST Loop Closing Device under Small Current Grounding System. Energies 2022, 15, 2307. [Google Scholar] [CrossRef]
- Tang, A.; Song, X.; Shang, Y.; Mengqi, Y.; Ximei, Z. Harmonic Mitigation Method and Control Strategy of Offshore Wind Power System Based on Distributed Power Flow Controller. Autom. Electr. Power Syst. 2024, 48, 20–28. [Google Scholar]
- Tang, A.; Yang, Y.; Yang, H.; Song, J.; Qiu, P.; Chen, Q.; Song, X.; Jia, T. Research on Topology and Control Method of Uninterrupted Ice Melting Device Based on Non-contact Coupling Power Flow Controller. Proc. CSEE 2023, 43, 8666–8674. [Google Scholar]
- Tang, A.; Ma, L.; Qiu, P.; Song, J.; Chen, Q.; Guan, M.; Song, X.; Xiong, B. Research on the harmonic currents rates for the exchanged energy of unified distributed power flow controller. IET Gener. Transm. Distrib. 2022, 17, 530–538. [Google Scholar] [CrossRef]
- Zhang, R.; Chen, X. Reliability evaluation of active distribution networks with multiple renewable energy sources. IEEE Trans. Power Syst. 2022, 37, 4892–4904. [Google Scholar]
- Hingorani, N.G.; Gyugyi, L. Understanding FACTS: Concepts and Technology of Flexible AC Transmission Systems, 2nd ed.; Wiley-IEEE Press: New York, NY, USA, 2023. [Google Scholar]
- Liu, Y.; Zhao, C. Advanced control strategies for MMC-STATCOM under unbalanced grid conditions. IEEE Trans. Power Electron. 2024, 39, 789–801. [Google Scholar]
- Zhao, H.; Wu, Q. Hierarchical control of cascaded STATCOM systems for power quality improvement. IEEE Trans. Smart Grid 2023, 14, 2890–2902. [Google Scholar]
- Zhao, Z.; Yang, P. Distributed coordination control of multiple STATCOMs in distribution networks. IEEE Trans. Power Deliv. 2023, 38, 3567–3579. [Google Scholar]
- Wang, W.; Tang, A.; Wei, F.; Yang, H.; Xinran, L.; Peng, J. Electric vehicle charging load forecasting considering weather impact. Appl. Energy 2025, 383, 125337. [Google Scholar]
- Yang, L.; Wang, W. Hybrid energy storage based dynamic voltage restorer with reduced cost and enhanced performance. IEEE Trans. Power Electron. 2023, 38, 8123–8135. [Google Scholar]
- Wang, Y.; Chen, C.; Wang, J.; Baldick, R. Research on resilience of power systems under natural disasters—A review. IEEE Trans. Power Syst. 2016, 31, 1604–1613. [Google Scholar] [CrossRef]
- Ding, T.; Lin, Y.; Li, G.; Bie, Z. A new model for resilient distribution systems by microgrids formation. IEEE Trans. Power Syst. 2017, 32, 4145–4147. [Google Scholar] [CrossRef]
- Wang, C.; Lei, S.; Ju, P.; Chen, C.; Peng, C.; Hou, Y. MDP-based distribution network reconfiguration with renewable distributed generation: An approximate dynamic programming approach. IEEE Trans. Smart Grid 2020, 11, 3620–3631. [Google Scholar] [CrossRef]
- Lei, S.; Chen, C.; Li, Y.; Hou, Y. Resilient disaster recovery logistics of distribution systems: Co-optimize service restoration with repair crew and mobile power source dispatch. IEEE Trans. Smart Grid 2019, 10, 6187–6202. [Google Scholar] [CrossRef]
- Butler, K.L.; Sarma, N.D.R.; Prasad, V.R. Network reconfiguration for service restoration in shipboard power distribution Systems. IEEE Trans. Power Syst. 2001, 16, 653–661. [Google Scholar] [CrossRef]
- Schmitz, M.; Bernardon, D.P.; Garcia, V.J.; Schmitz, W.I.; Wolter, M.; Pfitscher, L.L. Price-based dynamic optimal power flow with emergency repair. IEEE Trans. Smart Grid 2021, 12, 324–337. [Google Scholar] [CrossRef]
- Chen, C.; Wang, J.; Qiu, F. Resilient distribution system by microgrids formation after natural disasters. IEEE Trans. Smart Grid 2016, 7, 958–966. [Google Scholar] [CrossRef]
- Liu, C.C.; Lee, S.J.; Venkata, S.S. An expert system operational aid for restoration and loss reduction of distribution systems. IEEE Trans. Power Syst. 1988, 3, 619–626. [Google Scholar] [CrossRef]
- Carvalho, P.M.S.; Ferreira, L.A.F.M.; Barruncho, L.M.F. Optimization approach to dynamic restoration of distribution systems. Int. J. Electr. Power Energy Syst. 2007, 29, 222–229. [Google Scholar] [CrossRef]
- Li, J.; Ma, X.; Liu, C.; Schneider, K.P. Distribution system restoration with microgrids using spanning tree search. IEEE Trans. Power Syst. 2014, 29, 3021–3029. [Google Scholar] [CrossRef]
- Lopez, J.C.; Franco, J.F.; Rider, M.J.; Romero, R. Optimal restoration/maintenance switching sequence of unbalanced three-phase distribution systems. IEEE Trans. Smart Grid 2018, 9, 6058–6068. [Google Scholar] [CrossRef]
- Chen, B.; Chen, C.; Wang, J.; Butler-Purry, K.L. Multi-time step service restoration for advanced distribution systems and microgrids. IEEE Trans. Smart Grid 2018, 9, 6793–6805. [Google Scholar] [CrossRef]
- Zolfaghari, M.; Gharehpetian, G.; Shafie-khah, M.; Catalão, J.P.S. Comprehensive review on the strategies for controlling the interconnection of AC and DC microgrids. Int. J. Electr. Power Energy Syst. 2022, 136, 107742. [Google Scholar] [CrossRef]
Usage Method | Load Power Supply Rate/% | Training Cost/min | Decision-Making Time/s |
---|---|---|---|
Grey wolf algorithm | 91.83 | - | 17 |
SAC | 95.67 | 15 | 0.13 |
Penetration | Load Recovery Rate | Avg. Decision Time | Topology Adaptation Gain |
---|---|---|---|
30% | 94.7% | 0.14 s | +3.2% |
53.84% | 98.5% | 0.13 s | +4.5% |
70% | 96.8% | 0.15 s | +5.2% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Dan, Y.; Zhong, H.; Wang, C.; Wang, J.; Fei, Y.; Yu, L. A Graph Deep Reinforcement Learning-Based Fault Restoration Method for Active Distribution Networks. Energies 2025, 18, 4420. https://doi.org/10.3390/en18164420
Dan Y, Zhong H, Wang C, Wang J, Fei Y, Yu L. A Graph Deep Reinforcement Learning-Based Fault Restoration Method for Active Distribution Networks. Energies. 2025; 18(16):4420. https://doi.org/10.3390/en18164420
Chicago/Turabian StyleDan, Yangqing, Hui Zhong, Chenxuan Wang, Jun Wang, Yanan Fei, and Le Yu. 2025. "A Graph Deep Reinforcement Learning-Based Fault Restoration Method for Active Distribution Networks" Energies 18, no. 16: 4420. https://doi.org/10.3390/en18164420
APA StyleDan, Y., Zhong, H., Wang, C., Wang, J., Fei, Y., & Yu, L. (2025). A Graph Deep Reinforcement Learning-Based Fault Restoration Method for Active Distribution Networks. Energies, 18(16), 4420. https://doi.org/10.3390/en18164420