Faulty Links’ Fast Recovery Method Based on Deep Reinforcement Learning
Abstract
:1. Introduction
- (1)
- Control Plane-Based Approach for Fast Recovery of Faulty Links
- (2)
- Data Plane-Based Approach for Fast Recovery of Faulty Links
- We summarize the transmission characteristics of the wide-area measurement system (WAMS) communication network, analyze the reasons why conventional faulty-link recovery algorithms are unsuitable for WAMSs, and emphasize the importance of minimizing fault recovery time and ensuring network link load balancing during the fault recovery process.
- We propose a rapid fault recovery method based on deep reinforcement learning (DDPG-LBBP) and design a suitable algorithmic framework for WAMS networks. During training, the Gated Recurrent Unit (GRU) is employed to mitigate gradient issues and enhance efficiency. After convergence, the model outputs optimal link weights to achieve link load balancing.
- We design the maximum disjoint backup path and introduce the backup label mechanism. The former prevents the overlap of nodes and links between the primary path and the backup path, eliminating secondary network link failures. The latter guides the phasor data packets to be transmitted through the backup path, enabling the rapid recovery of the faulty link and enhancing the overall reliability of the WAMS communication network.
- The IEEE 30 and IEEE 57 benchmark power system communication networks are adopted as experimental network topologies. Under different data traffic intensities, the DDPG-LBBP algorithm is compared with the (1+2ε)-BPCA, FFRLI, and LIR methods, respectively. The experimental results show that the DDPG-LBBP fault recovery method has the advantages of low faulty-link recovery delay, low packet loss rate, and high recovery success rate. Compared with the (1+2ε)-BPCA algorithm, the recovery delay is reduced by about 12.26%, and the faulty-link recovery success rate is increased by about 6.91%. Compared with the FFRLI method, the packet loss rate in the network topology after faulty-link recovery is reduced by about 15.31%.
2. Wide-Area Measurement System Communication Network SDN Architecture
2.1. System Architecture Design
- (1)
- Application plane: The DDPG-LBBP algorithm resides in the application plane, where it is responsible for formulating path optimization strategies within the network topology. Firstly, the DDPG-LBBP algorithm obtains network status information, such as link bandwidth, utilization, delay, and packet loss rate, through the northbound interface. Based on this information, the algorithm leverages a neural network to produce training outputs, which are used to calculate action values—specifically, link weights. The controller then computes the traffic transmission path from the source node to the destination node using these link weights. Next, the controller gathers network topology status information and forwards it to the DDPG-LBBP algorithm for reward value computation. The algorithm updates the parameters within the neural network based on feedback from the reward value. After multiple iterations, the training process is completed. Finally, upon the convergence of the DDPG-LBBP algorithm’s training, the weights of all links in the network topology are determined and output. These weights are then utilized via the northbound interface to guide the controller in formulating the optimal path strategy between the source and destination nodes.
- (2)
- Control plane: The control plane is composed of SDN controller entities, which are responsible for tasks such as acquiring link information and generating flow table entries. The controller periodically acquires network topology information via the southbound interface and provides real-time network status updates to the DDPG-LBBP algorithm through the northbound interface. Based on the link weights generated by the application plane, the controller calculates the data transmission paths between nodes, issues flow table entries to each switch, and directs data forwarding to the data plane.
- (3)
- Data plane: The data plane mainly consists of network forwarding devices, such as OpenFlow switches. It features flexible flow tables and dynamic packet processing capabilities. It is responsible for dynamically configuring and managing the behavior of the data plane based on instructions from the control plane. It can handle packet forwarding, routing, and processing according to the control plane’s commands.
2.2. System Modeling
3. Faulty-Path Recovery Scheme Based on DDPG-LBBP
3.1. DDPG-LBBP Data Transmission Path Optimization Algorithm
- (1)
- Actor network update
- (2)
- Critic network updates
3.2. DDPG-LBBP Agent Interacts with the Environment
- (1)
- State mapping
- (2)
- Action mapping
- (3)
- Reward value mapping
Algorithm 1 Improved DDPG algorithm process based on GRU |
Random initialization: parameters , , , , and Input: Link status information in network topology = [,,] |
Output: Link weight (1) |
(1) For episode = 1, do:
(2) Initialize Initialize noise strategy (3) For = 1, do (4) Select action according to the current policy (5) Execute to make the SDN controller build phasor data transmission paths (6) Obtain and (7) Store transition in D (8) Sample a random minibatch of from D (9) Calculate target return value , (10) Update the critic GRU network and update , the network parameters, by minimizing the error through (11) Update the parameters of the actor GRU network using the critic GRU network (12) Update the target network using and (13) End for (14) End for |
3.3. Method for Implementing Maximum Disjoint Backup Paths
Algorithm 2 Maximum disjoint backup path algorithm process |
Input: Network topology , link weight in the topology |
Output: The maximum disjoint backup path from start node to destination node |
(1) Initialize shortest path tree (2) For each neighbor of (3) If is not visited (4) (5) End if (6) Add to collection (7) While there are unvisited nodes (8) Update neighbor weights of (9) Find maximum disjoint backup path (10) End while (11) End for |
3.4. Backup Path Installation Method Implementation
4. Experiment and Result Analysis
4.1. Experimental Environment and Parameter Configuration
4.2. Comparative Evaluation of DDPG-LBBP
4.3. Faulty-Link Recovery Delay
4.4. Packet Loss Rate After a Faulty-Link Is Restored
4.5. Faulty-Link Recovery Success Rate
4.6. Ablation Experiments
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Appendix A
Glossary of Equation Terminology | |
---|---|
Switch | |
Link connecting switches and in the network topology | |
Path from source node to destination node | |
Quantity of bytes transmitted by the port of switch at time | |
Utilization rate of link at time | |
Packet loss rate | |
State | |
Action | |
Gradient | |
Discount factor | |
D | Experience replay pool |
WAMS | Wide-area measurement system |
SDN | Software-Defined Networking |
DDPG-LBBP | Load Balancing Backup Path Based on Deep Deterministic Policy Gradient |
PMU | Phase Measurement Unit |
PDC | Phasor Data Concentrator |
CNN | Convolutional Neural Network |
GRU | Gated Recurrent Unit |
References
- Hassan, M.A.M.; Abdalla, O.H.; Fayek, H.H.; Toha, S.F. Optimal WAMS Configuration in Nordic Power System. IJCSNS Int. J. Comput. Sci. Netw. Secur. 2023, 23, 130. [Google Scholar]
- Arpanahi, M.K.; Torkzadeh, R.; Safavizadeh, A.; Ashrafzadeh, A.; Eghtedarnia, F. A novel comprehensive optimal PMU placement considering practical issues in design and implementation of a wide-area measurement system. Electr. Power Syst. Res. 2023, 214, 108940. [Google Scholar] [CrossRef]
- Fayek, H.H.; Abdalla, O.H. Operation of the Egyptian Power Grid with Maximum Penetration Level of Renewable Energies Using Corona Virus Optimization Algorithm. Smart Cities 2022, 5, 34–53. [Google Scholar] [CrossRef]
- Vahidi, S.; Ghafouri, M.; Au, M.; Kassouf, M.; Mohammadi, A.; Debbabi, M. Security of wide-area monitoring, protection, and control (WAMPAC) systems of the smart grid: A survey on challenges and opportunities. IEEE Commun. Surv. Tutor. 2023, 25, 1294–1335. [Google Scholar] [CrossRef]
- Maleh, Y.; Qasmaoui, Y.; El Gholami, K.; Sadqi, Y.; Mounir, S. A comprehensive survey on SDN security: Threats, mitigations, and future directions. J. Reliab. Intell. Environ. 2023, 9, 201–239. [Google Scholar] [CrossRef]
- Jia, H.; Hou, W.; Wan, S.; Wang, T.; Xiang, H. Fast Communication Path Restoration for Power System Observability Recovery in a Sdn-Enabled Wide-Area Monitoring System. Available online: https://ssrn.com/abstract=4625443 (accessed on 22 February 2025).
- Qu, Y.; Chen, G.; Liu, X.; Yan, J.; Chen, B.; Jin, D. Cyber-resilience enhancement of PMU networks using software-defined networking. In Proceedings of the 2020 IEEE International Conference on Communications, Control, and Computing Technologies for Smart Grids (SmartGridComm), Tempe, AZ, USA, 11–13 November 2020. [Google Scholar]
- Kelian, V.H.; Mohd Warip, M.N.; Ahmad, R.B.; Phaklen, E.; Faiz, Z.F.; Zaizu, I.M. Traffic Engineering Provisioning of Multipath Link Failure Recovery in Distributed SDN Controller Environment, Proceedings of the AIP Conference Proceedings; AIP Publishing: Tokyo, Japan, 2024; p. 2898. [Google Scholar]
- Muthumanikandan, V.; Valliyammai, C. Link failure recovery using shortest path fast rerouting technique in SDN. Wire-Less Pers. Commun. 2017, 97, 2475–2495. [Google Scholar] [CrossRef]
- Zheng, L.; Xu, H.; Chen, S.; Huang, L. Performance guaranteed single link failure recovery in SDN overlay networks. In Proceedings of the 2020 IEEE 26th International Conference on Parallel and Distributed Systems (ICPADS), Hong Kong, China, 2–4 December 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 703–708. [Google Scholar]
- Li, J.; Qi, X.; Ma, W.; Liu, L. Path selection for link failure protection in hybrid SDNs. Future Gener. Comput. Syst. 2022, 137, 201–215. [Google Scholar] [CrossRef]
- Astaneh, S.A.; Shah Heydari, S.; Taghavi Motlagh, S.; Izaddoost, A. Trade-offs between Risk and Operational Cost in SDN Failure Recovery Plan. Future Internet 2022, 14, 263. [Google Scholar] [CrossRef]
- Duan, T.; Dinavahi, V. Fast path recovery for single link failure in SDN-enabled wide area measurement system. IEEE Trans. Smart Grid 2021, 13, 1645–1653. [Google Scholar] [CrossRef]
- Wang, Y.; Feng, S.; Guo, H.; Qiu, X.; An, H. A single-link failure recovery approach based on resource sharing and performance pre-diction in SDN. IEEE Access 2019, 7, 174750–174763. [Google Scholar] [CrossRef]
- Nurwarsito, H.; Prasetyo, G. Implementation Failure Recovery Mechanism using VLAN ID in Software Defined Networks. Int. J. Adv. Comput. Sci. Appl. 2023, 14, 178. [Google Scholar] [CrossRef]
- Liang, D.; Liu, Q.; Yan, B.; Hu, Y.; Zhao, B.; Hu, T. Low interruption ratio link fault recovery scheme for data plane in software-defined net-works. Peer-to-Peer Netw. Appl. 2021, 14, 3806–3819. [Google Scholar] [CrossRef]
- Zhu, Z.; Yu, H.; Liu, Q.; Liu, D.; Mei, B. FFRLI: Fast fault recovery scheme based on link importance for data plane in SDN. Comput. Netw. 2023, 237, 110062. [Google Scholar] [CrossRef]
- Duan, T.; Dinavahi, V. Dataplane Based Fast Failover in SDN-Enabled Wide Area Measurement System of Smart Grid. IEEE Trans. Ind. Inform. 2022, 19, 8148–8158. [Google Scholar] [CrossRef]
- Tools, I.T.F.O.I.T.F.S.; Müller, S.C.; Georg, H.; Nutaro, J.J.; Widl, E.; Deng, Y.; Palensky, P.; Awais, M.U.; Chenine, M.; Küch, M.; et al. Interfacing power system and ICT simulators: Challenges, state-of-the-art, and case studies. IEEE Trans. Smart Grid 2016, 9, 14–24. [Google Scholar]
- Chakrabarti, S.; Kyriakides, E. Optimal placement of phasor measurement units for power system observability. IEEE Trans. Power Syst. 2008, 23, 1433–1440. [Google Scholar] [CrossRef]
- Zhu, X.; Wen, M.H.F.; Li, V.O.K.; Leung, K.-C. Optimal PMU-communication link placement for smart grid wide-area measurement systems. IEEE Trans. Smart Grid 2018, 10, 4446–4456. [Google Scholar] [CrossRef]
- Islam, M.T.; Islam, N.; Refat, M.A. Node to node performance evaluation through RYU SDN controller. Wirel. Pers. Commun. 2020, 112, 555–570. [Google Scholar] [CrossRef]
Software Name | Function in the Experiment | Interaction with Other Components |
---|---|---|
Ubuntu 18.04 | Offered stable runtime, managed resources, eased software installation | Supported Python, enabled SDN component comms, resource-supply Mininet |
Python 3.6.9 | Allowed development of algorithms and processed data | Invoked other libraries and interacted with Mininet and Ryu |
TensorFlow 1.8.0 | Supported the training of deep learning models | Built and optimized models within Python |
Gym 0.26.2 | Provided a simulation environment for reinforcement learning | Facilitated interaction during algorithm training |
OpenFlow protocol V1.3 | Enabled communication between the controller and switches and configuration | Managed flow tables |
Mininet simulation tool V2.3.0 | Simulated the network environment | Served as a test scenario for the algorithm |
Ryu controller V4.34 | Collected network information and guided data forwarding | Communicated with switches and provided data for the algorithm |
Training Time (s) | Average Recovery Delay (ms) | Average Packet Loss Rate (%) | Average Recovery Success Rate (%) | |
---|---|---|---|---|
DDPG-RNN | 356.8 ± 23.5 | 10.5 ± 1.2 | 6.5 ± 0.8 | 85.2 ± 3.5 |
DDPG-GRU | 214.6 ± 15.8 | 8.3 ± 0.9 | 4.2 ± 0.6 | 91.5 ± 2.8 |
DDPG-LBBP | 187.2 ± 12.4 | 6.8 ± 0.7 | 3.5 ± 0.5 | 94.5 ± 2.2 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Huang, W.; Gui, W.; Li, Y.; Lv, Q.; Zhang, J.; He, X. Faulty Links’ Fast Recovery Method Based on Deep Reinforcement Learning. Algorithms 2025, 18, 241. https://doi.org/10.3390/a18050241
Huang W, Gui W, Li Y, Lv Q, Zhang J, He X. Faulty Links’ Fast Recovery Method Based on Deep Reinforcement Learning. Algorithms. 2025; 18(5):241. https://doi.org/10.3390/a18050241
Chicago/Turabian StyleHuang, Wanwei, Wenqiang Gui, Yingying Li, Qingsong Lv, Jia Zhang, and Xi He. 2025. "Faulty Links’ Fast Recovery Method Based on Deep Reinforcement Learning" Algorithms 18, no. 5: 241. https://doi.org/10.3390/a18050241
APA StyleHuang, W., Gui, W., Li, Y., Lv, Q., Zhang, J., & He, X. (2025). Faulty Links’ Fast Recovery Method Based on Deep Reinforcement Learning. Algorithms, 18(5), 241. https://doi.org/10.3390/a18050241