Intelligent Power Management and Autonomous Fault Diagnosis for Enhanced Reliability in Secondary Power Distribution Systems

Li, Yongxiao; Hassan, Zaheer Ul; Sootahar, Haresh Kumar; Hussain, Touseef; Soothar, Kamlesh Kumar; Bhutto, Zulfiqar Ali

doi:10.3390/su17136009

Open AccessArticle

Intelligent Power Management and Autonomous Fault Diagnosis for Enhanced Reliability in Secondary Power Distribution Systems

by

Yongxiao Li

,

Zaheer Ul Hassan

^*

,

Haresh Kumar Sootahar

,

Touseef Hussain

,

Kamlesh Kumar Soothar

and

Zulfiqar Ali Bhutto

School of Electronics Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China

^*

Author to whom correspondence should be addressed.

Sustainability 2025, 17(13), 6009; https://doi.org/10.3390/su17136009

Submission received: 16 May 2025 / Revised: 26 June 2025 / Accepted: 28 June 2025 / Published: 30 June 2025

(This article belongs to the Special Issue The Electric Power Technologies: Today and Tomorrow)

Download

Browse Figures

Versions Notes

Abstract

Efficient decentralized power management is crucial for enhancing the reliability, resilience, responsiveness, and sustainability of secondary power distribution systems, thereby preventing major power outages and providing rapid responses. However, existing secondary power distribution networks are prone to failures, thus compromising their operational trustworthiness and efficiency. This work proposes an intelligent, decentralized control system with distributed processing capabilities. The proposed system is designed to automate fault detection and rectification along with optimized power management at secondary distribution nodes. The system enables rapid fault detection (line-to-line, line-to-ground, and overload) and initiates a fault-based response to isolate the load through controlled relays. Additionally, an intelligent power management system automatically rectifies surge faults (short-lived faults) and reports non-surge faults (persistent faults) to the control center. It continuously updates the status of real-time power parameters to the database using a Global System for Mobile Communications (GSM)-based communication system with a frequency of 60 s per sample for power management. The Proteus-based simulation and a scaled-down model validate the efficiency and supremacy of the proposed system over the existing control system for power distribution nodes. The results demonstrate that our model detects critical faults and initiates the response within 100 and 200 milliseconds, respectively. Surge faults are automatically rectified within 90 s, while non-surge faults are reported to the database after 90 s. This approach significantly reduces downtime, enables energy accountability, and supports sustainable energy management through a decentralized and distributed control system.

Keywords:

secondary power distribution; decentralized control; surge and non-surge faults; fault detection; real-time power monitoring; modern electrical systems; sustainable energy management

1. Introduction

The increasing demand for power at the consumer end has led to the automation of power distribution systems and to the prediction of power demand that changes continuously over time [1]. A continuous and reliable power supply is of central importance for modern infrastructure. Despite that, secondary power distribution networks in developing countries are affected by unexplained blackouts, ignored power outages, excessive power losses, and delayed fault response [2]. Socio-economic progress is hindered by this problem, and sustainable and adoptable decentralized solutions are being sought. The need for more efficient and reliable power distribution has also made researchers find ways to detect and remove faults more efficiently [3]. Previously, centralized systems were used for the automation of power transmission and distribution networks. Centralized systems are efficient in handling large networks, but they are not effective in secondary power distribution due to its complexity and diversity [4]. This problem is more prominent in developing countries, where the distribution system is not well structured or well planned and lacks a proper repair and maintenance system [5]. The secondary power distribution system is directly linked to the consumer end is most vulnerable to faults; this part of the network experiences surge and non-surge faults, resulting in power downtime [6].

Recently, significant progress has been made in developing more efficient and reliable techniques to detect, isolate, and restore faults in power distribution systems. Faults in the secondary power distribution system can cause problems at the whole distribution level if they are not resolved in time. Ref. [7] provides an extensive overview of fault detection, isolation, and restoration techniques. Additionally, it highlights the inability of present techniques to cater to real-time power demand fluctuations. While these techniques have greatly enhanced the efficiency and reliability of primary power distribution systems, none of them focus on the secondary power distribution system, which is more susceptible to faults, power failure, and power theft. Moreover, research is being conducted on the integration of modern communication systems and Artificial Intelligence (AI) algorithms to improve the prediction of load demand and the system’s reaction time to faults [8,9,10]. While this integration has improved the fault detection response, it relies on a centralized data processing model, which reduces the efficiency of the real-time fault management system and produces some unwanted delays. Therefore, an intelligent system is required at secondary distribution nodes to cater all these challenges.

Figure 1 is a pictorial representation of the designed system. The designed intelligent system has processing and control ability, and it is placed at the output of the secondary distribution node. It measures the required power parameters, controls the output relays, and communicates the information using GSM via Hypertext Transfer Protocol (HTTP) to the server. The communication is two-way, as the system receives instructions from the control center in the event of non-surge faults.

By providing a scalable solution, this work proposes a system that overcomes the drawbacks of existing techniques and is particularly well-suited for handling the challenges of secondary power distribution networks. Our system ensures a more dependable supply for end users by enhancing the resilience and efficiency of power distribution networks through distributed processing and real-time power parameter monitoring. The system’s design, implementation, and validation indicate that the decentralized secondary-level power management system is highly effective, as evidenced by simulation and scaled-down model testing. Our research is summarized as follows:

We proposed a decentralized system with distributed processing capabilities for real-time monitoring and processing of power parameters. The proposed system automates the power supply to consumers in secondary power distribution networks.
In our system, sensors and relay-based control switches are connected to a microcontroller to process, communicate, and efficiently perform actions to rectify the surge faults and communicate the non-surge faults with their location to the control center after 90 s for manual fault rectification.
The system continuously updates the database with the real-time power parameters and status of each secondary power distribution node, using GSM and an online server with a sampling rate of 60 s. This high-resolution of power parameter data is used for advanced power management.
Additionally, the proposed system monitors the energy consumption of each transformer, which is crucial for power loss detection and enhancing energy accountability across the distribution network.

The rest of this paper is organized as follows: in Section 2, we discuss the related work, while Section 3 describes the overall methods and a prototype of the system model. In Section 4, we demonstrate the extensive simulation results, and finally Section 5 concludes this study and describes possible future directions.

2. Literature Review

Baseline schemes for power distribution systems have predominantly relied on centralized automation and data acquisition systems. In these systems, control and data processing of power are centralized, allowing for systematic control from the center. However, such structured control systems introduce single points of failure and latency in fault response, making them less resilient [11]. In contrast, decentralized systems make decisions at local nodes, enhancing the robustness of the system [12,13]. Moreover, centralized systems are vulnerable to failure at the central node, which would cause the collapse of communication in the entire network. However, using a decentralized approach decreases the chances of system-wide failure because problems in one node do not affect the rest of the system [14]. In addition to enhancing reliability, decentralizing automation allows for a greater degree of scalability. An IoT-based monitoring and management system for distribution substations was developed using low-cost microcontrollers and cloud integration, offering real-time remote data access [15]. The transformers can be made nodal controllers, capable of scanning, detecting, and managing faults without going through a central node [16].

Moreover, the implementation of decentralized control systems for power distribution has demonstrated significant advantages, particularly for use in developing countries that face the challenges of unplanned power distribution network expansion. Decentralized systems are more reliable and cost-efficient to maintain because they enable localized fault identification and faster response times [17]. Additionally, the system cost is reduced by 25% in decentralized and distributed systems when compared to centralized systems for distribution networks of microgrids. Furthermore, decentralized energy resources (DERs) also enhance energy access and security within regions with poor infrastructure, including developing economies [18,19]. Conversely, fault detection and identification in every field are very important for reliable services [20]. Similarly, fault detection in decentralized power distribution systems, single-phase to ground with resonant grounding, can be detected in less than three cycles, enhancing system reliability and reducing downtime [13]. Similarly, a GSM-based fault detection and localization system in a three-phase power distribution network enables automatic monitoring, fault reporting, and prompt response to system failures [21]. The fault automation approach using GSM communication improves low-voltage power reliability [22]. This approach demonstrates that it is possible to provide automated fault rectification with the combination of GSM communication and decentralized decision-making. It is more efficient than traditional centralized monitoring systems, mainly focusing on fault identification rather than fault rectification techniques.

Combining automation with communication networks greatly improves response and fault handling in a decentralized system. Additionally, decentralized systems are easy and more flexible to control, are resilient to failures, and can properly pinpoint the faults [12]. Decentralized systems offer a robust and affordable alternative to centralized control systems in terms of reliability and economic performance. Additionally, decentralized systems have enhanced scalability and reduced potential failure points, and they allocate decision-making power to the transformers [17]. Table 1 compares and indicates the superiority of the proposed system over existing centralized systems.

A comprehensive review of AI-based fault localization techniques in power distribution systems is presented in [28], comparing methods such as neural networks, fuzzy logic, and reinforcement learning. The study also highlights the role of data sources, fault types, and distributed generation in influencing method applicability. AI classification in ring-type distribution is highly promising but offers central processing [29]. Moreover, while techniques like blockchain and IoT-based models focus on security and detection, they fail to address overhead power losses or bidirectional communication, and need complex load profiling based on AI/ML algorithms [30]. However, prior decentralized systems have shown limited contribution to secondary power distribution networks, and power converter faults (such as IGBT failures in inverters) are often overlooked. Industrial approaches like fault-tolerant H-bridge topologies [31] are of great importance for future decentralized systems to avoid component-level failures.

To exploit these gaps, our system detects faults with a mean value of 90 ms and responds with a mean value of 186 ms, respectively. Our system takes advantage of the systematic signature of distribution faults, i.e., current abnormalities that occur uniformly in line-to-line (L-L), line-to-ground (L-G), and overload faults. Because the proposed system measures current continuously, it provides comprehensive protection without requiring complex fault profiling. The proposed system rectifies surge faults within 90 s and communicates persistent faults to the server using GSM/HTTP-based communication. The system provides bidirectional connectivity between the server and the node. The proposed model has been tested only in a lab-controlled environment, and it needs real-world power distribution system validation.

3. Methods and Materials

This research developed a decentralized and distributed control strategy for the secondary power distribution network. The system detects and mitigates power surges and flags non-surge faults, with real-time power parameter monitoring and communication to the server using GSM. The methodology combines validation by simulation on Proteus (8.13) and a scaled-down hardware prototype. This section includes a description of the system architecture, simulation model and hardware prototype, and the working algorithm, as well as a stability analysis.

3.1. System Architecture

The decentralized and distributed intelligent autonomous system for secondary power distribution comprises an Arduino Mega 2560 for processing, input current sensors, voltage sensors, and control relays to regulate the electric supply to the load side. The system measures the current and voltage and formulates the power and energy using its local processing unit. It is programmed to control the output load within safe limits using relay switches according to the designed algorithm. Figure 2 shows the system architecture and hierarchy of connectivity for the proposed system. The distribution node supplies power to the load at the required voltage, while a 12-volt DC power supply is also connected to the distribution side to power up the microcontroller. A buck converter further reduces the 12-volt DC supply to 4.2 volts to power up the GSM module.

3.2. Simulated Model Design

Proteus simulation software allows for accurate emulation of IoT systems and their integrated microcontrollers, replicating the operational behavior of actual hardware. This capability enables us to simulate the proposed model within Proteus for precise and accurate observations. The simulation model includes two virtual terminals. Virtual terminal “A” visualizes power parameters in real time, while virtual terminal “B” shows the server interface. Figure 3 shows the schematic simulation of the designed model. The simulated model features a single phase of a distribution transformer with a capacity of 33 kVA

(P_s a f e

= 33.33 kVA). The simulation model’s load line is designed with the following parameters: 500 m distribution line with 50 mm² cross-sectional area, and the wire material is aluminum with

ρ = 2.82 \times 10^{- 8}

ohm·m. The transformer’s supply is connected to three units of load, i.e., L1, L2, and L3 (each approximately 15 kW), through control switches to turn the load on or off. Current and voltage measurement sensors on the secondary side of the distribution node feed power parameters to the microcontroller.

3.3. Hardware Prototype Development

A scaled-down hardware model of the proposed system has been designed and implemented to verify the simulation results and test the communication components. The model communicates the required power parameters to the server. This scaled-down model has been built for a 1000-watt safe load with 5% tolerance on the upper limit. Figure 4 indicates the hardware scaled-down model of the system.

The designed model working algorithm mirrors that of the simulated model, with the addition of communication components, i.e., GSM and server, to validate the bidirectional communication in the real model. A buck converter is added to power the GSM module as it operates on 3.8–4.2 volts. The load board is used for connecting the load. A relay controlled by a microcontroller enables automatic switching of the load, and a current sensor is clamped onto the wire carrying the load current. The model also has indication LEDs for SIM card, fault current, and network check. The scaled-down model is capable of detecting faults, migrating them, and communicating the power parameters and fault status to the server in real time.

3.4. Working Algorithm

The system carries out real-time online processing on the Arduino Mega 2560 microcontroller. It continuously processes live sensor data streams, performing 50 ms control cycles during normal operation. L-L, L-G, and overload faults at the power distribution node can be manifested as abnormalities in current, so the system efficiently addresses these faults. As these faults share common characteristics of high current values, the system’s current threshold-based protection system can handle these faults without the need for comprehensive fault profiling.

Figure 5 illustrates the working flow of the proposed model and the response of the system under different conditions. The system starts by turning ON the timer and power supply. Initially, the fault status is set to clear. A counter to count the number of times the system tried to clear a surge is set to zero, and finally, the system starts measuring power parameters to keep the system protected. During normal operation, after every 60 s, power parameters are sent to the server using GSM communication.

In the case of overload, the current is higher than the set threshold. As soon as that fault current is detected, the system waits for 10 control cycles, i.e., 500 ms, before triggering the fault mechanism to prevent false flags. (If the overload is more than 3 times the upper threshold, it triggers immediately.) The system then switches off the load using a relay-based control switch, waits for 30 s per IEC 60909, which specifies 30–60 s for temporary fault recovery (https://kupdf.net/download/iec60909_5b05ba45e2b6f50b4dcf2ffb_pdf, (accessed on 15 May 2025)), and increases the counter by 1. It then checks if the counter is less than or equal to 3; if yes, it turns ON the load again and checks if the load is within the safe limit. The system repeats this loop 3 times, and if the load falls back within the safe limit at any time between loops, the system resets the counter to zero and resumes working smoothly per IEEE Std 1547-2018, 2018 (https://standards.ieee.org/products-programs/standards-related/interactive-standards/, (accessed on 15 May 2025)). The fault persistence is modeled as

P_{load} (t) = P_{steady} + (P_{peak} - P_{steady}) \cdot e^{- t / τ}

(1)

Here,

P_{load} (t)

is the power taken by the load at a given time t,

P_{steady}

is the power a load takes in its steady state,

P_{peak}

is the peak power taken initially, t is the relay OFF time, i.e., 3 intervals of 30 s, and

τ

is the time for which the load takes extra current. For surge faults,

τ < t

. But if the load remains higher than the threshold and the counter goes above 3, this event is flagged as a non-surge fault. The system’s overload stays persistent,

τ > t

, even after 3 tries, so the system waits for manual fault rectification. When the system receives a fault status cleared from the server, it resumes its normal operation.

3.5. State-Space Stability Analysis

To guarantee system stability and reliability under both surge and non-surge fault conditions, a state-space model was developed which applies Lyapunov stability. The analysis considers a single node of a secondary power distribution network with line impedance

R + j ω L

and a resistive load

R_{L}

. The state vector is defined as

x = [\begin{matrix} i & v s . \end{matrix}]

, where i represents the line current and

v s .

is the load voltage. The governing equations are formulated using Kirchhoff’s voltage law (KVL) and are expressed in the standard state-space form.

L \frac{d i}{d t} + R i + v s . = u \cdot V_{supply}

(2)

Here,

v = R_{L} \cdot i

(load equation) and

u \in {0, 1}

(relay control). Substituting and rearranging the equation,

\frac{d i}{d t} = - \frac{(R + R_{L})}{L} i + \frac{V_{supply}}{L} u

(3)

For stability analysis, we use a Lyapunov stability analysis as

V (x) = x^{T} P x

, where

P = P^{T} > 0

, and P is the positive definite matrix.

P = [\begin{matrix} p_{11} & 0 \\ 0 & p_{22} \end{matrix}]

(4)

The stability is defined as

\dot{V} < 0

(the derivative of V should be negative):

\dot{V} = - 2 \frac{(R + R_{L})}{L} p_{11} i^{2} + 2 \frac{V_{supply}}{L} p_{11} i u

(5)

a. Relay OFF (

u = 0

):

\dot{V} = - 2 \frac{(R + R_{L})}{L} p_{11} i^{2} < 0 (always stable)

(6)

b. Relay ON (

u = 1

):

\dot{V} = - 2 \frac{(R + R_{L})}{L} p_{11} i^{2} + 2 \frac{V_{supply}}{L} p_{11} i (\dot{V} < 0, if i > \frac{V_{supply}}{(R + R_{L})})

(7)

This aligns with our system implementation, i.e., that system sets

I_{threshold} = \frac{V_{supply}}{(R + R_{L})}

. The system stability for the load current is defined in Table 2. The stability analysis provides a strong foundation to ensure the system’s stability under the specific experimental conditions of the designed system. Our analysis ensures stability for critical cases by relay actions, i.e., turning it OFF (fault isolation when overloaded) and ON (stable operation below the threshold current). The system’s design directly supports stability analysis by the control mechanism of relays and threshold current.

4. Results and Discussion

The effectiveness of the proposed decentralized control system is tested through simulation and hardware experiments. This section presents results under three scenarios: normal or safe load, surge fault, and non-surge fault conditions. The aim was to evaluate the system’s real-time monitoring, fault recovery, and inter-system communication functions. In this section, simulation results illustrate the system reacting to different loading and unloading scenarios. Moreover, experimental results from hardware implementation examine the system reliability in the real model, critical response times evaluations, communication in workflows, and fault escalation logic.

4.1. Simulation Results

The proposed system was simulated using Proteus, and the system’s response was observed on virtual terminals “A” and “B” to verify the implications of our methodology. Virtual terminal “A” displays the real-time power parameters being measured with a display sampling rate of 10 samples per minute. Virtual terminal “B” simulates the server with a sampling rate of 1 sample per minute during normal operation and immediately when a non-surge fault is detected. As discussed in the Simulated Model Design subsection, the system is simulated using three units of load, such that two units of load remain within the safe limit and the third behaves as an overload. Here, it is important to note that watts are measured with the formula

P = V I

, so there is no difference between watts and volt-amperes (power factor is considered as 1).

4.1.1. Smooth Working

The smooth working section simulates the system within a safe load. Under safe load conditions, the system operates smoothly and communicates power parameters at regular, defined intervals to the server. In Figure 6, sample numbers 8 to 19 on virtual terminal “A” demonstrate how the load fluctuates yet remains within the safe operational limit, while virtual terminal “B” (server) receives a sample after every 60 s (10 samples per minute on virtual terminal “A”). The relay stays ON (i.e.,

u = 1

) and the system satisfies the criterion

I_{load} < I_{threshold}

needed for the system’s stability.

The distributed power satisfies the condition

P (t) \leq P_{threshold}

, and the total energy at time t is given by

E (t) = \int_{0}^{t} P (t) d t

(8)

where

P (t)

is the distributed power,

E (t)

is the energy at time t, and

P_{threshold}

is the threshold distributed power.

4.1.2. Surge Fault

Surge faults are categorized as short-lived faults, caused by L-L, L-G, or overload events in the distribution network, which disappear after a defined interval. Figure 7 shows that “Sample No. 19” in the system experienced an overload and responded by turning OFF the relay (i.e.,

u = 0

) and achieving system stability. The system waited 30 s to make the first try to retain power, but as can be seen at “Sample No. 20”, the system was still in overload, so it attempted a second try and successfully restored the power supply; then, the overload status disappeared (Sample No. 21). Referring to Equation (1), the system waited for t time, as explained in the working algorithm. Here,

τ

(duration of overload) was less than t as the system’s load returned to the safe limit within t, allowing the system to regain stability with the load still ON.

4.1.3. Non-Surge Fault

Non-surge faults are categorized as persistent faults caused by L-L, L-G, or overload events in the distribution network. When the designed system encounters this kind of overload, it reports the fault status to the server to keep the system safe. It turns OFF the load side and waits for manual instruction to restore the power. Figure 8 shows that at “Sample No. 22” on virtual terminal “A”, the system experienced an overload, causing instability and triggering the relay to switch OFF to stabilize the system. The system tried to rectify the fault, but as shown from “Sample Nos. 23–26”, it remained overloaded for all three tries. As defined in the working algorithm, the system declared it as a non-surge fault (virtual terminal A), and the server terminal received the fault status.

The demand and response curve plotted against time for simulation results shows how the system responds to different demands of loads. Figure 9a shows the demand curve in kW, and Figure 9b shows the response curve of the system response to fluctuating load demand. The simulation duration of 510 s was specifically chosen to capture the complete system response for all workflows: steady state (0–120 s), surge fault recovery (120–180 s), non-surge fault escalation (300–390 s), and post-recovery stability (420–510 s). This allows for complete analysis of fault-handling cycles, stresses on the algorithms, and resets of the system. The response curve clearly shows that the system successfully achieves the stability conditions mentioned in Table 2. The system supplies the load within safe defined limits and turns OFF the supply to the load when its overload causes the system’s instability, which completely justifies our working algorithm.

4.2. Experimental Results

A scaled-down model of the proposed system is implemented to test the real-time response in a physical environment. The experimental setup is tested under different conditions of load (under a safe limit, surge fault, and non-surge fault) to verify its designed operations. The safe threshold of the load is set to 1050 watts, representing a nominal load value of 1000 watts plus 5% tolerance, considering the intrinsic errors of sensors. Moreover, communication components like GSM and the server database are tested using this physically scaled-down model of the system. The system uses an Arduino as a processing board; the Arduino IDE provides the user with a virtual display on a PC to observe the response of hardware models in real time. The hardware model of our system is tested using the same algorithm and conditions but at a scaled-down level with the best available resources. The scaled-down model shows promising results to verify the validity of the methodology in the real model, and it also satisfies the stability analysis.

4.2.1. Smooth Working

During smooth working experimentation, the load is kept within safe, defined load limits, i.e., 1050 kW. The system continues to measure the power parameters and remains in a stable state. To test the transient inrush immunity of the system, inductive loads are also tested to verify the system’s smooth operation. Figure 10a shows the response on the virtual display of the Arduino IDE, and Figure 10b shows the server’s response in real time. The virtual display verifies that the system continues to measure power parameters, and after 60 s, it sends the information to the server. The server sends an acknowledgment signal upon receiving the collected data. “Command 1” on the server refers to data received under normal operation, and “Status” is the node’s current status, which is “ON” in this condition.

To further validate the performance of the system in smooth operation, six load values were generated and programmed to switch ON and OFF randomly. It was tested for 1000 samples, with each sample having an interval of 6 s. Through this extended experimentation, load values remained within the safe limit of the system. Figure 11a plots the “Sample Number” vs. “ Load (w)”, showing that two false flags were triggered in 1000 samples of time, and all the flags were triggered at the upper threshold. Experimental results confirm 99.8% accuracy of the proposed model in real-world testing during smooth operation. Furthermore, Figure 11b shows the time taken by the system to send each sample to the server. The experimentation confirms that it sends a sample to the server with an average interval of 60.4 s, verifying that the frequency of data received at the server during experimentation is closely aligned with the designed 60 s interval.

4.2.2. Surge Fault

To test the system under surge fault conditions, an overload is generated for a short interval of time and then removed. Figure 12 shows that the virtual terminal of the Arduino experienced an overload (marked with a red arrow) and it turned OFF the load, waited for 30 s, and again turned it ON. It still showed an overload, so the system gave a second try. This time the load successfully returned to the safe limit, and virtual terminal displayed the message “Surge Fault Cleared”, allowing the system to resume smooth operation. So the system detected, responded to, and cleared a surge fault automatically, validating the designed algorithm for the system.

To empirically test the surge fault detection and response of the system, 100 surge faults are generated, and the detection and response times are observed. Our system has a 20 Hz (making a 50 ms loop cycle) refresh rate for the sensors’ values. The detection and response time of each surge sample is plotted to visualize the experimental results of the system during the surge fault conditions. Figure 13 illustrates the system’s time of detection and response to a fault in milliseconds. The experiment confirms that the mean detection time is 90 ms and the mean response time is 186 ms, confirming the system’s real-time detection and on-node resolution action in case of surge faults.

4.2.3. Non-Surge Fault

Persistent faults are generated to verify the working system under non-surge fault conditions. Persistent faults that are not resolved by the system’s auto recovery (three attempts) are communicated to the control center for operator-supervised intervention. This ensures the safety of the system, as critical faults are handled by trained professionals. The system is kept OFF until the fault is manually resolved and the system receives a command from the server to start normal operation again.

Figure 14a shows that the system experienced an overload (marked with red arrows) and made three attempts with a 30 s interval to restore power, but the system remained overloaded. After three attempts and 90 s, the system sent the issue to the server and got a response of “2” (marked in purple), which means the fault status was successfully updated to the server and the system was currently waiting for manual fault rectification. Figure 14b shows the real-time server response on a web page. This demonstrates that the server received the critical information send by hardware model, i.e., information on the overload and the status of the system (OFF) with the unique IMEI of that system. Once the overload was removed, remote instruction was sent using an HTTP link to the server. Figure 15 shows the system waiting for manual fault rectification. As soon as it received a response from the server (a response of 3, marked with a red arrow, means fault removed), the system first checked if it was still overloaded and then turned ON the power supply.

To check the robustness of the system, an experiment is designed to generate 100 non-surge faults, and their detection and response time are observed. As the system tries to restore power within three attempts, the detection and response time for non-surge faults is measured after 90 s. Figure 16 is the plot of the non-surge sample vs. its detection and response time (triggering server response). The figure validates that the average detection time is 92 ms and the average response time is 186 ms.

4.3. Simulation vs. Experimental Results

Table 3 summarizes the results from both the simulated model and the scaled-down hardware model. It provides a comprehensive and comparative analysis of results. The key performance metrics and operational characteristics are presented in the table.

5. Conclusions and Future Work

This research work focuses on the development of an intelligent decentralized system with distributed processing capability for secondary power distribution nodes. The research aims to develop a novel strategy that can detect and resolve surge faults in a node and communicate non-surge faults to the server. It monitors the critical power parameters in real time and uploads the data to the server. This work focuses on developing a decentralized IoT-based system. It uses a simple yet effective method to profile the faults in terms of overload current (short-lived, persistent). The superiority of the proposed system model over traditional controlled distribution systems is demonstrated by both simulation and hardware results. We also verified its ability to continuously monitor and communicate power parameters at specified intervals, achieving fault detection in 90 ms (92 ms for non-surge faults) and a response in 186 ms. Results also verified the mitigation of the surges in 90 s and communicated the non-surge fault after 90 seconds.

The proposed scheme makes the system a reliable solution for microgrids, as dynamic data-driven AI models can be a great solution for decentralized energy production with renewable sources for a greater environmental impact, even in dynamic energy trends. Future work will deal with overcoming limitations, like the implementation of the system in a real-world setting, detailed fault profiling, addressing the stability of the system in the real-world environment, and handling various nodes’ communication simultaneously. Future research will also address sustainable maintenance protocols and component failure resilience.

Author Contributions

Conceptualization, Y.L. and Z.U.H.; methodology, Y.L. and Z.U.H.; software, Z.U.H.; validation, Y.L., Z.U.H., H.K.S., and K.K.S.; formal analysis, Y.L.; investigation, T.H., K.K.S., and Z.A.B.; resources, Y.L. and Z.U.H.; data curation, Z.U.H., T.H., and K.K.S.; writing—original draft preparation, Y.L., Z.U.H., and H.K.S.; writing—review and editing, H.K.S., T.H., and K.K.S.; visualization, H.K.S., K.K.S., and Z.A.B.; supervision, Y.L. and Z.U.H.; project administration, Z.U.H.; funding acquisition, Y.L. All authors have read and agreed to the published version of the manuscript.

Funding

National Science Foundation for Young Scholars of China (62205031), Beijing University of Posts and Telecommunications, Beijing, China.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article.

Acknowledgments

We acknowledge setup funding for associate researchers from Beijing University of Posts and Telecommunications, Beijing, China.

Conflicts of Interest

The authors declare no conflicts of interest.

References

International Energy Agency. Electricity Market Report 2023. Available online: www.iea.org (accessed on 13 May 2025).
Salman, H.M.; Pasupuleti, J.; Sabry, A.H. Review on Causes of Power Outages and Their Occurrence: Mitigation Strategies. Sustainability 2023, 15, 15001. [Google Scholar] [CrossRef]
Yao, Y.; Zhao, Y.; Li, Y.; Liu, L.; Zhou, H.; Tang, Y. A Real Data-Driven Fault Diagnosing Method for Distribution Networks Based on ResBlock-CBAM-CNN. Electricity 2025, 6, 19. [Google Scholar] [CrossRef]
Liu, S.; Lin, Z.; Dong, Y.; Zhao, J. Editorial: Power system operation and optimization considering high penetration of renewable energy. Front. Energy Res. 2024, 12, 1483215. [Google Scholar] [CrossRef]
Davison, E.J.; Aghdam, A.G.; Miller, D.E. Centralized Control Systems. In Decentralized Control of Large-Scale Systems; Springer: New York, NY, USA, 2020; pp. 1–21. [Google Scholar] [CrossRef]
Xia, C.; Yao, T.; Wang, W.; Hu, W. Effect of Climate on Residential Electricity Consumption: A Data-Driven Approach. Energies 2022, 15, 3355. [Google Scholar] [CrossRef]
Srivastava, I.; Bhat, S.; Vardhan, B.V.S.; Bokde, N.D. Fault Detection, Isolation and Service Restoration in Modern Power Distribution Systems: A Review. Energies 2022, 15, 7264. [Google Scholar] [CrossRef]
Anwar, T.; Khan, M.A.; Anwar, M.W.; Khattak, A.M.; Qamar, S.; Akhtar, S.; Hassan, R. Robust fault detection and classification in power transmission lines via ensemble machine learning models. Sci. Rep. 2025, 15, 2549. [Google Scholar] [CrossRef]
Sisinni, E.; Saifullah, A.; Han, S.; Jennehag, U.; Gidlund, M. Industrial Internet of Things: Challenges, Opportunities, and Directions. IEEE Trans. Ind. Inform. 2018, 14, 4724–4734. [Google Scholar] [CrossRef]
Cao, Y.; Tang, J.; Shi, S.; Cai, D.; Zhang, L.; Xiong, P. Fault Diagnosis Techniques for Electrical Distribution Network Based on Artificial Intelligence and Signal Processing: A Review. Processes 2024, 13, 48. [Google Scholar] [CrossRef]
Haydaroğlu, C.; Kılıç, H.; Gümüş, B.; Özdemir, M.T. Advancing Fault Detection in Distribution Networks with a Real-Time Approach Using Robust RVFLN. Appl. Sci. 2025, 15, 1908. [Google Scholar] [CrossRef]
Ibekwe, K.I.; Adeniran, A.O.; Okonkwo, E.C.; Nnaji, S.; Egenti, R. Microgrid systems in U.S. energy infrastructure: A comprehensive review: Exploring decentralized energy solutions, their benefits, and challenges in regional implementation. World J. Adv. Res. Rev. 2024, 21, 973–987. [Google Scholar] [CrossRef]
Fiorucci, E.; Fioravanti, A.; Mari, S.; Luiso, M.; Ciancetta, F. Driving Sustainable Development with PMU Systems in Distribution Grids. Sustainability 2025, 17, 5280. [Google Scholar] [CrossRef]
Barik, M.A.; Gargoom, A.; Mahmud, M.A.; Haque, E.; Al-Khalidi, H.; Than Oo, A.M. A Decentralized Fault Detection Technique for Detecting Single Phase to Ground Faults in Power Distribution Systems with Resonant Grounding. IEEE Trans. Power Deliv. 2018, 33, 2462–2473. [Google Scholar] [CrossRef]
Alam, M.T.; Ashiquzzaman, M.; Alam, S.M.T.; Mostak, M.S.; Rahman, H.; Das, P. IoT-Based Power Monitoring and Management System of a Distribution Substation. In Proceedings of the 2023 10th IEEE International Conference on Power Systems (ICPS), Cox’s Bazar, Bangladesh, 13–15 December 2023; pp. 1–6. [Google Scholar] [CrossRef]
Veselý, V.; Körösi, L. Decentralized Control of Complex Systems: Lyapunov Function Approach. Electronics 2024, 13, 5024. [Google Scholar] [CrossRef]
Aoun, A.; Adda, M.; Ilinca, A.; Ghandour, M.; Ibrahim, H. Centralized vs. Decentralized Electric Grid Resilience Analysis Using Leontief’s Input–Output Model. Energies 2024, 17, 1321. [Google Scholar] [CrossRef]
Elizondo, G. Harnessing the Power of Distributed Energy Resources in Developing Countries: What Can Be Learned from the Experiences of Global Leaders? The Oxford Institute for Energy Studies: Oxford, UK, 2023. [Google Scholar]
Zhuang, Y.; Fang, X. The Real-Time Distributed Control of Shared Energy Storage for Frequency Regulation and Renewable Energy Balancing. Sustainability 2025, 17, 4780. [Google Scholar] [CrossRef]
Soothar, K.K.; Chen, Y.; Magsi, A.H.; Hu, C.; Shah, H. Optimizing Optical Fiber Faults Detection: A Comparative Analysis of Advanced Machine Learning Approaches. Comput. Mater. Contin. 2024, 79, 2697–2721. [Google Scholar] [CrossRef]
Gunasekar, T.; Aarthi, V.; Bharanidharan, K.L.; Ashwin, S.; Venkat, V.; Mohanasundaram, T. GSM Based Fault Detection in Three Phase Power Distribution System. In Proceedings of the 2021 7th International Conference on Advanced Computing and Communication Systems (ICACCS), Coimbatore, India, 19–20 March 2021; pp. 384–388. [Google Scholar] [CrossRef]
Sreejith, P.R.; Fathima, M.M.; Krishnaprasad, V.N.; Raphy, J.; George, A.; James, J. Powerline Fault Detection and Location Tracking with GSM. In Proceedings of the 2024 7th International Conference on Circuit Power and Computing Technologies (ICCPCT), Nagercoil, India, 21–22 August 2024; pp. 1331–1336. [Google Scholar] [CrossRef]
Sezgin, M.E.; Gol, M. Distributed energy management and communication strategy for network of microgrids. Electr. Power Syst. Res. 2025, 238, 111079. [Google Scholar] [CrossRef]
Cao, Y.; Zhang, Y.; Wu, Y. Distribution Automation: Enhancing Efficiency and Reliability in Power Distribution Systems. Acad. J. Sci. Technol. 2023, 6, 6–8. [Google Scholar] [CrossRef]
Rondinelli, D.; Nellis, N.J.; Cheema, G.S. Decentralization in Developing Countries: A Review of Recent Experience (English); World Bank Policy Research Working Paper; SWP 581; The World Bank: Washington, DC, USA, 2010. [Google Scholar]
Hosseinzadeh, J.; Masoodzadeh, F.; Roshandel, E. Fault detection and classification in smart grids using augmented K-NN algorithm. SN Appl. Sci. 2019, 1, 1627. [Google Scholar] [CrossRef]
Noor, R. Decentralized Renewable Energy for Improving Energy Access in the LDCs. Available online: https://climate.mit.edu/posts/decentralized-renewable-energy-improving-energy-access-ldcs (accessed on 13 May 2025).
Rezapour, H.; Jamali, S.; Bahmanyar, A. Review on Artificial Intelligence-Based Fault Location Methods in Power Distribution Networks. Energies 2023, 16, 4636. [Google Scholar] [CrossRef]
Alhanaf, A.S.; Farsadi, M.; Balik, H.H. Fault Detection and Classification in Ring Power System With DG Penetration Using Hybrid CNN-LSTM. IEEE Access 2024, 12, 59953–59975. [Google Scholar] [CrossRef]
Jaikumar, R.; Rajasekaran, A.S.; Rao, M.V.N.; Nayyar, A. FEMT-FL: A Novel Flexible Energy Management Technique Using Federated Learning for Energy Management in IoT-Based Distributed Green Computing Systems. Comput. Stand. Interfaces 2025, 94, 104017. [Google Scholar] [CrossRef]
Djaghloul, C.; Khan, N.; El Khamlichi Drissi, K.; Tehrani, K. A Fault-Tolerant Topology for Single-Phase H-Bridge Inverters Addressing Open- and Short-Circuit Failures for Industrial Applications. Frankl. Open 2025, 10, 100189. [Google Scholar] [CrossRef]

Figure 1. Designed system model.

Figure 2. System architecture.

Figure 3. Simulation model.

Figure 4. Hardware model.

Figure 5. Working algorithm.

Figure 6. Smooth working virtual display.

Figure 7. Surge detection and rectification.

Figure 8. Non-surge fault detection and response.

Figure 9. Graphical representation of the simulation model.

Figure 10. Smooth working of hardware model.

Figure 11. Load samples for smooth working.

Figure 12. Surge fault detection and rectification.

Figure 13. Surge fault detection and response time.

Figure 14. Non-surge fault detection.

Figure 15. Fault rectification.

Figure 16. Non-surge fault detection and response time.

Table 1. Comparison between centralized and proposed decentralized control systems.

Features	Centralized Control System	Proposed Decentralized Control System	Advantages of the Proposed System
Fault detection and response time	Generally slower due to centralized communication and processing (focus mainly on accuracy, response time) [23].	Rapid fault detection in an average time of 90 ms and response in 186 ms.	Improves service reliability and reduces downtime, aligning with SDG 7.1 for universal reliable energy access.
System reliability	Exposed to single points of failure, disruption in the primary node results in the failure of the whole network [24].	Self-contained regional control, failure of one system does not disturb the rest of the network.	Enhances system resilience and service continuity, aligning with SDG 9.1 (resilient infrastructure).
Real-time monitoring and data processing	Centralized systems rely on aggregate data monitoring with a delay [17].	Collect and process power parameters continuously and send them to the server every 60 s.	Improved data collection quality as nodes have processing power, supporting SDG 12 (sustainable resource management 12.2).
Operational costs	More expensive to operate due to the need for elaborate communication facilities and centralized processing units [18].	Less expensive to operate since control is decentralized and localized, thus eliminating the need for costly facilities.	A reasonable solution for regions that are resource-constrained.
Scalability	Systems have limited scalability. Complexity and cost increase when new components are integrated into the system [25].	Unlike other systems, it can be expanded easily, as no significant changes need to be made to it.	Flexibility for novel solutions suitable for each region depending on its conditions.
Maintenance requirements	Maintenance needs to be central, which is complex and expensive [26].	Each node is self-sufficient in terms of maintenance, which makes it more practical.	Shrinking the cost of maintenance with more focus on simplifying repair tasks.
Energy efficiency	Ineffective performance can be a consequence of central load control [27].	Improves node energy efficiency by locally managing loads and reducing losses at each node.	Enhance energy efficiency. Waste is reduced while productivity is boosted.
Implementation in developing countries	Investment in infrastructure and the induction of skilled labor pose barriers [23].	Constructive work is easier to execute in places where there is not much infrastructure or many technical resources available.	These areas perform better economically and socially with fewer resources available.

Table 2. System stability vs. load current.

Condition	Relay State	Stability Outcome
$I_{load} > I_{threshold}$	OFF	Stability through fault isolation
$I_{load} < I_{threshold}$	ON	Stable operation

Table 3. Comparison between simulated and scaled-down model results.

Metric	Simulated Model (Proteus)	Scaled-Down Hardware Model
System scale	33 kVA single-phase transformer	1 kW single-phase supply
Fault detection time (mean)	50 ms (single cycle)	90 ms (surge), 92 ms (non-surge)
Fault response time (mean)	100 ms (two cycles)	186 ms (both surge and non-surge)
Surge fault auto-rectification	Successful within 90 s (3 tries)	Successful within 90 s (3 tries)
Non-surge fault reporting	After 90 s	After 90 s
Data update interval (normal)	60 s	60.4 s (average)
Accuracy (normal operation)	100%	99.8% (1000 samples)
Communication validation	Virtual terminal	Real GSM/server

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, Y.; Hassan, Z.U.; Sootahar, H.K.; Hussain, T.; Soothar, K.K.; Bhutto, Z.A. Intelligent Power Management and Autonomous Fault Diagnosis for Enhanced Reliability in Secondary Power Distribution Systems. Sustainability 2025, 17, 6009. https://doi.org/10.3390/su17136009

AMA Style

Li Y, Hassan ZU, Sootahar HK, Hussain T, Soothar KK, Bhutto ZA. Intelligent Power Management and Autonomous Fault Diagnosis for Enhanced Reliability in Secondary Power Distribution Systems. Sustainability. 2025; 17(13):6009. https://doi.org/10.3390/su17136009

Chicago/Turabian Style

Li, Yongxiao, Zaheer Ul Hassan, Haresh Kumar Sootahar, Touseef Hussain, Kamlesh Kumar Soothar, and Zulfiqar Ali Bhutto. 2025. "Intelligent Power Management and Autonomous Fault Diagnosis for Enhanced Reliability in Secondary Power Distribution Systems" Sustainability 17, no. 13: 6009. https://doi.org/10.3390/su17136009

APA Style

Li, Y., Hassan, Z. U., Sootahar, H. K., Hussain, T., Soothar, K. K., & Bhutto, Z. A. (2025). Intelligent Power Management and Autonomous Fault Diagnosis for Enhanced Reliability in Secondary Power Distribution Systems. Sustainability, 17(13), 6009. https://doi.org/10.3390/su17136009

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Intelligent Power Management and Autonomous Fault Diagnosis for Enhanced Reliability in Secondary Power Distribution Systems

Abstract

1. Introduction

2. Literature Review

3. Methods and Materials

3.1. System Architecture

3.2. Simulated Model Design

3.3. Hardware Prototype Development

3.4. Working Algorithm

3.5. State-Space Stability Analysis

4. Results and Discussion

4.1. Simulation Results

4.1.1. Smooth Working

4.1.2. Surge Fault

4.1.3. Non-Surge Fault

4.2. Experimental Results

4.2.1. Smooth Working

4.2.2. Surge Fault

4.2.3. Non-Surge Fault

4.3. Simulation vs. Experimental Results

5. Conclusions and Future Work

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI