1. Introduction
As power systems become increasingly intelligent and complex, voltage transformer monitoring is critical to ensuring their safe operation [
1,
2]. Therefore, reliability analysis of monitoring devices has become a key component in maintaining system safety [
3,
4,
5,
6,
7].
Traditional reliability analysis methods, mostly based on physical models [
8,
9,
10], struggle to effectively handle fault reasoning under multi-factor coupling, dynamic environments, and uncertain conditions.
In recent years, Bayesian networks (BNs) [
11,
12,
13] have been widely applied in fields involving uncertainty due to their strong probabilistic reasoning capabilities and intuitive graphical modeling advantages, which use probabilistic algorithms as the basis for reasoning and handling uncertainty [
14,
15]. Internationally, significant progress has been made in applying Bayesian networks to reliability analysis of power equipment. Huo et al. [
16] proposed an application method of Bayesian networks (BN) in power system reliability assessment; Qiu et al. [
17] developed an evaluation framework integrating Bayesian networks for failure assessment of electric energy metering equipment; and Ramadan H. et al. [
18] optimized the health index calculation model of power transformers by combining neural networks with Bayesian reasoning, providing new ideas for dynamic reliability assessment. Domestic scholars, such as Zhang Zhaochuang [
19], Liao Caibo [
20], and Fan Huifang [
21], applied Bayesian networks to transformer fault analysis; Chen Yunhao et al. [
22] realized dynamic prediction of fault probabilities for current transformers based on Bayesian networks, verifying their applicability in complex equipment; and Tong Weiyan et al. [
23] proposed a hybrid method for reliability analysis of complex systems by combining fuzzy theory with Bayesian networks, further enhancing the model’s ability to handle uncertain information. Additionally, Zhao Miao [
24] revealed the dynamic characteristics of insulation monitoring parameters for voltage transformers through simulation studies, laying a foundation for the application of Bayesian networks in multi-physics field coupling analysis. Based on the logical relationship between component and system reliability, combined with test fault data, Wang Qing et al. [
25] proposed a Bayesian evaluation method for electric energy meter reliability integrating reliability prediction. Zhao Qicheng et al. [
26] developed a Bayesian state assessment method for dry-type air-core reactors.
Therefore, this paper proposes a reliability analysis method based on Bayesian networks to address the limitations of traditional physical models in handling multi-factor coupling and uncertain fault reasoning. By integrating the MATLAB simulation platform and through a dynamic probability reasoning mechanism, the model’s adaptability to complex environmental changes is enhanced. A multi-level causal dependency network is constructed to identify key risk factors (such as specific component defects and weak links in the process), overcoming the limitations of traditional methods in risk traceability. Compared with assessment systems relying on physical equations, this method demonstrates higher efficiency in dealing with uncertainties and decision optimization, providing quantifiable support for engineering practice and reducing empirical costs.
2. Bayesian Networks
A Bayesian network is a graphical model composed of nodes and directed edges, where each node represents a random variable, and edges represent conditional dependencies between variables. The goal of a Bayesian network is to infer the probability of unknown variables using known evidence, thereby evaluating system reliability.
Its core idea is to express dependencies between variables through conditional probability distributions and perform reasoning and updating using Bayes’ theorem. The mathematical basis of Bayesian networks is Bayes’ theorem [
27], using the following Equation (1):
where
is the posterior probability of hypothesis
H given evidence
E,
is the likelihood of observing evidence
E under hypothesis H,
is the prior probability of hypothesis
H, and
is the marginal probability of evidence
E.
As shown in
Figure 1,The probability distribution of each node in a Bayesian network depends on its set of parent nodes, i.e.,
, where
is a node in the network, and
is the set of parent nodes of
. This formula indicates that the joint probability distribution in the network can be expressed as the product of the conditional probabilities of each node.
By modeling the conditional probability distributions of each node, Bayesian networks can establish dependencies between multiple variables. During reasoning, the probability distribution of other unknown nodes can be derived from observed data of known nodes.
3. Reliability Analysis Based on Bayesian Networks
Reliability analysis of online monitoring devices using Bayesian networks can evaluate the health status of equipment operation. The health status of equipment is typically influenced by multiple factors. In a Bayesian network, each factor is represented as a node, and directed edges between nodes denote conditional dependencies among these factors. System reliability analysis involves inferring relationships between these variables to assess equipment reliability. The evaluation process diagram is shown in
Figure 2.
- (1)
Construction of equipment health status model
When constructing a health status model for online monitoring devices, it is first necessary to define various operating parameters of the device as nodes in the network and determine dependencies between these nodes using historical data. The Bayesian network model for equipment health status can be expressed as Equation (2):
where
represents temperature,
represents humidity,
represents voltage, and
represents the equipment health status. Temperature, humidity, and voltage, as crucial environmental variables, directly affect the dielectric properties and insulation performance of equipment, and also have an impact on the health status of the equipment. The conditional probability distributions, such as
, describe the relationship between temperature and pressure;
represents the dependency between humidity and voltage; and finally,
indicates the influence of voltage on the health status of the equipment.
- (2)
Fault modes and fault probabilities
Conditional probabilities under different fault modes are established using historical data and equipment design information. The occurrence probabilities of these fault modes are inferred through Bayesian networks.
Let
be the fault state of the equipment,
be temperature, and
be voltage. The probability of equipment failure can be expressed by Equation (3):
where
is the conditional probability of temperature and voltage under the fault state,
is the prior probability of failure occurrence, and
is the marginal probability of temperature and voltage.
- (3)
Reasoning and decision support
Reasoning through Bayesian networks infers equipment health status based on real-time sensor data. For example, if the temperature and voltage of the equipment reach certain preset thresholds, Bayesian reasoning can calculate the posterior probability of equipment failure. Assuming the temperature and current of the equipment are observed, the posterior probability of equipment failure can be expressed as Equation (4):
In this way, Bayesian networks can provide health status assessments of online monitoring devices based on current observed data.
4. Experimental Verification
Bayesian networks are based on probabilistic graphical models, consisting of nodes and directed edges, representing conditional dependencies between variables. Therefore, it is first necessary to identify key factors affecting the reliability of CVT online monitoring devices. Each factor is then treated as a node in the network, and causal relationships between them are established. This paper will use the CVT online monitoring device from Ningxia Company of State Grid at Yinchuan China to conduct the following experiments.
- (1)
Environmental factors
First, environmental factors such as temperature and humidity are used as nodes to establish relationships. The parent nodes are environmental factors, and the child node is reliability. Each parent node has different states, and the reliability node depends on the conditional probabilities of these two factors. Next, it is necessary to determine the conditional probability table (CPT) for each node. The experimental steps are as follows:
First, the network structure and probability parameters are defined, and a Bayesian network structure with three nodes (temperature, humidity, and reliability) is constructed. Then, the joint probability is calculated to derive the marginal probability of reliability, and results are visualized via histograms and tables.
The reliability analysis results of capacitive voltage transformers are shown in
Figure 3 and
Figure 4.
Analysis of
Table 1 results: Overall fault probability: 32.70%; overall normal probability: 67.30%. The most dangerous combination is high temperature + high humidity, with a 95% fault probability. In a high-temperature and high-humidity environment, the failure probability of the CVT online monitoring device reaches 95%, mainly due to the combined effect of high temperature and high humidity. High temperature accelerates the aging of electronic components and reduces insulation performance, while high humidity causes water vapor to penetrate into the sensors and circuit boards, leading to corrosion, short circuits, or signal drift.
- (2)
Component quality
Three core components of the CVT online monitoring device, namely the sensor unit, communication and transmission module, and data acquisition and processing unit, are selected as nodes to establish relationships. These three factors serve as parent nodes, and reliability is the child node. The probabilities of each state are calculated through joint probabilities, and then the conditional probability table for each node is determined. The experimental steps are as follows:
First, define the prior probabilities of each node (each node is independent, and the joint probability is the product of their respective probabilities). Define the conditional probability table (CPT) as the conditional probability of the reliability node R. Enumerate all possible combinations of parent node states, calculate the probability of each combination, and visualize the results, as shown in
Figure 5.
Analysis of
Table 2 results: The overall normal reliability probability is 95%. The most dangerous combination is all faults, with a 90% fault probability. When the data acquisition and processing Fault (D Fault) fails, the system reliability significantly decreases. Its impact is greater than that of the sensor Fault (C Fault) and the communication module (T Fault). The failure of this Fault will directly cause the entire monitoring system to lose its “decision-making center”. Moreover, when all three components fail simultaneously, the system reliability drops sharply to 10%.
- (3)
Installation process
First, two important processes in the installation of CVT online monitoring devices, wiring process and mechanical fixing, are used as parent nodes, and reliability is the child node. The wiring process has two states: good and poor; mechanical fixing has two states: normal and faulty. The reliability state of the entire system is determined by these factors. The experimental steps are as follows:
Construct a Bayesian network containing three nodes: wiring process, mechanical fixing, and reliability, where the reliability node is affected by the first two. Then calculate the conditional probabilities when each factor fails. Next, enumerate all possible combinations, calculate the joint probabilities, and sum them to obtain the marginal probability of the reliability node. Finally, visualize the results as shown in
Figure 6 and
Figure 7.
Analysis of
Table 3 results: The overall normal probability is 92.72%. The wiring process has a greater impact on reliability than mechanical fixing.
- (4)
Maintenance management
Regular maintenance and no maintenance are set as parent nodes, and reliability is the child node. The reliability state of the entire system is determined by maintenance management. It is necessary to determine the influence weight of each parent node on system reliability or the specific values of joint probabilities. The experimental steps are as follows:
First, maintenance management, as a parent node, directly affects reliability. Then, determine the states of each node. Maintenance management has two states: “regular maintenance” and “no maintenance”; reliability has two states: “good” and “poor”. Next, set the conditional probability table. Reliability under different maintenance states is shown in
Figure 8.
Table 4 shows that when regular maintenance is implemented, the probability of the device being in a “good” state is as high as 95%, and the probability of being in a “poor” state is only 5%. Without maintenance, the probability of the device being in a “good” state drops sharply to 30%, while the probability of being in a “poor” state significantly increases to 70%. These results strongly demonstrate the extreme importance of regular maintenance in ensuring the reliability of CVT online monitoring devices. Compared with the on-site maintenance records of the State Grid in Ningxia region, the actual proportion of “good” is 94.9%, which is only 0.1% lower than the predicted value of 95%. In the un-maintained areas, the proportion of “poor” reaches 71.1%, which is 1.1% higher than the predicted value of 70%. This confirms that the Bayesian model has high accuracy and practical effectiveness.
- (5)
Transformer’s own error
The error magnitude of the transformer itself is set as the parent node, and reliability is the child node. The reliability state of the entire system is determined by maintenance management. It is necessary to determine the influence weight of each parent node on system reliability or the specific values of joint probabilities. The experimental steps are as follows:
First, the key variables here are the error characteristics and reliability state of the transformer. The error characteristic is a parent node, and reliability is a child node, as error characteristics affect reliability. Next, determine the states of the variables. Error characteristics may have different states: “low error”, “medium error”, and “high error”. Reliability is divided into “good” and “bad”. Then, define the conditional probability table (CPT) as shown in
Figure 9.
Table 5 shows that when in the “low error” state, the probability that the monitoring device outputs a “reliable” result is the highest. This indicates that the monitoring device has high credibility in its output results when measuring healthy or well-functioning CVTs. As the error increases, the probability that the monitoring device outputs a “reliable” result decreases to 75%. This means that when the CVT itself begins to have minor issues, the accuracy of the monitoring device’s diagnostic results may decrease due to the influence of abnormal signals from the device itself, with a certain risk of misjudgment or missed judgment. When in the “high error” state, the probability that the monitoring device outputs a “reliable” result drops sharply to 40%. This indicates that severe faults or performance degradation of the CVT itself will greatly interfere with the judgment ability of its online monitoring device.
5. Conclusions
With the intelligent development of power systems, traditional reliability analysis methods struggle to effectively solve the problem of fault reasoning for CVT online monitoring devices under multi-factor coupling, dynamic environments, and uncertain conditions.
This paper proposes a reliability analysis method based on Bayesian networks to dynamically evaluate the health status of online monitoring devices, identify key risk factors, and optimize maintenance strategies. A Bayesian network model for CVT online monitoring devices is constructed, where key variables such as environmental factors, component quality, installation process, and maintenance management are defined as network nodes, and conditional probability dependencies are established. Multi-scenario experiments are conducted using the MATLAB platform to simulate system behavior under different environmental combinations (temperature and humidity), component states (sensors/communication/data processing units), process levels (wiring/mechanical fixing), and maintenance strategies. Dynamic updates and predictions of fault probabilities are realized through Bayesian reasoning. The results show that the combination of high-temperature and high-humidity environments has the most significant impact on reliability; among component factors, failures of the data acquisition and processing unit have the greatest impact on system reliability. The proposed Bayesian network model can not only identify the key reliability threats but also quantify their impact; regular maintenance can increase the system reliability by 65 percentage points (from 30 to 95%), while high-temperature and high-humidity environments will increase the failure probability to 95%. By supporting data-driven decision-making in the complex power grid environment, this method can improve operational efficiency and reduce unplanned downtime. Wiring process issues pose greater risks to reliability than mechanical fixing problems; and regular maintenance can significantly improve system reliability. This study verifies the effectiveness of Bayesian networks in reliability analysis, proposes a Bayesian network modeling method for multi-factor coupling, solves the limitations of traditional models in uncertain reasoning, and provides a new paradigm for complex system reliability theory.