Next Article in Journal
A Contemporary Survey on Deepfake Detection: Datasets, Algorithms, and Challenges
Next Article in Special Issue
Method for Fault Diagnosis of Track Circuits Based on a Time–Frequency Intelligent Network
Previous Article in Journal
A Linear Multi-Band Voltage-Controlled Oscillator with Process Compensation for SerDes Applications
Previous Article in Special Issue
Inverter Fault Diagnosis for a Three-Phase Permanent-Magnet Synchronous Motor Drive System Based on SDAE-GAN-LSTM
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Fault Tracing Method for Relay Protection System–Circuit Breaker Based on Improved Random Forest

Key Laboratory of Power System Intelligent Dispatch and Control of Ministry of Education, Shandong University, Jinan 250061, China
*
Author to whom correspondence should be addressed.
Electronics 2024, 13(3), 582; https://doi.org/10.3390/electronics13030582
Submission received: 5 January 2024 / Revised: 28 January 2024 / Accepted: 29 January 2024 / Published: 31 January 2024

Abstract

:
The incorrect operation of protective relays and circuit breakers will significantly compromise the safety and stability of power systems. To promptly detect the faults of the relay protection system and the circuit breakers in time and to ensure the operational reliability of these protective devices, this paper proposes a fault tracing method for a relay protection system–circuit breaker based on improved Random Forest. Firstly, an analysis is conducted to identify the causes of incorrect operation of the protective relay and the circuit breaker. The fault types and corresponding alarm messages for the relay protection system and the circuit breaker are categorized, and the alarm feature set is constructed. Then, the Random Forest is improved and trained to develop the fault tracking model. Finally, the operation evaluation process is developed to determine the incorrect operations of the protective relay and the circuit breaker, and the fault tracking model and fault tracking process are then employed to locate the faults of the relay protection system and the circuit breaker. The experimental results demonstrate the method’s capability to accurately track faults in the relay protection system and the circuit breaker, thereby assisting operation and maintenance personnel in troubleshooting and highlighting its promising practical potential.

1. Introduction

The continuous development of power systems and the incremental improvement of infrastructure have significantly increased the requirements for reliability and maintenance in order to guarantee the uninterrupted and dependable operation of power systems [1]. Protective relays (PRs) and circuit breakers (CBs) serve as control and protection equipment in power systems. They detect faults or exceptions in the power grid and promptly isolate faulty equipment to minimize the impact of the fault, thereby reducing the power grid losses [2]. Thus, the reliable operation of the relay protection system and the CB is crucial for maintaining a safe and stable power system. When the PR or CB fails to operate or operates incorrectly, it is an important research topic to accurately diagnose the faults in the relay protection system and the CB.
The increasing prevalence and integration of the Internet of Things (IoT), blockchain, and other computer and communication technologies have accelerated the growth of smart grids and smart substations within the power system [3,4]. The secondary system has realized intelligence, informatization, networking, and communication standardization, which provides sufficient data support for fault diagnosis and location in the intelligent substation [5]. However, informatization and networking also lead to the diversity and complexity of alarm messages. Massive alarm messages reduce the efficiency of diagnosing the incorrect operation of PRs or CBs and locating faulty secondary devices [6]. Data mining techniques such as artificial neural networks and machine learning are employed in power system fault diagnosis and localization to effectively leverage alarm information and enhance fault detection efficiency and accuracy [7]. A fault tracking architecture based on big data technology is proposed for smart substations [6]. The framework includes a big data platform that enables the data mining of various alarm messages, including device alarms, generic object-oriented substation event (GOOSE) alarms, sampled value (SV) alarms, and device self-checking alarms in the smart substation. This platform eliminates the message barriers between the safety isolation zone and multiple supervisory systems. In addition, fault diagnosis methods and handling methods have been proposed for merger units, intelligent terminals, and fiber optic links in the process layer of smart substations [8]. The FP-growth algorithm is applied using the Hadoop framework and MapReduce model to discover frequent items and strong correlations among anomaly signals, aiming to identify potential defects [9]. Previous studies [10,11,12] have conducted a correlation analysis between secondary device faults and alarm messages, enabling the matching of faulty devices and fault causes through the discovery of correlation rules between fault messages and faulty devices. However, the current matching approach faces challenges in handling uncertainty and incomplete messages, and it also lacks generalization. The long short-term memory (LSTM) network has been employed to automatically diagnosis faults during the relay protection test [13]. However, deep learning methods are less explanatory and have high requirements for training samples, which makes it challenging to obtain perfect samples.
Communication failures may occur in the secondary circuit due to transmission delays, network attacks, and other interferences, subsequently affecting the normal operation of the secondary equipment or potentially causing maloperation [3,14]. Moreover, communication among secondary devices occurs through the network, leading to a loss of one-to-one correspondence in signal transmission between equipment. Consequently, issues such as hidden logic circuits and challenging communication troubleshooting arise. In [15], a communication network model is proposed to establish a correlation between communication links and physical aspects among secondary devices. This model then identifies communication faults by analyzing message transmission paths. Moreover, a communication fault diagnosis model based on a deep confidence network is developed in [16], demonstrating high accuracy and fault tolerance when dealing with untrustworthy messages. In [17], a fault localization method for communication networks is proposed. This method utilizes deep neural networks and effectively performs localization, even when multiple faults and untrustworthy alarms are present. Matrix modeling is used in [18] to construct the connectivity state matrix and logical node model of the secondary system. The matrix algorithm is also integrated with a back-propagation neural network to propose a fault localization method for the secondary system. The secondary circuit topology is converted into fault map data, and the fault location portability of the secondary circuit is improved by training a graph neural network as a fault location model [19].
Contrary to the fault diagnosis and fault localization mentioned earlier, fault tracing refers to the process of finding the cause of incorrect operations in PRs and CBs using alarm messages in the substation [6,20]. A fault tracing method is proposed utilizing the information difference graph model [21]. By marking the components, this technique enables the tracing of interaction relationships within the graph model. Thus, it effectively overcomes the limitations of conventional fault tracing methods that depend heavily on expert experience and logical topological relations. The Bayesian suspected degree, as proposed in [22], addresses the issue of missed judgments caused by lost alarms by calculating the probability of faults in PRs and CBs. The method proposed in [23] offers comprehensive and intuitive state information, the timely identification of hidden system hazards, and accurate fault causality tracing. It enhances the operational state monitoring and risk control capabilities of communication networks. The Recurrent Neural Network (RNN) has been employed to localize secondary device faults by extracting the temporal features of the alarm messages [24]. This method does not consider the effect of timing disorder in alarm messages on their accuracy. The Decision Tree and Gradient Boosting Decision Tree have been proposed in [25,26], respectively, as fault tracing models to trace the causes of device faults of the relay protection system.
The CB and relay protection system (composed of a merging unit, protective relay, and intelligent terminal) belong to the primary side and secondary side of the power system, respectively. However, their communication connection is established through optical fiber. During the fault isolation, the relay protection system detects faults and issues trip commands, while the CB acts as an actuator and trips upon receiving the tripping command. Therefore, the causes of PR and CB rejections or maloperations include device faults in the PR and CB, device faults in other secondary devices in the relay protection system, and communication faults between these devices. However, most existing methods study the fault diagnosis of CBs or relay protection systems separately, without considering the interactions between the relay protection system and the corresponding CB. Moreover, they fail to comprehensively diagnose the device and communication faults of the relay protection system and CB and explore the reasons for the incorrect operation of the PR and CB.
To ensure the operational reliability of the PR and CB, and considering the correlation between the relay protection system and the CB in terms of topology and action logic, this paper proposes a fault tracking method for the relay protection system and CB (denoted as the RPS-CB). This paper first analyzes the causes of incorrect operations in PRs and CBs. It then establishes a correspondence between fault types and alarm messages in the relay protection system and the CB to create an alarm feature set. Subsequently, an improved Random Forest model is developed by integrating the Re-Relief F algorithm and a weighted voting strategy. The improved Random Forest model is trained for fault tracking. Finally, the operation evaluation process is utilized to identify PRs and CBs that were operated incorrectly. A fault tracing process based on the improved Random Forests is proposed to determine fault types in relay protection systems and CBs.
This paper is structured as follows: Section 2 describes the fault types and alarm messages of relay protection systems and CBs; Section 3 introduces the fault tracing model and fault tracing process based on improved Random Forest; Section 4 presents the algorithm validation and case study; and Section 5 concludes.

2. Fault Types and Alarm Feature Sets for the Relay Protection System–Circuit Breaker

To facilitate fault tracking for the RPS-CB, a comprehensive analysis of the root causes behind the erroneous functioning of both the PR and CB components is imperative. Subsequently, based on the causes above, a systematic categorization of the corresponding relationships between fault types and alarm messages about the RPS-CB can be established. Lastly, a specialized representation method for effectively capturing and managing alarm messages is devised to facilitate fault tracking procedures.

2.1. Reasons for Incorrect Operation of Protective Relays and Circuit Breakers

The PR receives the SV message and compares it with the protection setting value to determine the fault and issue a trip signal. The transmission path for the SV message is as follows: electronic transformer → merging unit → SV message → protective relay. Therefore, besides the faults of the PR, the causes of PR rejection also include exceptions in SV messages, errors in protection setting value configuration, network errors, and faults in other related secondary devices. In contrast to the PR rejection, PR maloperation refers to the abnormal PR function, which is mainly caused by abnormal SV sampling messages, incorrect protection setting values, and algorithm logic errors. The main reasons for PR rejection and malfunction are illustrated in Figure 1.
The CB trips upon receiving the GOOSE message sent by the PR; the transmission path of the GOOSE message is as follows: protective relay → GOOSE message → intelligent terminal → circuit breaker. The operation state of the CB can be affected not only by the rejection and maloperation of the PR but also by CB faults, abnormal GOOSE messages, and communication faults. These factors contribute to the CB rejection and maloperation, as illustrated in Figure 2.
The analysis above reveals that the incorrect operation of the PR and CB can be attributed to faults within the relay protection system and the CB. Specifically, CB rejection and maloperation are directly linked to PR rejection and maloperation. This paper focuses on developing a fault tracking model and process for the RPS-CB (relay protection system and corresponding CB), aiming to investigate the relationship between system faults and the incorrect operation of the PR and CB.

2.2. Fault Types and Corresponding Alarm Messages of the RPS-CB

The fault tracing object mainly contains merging units, PRs, intelligent terminals, CBs, and the communication links between these devices. When the PR and CB fail to operate normally, corresponding alarm messages are issued by these devices to facilitate maintenance and repair. To achieve the fault tracking of the PR’s and CB’s incorrect operation, the faults of the protection relay and circuit breaker system (PR and CB) have been classified as device faults and communication faults, after consulting the literature [27,28] and relevant industry technical specifications. The fault types and their corresponding alarm messages are presented in Table 1 and Table 2.

2.3. Alarm Feature Set

To make the expression of the alarm messages and the training of the fault tracking model more convenient, the alarm feature set is constructed according to the fault types and alarm messages in Table 1 and Table 2, as shown in Equation (1).
X i = X SV i , X GOOSE i , X SC i , X COM i
where Xi denotes the ith alarm feature set, which contains the SV alarm subset XSVi, the GOOSE alarm subset XGOOSEi, the device exception alarm subset XSCi, and the communication alarm subset XCOMi.
The SV alarm subset XSVi reflects the sampling state of the merging unit and PR, as shown in Equation (2).
X SV i = { X SV _ MU i , X SV _ PR i } X SV _ MU i = { S MA , S SA , S SE , S CE , } X SV _ PR i = { S MA , S SA , S EA , S CE , }
where XSV_MUi and XSV_PRi are the sampling alarm subsets of the merging unit and protective relay, including SV total alarm SMA, abnormal sampling alarm SSA, synchronization exception alarm SEA, sampling configuration error SCE, etc.
The GOOSE alarm subset XGOOSEi reflects the switching quantity status of the merging unit, PR, and intelligent terminal, as shown in Equation (3).
X GOOSE i = { X GOOSE _ MU i , X GOOSE _ PR i , X GOOSE _ IT i } X GOOSE _ MU i = { G MA , G DE , G BC , } X GOOSE _ PR i = { G MA , G DE , G BC , } X GOOSE _ IT i = { G MA , G DE , G BC , }
where XGOOSE_MUi, XGOOSE_PRi, and XGOOSE_ITi are the subsets of the switching alarm features of the merging unit, PR, and intelligent terminal, respectively, which include GOOSE total alarm GMA, GOOSE data exception GDE, GOOSE interruption GBC, etc.
The device exception alarm subset XSCi reflects the operation status of the merging unit, PR, intelligent terminal, and CB, as shown in Equation (4).
X SC i = { X SC _ MU i , X SC _ PR i , X SC _ IT i , X SC _ CB i , X SC _ EX i } X SC _ MU i = { S CAN , S PF , S DL , } X SC _ PR i = { S CAN , S PF , S DL , } X SC _ IT i = { S CAN , S PF , S DL , } X SC _ CB i = { S CL , S SC , S SL , } X SC _ EX i = { S CAN , S PF , S BC , }
where XSC_MUi, XSC_PRi, XSC_ITi, XSC_CBi, and XSC_EXi are the subsets of the device self-test alarm features of the merging unit, PR, intelligent terminal, CB, and switch, respectively, which include device self-test SCAN, power fault alarm SPF, device lockout SDL, etc. and CB control loop disconnection SCL, closing coil voltage exception SSC, closing coil loop blocking SSL, and switch communication interruption SBC, etc.
The communication exception alarm subset XCOMi reflects the communication status of the merging unit, protective relay, intelligent terminal, and circuit breaker, as shown in Equation (5).
X COM i = { X COM _ MU i , X COM _ PR i , X COM _ IT i , X COM _ EX i } X COM _ MU i = { C SV , C SOI , C SOI , C SIE , } X COM _ PR i = { C SV , C GOOSE , C SOI , C SIE , } X COM _ IT i = { C GOOSE , C SOI , C SIE , } X COM _ EX i = { C BC , C REF , C IED , }
where XCOM_MUi, XCOMPRi, XCOM_ITi, and XCOM_EXi are the subsets of the communication alarm features of the merging unit, PR, intelligent terminal, and switch, respectively, which include SV interruption CSV, GOOSE interruption CGOOSE, input/output communication interruption CSOI, input circuit self-check error CSIE, etc. and switch self-check fault CBC, data non-renovation CREF, port IED communication fault CIED, etc.
When incorrect operation occurs in the PR or CB, the relevant devices in the corresponding spacing send out alarm messages. When the alarm message is received, the corresponding position element of the alarm feature set is set to 1; otherwise, the corresponding position element is 0.

3. Fault Tracking Based on Improved Random Forest

Fault tracing is determining the cause of incorrect operations in the PR and CB through alarm messages. Its essence is a multi-classification problem with alarm messages as classification features and fault types as classification results. However, fault types and alarm features in relay protection systems and CBs are diverse and complex. The alarm features of some fault types overlap, which may lead to inaccurate classification. In addition, certain faults may occur less frequently, causing data imbalance problems.
Random Forests are widely used as a multi-classification algorithm in the field of classification and prediction. The Random Forest is an integrated learning classification model consisting of multiple decision trees. High-dimensional datasets with complex features, such as alarm messages of the RPS-CB, can be effectively handled by random sampling. Training with partial samples and features can reduce the impact of noise on the overall performance of Random Forest and improve robustness. During the training process, the Random Forest will come to evaluate the contribution of each feature to the classification, in which the feature with the largest contribution is used as the node attribute of the decision tree. Therefore, the classification process of the Random Forest has good interpretability, which is conducive to adjusting and improving the algorithm for fault tracing problems. In addition, the results of Random Forest are voted or averaged by multiple decision trees, which can reduce the variance of the model and improve the generalization ability. In summary, Random Forest is suitable for dealing with multi-classification problems in the RPS-CB.
This paper improves the Random Forest, establishes the mapping relationship between alarm messages and fault types through model training (as shown in Equation (6)), constructs the fault tracking model based on the improved Random Forest, and proposes the fault tracking process based on the improved Random Forest to achieve fault tracking of the causes of incorrect PR and CB operations.
Y = T ( X )
where T( ) is the fault tracking model with improved Random Forest, X is the alarm feature set, and Y is the set of classification labels, which consists of the fault numbers in Table 1 and Table 2. To facilitate the input and output of the fault tracking model, Y is represented in the form of a vector. For example, the fault number 000001 can be represented as the vector {0, 0, 0, 0, 0, 0, 1}.

3.1. Improved Random Forest and Model Training

The Random Forest combines the Bootstrap method to form an ensemble learning model of multiple decision trees. Through the voting strategy, the robustness of the Random Forest is improved without feature scaling. The Random Forest is suitable for the fault tracing of the RPS-CB, which involves complex fault features with faults. This paper proposes an improved Random Forest that combines a feature selection algorithm and a weighted voting strategy. The feature selection algorithm eliminates the useless features and retains the features that impact model training and classification. The weighted voting strategy determines the weights of each decision tree according to the classification performance, which strengthens the impact of decision trees with good classification performance and further improves the accuracy of the fault tracking.

3.1.1. Feature Selection Algorithm

In constructing decision trees for Random Forests, each node randomly selects some features of the training samples to form a feature candidate set and then selects the optimal features from the feature candidate set as the node features. In the Random Forest, the samples are classified from the root node to the leaf nodes of each decision tree. After passing through each node, they will be classified according to the node feature and then go to the next child node to continue classification until they reach a leaf node. The samples are classified as the category of the leaf node. However, due to the wide variety of alarm messages of the RPS-CB and the high dimensionality of the alarm features, there is a likelihood that the feature candidate set will contain features that are not relevant to the samples assigned to the node. Supposing the node features are selected unreasonably, in such a case, the samples may be classified into the wrong leaf nodes along the wrong paths, leading to classification errors and reducing the classification performance of the Random Forest.
This paper addresses the problems by employing the Re-Relief F algorithm to evaluate alarm features within the training set. Weights are assigned based on their contribution to the classification, enhancing the selection probability of features with significant contributions and high weights. Furthermore, it reduces irrelevant and redundant features in the candidate set, improving the generalization ability and reducing overfitting. The Re-Relief F algorithm [29] is a feature selection algorithm applied to multi-classification problems. This algorithm first calculates the distances from the sample to the nearest neighbor samples of different classes and the distances to the nearest neighbor samples of the same class. Then, it takes the ratio of the two distances as the weights of the features. By emphasizing features that exhibit strong proximity within classes and significant gaps between classes, the Re-Relief F algorithm improves the effectiveness and quality of the resulting feature subspace. The corresponding weight of the sample feature A of the Re-Relief F algorithm, w[A], is calculated as follows:
w [ A ] = i = 1 n y class ( R i ) P ( y ) 1 P ( class ( R i ) ) j = 1 k diff ( A , R i , M j ( y ) ) i = 1 n j = 1 k diff ( A , R i , H j )
where n denotes the number of sample sampling, Ri is a random sample in the training set, k is the number of samples that are close neighbors to R, Mj(y) denotes the jth nearest-neighbor sample of different category c, Hj denotes the jth nearest-neighbor sample in the same category as R, P(y) denotes the ratio of the number of target samples of category y to the total number of samples, class(Ri) denotes the category that Ri belongs to, and the function diff(A, Ri, Rj) is to compute the distance of the sample instances Ri and Rj concerning feature A:
diff ( A , R i , R j ) = 0 , R i ( A ) = R j ( A ) 1 , R i ( A ) R j ( A )
where Ri(A) and Rj(A) are the feature values of the samples Ri and Rj corresponding to feature A.
The calculation process for the feature weights of the alarm features is as follows, based on the definition of the Re-Relief F algorithm:
  • Input training set Dtrain. The sample sampling number m is 10, and the number of feature dimensions is N. Initialize n = 1.
  • Initialize i = 1.
  • Select a random sample Ri from Dtrain.
  • Randomly select the k nearest neighbor sample Hj of Ri with different class samples and nearest neighbor samples Mj with different class samples.
  • If i < m, i = i + 1, return to step 3; otherwise, proceed to the next step.
  • According to Equation (7), the feature weight of the nth feature is calculated.
  • If n < N, n = n + 1, return to step 2; otherwise, output the feature weights of all alarm features.
The higher the weight of an alarm feature, the better the alarm feature is for distinguishing between similar and different types of faults in the immediate neighborhood. According to the feature weights, this paper improves the feature selection method in the training process of Random Forest. First, the Bootstrap method is used to obtain the training subsets required for decision tree training from the training set. Then, the features with a weight of 0 are excluded from the training subsets. The remaining features are sorted in descending order of weight and are evenly divided into high, medium, and low feature subsets (Wh, Wm, and Wl). Finally, the same number of alarm features from the feature subsets Wh, Wm, and Wl are randomly selected as the feature candidate set for the node of the decision trees.
The improved feature selection method increases the probability that features favorable for classification are selected, thus improving the classification performance of the decision tree. Simultaneously, this method maintains the randomness of feature selection and ensures robustness. In addition, the Re-Relief F algorithm compares the distances of the nearest neighbor samples of the same class and different classes, considering the correlation between alarm features, which is beneficial for distinguishing the types of faults with similar alarm features (e.g., communication-related device faults and communication faults), and thus reduces the probability of misclassification.

3.1.2. Weighted Voting Strategy

The average voting strategy ignores the differences in the classification performance of different decision trees in the Random Forest. It fails to give full play to the advantages of an ensemble learning model. Since fault tracking is a multi-classification problem, existing Random Forest-based methods primarily address binary classification problems and regression problems using weighted voting strategies. Therefore, this paper proposes a weighted voting strategy based on the Kappa coefficient to improve the overall performance of the Random Forest.
The Kappa coefficient is a metric for evaluating the performance of multi-classification models, which considers the classification accuracy and the consistency between classifications. It evaluates the multi-classification model performance more comprehensively by correcting for the expected accuracy of the classifications. The Kappa coefficient is computed as follows:
K a p p a = N i = 1 Y N i i i = 1 Y N i + N + i N 2 i = 1 Y N i + N + i
where N denotes the total number of test samples, Y is the total number of categories, Nii is the number of correctly classified samples, and N+i and Ni+ are the number of true samples in category i and the number of samples classified as category i, respectively.
The process of calculating the voting weight of the decision tree in the improved random forest is as follows:
  • In addition to the training and validation sets needed for model training and validation, prepare a test set.
  • Train the Random Forest using the training set to obtain a fault tracking model consisting of T decision trees. Initialize t = 1.
  • Select the tth decision tree in the fault diagnosis model, and input the test set into the decision tree for classification. Calculate the weight Kappat of the tth decision tree based on Equation (9) and the classification results of the test set.
  • If t < T, then t = t + 1; return to step 3; otherwise, proceed to the next step.
The classification result of the improved Random Forest depends on the weighted voting result of each decision tree classification result, as shown in Equation (10):
T ( X i ) = arg max i = 1 K K a p p a i I ( t i ( X i ) ) = Y )
where ti(Xi) is the classification result of the ith decision tree; Kappai is the Kappa coefficient corresponding to ti; I( ) is a schematic function, and the function value is 1 when the classification result of the decision tree is related to a certain label in the classification label set Y, and 0 otherwise; arg( ) is the function of taking the autocorrelation, which serves the purpose of taking the classification with the highest number of votes.

3.1.3. Fault Tracing Model Training

Based on Random Forest training, combined with the Re-Relief F algorithm and weighted voting strategy, the training process of the fault tracking model based on the improved Random Forest is proposed as follows:
  • Sample the training set Dtrain T times using the Bootstrap method to obtain T sample subsets. Compute the feature weights for the training set Dtrain. Initialize t = 1.
  • Select the tth training subset. Refer to Section 3.1.1 to construct the high, medium, and low feature subsets (Wht, Wmt, and Wlt) corresponding to the training subset.
  • Input the tth training subset into the root node of the tth decision tree, and start building the decision tree from the root node.
  • If the sample set in the current node is non-empty and all samples belong to multiple classes, this indicates that the current node is an internal node; proceed to step 4. Otherwise, the current node is a leaf node that is not further split. Other nodes are selected for further splitting. If all remaining nodes are leaf nodes, this indicates that all the samples have been trained and the construction of the decision tree is complete; proceed to step 8.
  • The current node contains samples with a total of M features. From Wht, Wmt, and Wlt, m ( m = M / 3 , rounded down) features are randomly selected to form the candidate feature set Dt of the current node.
  • Calculate the Gini coefficient of each feature in the candidate feature set D according to Equation (11). The feature with the smallest Gini coefficient is removed from the candidate feature set D as the optimal feature of the current node.
    G i n i ( p ) = 1 k = 1 m p k 2
    where Gini(p) denotes the Gini coefficient of node p, k is the number of categories, and pk denotes the classification probability.
  • Split the current node into two sub-nodes, and divide the samples contained in the current node into two parts according to the optimal features and input into the sub-nodes.
  • Repeat steps 4–8 for the sub-nodes.
  • If t = T, the Random Forest decision tree construction is completed; otherwise, t = t + 1, so return to step 3 and continue to construct the decision tree.
  • According to Section 3.1.2, the voting weights of the decision trees in the Random Forest are calculated, and the weighted voting result of this fault tracking model is shown in Equation (10).
  • The training of the fault tracking model based on the improved Random Forest is completed.

3.2. Fault Tracing Process Based on Improved Random Forest

3.2.1. Evaluation Process for PR and CB Operations

The classification object of the fault tracking model is the relay protection system and its corresponding CB. Therefore, before invoking the fault tracking model, it is necessary to identify incorrectly operated PRs and CBs. To this end, this paper develops the evaluation process for PR and CB operations. Firstly, the PRs and CBs are assigned confidence degrees according to alarm messages and fuzzy theory. Then, the confidence degrees of the PRs and CBs are fused and compared to determine the appropriate protective measure to isolate the faulty component. Finally, the incorrect operations of the main, near backup, and remote backup PRs and CBs are determined based on the configuration rules for relay protection. The specific steps of the evaluation process are as follows:
  • Suppose the fault diagnosis identifies R faulty components. SAM represents the set of PRs and CBs that receive the alarm messages. Initialize r = 1.
  • Select the rth faulty component. Let C represent the number of CBs connected to the rth faulty component. Initialize c = 1.
  • Search for the main and near backup PR corresponding to the cth CB. Assume the rth faulty component has L neighboring lines, and search for the CB and the remote backup PR on the neighboring lines. Query the alarm message set SAM and assign confidence degrees to the operation states of the PR and CB. The assignment rules for the confidence degrees of PRs and CBs [30] are shown in Table 3.
  • Calculate the fusion confidence degrees for the cth CB with the corresponding main and the near backup PR. Calculate the fusion confidence degrees for the remote CB and remote backup PR of L adjacent lines. The calculation process of the fusion confidence degrees is shown in Equation (12).
    P MPCB = 1 2 ( P MPR + P MCB ) P NPCB = 1 2 ( P NPR + P NCB ) P RPCB = 1 2 l 1 L ( P RPR l + P RCB l )
    where PMPCB, PNPCB, and PRPCB represent the fusion confidence degrees of the main, near backup, and remote backup PRs with the corresponding CBs, respectively. PMPR, PNPR, and PRPRl represent the confidence degrees of the main, near backup, and remote backup PRs for the far end of the lth adjacent line, respectively. PMCB, PNCB, and PRCBl represent the confidence degrees of CBs corresponding to the main, near backup, and remote backup PR in the far end of the lth adjacent line, respectively.
  • Compare the fusion confidence degrees of the main, near backup, and remote backup PRs. The protection that isolates the rth faulty component is the one with the highest fusion confidence degree. Based on the alarm messages and the action logic between different types of protection, evaluate the PR and CB operations. The operation evaluation rules for PRs and CBs are shown in Table 4.
  • According to the evaluation rules, the PRs and CBs configured for the rth faulty component are classified and placed into the set SNORM of the normal operation PRs and CBs, the set SFP (SFN) of the PRs and CBs with no alarms (incorrect alarms), and the set SRO (SFO) of the rejected (incorrect operated) PRs and CBs.
  • If c = C, proceed to the next step after searching all connected CBs of the faulty components; otherwise, c = c + 1 and return to step 3.
  • If r = R, proceed to the next step after traversing all faulty components; otherwise, r = r + 1 and return to step 2.
  • If SNORMSFPSFNSROSFO = SAM, the PRs and CBs with alarm messages have completed the operation evaluation; otherwise, there are PRs and CBs without corresponding faulty components. Considering that the PRs and corresponding CBs may be false alarms or maloperation, the PRs and CBs without corresponding faulty components are classified into SFN and SFO simultaneously. Since both the remote backup PR and the corresponding CB are backup protections for adjacent components, it is necessary to avoid the normally operated remote backup PRs and CBs as maloperation or rejection during the operation evaluation of the adjacent component. Therefore, the SFN and SFO require the following operations:
    S FN = S FN ( S FN S NORM ) S FO = S FO ( S FO S NORM )
  • Since maloperation and false alarms have alarm messages, while rejection and missing alarms do not, some of the PRs and CBs cannot be directly distinguished between the cases of maloperation or false alarms and the cases of rejections or missing alarms. Therefore, these PRs and CBs are divided into the pending set SUN and evaluated after the fault tracking of the RPS-CB. The SUN is as follows:
    S UN = S FN S FO + S FP S RO S FN = S FN S FN S FO S FO = S FO S FN S FO S FP = S FP S FP S RO S RO = S RO S FP S RO
  • Conclude the operation evaluation and output the evaluation results SNORM, SFP, SFN, SRO, SFO, and SUN.
  • Either SRO or SFO is not an empty set; there are incorrectly operated PRs and CBs, and it is necessary to obtain the alarm messages of the corresponding spacings of the intelligent substation for fault tracing to determine the fault types of the relay protection system and circuit breakers. Either SFP or SFN is not an empty set; there are missing and false alarms of the PRs and CBs, and it is necessary to check the communication network between the substation and the control center or request the re-uploading of alarm messages. SUN is not an empty set; there are PRs and CBs that are indistinguishable from those of the evaluation rules, and it is necessary to carry out secondary evaluation based on the fault tracking results.

3.2.2. Fault Tracing Process

Since the improved Random Forest-based fault tracking model is a single-label and multiclassification model, this model is unable to track multiple faults simultaneously in the RPS-CB. In addition, faults in communication and neighboring devices may lead to alarms from multiple devices, so fault tracking must differentiate between communication faults and faults in adjacent neighboring devices. Furthermore, once faults are detected in the RPS-CB, it is crucial to ascertain whether the CB rejection (maloperation) results from PR rejection (maloperation) by examining the correlation between the operations of PR and CB.
To solve the above problems, this paper divides complex faults into multiple simple faults for fault tracking by dividing the alarm messages. Since communication faults simultaneously trigger alarms in adjacent devices, and the alarm messages for communication faults and device faults differ, it is prioritized to determine whether a communication fault is present. In the absence of a communication fault, the fault lies with adjacent devices and fault tracking is conducted based on individual device faults. If a communication fault exists, the alarm features related to the communication fault are removed from the alarm feature set. If the alarm feature set is not empty, then fault tracking is conducted according to multiple simple faults. Regarding multiple faults, the alarm messages are divided into individual device faults, which are further subdivided into multiple single faults for fault tracking.
According to the above problems and solutions, this paper proposes a fault tracking process based on an improved Random Forest, as shown in Figure 3.
The specific steps of the fault tracking process are as follows:
  • Once a fault occurs in the power grid, the fault diagnosis algorithm diagnoses the faulty component [30]. Subsequently, PRs and CBs are evaluated for their operation according to Section 2.1, and incorrectly operated PRs and CBs are identified.
  • Drawing from previous engineering research [6], the alarm messages of the PRs, CBs, and their associated secondary devices (merging units, protective relays, intelligent terminals, and switches) located within the device spacings are collected within 6 s following the fault. According to Section 2.3, these collected alarm messages form the corresponding alarm feature set, denoted as X.
  • If alarm messages are not issued from adjacent devices, this situation may indicate a single device fault or multiple faults of non-adjacent devices. The alarm feature set is divided according to the device to obtain the alarm feature subsets of single devices. If alarm messages are issued from adjacent devices, this situation may indicate communication faults or multiple faults. In such cases, the alarm feature set is divided based on the adjacent devices, resulting in subsets of alarm features for both the adjacent devices and individual devices.
  • The subsets of alarm features from adjacent devices are input to the fault tracking model. If a communication fault is found in the output result, in such cases, the subset of the alarm features corresponding to the communication fault is removed from the alarm feature set X to avoid repeated diagnosis.
  • If the alarm feature set X is empty, all alarm features have completed fault tracking and output the communication fault type of the RPS-CB. If not, there are other non-communication faults present, and it is then divided based on the individual device to obtain subsets for alarm features of individual devices.
  • Alarm feature subsets of individual devices are input into a fault tracking model, outputting device fault types of the RPS-CB.
  • For the PRs and CBs in the set SUN, a secondary evaluation is necessary to assess whether they are rejected or missing (missing alarm or false alarm). The secondary evaluation rules are shown in Table 5.
  • After combining the operation evaluation and the secondary evaluation results, whether the CB rejection (maloperation) is caused by the PR rejection (maloperation) or the faults in the relay protection system based on the fault tracking model results is determined. If both the PR and the CB are incorrectly operated, and the CB is not faulty, CB maloperation (CB rejection) is caused by the PR maloperation (PR rejection). If the PR operates and the corresponding CB fails to trip, it is considered that the CB rejection is caused by a device fault or a communication fault in the relay protection system.
  • The relay protection system and CB faults that caused the PRs and CBs to operate incorrectly are determined based on Figure 1 and Figure 2.

4. Case Verification

4.1. Verifications of Operation Evaluation Process

4.1.1. Fault Cases in the IEEE 39-Bus System

In this paper, the accuracy of the operation evaluation is verified by the fault cases of the IEEE 39-bus system (as shown in Figure 4). The fault cases and operation evaluation results are shown in Table 6. In Figure 4, B represents the bus, L represents the line, T represents the transformer, CB represents the circuit breaker, Lp represents the line protection, Bp represents the bus protection, Tp represents the transformer protection, and the suffixes m, p, and s represent the main, near backup, and remote backup protection, respectively. For example, the line connecting B03 and B18 is L0318, and the transformer connecting B02 and B30 is T0230; the CB near the B03 of L0318 is CB0318, and the circuit breaker near the B18 side is CB1803. Lp0318m, Lp0318p, and Lp0318s represent the corresponding main, near backup, and remote backup protection of CB0318, respectively. Bp03 is the bus protection of B03, Tp0230m, and Tp0230p are the main and near backup protection of the transformer.
Table 6 displays that Cases 1–10 simulate different single faults, including scenarios with incorrect PR and CB operations, as well as the absence of alarms and rejections. The proposed operation evaluation rules in this paper assess the operations of the PRs and CBs configured for the fault component based on the operational logic among the main, near backup, and remote backup protections, as well as between the PR and its corresponding CB. For example, in Cases 1 and 2, there are no alarm messages of CB1803. Lp0318s operates instead of CB1803 to isolate the fault in Case 2, while Case 1 lacks a backup protection PR. Thus, Case 1 represents the absence of an alarm message for CB0318, whereas Case 2 indicates CB0318 rejection.
Similarly, in Cases 3 and 4, it is possible to distinguish between the rejection or missing alarm of the PR by the operation state of the CB corresponding to the bus protection and the backup protections. In Cases 5–7, the PR and CB fail to operate, and the missing alarm occurs simultaneously. It can be seen from the evaluation results that the evaluation rules can correctly identify the simultaneous rejections and missing alarms in a single fault. In Cases 8–10, false alarms and the maloperations of PRs and CBs occur. It should be noted that since the main and near backup PRs correspond to the same CB, the maloperation and false alarm of the near backup PR cannot be directly judged by the corresponding CB operation state. Therefore, Lp0203p in Case 9 will be divided into the pending set SUN and will wait for the fault tracking results for secondary evaluation, as per step 10 of the evaluation process in Section 3.2.1.
Unlike the single fault, the double faults in Cases 13–16 need to consider that the adjacent fault components are configured with the same backup PRs and CBs. When the PRs and CBs of a faulty component are evaluated, the remote backup PR and CB operation due to another faulty component will be judged as maloperation. For example, when a fault occurs in L0318 and L1718 in Case 13, and when evaluating the PRs and CBs operation of L0318, CB1718 is a remote backup PR of L0318 and is operated due to the isolation of the L1718 fault. This situation is a false alarm or malfunction in the evaluation rules. When evaluating the PRs and CBs operation of L1718, CB1718 functions as a normal CB. Therefore, during step 9 of the operation evaluation process, the evaluation results of PRs and CBs between different faulty components are compared, and the above double fault situation is corrected to avoid incorrect evaluation results.
The operation evaluation process proposed in this study accurately evaluates the operating states of PRs and CBs, reducing the scope of fault tracking for subsequent relay protection systems and CBs. Furthermore, this process can be applied not only to single faults but also to false alarms or missing alarms, maloperation or rejection, adjacent components configured with the same PRs and CBs, and other complex situations. The proposed method makes the operation evaluation more comprehensive and practical, and it is applicable to complex real-world scenarios in power systems.

4.1.2. Fault Cases in the IEEE 118-Bus System

For the fault tracing method to be applied to larger and more complex power systems, the evaluation rules must be capable of evaluating the operations of the PRs and CBs in this power system to identify failed relay protection systems and CBs. This paper verifies the applicability of the evaluation rules by testing them on fault cases in the IEEE 118-bus power system (as shown in Figure 5). The circles in Figure 5 represent busbars, and the lines connecting them are transmission lines. The numbering rules for lines, buses, CBs, and PRs match those in Figure 4.
Table 7 presents the results of the operation evaluation for fault cases within the IEEE 118-bus power system. Cases 17–19 represent line faults characterized by CB rejections, the missing alarm message, and the false alarm from the PR. Cases 20–22 involve bus faults with CB rejections, missing alarm messages from CBs, and the false alarm of the PR. Cases 23 and 24 encompass double faults, including the rejection and missing alarm messages from the bus PR. The evaluation results demonstrate the accurate identification of improperly functioning PRs and CBs for both single faults (Cases 17–22) and complex faults (Cases 22 and 24).
False alarms of CBs and PRs occur in Cases 19, 21, and 22, respectively. The operation evaluation rule determines the protection type that isolates the faulty components by comparing the status of the main, near backup, and far backup PRs. Whether it is classified as a maloperation or a false alarm depends on whether the corresponding CB (PR) of the falsely alarmed PR (CB) operates or not. Since the corresponding PRs and CBs do not operate in these cases, they are judged as false alarms.
In power systems of varying sizes and topologies, components are configured with main, near backup, and remote backup PRs and corresponding CBs. Based on the action logic of different types of protection, the complex PR and CB configurations are divided into several mutually independent combinations of main, near backup, and remote backup PRs and corresponding CBs. Operation evaluation rules can be applied to evaluate each combination. Thus, the operation evaluation rules can evaluate the operational status of the PRs and CBs, even in larger and more complex power systems.
The above cases prove that the operation evaluation rules can be applied in larger and more complex power systems, which lays the foundation for the application of the fault tracking model and fault tracking process in large-scale and complex power systems.

4.2. Improved Random Forest Verification

4.2.1. Data Preprocessing and Sample Set Construction

In this paper, a representative line interval from a 220 kV intelligent substation (illustrated in Figure 6) is selected as a case study. Alarm messages are gathered from historical records and simulation experiments to create sample datasets of relay protection systems and CBs. The effectiveness and reliability of the improved Random Forest proposed in this paper are assessed.
The steps for preprocessing and sample preparation for alarm messages are as follows:
  • Collect the alarm messages of the relay protection systems or CBs from the secondary system of the substation, and extract the names of stations, intervals, and devices. Then, group the alarm messages according to the plant name and the interval name.
  • According to the engineering experience [6], take 6 s as a time window and calculate the frequency of the alarm message within the time window. If an alarm appears only 1–2 times in a time window, regard it as a false alarm; regard an alarm that appears three times or more as a real alarm.
  • According to the definition of the alarm feature set, map alarm messages to the corresponding alarm feature set Xi. If the alarm message exists, set the alarm feature at the corresponding position of the alarm feature set Xi set to 1; otherwise, set it to 0.
  • Label the alarm feature set Xi with fault type Yi; the numbering of the fault types is detailed in Table 1 and Table 2. Combine the alarm features Xi and the corresponding fault type Yi as (Xi, Yi) and input into the sample set of fault tracking.
Based on the above steps, this paper constructs a fault sample set of relay protection systems and circuit breakers with a total number of 16,107 samples, of which 11,183 are training samples, 1604 are test samples, and 3320 are validation samples. The details of the samples are shown in Table 8 and Table 9.

4.2.2. Evaluation Indicators

Considering that the fault tracking of the RPS-CB is a multiclassification problem, this paper chooses producer accuracy, user accuracy, overall accuracy, and the Kappa coefficient to evaluate the classification performance. User and producer accuracy are used to evaluate the classification performance of a certain category, and the overall accuracy and Kappa coefficient are used to evaluate the overall classification performance of the algorithm. The Kappa coefficient is defined as shown in Equation (9), and the producer accuracy (PA), user accuracy (UA), and overall accuracy (OA) are defined as follows:
P A = N i i / N + i
U A = N i i / N i +
O A = i = 1 Y N i i / N

4.2.3. Parameter Optimization of the Improved Random Forest

The number and maximum depth of decision trees are the main parameters affecting the classification performance of the improved Random Forest. Reasonable parameter settings can ensure good accuracy and generalization ability and prevent the fault tracking model from being too complex, and therefore being large, which affects the training and operation efficiency. For this reason, this paper adopts the hierarchical cross-validation method. The overall accuracy OA is the evaluation index to tune the number of decision trees (n_estimators) and the maximum depth (max_depth) of the improved Random Forest.
The process of 10-fold hierarchical cross-validation is as follows:
  • The training set is divided into ten groups, ensuring that the proportion of each fault type in each group is consistent with the training set.
  • Nine groups are selected as the cross-validated training set to train the improved Radom Forest, and the remaining group as the cross-validated validation set to verify the training effect. This is repeated ten times to ensure that each group is selected as the validation set for cross-validation.
  • The average of the overall accuracies of the ten cross-validations is counted as the result of the cross-validation.
The maximum number of decision trees is determined through the optimization process using 10-fold hierarchical cross-validation. The number of decision trees for the improved Random Forest starts from one, and a hierarchical cross-validation is conducted for each additional decision tree. The relationship between the number of decision trees and the cross-validation results is visualized in Figure 7. As the number of decision trees increases, the average overall accuracy shows an upward trend until it stabilizes at 50 trees.
Following the selection of the optimal number of decision trees, the maximum depth of the decision trees is optimized using 10-fold hierarchical cross-validation. The improved Random Forest initializes its decision tree with a root node and performs hierarchical cross-validation for each subsequent node added incrementally. The relationship between the maximum depth of the decision tree and the cross-validation results is visualized in Figure 8. As the maximum depth of the decision tree increases, there is a tendency for the average overall accuracy to rise until it stabilizes at a maximum depth of approximately 20.
Decreasing the size of the Random Forest can reduce the computational burden and improve the efficiency of operation while maintaining a high overall accuracy. Moreover, an improved Random Forest-based model with large scale and deep depth can lead to overfitting. Therefore, this paper sets the number of decision trees for the improved Random Forest-based fault tracking model to 50 and sets the maximum depth to 20.

4.2.4. Comparisons of Fault Tracing Methods

In order to evaluate the classification effectiveness of the improved Random Forest, this paper compares it with the existing fault tracking methods, which include the Decision Tree [6,25], Gradient Boosting Decision Tree (GDBT) [26], Recurrent Neural Network (RNN) [24], and Random Forest.
The comparison experiment of fault tracking methods is conducted as follows:
Firstly, the improved Random Forest and other existing methods are trained ten times using the training set. Subsequently, the validation set is employed to validate each method after every training iteration. The running time Ttest, the overall accuracy (OA), and the Kappa coefficient (kappa) of the classification results of the validation set are recorded for each validation result, as shown in Table 10. Finally, the optimal validation results of each method are selected and compared with the user accuracy (UA) and producer accuracy (PA) for each type of fault, as shown in Table 11 and Table 12. All comparison experiments are performed on a computer with a 2.2 GHz main frequency, a six-core processor (Intel Core i7-8750H), and 32 G of RAM, utilizing the PyCharm programming tool.
The average computing times of the validation process for the Decision Tree, Random Forest, GDBT, RNN, and the proposed method are 7.5 ms, 23.4 ms, 24.1 ms, 72.6 ms, and 22.7 ms, respectively. Their highest overall accuracies are 91.60%, 94.58%, 95.84%, 96.54%, and 97.83%; the corresponding Kappa coefficients are 91.32%, 94.40%, 95.70%, 96.43%, and 97.79%, respectively. Based on the overall performance, the improved Random Forest has high accuracy and computational efficiency, making it suitable for real-time systems. Moreover, the improved Random Forest demonstrates superior user accuracy and producer accuracy in classifying different fault types. Producer accuracy is the ratio of correctly classified samples to all samples belonging to a category. In contrast, user accuracy is the ratio of correctly classified samples to all samples classified as that category. Therefore, the enhanced scheme presented in this paper effectively reduces the probability of the misclassification of fault types.
Some faults occur infrequently, resulting in an imbalance in sample sets, with certain fault types classified as minority or majority classes. For example, the sampling out of step of the merging unit, the CPU module error/exception and the configuration error of the PR, the control loop fault and the contact fault of the CB, and the communication optical fiber fault between the protection device–intelligent terminal are categorized as minority class fault types. The comparison demonstrates that the proposed method achieves high UA and PA in classifying minority classes. It supports the claim that the proposed method exhibits excellent classification accuracy when dealing with fault tracking in unbalanced data. While the other methods achieve 100% UA in the minority class, this merely indicates that the other faults are not misclassified as the minority class. Furthermore, the lower PA of the other methods in the minority class suggests that more minority class faults are misclassified as other majority classes, resulting in a decrease in the UA of those majority classes and ultimately lowering the overall classification performance.
When dealing with imbalanced classification problems, relying solely on accuracy and recall metrics is insufficient to evaluate the performance of the classifier due to the influence of the majority class. Therefore, this paper incorporates Kappa coefficients as voting weights to assess the consistency between predicted and actual classes in the classification results and to evaluate the classification bias of the decision trees. By employing a weighted average strategy based on the Kappa coefficient, the results of decision trees with more balanced classification performance carry greater significance, ensuring that the final voting outcomes of the improved Random Forest exhibit high classification consistency and low classification bias. Additionally, the Re-Relief F-based feature selection algorithm measures the distance ratio between each sample and its nearest neighboring samples to determine the importance of each feature for classification. Even in the presence of class imbalance within the training set, the Re-Relief F algorithm selects features that effectively differentiate between minority and majority classes, assigning them higher feature weights. Since features with higher weights are more likely to be selected as node features, the ability of the Random Forest to distinguish between minority and majority classes is enhanced.
Based on the above analysis, the approach proposed in this paper demonstrates strong performance in terms of classification results both overall and for each fault type. Additionally, it ensures a high accuracy rate for the classification of minority classes within unbalanced data.

4.2.5. Comparison of Feature Selection Methods

In this paper, the Re-Relief F algorithm is utilized to evaluate feature importance in classification, with the objective of eliminating redundant features and enhancing classification accuracy. To assess the impact of different feature selection methods on the classification performance of fault tracking models, the Spearman correlation coefficient [31], Kendall correlation coefficient [32], information gain (InfoGain) [33], and maximum correlation minimum redundancy (mRMR) [34] are selected for conducting comparative experiments. Among them, Spearman and Kendall correlation coefficients measure the importance of features for classification by calculating the correlation between categories and features; InfoGain uses information entropy to describe the degree of influence of features on the classification results; and mRMR searches for the combination of features that have the maximum correlation with the categories and the minimum redundancy among them.
The experimental procedure for comparing feature selection methods is as follows:
First, the feature weights of the alarm features are calculated according to the definitions of Spearman, Kendall, InfoGain, and mRMR, respectively. Then, these weights are incorporated into the training process of the Random Forest. The features in the training subset are divided into three corresponding high, medium, and low feature subsets (Wh, Wm, and Wl) based on their corresponding feature weights. The same number of features from the three subsets are randomly selected as the candidate features of nodes. Finally, each feature selection method is incorporated into the Random Forest algorithm and repeated five times for training. The validation set is used to verify and compare the effects of different feature selection algorithms when combined with Random Forest. The experimental results are presented in Table 13.
The Spearman and Kendall correlation coefficients measure the extent to which alarm features contribute to classification by assessing the correlation between alarm features and fault types. A comparison reveals that feature selection algorithms based on Spearman and Kendall coefficients enhance both classification stability and accuracy to a certain degree. Notably, the Kendall-based feature selection method demonstrates the best performance, achieving an OA of 95.30% and a Kappa of 95.47% when combined with the Random Forest.
There is a correlation between fault types and alarm features in relay protection systems and CBs. Moreover, duplication exists among features of different fault types. InfoGain calculates the information gain for each alarm feature that corresponds to a specific fault type. The higher the gain, the greater the impact on classification. Additionally, mRMR comprehensively considers the correlation between alarm features and fault types, as well as the redundancy within alarm features. In comparison with the correlation coefficient, InfoGain and mRMR have a more prominent effect on the classification results.
Both the feature selection algorithm mentioned earlier and the proposed method enhance the stability and accuracy of classification compared with the original Random Forest. Additionally, the proposed method outperforms the other methods in terms of OA and Kappa, demonstrating higher classification accuracy and consistency. The Re-Relief F-based feature selection algorithm accentuates features with strong intra-class proximity and significant inter-class differentials. This refined feature subset enables a more accurate representation of the variations in fault types and their distances. In contrast to other feature selection methods, the Re-Relief F-based approach proves to be more effective in improving fault tracking outcomes for relay protection systems and CBs.
In addition, the feature selection method is a sample preprocessing method employed before model training and does not participate in the process of training and running the fault diagnostic model, so it does not affect the complexity and computational efficiency of the proposed method.

4.2.6. Fault Tolerance in Case of Unreliable Alarms

The false and missing alarm message can result in errors and the loss of features in the alarm feature set, thereby reducing the accuracy of the fault tracking model. To assess the fault tolerance of the proposed method in handling unreliable alarms, a specific number of alarm features in the validation set are chosen to undergo an inverse operation. This operation involves setting the original “0” feature to “1” or the original “1” feature to “0”, simulating false and missed alarms in the alarm messages.
The verification process is as follows:
  • An alarm subset with non-zero feature values is selected from the alarm feature set of each validation set sample. Then, a certain number (u) of alarm features with a value of “0” are randomly chosen, and their values are changed to “1”. This process simulates the occurrence of false alarms.
  • A certain number (v) of alarm features with a value of “1” are randomly chosen from the alarm feature set of each validation set sample, and their values are changed to “0”. This process simulates the occurrence of missed alarms.
  • The above methods are also utilized in the training and test sets to simulate false alarms and missed alarms.
  • In this paper, the values of u and v are set in the range of [1, 2, 3] to simulate the occurrence of false alarms and missed alarms for 1 to 3 alarm messages. Each value of u and v is simulated five times to simulate the occurrence of false alarms and missed alarms for different alarm features.
  • The processed training and test sets are used to train the fault tracking model, and the processed validation set is fed into the fault tracking model to compare the classification results in the case of false alarms and missed alarms. The OA and Kappa of the classification results are counted, as shown in Table 14.
According to Table 14, despite the false and missing alarm messages, the proposed method achieves a high overall accuracy and Kappa coefficient. This suggests that the method has strong anti-interference and fault-tolerant capabilities. By comparing the classification results with the same number of false and missing alarm messages, it is found that missing alarms have a greater impact on the classification results. This phenomenon is caused by the following two reasons: firstly, missing alarm messages may result in crucial fault features being missed, preventing the fault tracking model from accurately distinguishing between fault types; secondly, some fault types have fewer alarm features, which further increases the missing probability of important alarm features.
Due to the unreliability of alarm messages, there may be some noises or missing alarm features. In this paper, the Re-Relief F-based feature selection method is employed to filter out the features with stronger relevance to the classification results and reduce the interference of noise and redundant features. Randomly selecting features for training can reduce the impact of missing or wrong features on a single decision tree. Even if a feature is missing or wrong, it will only affect part of the decision tree, not all of the decision tree.
This paper employs weighted voting based on Kappa coefficients to assign higher weights to decision trees that demonstrate greater reliability, thereby reducing the impact of missing and erroneous alarm features on the final classification results. Moreover, the result of the improved Random Forest is determined through weighted voting based on the results of multiple decision trees, which can make up for the incorrect classification of certain decision trees, thus improving the robustness of the model.

4.3. Case Analysis of Complex Faults

This section provides a detailed analysis of the fault tracing process based on the improved Random Forest in various fault cases. The fault cases contain communication faults in protection devices and intelligent terminals and double faults in neighboring and non-neighboring devices. Through this case study, it can be proved that the proposed method can accurately trace the complex faults of relay protection systems and CBs and identify the causes of incorrect operation of PRs and CBs.
The fault case takes a 220 kV substation in Shandong as an example, and its main wiring is shown in Figure 9. Among them, 220 kV double buses (B11 and B12) connect two outgoing lines (L11 and L12), two 110 kV double buses (B21 and B22, B31 and B32) connect four outgoing lines (L21 and L22, L31 and L32), and two transformers (T1 and T2) are connected between the buses.

4.3.1. Case 1

Case 1 is a real fault case of a 220 kV substation with a line fault and CB rejection. The fault scenario is as follows: a two-phase ground fault occurs on the 110 kV line L22 in Figure 9, and the main protection Lp22m detects the fault and sends out the tripping signal. However, because of the output port fault of the protection device, the CB22 does not receive the tripping signal and fails to trip. Therefore, CB failure protection Lp22f is used as a backup protection to trip CB21, CBT12, CBB231, and CBB232 to isolate the faulty line L22.
The alarm messages of the PRs and CBs are shown in Table 15. The alarm messages of the secondary system of the substation are shown in Table 16.
According to Table 15, L22 is diagnosed as a faulty component. The corresponding main and near backup protections for CB22 are Lp22m and Lp22p, and the CB failure protection and corresponding CBs are Lp22f, CBT12, CBB231, and CBB232. The operation states of the main, near backup, and remote backup PRs and their corresponding CBs are (1, 0), (0, 0), and (1, 1). Their corresponding fusion confidence degrees are 0.5957, 0.2, and 0.72. According to the operation evaluation rules, it is concluded that Lp22m, Lp22f, CB21, CBT12, CBB231, and CBB232 are normal, while the CB22 fails to trip.
The alarm features of the relay protection system and CB are extracted from the secondary system alarm information in Table 16, as shown in Table 17. Alarm messages are issued by adjacent devices (PRs and intelligent terminals) at the same time. Based on the fault tracking process in Figure 3, the alarm feature subset X1= {XGOOSE1, XSC1, XCOM1} of the PRs and intelligent terminals is divided from alarm feature set X according to the adjacent devices. The corresponding element in the alarm feature subset X1 of the alarm messages in Table 9 is one, and the rest are zero, as shown in Equations (18)–(20) (due to the large dimension of the alarm feature set, only the alarm feature set is shown with non-zero elements).
X GOOSE 1 = { X GOOSE _ IT 1 } X GOOSE _ IT 1 = { 1 , 0 , 1 , 0 , , 0 }
X SC 1 = { X SC _ PR 1 } X SC _ PR 1 = { 1 , 0 , 0 , , 1 , 0 , , 0 }
X COM 1 = { X COM _ PR 1 , X COM _ IT 1 } X COM _ PR 1 = { 0 , 0 , 1 , 0 , , 0 } X COM _ IT 1 = { 1 , 0 , 1 , 0 , , 1 , 1 , 0 , , 0 }
The subset of alarm features X1 is input into the fault tracking model based on the improved Random Forest, and the output is:
Y 1 = { 0 , 1 , 1 , 1 , 0 , 1 }
In looking at Table 2, the fault type is the output port fault of the PR, and there is a communication fault in the fault tracing result. The alarm feature set X is empty after removing the alarm feature subset X1, and the fault tracing result is the output port fault of the PR. According to the fault tracing process in Figure 3, the CB refuses to trip without the CB fault, and a communication fault occurs in the relay protection system. Therefore, the CB rejection is caused by the output port fault of the PR.
In Case 1, the output port fault of the PR will cause the PR and the intelligent terminal to issue alarm messages. The fault tracking process collects alarm messages from adjacent devices that may have communication faults for fault tracking, which is conducive to distinguishing device faults and communication faults.

4.3.2. Case 2

The adjacent device faults or communication faults in relay protection systems and CBs may cause adjacent devices to issue alarm messages. Based on Case 1, this paper simulates the simultaneous occurrence of an adjacent device fault and a communication fault. It proves that the proposed method can distinguish between adjacent device faults and communication faults. The details of Case 2 are as follows: a fault occurs at L22, and the DSP module of the merging unit and output port fail at the same time, resulting in Lp22m rejection. By the time the near backup protection (Lp22p) operates, CB22 starts tripping and isolates the fault line L22.
The alarm messages of PRs and CBs in Case 2, based on the typical monitoring information table of the 220 kV substation, are presented in Table 18. The alarm messages of the DSP module fault are fused with Case 1 to obtain the alarm messages of the secondary system in Case 2, as shown in Table 19.
According to Table 18, the operation states of the main, near backup, and remote backup PRs and their corresponding CBs are (0, 1), (1, 1), and (0, 0). Their corresponding fusion confidence degrees are 0.5917, 0.825, and 0.2. According to the operation evaluation rules, it is concluded that Lp22p and CB22 are normal, while Lp22m fails to operate.
The alarm features of the relay protection system and the CB are extracted from the secondary system alarm messages in Table 19, as shown in Table 20. Alarm messages are issued by adjacent devices (merging unit with PR and PR with intelligent terminal) simultaneously. Based on the fault tracking process of Figure 3, the alarm feature subset X1= {XGOOSE1, XSC1, XCOM1} (Equations (18)–(20)) of the PR and the intelligent terminal, and the alarm feature subset X2= {XSV2, XSC2, XCOM2} of the merging unit and the PR are divided according to the adjacent device, as shown in Equations (22)–(24).
X SV 2 = { X SV _ MU 2 } X SV _ MU 2 = { 1 , 1 , 1 , 0 , , 0 }
X SC 2 = { X SC _ MU 2 , X SC _ PR 2 } X SC _ MU 2 = { 1 , 0 , 0 , , 1 , , 0 } X SC _ PR 2 = { 1 , 0 , 1 , , 1 , , 0 }
X COM 2 = { X COM _ MU 2 , X COM _ PR 2 } X COM _ MU 2 = { 1 , 1 , 0 , , 0 } X COM _ PR 2 = { 0 , 0 , 1 , 0 , , 0 }
The alarm feature subsets X1 and X2 of adjacent devices are input into the fault tracking model based on the improved Random Forest, and the output is as follows:
Y 1 = { 0 , 1 , 1 , 1 , 0 , 1 } Y 2 = { 0 , 0 , 0 , 0 , 0 , 1 }
In looking at Table 1 and Table 2, the fault types are the DSP module fault of the merging unit and the output port fault of the protective relay.
There is a communication fault in the fault tracking results. The alarm feature set X is not empty after excluding the alarm feature subset X1. Firstly, the communication fault is the output port fault of the PR, and then the alarm feature subset X3= {XSV3, XSC3, XCOM3} of the merging unit is divided according to a single device.
X SV 3 = { X SV _ MU 3 } X SV _ MU 3 = { 1 , 1 , 1 , 0 , , 0 }
X SC 3 = { X SC _ MU 3 , X SC _ PR 3 } X SC _ MU 3 = { 1 , 0 , 0 , , 1 , , 0 }
X COM 3 = { X COM _ MU 3 } X COM _ MU 3 = { 1 , 1 , 0 , , 0 }
The alarm feature subset X3 is input into the fault tracking model based on the improved Random Forest, and the output is as follows:
Y 3 = { 0 , 0 , 0 , 0 , 0 , 1 }
In looking at Table 1, the fault type is the DSP module fault of the merging unit, consistent with the results of the subset X2. The result of the fault tracking is the output port fault of the PR and the DSP module fault of the merging unit. According to the fault tracking process in Figure 3, the PR rejection is caused by the DSP module fault of the merging unit.
During the fault tracking process, the alarm feature subset of adjacent devices is input into the fault tracking model to determine if there are any communication faults. The alarm feature set is re-divided according to a single device for fault tracking after eliminating the alarm feature subset of the communication fault. After two divisions of the alarm feature set and fault tracking, the device faults and communication faults are distinguished. Through the verification of Case 2, the fault tracking process does not incorrectly divide the alarm features of the device and communication faults. It can accurately track the complex scenario of device and communication faults.

4.3.3. Case 3

Case 3 simulates a complex fault scenario in which both the PR and the CB refuse to operate, and faults in non-adjacent devices occur in the relay protection system. The details of Case 3 are as follows: a fault occurs in L22, and the DSP module of the merging unit fails, resulting in the Lp22m rejection. The Lp22p operates, but due to the configuration error of the intelligent terminal, the CB22 does not receive the trip signal to refuse to trip. Therefore, CB failure protection Lp22f is operated, and CB21, CBT12, CBB231, and CBB232 trip and isolate the faulty line L22.
The alarm messages of PRs and CBs in Case 3, based on the typical monitoring information table of the 220 kV substation, are presented in Table 21. By fusing the alarm messages of the DSP module fault of the merging unit and the configuration error of the intelligent terminal, the alarm messages of the secondary system of the Case 3 substation are shown in Table 22.
According to Table 21, the operation states of the main, near backup, and remote backup PRs and their corresponding CBs are (0, 0), (1, 0), and (1, 1). Their corresponding fusion confidence degrees are 0.2, 0.5, and 0.725. According to the operation evaluation rules, it is concluded that Lp22p, Lp22f, CB21, CBT12, CBB231, and CBB232 are all normal, while Lp22m and CB22 fail to operate.
The alarm features of the relay protection system and CB are extracted from the secondary system alarm messages in Table 22, as shown in Table 23. There are no alarm messages issued by neighboring devices at the same time in the alarm messages. According to the fault tracking process of Figure 3, the alarm feature subset X3 = {XSV3, XSC3, XCOM3} of the merging unit (such as Equations (26)–(28)) and the alarm feature subset X4= {XGOOSE4, XCOM4} of the intelligent terminal are divided according to a single device, as shown in Equations (30) and (31).
X GOOSE 4 = { X GOOSE _ IT 4 } X GOOSE _ IT 4 = { 0 , 0 , 0 , , 1 , , 0 }
X COM 4 = { X COM _ IT 4 } X COM _ IT 4 = { 0 , 0 , 0 , , 1 , , 1 , , 0 }
The alarm feature subsets X3 and X4 of a single device are input into the fault tracking model based on improved Random Forest, and the output is as follows:
Y 3 = { 0 , 0 , 0 , 0 , 0 , 1 } Y 4 = { 0 , 1 , 0 , 0 , 0 , 0 }
In Table 1, the fault types are the faulty DSP module of the merging unit and the misconfiguration of the intelligent terminal, which are identified as fault tracing results. According to the fault tracing process in Figure 3, the protection rejection is caused by the fault of the DSP module of the merging unit, and the protection rejection causes the CB rejection.
Since the fault tracing model is designed as a single-input and single-output multiclassification model, directly using alarm messages of complex faults as input does not enable simultaneous identification of all fault types. Developing separate fault tracking models for individual devices and communication faults results in multiple outputs and inaccuracy results. In Case 3, the fault tracking process divides the alarm feature subset based on individual devices, breaks down complex faults into multiple simpler faults for tracking, and simultaneously determines the fault types of multiple faults occurring at the same time. This study utilizes single-input and single-output fault tracking models to achieve multi-label classification results, effectively addressing the challenge of achieving complex fault tracking with a single model and mitigating the inaccuracies associated with multiple models.

4.3.4. Scalability Analysis

Based on the fault tracing process in the above case study, it is evident that the fault tracing in this paper consists of two parts: firstly, the identification of improperly operated PRs and CBs, and secondly, the determination of fault types within the relay protection systems and CBs associated with them.
The operation valuation rules are established based on the fundamental configurations of main, near backup, and remote backup PRs. In Section 4.1, it is demonstrated that these rules are applicable to both the IEEE 39-bus and 118-bus power systems. They effectively identify incorrectly operated PRs and CBs in fault cases, thereby narrowing down the fault tracing process to the corresponding substation intervals. This paper proposes a fault tracing model and process for complex faults occurring in relay protection systems and CBs. The alarm feature set is divided based on individual devices and neighboring devices, allowing for the separation of complex faults into simple device faults and communication faults. The above case study validates the feasibility of this division approach. Other types of relay protection systems, such as busbar protection and transformer protection, also incorporate essential components like merging units, protection devices, and intelligent terminals within an intelligent substation. Consequently, these relay protection systems can be further broken down into individual or adjacent devices for fault tracing.
In summary, this paper utilizes operation evaluation rules and a fault tracing process to gradually convert the fault tracing problem into a fault categorization problem for basic devices. It demonstrates that the proposed method is not only applicable to larger and more complex power systems but also has good scalability for the fault classification of different types of relay protection systems and CBs.

5. Conclusions

This paper presents a fault tracking method for the RPS-CB based on an improved Random Forest algorithm. Firstly, the reasons for the incorrect operation of PRs and CBs are analyzed, the fault types of a relay protection system–circuit breaker and corresponding alarm messages are demonstrated, and the method of characterizing the alarm messages is provided; secondly, the feature selection and voting strategy of the Random Forest algorithm is improved, and the fault tracking model based on the improved Random Forest is trained; then, the incorrectly operated PRs and CBs are identified through the operation evaluation; finally, the fault tracking model and the process are utilized to derive the reasons for the faults of the relay protection system and the CB and determine the case of incorrect operation. The main contributions of this paper contain the following points:
  • A fault tracking model based on an improved Random Forest is constructed. This model combines the Re-Relief F algorithm and the weighted voting strategy. The feature selection is optimized to ensure the stable classification performance of the Random Forest. Higher weights are assigned to the decision trees with strong classification performance, improving overall classification performance and accuracy.
  • An operation evaluation process is proposed for identifying PR and CB that are incorrectly operated during complex faults, providing a target for fault tracking. The complex faults verify the evaluation results, and it is confirmed that the evaluation results are still reliable in the case of missing or false alarms.
  • A fault tracking process based on improved Random Forest is developed to realize accurate tracking of complex faults such as device faults, communication faults and multiple faults of the RPS-CB by dividing the alarm feature set and complex faults into multiple simple faults.
After case validation and analysis, it is proved that the proposed method achieves the fault tracing of the incorrect operations of PRs and CBs. This work not only provides an important supplement to the fault diagnosis but also provides accurate and effective references and suggestions, which helps to realize accurate operation and maintenance.
While fault tracing can accurately pinpoint faults in relay protection systems and circuit breakers, it cannot prevent or anticipate them. Thus, our future research aims to employ device operational data to evaluate the performance of the relay protection system and CBs, predicting probable device failures and promptly alerting maintenance personnel to perform necessary repairs or replacements. This approach aims to enhance the reliability of both PRs and CBs.

Author Contributions

Conceptualization, N.S. and Q.C.; data curation, N.S. and C.Y.; formal analysis, N.S., D.X. and Y.S.; funding acquisition, Q.C.; methodology, N.S., C.Y. and D.X.; project administration, Q.C.; software, N.S.; validation, N.S. and Q.C.; visualization, N.S. and Y.S.; writing original draft, N.S.; writing review and editing, N.S., C.Y., D.X. and Y.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China under Grant 51877123.

Data Availability Statement

The data presented in this study are available in the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Alvarez-Alvarado, M.S.; Donaldson, D.L.; Recalde, A.A.; Noriega, H.H.; Khan, Z.A.; Velasquez, W.; Rodriguez-Gallegos, C.D. Power System Reliability and Maintenance Evolution: A Critical Review and Future Perspectives. IEEE Access 2022, 10, 51922–51950. [Google Scholar] [CrossRef]
  2. Wang, S.-P.; Zhao, D.-M. A Hierarchical Power Grid Fault Diagnosis Method Using Multi-Source Information. IEEE Trans. Smart Grid 2020, 11, 2067–2079. [Google Scholar] [CrossRef]
  3. Goudarzi, A.; Ghayoor, F.; Waseem, M.; Fahad, S.; Traore, I. A Survey on IoT-Enabled Smart Grids: Emerging, Applications, Challenges, and Outlook. Energies 2022, 15, 6984. [Google Scholar] [CrossRef]
  4. Waseem, M.; Adnan Khan, M.; Goudarzi, A.; Fahad, S.; Sajjad, I.A.; Siano, P. Incorporation of Blockchain Technology for Different Smart Grid Applications: Architecture, Prospects, and Challenges. Energies 2023, 16, 820. [Google Scholar] [CrossRef]
  5. Dong, X.Z.; Wang, D.L.; Zhao, M.Y.; Wang, B.; Shi, S.X.; Apostolov, A. Smart Power Substation Development in China. Csee J. Power Energy 2016, 2, 1–5. [Google Scholar] [CrossRef]
  6. Wang, L.; Chen, Q.; Gao, H.; Ma, Z.; Zhang, Y.; He, D. Framework of Fault Trace for Smart Substation Based on Big Data Mining Technology. Autom. Electr. Power Syst. 2018, 42, 84–91. [Google Scholar]
  7. Moradzadeh, A.; Mohammadi-Ivatloo, B.; Pourhossein, K.; Anvari-Moghaddam, A. Data Mining Applications to Fault Diagnosis in Power Electronic Systems: A Systematic Review. IEEE Trans. Power Electron. 2022, 37, 6026–6050. [Google Scholar] [CrossRef]
  8. Zhang, Y.; Li, D.; Wen, F.; Zhang, Y.; Dong, Y.; Zheng, J. Process Level Fault Diagnosis in IEC 61850 Based Smart Substations. Electr. Power Constr. 2018, 39, 42–48. [Google Scholar]
  9. Fang, X.; Huang, W.; Ye, D.; Huang, Y. Application of a distributed parallel FP-growth algorithm in secondary device defects monitoring. Power Syst. Prot. Control 2021, 49, 160–167. [Google Scholar]
  10. Wang, M.; Li, T.; Ren, J.; Xu, Y. Realization of Fault Diagnosis of Substation Secondary System Using Apriori Algorithm. Proc. CSU-EPSA 2021, 33, 145–150. [Google Scholar]
  11. Wang, M.; Xu, Y.; Fan, W.; Li, T.; Ren, J. Fault Correlation Analysis of Substation Secondary System Based on H-mine Algo-rithm. J. N. China Electr. Power Univ. 2021, 48, 36–44. [Google Scholar]
  12. Xu, Y.; Wang, M.; Fan, W. Defect Data Association Analysis of the Secondary System Based on AFWA-H-Mine. Energies 2021, 14, 4228. [Google Scholar] [CrossRef]
  13. Chen, G.; Dong, X.; Zheng, Y.; Xu, H. Fault diagnosis of a relay protection test based on a long short-term memory network. Power Syst. Prot. Control 2022, 50, 65–73. [Google Scholar]
  14. Higgins, M. Operational Moving Target Defences for Improved Power System Cyber-Physical Security; University of Oxford: Oxford, UK, 2022. [Google Scholar]
  15. Zhang, Y.; Cai, Z.; Long, P.; Li, X.; Su, Z. Real-Time Fault Diagnosing Models and Method for Communication Network in Smart Substation. Power Syst. Technol. 2016, 40, 1851–1857. [Google Scholar]
  16. Sun, Y.; Cai, Z.; Guo, C.; Ma, G.; Dai, G. Fault Diagnosis and Positioning for Communication Network in Intelligent Substation Based on Deep Learning. Power Syst. Technol. 2019, 43, 4306–4314. [Google Scholar]
  17. Ren, B.; Li, J.; Zheng, Y.; Chen, X.; Zhao, Y.; Zhang, H.; Zheng, C. Research on Fault Location of Process-Level Communication Networks in Smart Substation Based on Deep Neural Networks. IEEE Access 2020, 8, 109707–109718. [Google Scholar] [CrossRef]
  18. Dai, Z.; Geng, H.; Han, J.; Li, J.; Fang, W. Fault Location Method of Secondary System in Smart Substation Based on Matrix Algorithm and BP Neural Network. J. N. China Electr. Power Univ. 2022, 49, 1–10. [Google Scholar]
  19. Zhang, C.; Zheng, Y.; Lu, J.; Zhang, H.; Ren, H.; Yang, Z. Fault location of secondary circuits in a smart substation based on a graph neural network. Power Syst. Prot. Control 2022, 50, 81–90. [Google Scholar]
  20. Zhang, X.; Chen, Q.; Sun, M.; Huang, W.; Wang, L.; Liu, B. Fault tracking of high-voltage circuit breakers in case of secondary circuit faults in intelligent substations. Electr. Power Autom. Equip. 2020, 40, 212–217+224. [Google Scholar]
  21. Gao, X.; Ren, B.; Zhang, H.; Liu, M.; Li, J.; Xu, J. Component Fault Tracing of Power Dispatching Automation System Based on Information Difference Graph Model. Power Syst. Technol. 2021, 45, 4808–4817. [Google Scholar]
  22. Ji, J.J.; Chen, Q.; Jin, L.; Zhou, X.T.; Ding, W. Fault Diagnosis System of Power Grid Based on Multi-Data Sources. Appl. Sci. 2021, 11, 7649. [Google Scholar] [CrossRef]
  23. Wang, H.; Ma, Y.; Wu, Y.; Zhao, L.; Liu, T.; Li, X.; Wang, Y.; Zhang, B. Call chain monitoring and distributed tracking method for the communication bus of a power grid control system. Power Syst. Prot. Control 2021, 49, 29–37. [Google Scholar] [CrossRef]
  24. Ren, B.; Zheng, Y.; Wang, Y.; Sheng, S.; Li, J.; Zhang, H.; Zheng, C. Fault Location of Secondary Equipment in Smart Substation Based on Deep Learning. Power Syst. Technol. 2021, 45, 713–721. [Google Scholar]
  25. Cui, C.; Ren, W.; Feng, S.; Qu, L.; Liu, D.; Wang, Y. Remote Power Failure Tracing Based on Classification Decision Tree. In Proceedings of the 2023 IEEE 3rd International Conference on Information Technology, Big Data and Artificial Intelligence (ICIBA), Chongqing, China, 26–28 May 2023; pp. 873–877. [Google Scholar]
  26. Ding, W.; Chen, Q.; Dong, Y.; Shao, N. Fault Diagnosis Method of Intelligent Substation Protection System Based on Gradient Boosting Decision Tree. Appl. Sci. 2022, 12, 8989. [Google Scholar] [CrossRef]
  27. Jiao, K. Research on Intelligent Substation Fault Diagnosis Technology Based on Relay Protection Test Technology. Master’s Thesis, North China Electric Power University, Baoding, China, 2020. [Google Scholar]
  28. Ren, B. Research on Fault Location of Secondary System in Smart Substation Based on Deep Learning. Master’s Thesis, North China Electric Power University, Baoding, China, 2021. [Google Scholar]
  29. Yang, Z.; Nurbol; Jia, X.; Hu, L. Intrusion Feature Selection Methods Based on ReliefF. J. Jilin Univ. Sci. Ed. 2015, 53, 505–510. [Google Scholar] [CrossRef]
  30. Wang, T.; Wei, X.; Wang, J.; Huang, T.; Peng, H.; Song, X.; Valencia Cabrera, L.; Perez-Jimenez, M.J. A weighted corrective fuzzy reasoning spiking neural P system for fault diagnosis in power systems with variable topologies. Eng. Appl. Artif. Intel. 2020, 92, 103680. [Google Scholar] [CrossRef]
  31. Liu, S.X.; Li, Y.K.; Gao, S.Y.; Xing, C.J.; Li, J.; Cao, Y.D. Prediction of Residual Electrical Life in Railway Relays Based on Convolutional Neural Network Bidirectional Long Short-Term Memory. Energies 2023, 16, 6357. [Google Scholar] [CrossRef]
  32. Iqbal, T.; Elahi, A.; Wijns, W.; Amin, B.; Shahzad, A. Improved Stress Classification Using Automatic Feature Selection from Heart Rate and Respiratory Rate Time Signals. Appl. Sci. 2023, 13, 2950. [Google Scholar] [CrossRef]
  33. Chen, Y.; Ma, L.X.; Yu, D.S.; Zhang, H.D.; Feng, K.Y.; Wang, X.; Song, J. Comparison of feature selection methods for mapping soil organic matter in subtropical restored forests. Ecol. Indic. 2022, 135, 108545. [Google Scholar] [CrossRef]
  34. Shaffie, A.; Soliman, A.; Eledkawy, A.; Fu, X.A.; Nantz, M.H.; Giridharan, G.; van Berkel, V.; El-Baz, A. Lung Cancer Diagnosis System Based on Volatile Organic Compounds (VOCs) Profile Measured in Exhaled Breath. Appl. Sci. 2022, 12, 7165. [Google Scholar] [CrossRef]
Figure 1. Reasons for PR rejection and malfunction.
Figure 1. Reasons for PR rejection and malfunction.
Electronics 13 00582 g001
Figure 2. Reasons for CB rejection and malfunction.
Figure 2. Reasons for CB rejection and malfunction.
Electronics 13 00582 g002
Figure 3. Fault Tracing Process Based on Improved Random Forest.
Figure 3. Fault Tracing Process Based on Improved Random Forest.
Electronics 13 00582 g003
Figure 4. IEEE 39-bus power system.
Figure 4. IEEE 39-bus power system.
Electronics 13 00582 g004
Figure 5. IEEE 118-bus power system.
Figure 5. IEEE 118-bus power system.
Electronics 13 00582 g005
Figure 6. Topological structure of line spacing.
Figure 6. Topological structure of line spacing.
Electronics 13 00582 g006
Figure 7. Verification curve of the number of decision trees.
Figure 7. Verification curve of the number of decision trees.
Electronics 13 00582 g007
Figure 8. Verification curve of the max depth of decision trees.
Figure 8. Verification curve of the max depth of decision trees.
Electronics 13 00582 g008
Figure 9. Main wiring diagram of a 220 kV substation in Shandong, China.
Figure 9. Main wiring diagram of a 220 kV substation in Shandong, China.
Electronics 13 00582 g009
Table 1. Device fault types and corresponding alarm messages.
Table 1. Device fault types and corresponding alarm messages.
DevicesAlarm MessagesFault TypesNumber
Merging unitMerging unit self-check alarm, sampling anomaly, merging unit synchronization anomaly, SV total alarm of the merging unit/PR, PR blocking, etc.DSP module fault000001
Merging unit self-check alarm, protection SV total alarm, invalid protection SV data, protection blocking, etc.AC input module fault000010
Merging unit self-check exception/alarm, input/output self-check circuit error, GOOSE/SV total alarm, GOOSE/SV communication interruption, chain break occurs when the merging unit receives GOOSE from measurement and control device, etc.Input/output module fault000011
Power fault alarm of the merging unitPower module fault000100
Abnormal synchronization of the merging unit, synchronization signal interruption of the merging unit, protection blocking, etc.Sampling out-of-step000101
Merging unit GOOSE/SV configuration error, input/output configuration error, GOOSE/SV receiving plate error, etc.Configuration error000110
Protective relayPR self-check alarm, CPU anomaly, fixed value verification error, device parameter sequence error, FLASH self-check anomaly, etc.CPU module error/exception000111
PR self-check alarm, SV total alarm, protection SV sampling exception, protection blocking, etc.SV module fault001000
PR self-check alarm, GOOSE total alarm, intelligent terminal GOOSE total alarm, reclosing lockout, etc.GOOSE module fault001001
The power fault alarm of the protective relay Power fault of the power module 001010
Input/output communication interruption of the PR, the transmission state has not returned, input circuit exception, input circuit self-check error, device locking, etc.Longitudinal channel fault001011
Inconsistent protection setting value, self-checking error of setting value, inconsistent panel configuration, inconsistent configuration file, SV/GOOSE configuration error, etc.Configuration error001100
Intelligent terminalIntelligent terminal self-check alarm, memory error, check error, GOOSE double receiving inconsistency, etc.CPU module fault001101
Intelligent terminal self-check alarm, GOOSE total alarm, GOOSE/SV communication interruption, chain break occurs when the intelligent terminal receives GOOSE from merging unit, etc.I/O module fault001110
The power fault alarm of the intelligent terminalPower fault of the power module001111
Intelligent terminal self-check alarm, GOOSE configuration error, input configuration error, GOOSE receiving platen error, etc.Configuration error010000
Circuit breakerControl circuit disconnection, control power faultControl loop fault010001
Abnormal voltage of the closing coil, too little current of the closing coil, too short a current time of the closing coil, excessive current of the closing coils, excessive current time of the opening and closing coilsClosing coil fault010010
Loop locking of the switching circuit, abnormal total travel, the frequent start of the energy storage motor, long pressing time of the energy storage motor, wellhead pressure exception, energy storage anomaly of the springOperating mechanism fault010011
Long opening time of the contact, non-simultaneous movement of the contact, excessive shell temperatureTransmission mechanism fault010100
Excessive moisture content, SF6 gas leakageContact fault010101
SwitchSwitch communication interruption, data non-renovation, IED port communication fault, port indicator lights off, etc.Communication link fault010110
Switch link interruption, port forwarding message inconsistency, broadcast storm, etc.Port fault010111
StNum/SqNum jumping of GOOSE message, SV message counter jumping, etc.Communication packet loss011000
The power fault alarm of the switchPower fault011001
Table 2. Communication fault types and corresponding alarm messages.
Table 2. Communication fault types and corresponding alarm messages.
Communication LinkAlarm MessagesFault TypesNumber
Merging unit–
Protective relay
SV total alarm, SV sampling exception, input circuit exception, etc.Merging unit device exception, self-test exception/alarms, etc.Merging unit output port fault011010
PR device exception, self-check alarm, input circuit exception, input circuit self-check error, etc.Protective relay input port fault011011
-Communication optical fiber fault011100
Protective relay–
Intelligent terminal
Intelligent terminal GOOSE communication interruption, no input message, input circuit exception, input circuit self-check error, etc.PR output communication interruption, GOOSE communication interruption, self-check exception/alarm, etc.Protective relay output port fault011101
Intelligent terminal device exception, self-check exception/alarm, etc.Intelligent terminal input port fault011110
-Communication optical fiber fault011111
Bus merging unit–
Line merging unit
Merging unit input circuit self-check error, input exception, self-check alarm, SV total alarm, Invalid/abnormal SV data, SV communication interruption, etc.Bus merging unit device exception, self-check exception/alarm, etc.Bus merging unit output port fault100000
Line merging unit device exception, self-check exception/alarm, etc.Line merging unit input port fault100001
-Communication optical fiber fault100010
Intelligent terminal–
Switch
Intelligent terminal exception, output test error, circuit breaker no action, intelligent terminal feedbackIntelligent terminal output port fault100011
Table 3. Assignment rules for confidence degrees of PRs and CBs.
Table 3. Assignment rules for confidence degrees of PRs and CBs.
PRs and CBsLinesBusesTransformers
MainPRs0.9913 (0.2)0.8564 (0.4)0.7756 (0.4)
CBs0.9833 (0.2)0.9833 (0.2)0.9833 (0.2)
Near backupPRs0.80 (0.2) 0.75 (0.4)
CBs0.85 (0.2) 0.8 (0.2)
Remote backupPRs0.70 (0.2)0.70 (0.4)0.7 (0.4)
CBs0.75 (0.2)0.75 (0.2)0.75 (0.2)
Note: The values outside the parentheses represent the confidence degrees of PRs and CBs with alarm messages. The values inside parentheses indicate the confidence degrees of PRs and CBs without alarm messages.
Table 4. Operation evaluation rules for PRs and CBs.
Table 4. Operation evaluation rules for PRs and CBs.
Highest Fusion Confidence DegreesAlarms of Main PR and CBOperation EvaluationAlarms of Near Backup PR and CBOperation
Evaluation
Alarms of Remote Backup PR and CBOperation Evaluation
The main protection and corresponding CB(1, 1)Normal(1, 1)False alarm or maloperation of the near backup PR(1, 1)Remote backup PR and CB maloperation
(1, 0)False alarm of the remote backup PR
(0, 1)False alarm or maloperation of the CB
(1, 0)Missing alarm of the CB(1, 0)False alarm of the near backup PR(1, 0)False alarm of the remote backup PR
(0, 0)Missing alarm of the CB(0, 1)False alarm or maloperation of the CB
(0, 1)Missing alarm of the main protection (0, 1)Normal(1, 0)False alarm of the remote backup PR
(0, 1)False alarm or maloperation of the CB
The near backup protection and corresponding CB(0, 1)Main protection rejection(1, 1)Normal(1, 1)Remote backup PR and CB maloperation
(1, 0)False alarm of the remote backup PR
(0, 1)False alarm or maloperation of the CB
(0, 0)Main PR rejection(1, 0)Missing alarm of the CB(1, 0)False alarm of the remote backup PR
(0, 1)False alarm or maloperation of the CB
The remote backup protection and corresponding CB.(0, 0)Main PR rejection(0, 0)Near backup PR rejection(1, 1)Normal
(1, 0)CB rejection
(1, 0)CB rejection(1, 0)
(0, 0)
CB rejection
(0, 1)False alarm of the CB(0, 1)False alarm of the CB
(0, 0)Main PR rejection(0, 0)Near backup PR rejection(1, 0)Missing alarm or rejection of the CB
(0, 1)Missing alarm of the remote backup PR
Note: The binary group (p, q) indicates whether the alarm message exists or not; p = 1 (q = 1) indicates that the PR (CB) alarm message exists; p = 0 (q = 0) indicates that the PR (CB) alarm does not exist.
Table 5. Secondary evaluation rules.
Table 5. Secondary evaluation rules.
Alarm Status of PRs and CBs in SUNStatus of the Corresponding RPS-CBsOperation Evaluation
Alarm messages are availableFaultMaloperation
No faultsFalse alarm
No alarm messagesFaultRejection
No faultsMissing alarm
Table 6. Evaluation results for fault cases in the IEEE 39-bus power system.
Table 6. Evaluation results for fault cases in the IEEE 39-bus power system.
No.Fault DetailsBrief Description of the Alarm MessagesFaulty
Components
Operation Evaluation
1A fault occurs at B18, and the alarm message of CB1803 is lost.Bp18, CB1817B18Missing alarm message of CB1803
2A fault occurs at B18, CB1803 fails to trip, Lp0318s operates, and CB0318 trips.Bp18, CB1817, Lp0318s, CB0318B18CB1803 rejection
3A fault occurs at B03, and the alarm message of Bp03 is lost.CB0302, CB0304, CB0318B03Missing alarm message of Bp03
4A fault occurs at B03, Bp03 fails to operate, and remote backup PRs operate.Lp1803s, Lp0403s, Lp0203s, CB1803, CB0403, CB0203B03Bp03 rejection
5A fault occurs at L0318, Lp1803m fails to operate, and Lp1803p operates.Lp0318m, CB0318, Lp1803p, CB1803L0318Lp1803m rejection
6A fault occurs at L0318, CB1803 fails to trip, and the alarm message of Lp0318m is lost.Lp1803m, CB0318, Lp1718s, CB1718L0318CB1803 rejection and missing alarm message of Lp0318m
7A fault occurs at L0318, and the alarm message of CB0318 is lost.Lp0318m, Lp1803m, Lp1718s, CB1718L0318CB1803 rejection and missing alarm message of CB0318
8A fault occurs at L0318, CB1803 fails to trip, and the alarm message of Lp0203s is wrong.Lp1803m, Lp0318m, CB1817s, CB1718, Lp0203sL0318CB1803 rejection and false alarm message of Lp0203s
9A fault occurs at B03, CB0318 fails to trip, and Lp0203p operates incorrectly.Bp03, CB0302, CB0304, Lp0203p, CB0203, Lp1803s, CB1803B03malfunction or false alarm of Lp0203p and CB0318 rejection
10A fault occurs at L0318, Lp1718s operates incorrectly, and CB1718 trips.Lp0318m, Lp1803m, CB0318, CB1803, Lp1817s, CB1817L0318Lp1718s and CB1718 malfunction
11A fault occurs at T1213, and the alarm message of CB1312 is lost.Tp1213m, CB1213T1213Missing alarm message of CB1312
12A fault occurs at T1213, CB1312 fails to trip, Lp1413s and Lp1013s operate, and CB1413 and CB1013 trip.Tp1213m, CB1213, Lp1413s, Lp1013s, CB1413, CB1013T1213CB1312 rejection
13A fault occurs at L0318, and the alarm message of CB0318 is lost; a fault occurs at L1718.Lp0318m, Lp1803m, CB1803, Lp1817m, Lp1718m, CB1817, CB1718L0318, L1718Missing alarm message of CB0318
14Faults occur at L0318 and B17, CB1803 fails to trip, and the alarm message of Lp1817s is lost.Lp1803m, Lp0318m, CB0318, CB1718, Bp17, CB1727, CB1716L0318, B17CB1803 rejection and missing alarm message of
Lp1817s
15A fault occurs at B03, and the alarm message of CB0318 is lost; a fault occurs at B14.Bp03, CB0302, CB0304, Bp14, CB1404, CB1415, CB1413B03, B14Missing alarm message of CB0318
16A fault occurs at B18, Bp18 fails to operate, and remote backup PRs operate. A fault occurs at T1213, and the alarm message of Tp1213m is lost.Lp0318s, Lp1718s, CB0318, CB1718, CB1312, CB1213B18, T1213Bp18 rejection and missing alarm message of Tp1213m
Table 7. Evaluation results for fault cases in the IEEE 118-bus power system.
Table 7. Evaluation results for fault cases in the IEEE 118-bus power system.
No.Fault DetailsBrief Description of the Alarm MessagesFaulty
Components
Operation Evaluation
17A fault occurs at L020021, CB021020 fails to trip, and the alarm message of Lp020021m is lost.Lp021020m, CB020021, Lp022021s, CB022021L020021CB021020 rejection and
missing alarm message of Lp020021m
18A fault occurs at L020021, and Lp021020m fails to operate.Lp020021m, CB020021, Lp022021s, CB022021L020021Lp021020m rejection
19A fault occurs at L020021, and the alarm message of Lp021020p is wrongLp021020m, Lp020021m, Lp022021p, CB020021, CB022021L020021False alarm of Lp021020p
20A fault occurs at B038, CB038030 fails to trip, and the alarm message of CB038037 is lost.Bp038, Lp030038s, CB038065, CB030038B038CB038030 rejection and missing alarm message of CB038037l
21A fault occurs at B045, CB045044 fails to trip, and the alarm message of CB046045 is wrong.Bp045, Lp044045s, CB044045, CB045046, CB045049, CB046045B045CB045044 rejection and false alarm or maloperation of CB046045
22A fault occurs at B045, CB045044 fails to trip, and the alarm message of Lp046045s is wrong.Bp045, Lp044045s, CB044045, CB045046, CB045049, Lp046045sB045CB045044 rejection and false alarm of Lp046045s
23A fault occurs at L020021, and the alarm message of Lp020021s is wrong.
A fault occurs at B072, and Bp072 fails to trip.
Lp020021m, Lp021020m, CB020021, CB021020, Lp020021s; Lp024072s, Lp070072s, Lp071072s, CB022021, CB022021, CB022021L020021
B072
False alarm of Lp020021s and Bp072 rejection
24A fault occurs at B039, and the alarm message of Bp039 is lost.
A fault occurs at L043044, and the alarm message of Lp044043m is lost.
CB039037, CB039040, CB039041, CB039042, Lp043044m, CB043044, CB044043B039
L043044
Missing alarm message of Bp039 and Lp044043m
Table 8. Samples of device faults.
Table 8. Samples of device faults.
DevicesFault TypesTotal Number of SamplesNumber of Training SamplesNumber of Test SamplesNumber of Validation Samples
Merging unitDSP module fault4773354498
AC input module fault3712573777
Input/output module fault53036654110
Power module fault3102143165
Sampling out-of-step1971371941
Configuration error3662593176
Protective relayCPU module error/exception2471752151
SV module fault4062863585
GOOSE module fault57240649117
Power fault of the power module59140565121
Longitudinal channel fault66245078134
Configuration error3201818554
Intelligent terminalCPU module fault2561782454
I/O module fault65145759135
Power fault of the power module64144857136
Configuration error54638748111
Circuit breakerControl loop fault3032132664
Closing coil fault67646472140
Operating mechanism fault70248677139
Transmission mechanism fault63544362130
Contact fault3062132964
SwitchCommunication link fault53337748108
Port fault66644290134
Communication packet loss50035144105
Power fault2992132561
Table 9. Samples of communication faults.
Table 9. Samples of communication faults.
Communication LinksFault TypesTotal Number of SamplesNumber of Training SamplesNumber of Test SamplesNumber of Validation Samples
Merging unit–
Protective relay
Merging unit output port fault4543184096
Protective relay input port fault3622533376
Communication optical fiber fault52436748109
Protective relay–
Intelligent terminal
Protective relay output port fault3102172766
Intelligent terminal input port fault4122884084
Communication optical fiber fault2621832554
Bus merging unit–
Line merging unit
Bus merging unit output port fault50035046104
Line merging unit input port fault4663264496
Communication optical fiber fault51135844109
Intelligent terminal–
Circuit breaker
Intelligent terminal output port fault54338047116
Table 10. Statistical results.
Table 10. Statistical results.
No[6,25]Random Forest[26][24]This Paper
Ttest (ms)OA (%)Kappa (%)Ttest (ms)OA (%)Kappa (%)Ttest (ms)OA (%)Kappa (%)Ttest (ms)OA (%)Kappa (%)Ttest (ms)OA (%)Kappa (%)
1791.6091.322393.7793.572795.8495.707396.4296.302096.8196.71
2691.2090.933093.3193.122395.7895.657196.4596.332297.2097.11
3691.1490.872293.3793.182395.6695.527496.2096.082197.6597.57
4791.5491.262394.0193.812494.6194.436796.3696.232397.8397.79
5991.5191.262294.5894.402095.1294.967596.5496.422597.0896.98
61089.9189.642393.4993.012495.3995.247796.1195.982397.2097.11
71091.0890.812493.3793.182395.3095.156996.3096.172097.4497.36
8790.7590.482293.6793.482695.5195.377396.2396.112697.0896.98
9690.5790.302294.4994.222594.4194.217296.6696.542497.3597.23
10791.2790.992392.9592.762695.0094.847595.9995862397.1197.01
Table 11. Verification results of device faults.
Table 11. Verification results of device faults.
DevicesFault Types[6,25]Random Forest[26][24]This Paper
UA (%)PA (%)UA (%)PA (%)UA (%)PA (%)UA (%)PA (%)UA (%)PA (%)
Merging unitDSP module fault84.6289.8091.0993.8892.1695.9292.4195.9295.0096.94
AC input module fault85.1989.6187.8093.5190.0093.5194.5094.8197.3796.10
Input/output module fault88.6091.8290.2792.7392.0494.55100.0093.6496.3696.36
Power module fault100.0095.38100.00100.00100.00100.00100.00100.00100.00100.00
Sampling out-of-step94.5985.37100.0087.8097.3790.24100.0095.1297.5697.56
Configuration error100.0094.74100.0096.05100.0094.7491.8498.68100.0098.68
Protective relayCPU module error/exception96.0896.0898.0096.0898.0096.08100.0098.04100.0098.04
SV module fault83.7090.5991.8692.9494.0592.9498.2395.29100.0096.47
GOOSE module fault92.5985.4794.8394.0294.7492.31100.0094.8797.4497.44
Power fault of the power module100.0098.37100.00100.00100.00100.0097.76100.00100.00100.00
Longitudinal channel fault98.4091.7998.4796.2798.4897.0198.1897.7697.7697.76
Configuration error96.2394.44100.0094.4498.1196.3088.57100.00100.00100.00
Intelligent terminalCPU module fault92.5992.5996.3698.1598.18100.0096.97100.0098.1598.15
I/O module fault85.9290.3789.5194.8193.5797.04100.0094.8196.3297.04
Power fault of the power module95.0498.5399.27100.00100.00100.00100.00100.00100.00100.00
Configuration error99.0694.59100.0097.30100.0098.20100.0097.30100.0098.20
Circuit breakerControl loop fault100.00100.00100.00100.00100.00100.0097.79100.00100.00100.00
Closing coil fault93.3890.7197.0895.0097.1296.4392.9695.0097.8697.86
Operating mechanism fault81.1789.2987.8493.5391.7295.6896.9594.9697.1297.12
Transmission mechanism fault89.7687.6992.9791.5496.8394.57100.0097.6999.2399.23
Contact fault100.0089.06100.0093.7598.4196.8891.9696.88100.00100.00
SwitchCommunication link fault82.3590.7485.2290.7489.2992.5996.2495.3795.5098.15
Port fault92.0686.5792.3189.5593.9893.28100.0095.5298.4897.01
Communication packet loss100.0097.14100.0097.17100.0097.14100.0099.05100.0099.05
Power fault100.00100.00100.00100.00100.00100.00100.00100.00100.00100.00
Table 12. Verification results of communication faults.
Table 12. Verification results of communication faults.
Communication LinksFault Types[6,25]Random Forest[26][24]This Paper
UA (%)PA (%)UA (%)PA (%)UA (%)UA (%)PA (%)UA (%)PA (%)UA (%)
Merging unit–Protective relayMerging unit output port fault87.1391.6787.8890.6391.9294.7991.1493.7592.0896.88
Protective relay input port fault87.8485.5390.7990.7991.0393.4297.1794.7496.0596.05
Communication optical fiber fault95.0588.0796.2392.7399.0494.5094.2994.5099.0696.33
Protective relay–
Intelligent terminal
Protective relay output port fault77.6389.3988.4192.4289.8693.9491.8693.9495.5296.97
Intelligent terminal input port fault86.0588.1091.7692.8691.8694.0591.2394.0594.1996.43
Communication optical fiber fault80.3683.3394.1288.8994.3492.5998.1896.3098.1196.30
Bus merging unit–
Line merging unit
Bus merging unit output port fault88.5789.4293.3394.2391.9294.2394.8595.1995.2495.24
Line merging unit input port fault89.6990.6393.8194.7991.0395.8395.3795.8395.8895.88
Communication optical fiber fault92.5991.7495.3794.5099.0496.33100.0094.5097.2297.22
Intelligent terminal–
Circuit breaker
Intelligent terminal output port fault10094.8310096.5510097.41100.00100.00100100
Table 13. Results of different feature selection algorithms.
Table 13. Results of different feature selection algorithms.
NoRandom ForestSpearmanKendallInfoGainmRMRThis Paper
OA (%)Kappa (%)OA (%)Kappa (%)OA (%)Kappa (%)OA (%)Kappa (%)OA (%)Kappa (%)OA (%)Kappa (%)
193.3793.1894.3494.1595.1294.9695.4295.2696.0895.9596.8196.70
294.0193.8194.6794.4994.7994.6195.6695.5396.0295.9197.8397.66
394.5894.4094.5294.3395.3095.4795.6095.4596.2396.1097.0896.98
493.4993.0194.4094.2194.9194.7495.8195.7796.1195.9797.2097.10
593.3793.1894.4694.2795.00 94.8495.8795.7395.8795.7597.3597.23
Table 14. Results in case of unreliable alarms.
Table 14. Results in case of unreliable alarms.
Nou = 1 (v = 0)u = 2 (v = 0)u = 3 (v = 0)v = 1 (u = 0)v = 2 (u = 0)v = 3 (u = 0)
OA (%)Kappa (%)OA (%)Kappa (%)OA (%)Kappa (%)OA (%)Kappa (%)OA (%)Kappa (%)OA (%)Kappa (%)
197.2097.1196.1496.0295.8795.7496.8196.7195.3695.2194.1093.91
297.3597.2696.6396.5295.7895.6596.9096.8095.2495.1094.5294.34
396.9096.8096.4296.3096.0895.9696.6696.5595.3995.2494.0193.81
497.0596.9596.2096.0895.8495.7196.6396.5295.4595.3193.8993.69
596.9996.8996.4896.3695.5195.3796.7296.6195.6395.4994.2294.03
Table 15. Alarm messages of PRs and CBs in Case 1.
Table 15. Alarm messages of PRs and CBs in Case 1.
SubstationDevicesStatuses
HZ StationLine L22 protection Lp22mOperated
HZ StationLine L21 failure protection Lp22fOperated
HZ StationLine L21 Switch CB21Tripped
HZ StationTransformer T1 switch CBT12Tripped
HZ StationBusbar B21 Switch CBB231Tripped
HZ StationBusbar B22 Switch CBB232Tripped
Table 16. Alarm messages of the secondary system in Case 1.
Table 16. Alarm messages of the secondary system in Case 1.
SubstationIntervalDevicesAlarms
HZ StationInterval 22Line L22 Protection Lp22mSelf-check alarm
HZ StationInterval 22Line L22 Protection Lp22mException operation
HZ StationInterval 22Line L22 Protection Lp22mOutput communication interruption
HZ StationInterval 22Intelligent terminalGOOSE total alarm
HZ StationInterval 22Intelligent terminalGOOSE interruption
HZ StationInterval 22Intelligent terminalNo GOOSE input messages
HZ StationInterval 22Intelligent terminalGOOSE communication interruption
HZ StationInterval 22Intelligent terminalInput circuit self-check error
HZ StationInterval 22Intelligent terminalInput exception
Table 17. Alarm features of Case 1.
Table 17. Alarm features of Case 1.
No.Feature CategoriesAlarm Features
1GOOSE AlarmGOOSE total alarm of the intelligent terminal
2GOOSE interruption of the intelligent terminal
3Device self-check alarmThe self-check alarm of the protective relay
4Exception operation of the protective relay
5Communication alarmOutput communication interruption of the protective relay
6No GOOSE input messages of the intelligent terminal
7GOOSE communication interruption of the intelligent terminal
8Input circuit self-check error of the intelligent terminal
9Input exception of the intelligent terminal
Table 18. Alarm messages of PRs and CBs in Case 2.
Table 18. Alarm messages of PRs and CBs in Case 2.
SubstationDevicesStatuses
HZ StationLine L22 Protection Lp22pOperated
HZ StationLine L21 Switch CB22Tripped
Table 19. Alarm messages of the secondary system in Case 2.
Table 19. Alarm messages of the secondary system in Case 2.
SubstationIntervalDevicesAlarms
HZ StationInterval 22Merging unitTotal SV alarm
HZ StationInterval 22Merging unitSampling exception
HZ StationInterval 22Merging unitSampling exception
HZ StationInterval 22Merging unitDevice exception
HZ StationInterval 22Line L22 Protection Lp22mSelf-check alarm
HZ StationInterval 22Line L22 Protection Lp22mException operation
HZ StationInterval 22Line L22 Protection Lp22mOutput communication interruption
HZ StationInterval 22Intelligent terminalGOOSE total alarm
HZ StationInterval 22Intelligent terminalGOOSE interruption
HZ StationInterval 22Intelligent terminalNo GOOSE input messages
HZ StationInterval 22Intelligent terminalGOOSE communication interruption
HZ StationInterval 22Intelligent terminalInput circuit self-check error
HZ StationInterval 22Intelligent terminalInput exception
Table 20. Alarm features of Case 2.
Table 20. Alarm features of Case 2.
No.Feature CategoriesAlarm Features
1SV alarmTotal SV alarm of the merging unit
2Sampling exception of the merging unit
3Synchronization exception of the merging unit
4GOOSE alarmIntelligent terminal GOOSE total alarm
5Intelligent terminal GOOSE interruption
6Device self-check alarmDevice exception of the merging unit
7The self-check alarm of the merging unit
8The self-check alarm of the protective relay
9Exception operation of the protective device
10Output communication interruption of the merging unit
11Communication alarmSV communication interruption of the merging unit
12Output communication interruption of the protective relay
13No input GOOSE messages of the intelligent terminal
14Communication interruption of the intelligent terminal GOOSE
15Input circuit self-check error of the intelligent terminal
16Input exception of the intelligent terminal
Table 21. Alarm messages of PRs and CBs in Case 3.
Table 21. Alarm messages of PRs and CBs in Case 3.
SubstationDevicesStatuses
HZ StationLine L22 Protection Lp22pOperated
HZ StationLine L21 failure protection Lp22fOperated
HZ StationLine L21 Switch CB21Tripped
HZ StationTransformer T1 switch CBT12Tripped
HZ StationBusbar B21 Switch CBB231Tripped
HZ StationBusbar B22 Switch CBB232Tripped
Table 22. Alarm messages of the secondary system in Case 3.
Table 22. Alarm messages of the secondary system in Case 3.
SubstationIntervalDevicesAlarms
HZ StationInterval 22Merging unitTotal SV alarm
HZ StationInterval 22Merging unitSampling exception
HZ StationInterval 22Merging unitSampling exception
HZ StationInterval 22Merging unitDevice exception
HZ StationInterval 22Intelligent terminalGOOSE total alarm
HZ StationInterval 22Intelligent terminalGOOSE interruption
HZ StationInterval 22Intelligent terminalNo GOOSE input messages
HZ StationInterval 22Intelligent terminalGOOSE communication interruption
HZ StationInterval 22Intelligent terminalInput circuit self-check error
HZ StationInterval 22Intelligent terminalInput exception
Table 23. Alarm features of Case 3.
Table 23. Alarm features of Case 3.
No.Alarm Messages
1SV alarmTotal alarm of the merging unit
2Sampling exception of the merging unit
3Synchronization exception of the merging unit
4GOOSE alarmGOOSE configuration error of the intelligent terminal
5Device self-check alarmDevice exception of the merging unit
6The self-check alarm of the merging unit
7Communication alarmOutput communication interruption of the merging unit
8SV communication interruption of the merging unit
9GOOSE panel configuration error of the intelligent terminal
10Input configuration error of the intelligent terminal
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Shao, N.; Chen, Q.; Yu, C.; Xie, D.; Sun, Y. Fault Tracing Method for Relay Protection System–Circuit Breaker Based on Improved Random Forest. Electronics 2024, 13, 582. https://doi.org/10.3390/electronics13030582

AMA Style

Shao N, Chen Q, Yu C, Xie D, Sun Y. Fault Tracing Method for Relay Protection System–Circuit Breaker Based on Improved Random Forest. Electronics. 2024; 13(3):582. https://doi.org/10.3390/electronics13030582

Chicago/Turabian Style

Shao, Ning, Qing Chen, Chengao Yu, Dan Xie, and Ye Sun. 2024. "Fault Tracing Method for Relay Protection System–Circuit Breaker Based on Improved Random Forest" Electronics 13, no. 3: 582. https://doi.org/10.3390/electronics13030582

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop