A Novel Fault Diagnosis System on Polymer Insulation of Power Transformers Based on 3-stage GA–SA–SVM OFC Selection and ABC–SVM Classifier

Dissolved gas analysis (DGA) has been widely used in various scenarios of power transformers’ online monitoring and diagnoses. However, the diagnostic accuracy of traditional DGA methods still leaves much room for improvement. In this context, numerous new DGA diagnostic models that combine artificial intelligence with traditional methods have emerged. In this paper, a new DGA artificial intelligent diagnostic system is proposed. There are two modules that make up the diagnosis system. The two modules are the optimal feature combination (OFC) selection module based on 3-stage GA–SA–SVM and the ABC–SVM fault diagnosis module. The diagnosis system has been completely realized and embodied in its outstanding performances in diagnostic accuracy, reliability, and efficiency. Comparing the result with other artificial intelligence diagnostic methods, the new diagnostic system proposed in this paper performed superiorly.


Motivation
Transformers are distributed in almost all domains of the entire electrical network, changing the values of AC voltage (current) at given points to another or several values without altering the frequency. They not only guarantee the normal operation of the power grid, but also affect people's living environment [1]. However, the operating conditions of the transformer (including temperature and electromagnetic conditions) are harsh and not conducive to its long-term health [2][3][4]. In the meantime, the failure of power transformers is often attended by disastrous consequences, which include equipment burning and large-scale blackouts. Undoubtedly, the operational safety of power transformers deserves serious concern.
Fault diagnosis is regarded as one of the most important considerations in maintaining the safe operation of the power transformer. Diagnostic and fault prognosis techniques have been widely and successfully applied in numerous engineering dynamic systems [5], and are also of extreme importance to researchers of electrical energy systems [6]. At present, fault diagnosis systems play a critical role in maintaining the operational safety of power transformers, and their principles and designs are constantly updated and strengthened [7,8].
We argue that a sound transformer diagnostic method should be strengthened in the following aspects: (1) Economic efficiency and (2) solving the allowable-time problem. Economic efficiency is related to diagnostic costs. The average annual failure rate of transformers is not very high; it usually does not exceed 5% [9]. In other words, the expense of the transformer fault diagnosis is of minor significance in most cases. When the transformer is operating properly, the diagnosis provides less valuable guidance to maintenance staff. However, the traditional diagnosis costs for the transformer are relatively prominent, due to the lack of online diagnostic methods. Transformers need to be shut down periodically for maintenance and, during such shutdowns, the outage cost of the transformer is huge. Therefore, it is necessary to control the diagnostic costs and improve the economic efficiency of the transformer's diagnostic method. On the other hand, the allowable-time problem is a noticeable challenge to traditional transformer diagnosis. The fault of the transformer has no obvious abnormality at the beginning, making the allowable-time for maintenance actions relatively short. Under these circumstances, this paper aims to propose an online-diagnosis method that is economical and capable of overcoming the allowable-time problem.

Related Work
Mainstream transformers' fault diagnostic methods include chemical quantity based methods and electrical quantity based methods [10]. Chemical based methods typically include dissolved gas analysis (DGA) [11], degree of polymerization (DP) measurements [12], moisture analysis (MA) [13], and Furan analysis by high performance liquid chromatography (HPLC) [14], among others. The electrical based methods involve the time domain method [15] and frequency domain polarization measurement [16]. Among these, DGA is the most widely exploited [17]. Since DGA was proposed in 1973, this online method has been widely accepted and exploited all around the world, owing to its outstanding economic efficiency and capability to detect failure in advance, which effectively alleviates the pressure brought on by the allowable-time problem [18]. The DGA works via detecting hydrogen (H 2 ), methane (CH 4 ), acetylene (C 2 H 2 ), ethylene (C 2 H 4 ), ethane (C 2 H 6 ), carbon monoxide (CO), and carbon dioxide (CO 2 ) gases dissolved in the transformer oil, which is produced by pyrolysis of insulation paper (board) cellulose. In this proposal, we divided DGA methods into traditional methods and intelligence methods. Traditional methods include: The Doernerburg Ratio Method [19], Rogers Ratio [20], IEC 60,599 Method [21,22], Duval Triangles Method [23], and Pentagon Method [24]. Despite having a long history, most of these methods are unstable in a complex operating environment. On the other hand, while research of intelligent DGA diagnostic methods has appeared, these are implemented less frequently compared with the traditional methods [25]. Therefore, this paper hopes to improve this situation as much as possible.
The development of an intelligent diagnosis of the power transformer is promising. In general, intelligent diagnosis designs are built on the ideas of traditional methods. They combine most of the advantages of both traditional ideas and intelligent algorithms. Recently, intelligent methods, such as fuzzy logic inference systems [26], artificial neural networks [27], support vector machines (SVM) [28,29], and some other machine learning algorithms have been applied to transformer fault diagnosis and have had impressive performances [10,30,31]. However, limitations also exist with intelligent diagnostic methods. For example, fuzzy inference depends excessively on the experience of researchers [32]. In addition, "local minima" and "overfit" are two of the marked weaknesses of Artificial Neural Network (ANN) [33]. Compared to these methods, the application of SVM in abnormal detection and fault diagnosis has marked advantages [34]. It overcomes the local minimum, dimension, and over-fitting problems, and requires less in the scale of the training sample.

Contribution and Paper Orgnization
In this paper, we combined SVM with traditional DGA and proposed a two-stage SVM diagnostic system, and the main structure of the system was depicted in Figure 1. The system contains a feature selection module, which would extract the optimal feature combination (OFC) from DGA information, and a multiclassifier, judging the type of fault in power transformers based on the OFC. Accounting for the relatively low request of the speed and the highlighted accuracy and reliability, a 3-stage GA-SA-SVM selection model which combined a genetic algorithm (GA) [35] and simulated annealing (SA) algorithm [36] with SVM was utilized to complete the selection of OFC. In addition to these, the artificial bee colony (ABC) algorithm [37], which has the fastest iteration speed and the highest search efficiency, was exploited in the diagnostic stage. The entire system has been fully realized, with the accuracy of its result reaching 92%.
The remaining sections of this paper are structured as follows: In Section 2, a 3-stage GA-SA-SVM method, which was used to determine the optimal feature combination of DGA feature sets, is proposed. The ABC-SVM based diagnostic model is constructed in Section 3. A case study of the entire system is illustrated in Section 4. Finally, we conclude the research and identify the direction of future research in Section 5.

The Candidate DGA Feature Sets
In recent years, the DGA gas ratio is used as the characteristic parameter, such as in the Doernerburg Ratio Method [19] and the Rogers Ratio Method [20]. Inspired by [38], eight categories of gas: H 2 , CO, CO 2 , CH 4 , C 2 H 2 , C 2 H 4 , C 2 H 6 , and the total hydrocarbon (TH, TH = CH 4 + C 2 H 2 + C 2 H 4 + C 2 H 6 ) were taken into account in this paper. Therefore, there were 28 DGA candidate ratios in total, which are shown in Table 1 below.

DGA Feature Selection Model
The DGA feature selection is necessary due to the ambiguous relationship between the DGA features and the types of transformer fault. According to [39], parts of DGA gas ratios are unrelated to the fault diagnosis, which means that blindly selecting DGA features or even focusing on all features is unwise. To address this problem, it is necessary to select the key DGA features carefully, which was mostly suitable in the following diagnosis. Accounting for the low request of the speed and the highlighted accuracy and reliability, a 3-stage GA-SA-SVM selection model was constructed to complete the selection. The flowchart of the selection process is illustrated in Figure 2, and a brief description of the 3-stage GA-SA-SVM model constructing process is as follows:

Multiclass Nonlinear SVM Model
The principle of SVM is to find the optimal hyperplane which satisfies the classification requirements and extends the distance of the two data sets on the hyperplane as much as possible. The model of the SVM is depicted in Figure 3.
Assuming that x i ∈ R n and y i ∈ R n are the input and output of the training set, respectively, the training set {(x 1 , y 1 ), . . . , (x i , y i ), . . . , (x l , y l )} was obtained. At the same time, a class table y i ∈ {−1, +1} was introduced, which was determined through the given x i . The constraints of the training set data can be written as (1): where the ϕ(x i ) is a nonlinear mapping. Both the ϕ(x i ) and ω contain infinite dimensions. They form the optimal hyperplane together. When the data are linearly inseparable, a non-negative slack variable ξ i is introduced to transform the SVM into (2): where parameter C is the penalty factor. C was determined through optimization, which depends on the GA and SA algorithms. Build a Lagrangian function to solve the QP problem of (2): Among them, α i > 0 and β i > 0 are Lagrange multipliers; then, the original problem has been transformed into a quadratic programming problem: where K(x i , x j ) is called the kernel function that satisfies (6). σ was a given parameter, which was determined through the GA and SA optimization.
Using the One-Against-One (OAO) method to extend the two-class SVM to a multiclass SVM, the optimization problem translated into (7): Therefore, the expression of the decision function is written as (8):

Application of Genetic Algorithm
Genetic algorithms contributed to the OFC selection for determining C and σ. They contained three main units: Coding, fitness calculation, and genetic operation.

•
Chromosome coding The chromosome in GA was abstracted as the solution of the objective function. The abstract process is known as coding. As shown in Figure 4, three sets of parameters: C, σ, and DGA ratio sets need to be optimized. They match three segments of binary codes which were represented by L 1 , L 2 , and L 3 , respectively. In the first two codes, namely L 1 and L 2 , the value of binary encoding in the decimal form is equivalent to the value of its corresponding C and σ. The binary code on the L 3 segment reflects the combination of selected DGA ratios. The "1" on each bit in L 3 means the corresponding DGA ratio is selected, and "0" reflects the opposite meaning. • Genetic fitness calculation Genetic fitness is calculated as follows in (9), which is the standard to evaluate the performance of a single chromosome: where, l i is the number of samples in the ith verification set; l i T is the correct classified number in the verification set; and k is the number of cross validation. The concept of K-fold cross-classification will be illustrated in the following description.

• Genetic operations
The old solution generates new solutions through genetic operations. Genetic operation refers to the fact that in each generation, individual chromosomes are chosen according to their selection probability. After that, chromosomes still need to experience crossover and mutation in order to generate a new population. This process ensures that the new population is more adaptable to the environment than the previous generation. The selection probability of each individual is calculated as follows in (10): f i is the ith genetic fitness. Crossover operations followed (11) with a given certain probability: where a is a random number in the interval [0, 1]. The mutation operation (12) refers to randomly selecting a mutation bit j in the mutated chromosome and setting it as a normalized random number U(a i , b i ). a i , b i are the upper and lower constraints of the corresponding mutation.

Combination of SA Algorithm and GA
The SA operation and the inverse SA operation were combined with the GA to acquire a more impressive performance in optimization. The flowcharts of the SA operation and the inverse SA operation are presented in Figure 5. •

Simulated annealing operation
The SA operation is the most noticeable difference between the SA algorithm and other greedy algorithms. The SA operation refers to a reservation principle for the new solution set up in iterations. In the SA operation, the solution of a new generation is retained based on a probability, which is calculated following the Metropolis criterion.

Metropolis criterion
The SA algorithm draws on the relationship between the temperature T and the internal energy E in the solid annealing principle. The Metropolis criterion describes the relationship between the probability of accepting a solution of the new generation and T and E. Assuming that, at temperature T, the number of current iterations is i and the number of new iterations is j, if E j < E i , save j as the current generation; otherwise, follow the probability P to determine whether j would be saved, where P followed (13). In the SA algorithm, T is positively related to the number of iterations, and E i is numerically equal to the fitness of the ith generation.
• Inverse simulated annealing operation Inverse simulated annealing operation is the opposite of simulated annealing operation: If E j > E i , accept j as the current generation; otherwise, follow the probability P to accept j as the current generation.

•
Multi stage of GA-SA-combination In order to obtain the result of selection with high accuracy, sound stability, and a relatively short time-consumption, we utilize the GA algorithm, the GA hybrid SA (GA-SA) algorithm, and the Inverse SA hybrid GA (Inverse SA-GA) algorithm in multiple stages of generation intervals. Only one of these algorithms runs in each interval to optimize the SVM parameter and OFC.
Through combining GA and SA in multiple stages, we obtained a multistage-GA-SA-SVM selection model.

K-Fold Cross-Validation
K-fold cross-validation (CV) was hired to verify the accuracy of selection. To imply K-fold CV, the initial sample is divided into K subsamples, a separate subsample is retained as the data for the validation model, and the other K − 1 samples are employed for training. The cross-validation will repeat for K times. Each subsample will act as a verified sample once. The average result of validations in K times is viewed as the estimation result [29]. Here we set K = 5.

Fault Diagnosis Model Based on ABC-SVM
ABC is a mature algorithm which has been widely applied in solving numerous optimization problems due to its prominent convergence characteristics [37,40]. In order to obtain the highest operating efficiency and the highest diagnostic accuracy, we utilized the ABC algorithm to optimize the SVM parameters, and constructed the ABC-SVM based transformer fault diagnosis model.

The Mechanism of ABC
There are four key components in ABC: Honey sources which abstract into points in the solution space, lead bees, follow bees, and reconnaissance bees. The bees represent the potential solution to the problem. The tasks for each bee are different. As shown in Figure 6, they extend the known honey source to search for the global optimal solution in predetermined manners. Search manners consist of three steps: (1) Lead bees discover a source of the honey and share the source information; (2) each follow bee selects the source to collect the honey according to the information and evaluates the quality of the source; (3) the lead bee converts into a reconnaissance bee and continues to search for new sources near the hive when a source is found repeatedly, but the quality is not improved. When a high-quality source is found, he turns his role back into that of a lead bee. These three steps will be replicated until the best honey source is found.

ABC Optimization Model
In the ABC model, the quality of the honey source i (i = 1, 2, . . . , NP) corresponds to the fitness value fit i of the solution, and NP is the number of honey sources. The numbers of lead bees and follow bees equal to half of the bee colony, respectively, which also equal the number of honey sources. A honey source accommodates only one bee at any one time.
Let the dimension of the solution problem be D. The position of the honey source at the tth generation is denoted as , where x id ∈ (L d , U d ), L d and U d denote the lower and upper constraints of the search space, respectively, and d is a random integer in [1, D]. The initial position of the honey source i is randomly generated in the search space according to (14).
To start the search, the lead bee searches around the honey source i according to (15) to generate a new honey source: where j ∈ {1, 2, . . . , NP}, j = i, this indicates randomly selecting a honey source that is not equal to i among the NP honey sources; ϕ is a random number of [−1, 1], which is uniformly distributed and determines the magnitude of perturbation. Then, the follow bee calculates the fitness of the new honey source (16) and decides whether or not to replace X i or keep X i by using the greedy choice method.
f i represents the objective function whose functional value is numerically equal to the mean square error (MSE) of the accuracy of the SVM prediction model.
After that, follow bees use the Roulette Wheel Selection to determine the lead bees they follow. The probability in the Roulette Wheel Selection was calculated through (17): During the search process, if a source X i reaches the limit through trial iterations without finding a better source, the source X i will be abandoned. The lead bee turns into the role of reconnaissance bee and generates a new source of honey in the search space followed randomly (18). ABC algorithm flowcharts were depicted in Figure 7.

Leave-P-Out Cross Validation
In the diagnostic phase, the LpO CV was exploited for verification. The LpO CV refers to using the p elements in the full set X as the testing set, and the remaining n-p elements as the training set. As a result, p verification results will be obtained in the end. The final result is numerically equal to the percentage of correct results in p results.

Process of Classification Based on ABC-SVM
The established ABC model is applicable for selecting the optimal parameters of a nonlinear multiclass SVM (C and σ). The flowchart of ABC-SVM is given in Figure 8, which contains four steps: Step 1. Utilize LpO CV to generate a training set and a testing set. The training set was sent to the ABC model. Step 2. Use training set and nonlinear multiclassification support vector machine to construct unknown parameters and to form the optimal objective function.
Step 3. Apply ABC to find the best solution to determine the best parameters of SVM. The best parameters are obtained when the training accuracy meets the threshold of checking; otherwise, step 3 is replayed.
Step 4. Input testing is set to SVM, then the output will be obtained.

Data Preprocessin
In this research, 118 sets of transformer fault data which originated from International Electrotechnical Commission Technical Committees (IEC TC) 10 databases [20] were engaged to carry out the test. We labeled the 118 datasets with five states of transformers, which contained: 23 sets of low-energy discharge (LED, represented by "1"), 45 sets of high-energy discharge (HED, represented by "2"), 10 sets of low and middle-temperature overheat (LMT, using "3". Representative), 14 sets of high temperature overheating (HT, represented by "4"), and 26 sets of normal operation (N, represented by "5"). The states arrangement of samples is given in Table 2. The data need to be preprocessed: Normalizing the data follows (19) to eliminate differences caused by ratio magnitude differences: (19) where x i.result is the result of normalization, x i is the ratio which needs to be normalized, x i.max and x i.min are the maximum and the minimum members among entire samples.

Parameter Setting in Three Stage-GA-SA-SVM
In 3-stage GA-SA-SVM optimization, several parameters were specified in Tables 3 and 4. The maximum iteration number was set at 200. The population scale was determined at 20. The number of chromosome segments was 3. Both L 1 and L 2 took 18, which guarantees that the upper bound of both C and σ is 255 and they can be accurate to 10 −4 . L 3 was 28. In the optimization process, the first 40 generations were fully optimized using the GA-SVM method, 40-180 generations utilized the GA-SA-SVM algorithm, and the Inverse SA-GA-SVM algorithm was applied in the last 20 generations. Table 3. Parameters preset in 3-stage GA-SA-SVM.

Max Iteration
Population Scale  Table 4. Generation interval of three stages.

Comparison with Other Methods
In order to embody the advantages of 3-stage GA-SA-SVM, we used GA-SVM, GA-SA-SVM, 2-stage GA-SA-SVM, and 3-stage GA-SA-SVM to select feature combinations and obtain the results of four methods. Each result includes CV accuracy, fitness curve, and optimal feature combination. CV accuracy and the optimal combination of features for each method are listed in Table 5. The fitness curve for each method is shown in Figure 9, which also included optimal C and σ. In the following figures, g is used to represent σ and c is used to represent C. In the meantime, we would like to emphasize that, in order to overcome the possible accuracy problems caused by insufficient public sample data of transformers, these results are carefully selected by the author after 50 times of repeated experiments, and are the closest to the average results.   In Table 5, 3-stage GA-SA-SVM has the highest CV accuracy among all algorithms. In Figure 9a, GA-SVM's fitness reaches the highest value within 20 generations and takes the shortest time, which is only about 200 s, to end the iteration. However, due to the fitness curve no longer climbing after reaching a platform, the GA algorithm is more likely to be trapped in the local optimal solution, making the result unstable. Also, the accuracy of the GA-SVM is slightly lower than that of other algorithms. The GA-SA-SVM algorithm made some improvements based on the GA algorithm. In Figure 9b, the fitness of the GA-SA-SVM changed after arriving at a local optimization platform, which means that it is easier for the GA-SA-SVM to jump out of the local optimal solution. Therefore, the result of the GA-SA-SVM looks more stable and accurate. However, the weakness of the GA-SA-SVM is marked. The GA-SA-SVM's fitness reaches the platform period at around the 40th generation and requires a long running time of more than 600 s. The adoption of the 2-stage GA-SA-SVM has already made some improvements to this problem. In Figure 9c, the 2-stage GA-SA-SVM was able to jump out of the local optimal solution and merely required 474 s to complete the optimization, and 20 generations to reach the local optimal platform. The shortcoming that remains in the 2-stage GA-SA-SVM is that the fitness may jump out of the global optimal solution at the end of the optimization. This situation is due to the temperature T in the SA algorithm is already very low within the last ten generations, and the probability of accepting a positive-direction-mutation (which makes the fitness grow) is quite low. In contrast, the probability of receiving a negative-direction-mutation is 100%. For example, maximum fitness in the 194th generation of Figure 9c decreased during accepting a negative-direction-mutation. This generation was very close to the maximum number of iterations. At this moment, it was a risk that the result may return to the local optimal solution and never grow again until the end of the iteration. This is a typical defect of the SA algorithm when the maximum number of iterations is set in the first place. A similar situation also occurs in the GA-SA-SVM hybrid algorithm: At the 190th generation in Figure 9b, fitness returns to the local optimal solution until the end of the iteration. The 3-stage GA-SA-SVM was intended to overcome this shortage. In Figure 9d, the 3-stage GA-SA-SVM retains all of the benefits of the 2-stage GA-SA-SVM, except that the solution takes a little longer-up to 506 s. After the 180th generation, the inverse SA algorithm not only eliminated the decrease in fitness that might occur in the SA algorithm, but also provided two opportunities for the fitness to raise. The 3-stage GA-SA-SVM is therefore more stable and accurate than the 2-stage GA-SA-SVM method.
Based on the high accuracy and stability of the 3-stage GA-SA-SVM, the selection results of the 3-stage GA-SA-SVM are considered to be the most reasonable OFC. DGA ratio components of the OFC are set out in Table 6.

Parameter Setting in Three ABC-SVM
Based on the LpO CV, 118 sets of IEC TC 10 samples were divided into two groups. Among these, 93 sets were for training and 25 sets for testing. The states arrangement of testing samples is listed in Table 7. In the ABC algorithm, we set the scale of the bee colony to 20. The number of honey sources (solutions) is set to half of the scale, that is, 10. In each generation, the maximum number of extra honey sources that can be found are 100. That is, if the reconnaissance bees discover more than 100 fresh honey sources and the quality of the honey sources does not increase, reinitialize the honey sources. This setting is to prevent ABC from being trapped in the local optimal solution. The maximum number of loops is 10 and the dimension of the vector to be optimized is 2. The parameters are arranged as shown in Table 8.

Diagnostic Result
The final results of the 25 testing sample diagnoses and the accuracy of the diagnosis are given in Figure 10. The blue circle in the upper half of the diagram represents the correct label status of the testing data, and the red dot is the diagnostic result from the diagnostic system. When the red dot coincides with the blue circle, it means that the result of diagnosis is accurate. The lower half figure depicts the distribution of the diagnostic deviation of the diagnostic system. Results showed that diagnostic accuracy amounts to 92% (23/25); errors only occurred at points 16 and 17.

Result of Comparison
We divided the comparison into two parts: Self-comparing and comparison with standard algorithms and other wrapper algorithms.

•
Self-comparing Since that the task of ABC algorithm is to search the optimal value of C and the σ, we simulated the diagnostic accuracy of each point of C and the σ to demonstrate the superiority of ABC. The σ and C are both on a [0, 200] × [0, 200] square plane. Their values correspond to the X axis and the Y axis on the plane, respectively. The Z axis perpendicular to the plane represents the accuracy of diagnosis. The detailed results of the simulation are presented in Figure 11, which depicted that the highest precision obtained by the SVM classifier was 91.96%. The diagnostic result of the ABC-SVM has reached the global highest point. The self-comparison verified that ABC has excellent performance under given data and operating conditions.

• Comparison with Standard Algorithms and other Wrapper Algorithms
We compared the results obtained by ABC-SVM with the results originating from standard algorithms based methods: SVM, and back propagation neural networks (BPNN) and wrapper algorithms: GA-SVM and PSO-SVM. These methods shared the same sample, based on the optimal feature combination in Table 6. The diagnostic result of each method is listed in Figure 12. To compare with other wrapper algorithms, fitness curves of wrapper algorithms are shown in Figure 13. In the meantime, we would like to emphasize that, in order to overcome the possible accuracy problems caused by insufficient public sample data of transformers, these diagnostic results are carefully selected by the author after 50 times of repeated experiments, and are the closest to the average results.
The advantages of the ABC-SVM in terms of accuracy can be clearly seen from the comparison between Figures 10 and 12. The ABC-SVM is the only one of all algorithms with a precision of over 90%. The diagnostic accuracy of ABC-SVM is obviously improved compared to that of standard SVM, and it is better than that of BPNN. The accuracy of ABC-SVM is also higher than that of other wrapper algorithms.
In addition, it can be seen that the ABC algorithm has better convergence characteristics when compared to other optimization algorithms, which guarantees that ABC-SVM performs better than other wrapper algorithms. It runs steadily, has fewer iterations, and has a rapid convergence time and a high termination fitness. Unlike GA or PSO, which require hundreds of generations of calculations, ABC reached the optimal platform within five iterations. Besides, as seen in Figure 13, termination fitness of ABC exceeded 95%, which is almost 10% higher than that of the GA and the PSO. This shows the ABC's outstanding preferment in convergence.

Conclusions and Future Directions
This paper combines the traditional DGA method and intelligent algorithms and proposes a complete online monitoring and diagnostic system for power transformers. Inheriting the advantages of traditional DGA online technology, the novel diagnostic method has sound economic characteristics and alleviates the pressure brought by the allowable-time problem effectively. The diagnostic process includes: (1) An extracted DGA feature combination based on 3-stage GA-SA-SVM and (2) using the ABC-SVM classification model to diagnose transformer faults based on the optimal feature combination. The results are shown to be highly accurate and reliable. The system has strong anti-noise ability, so it requires less attention in the working environment condition.
In subsequent studies, we will concentrate on two research directions: (1) Developing an improved ABC algorithm to link SVM mode, and (2) designing an algorithm (ABC-SVR) that combines support vector regression (SVR) with ABC.
Author Contributions: In this research activity, all the authors were involved in the data collection and preprocessing phase, model constructing, empirical research, results analysis and discussion, and manuscript preparation. All authors have approved the submitted manuscript.