A Novel Approach to Arcing Faults Characterization Using Multivariable Analysis and Support Vector Machine

: Based on the Institute of Electrical and Electronics Engineers (IEEE) Standard C37.104-2012 Power Systems Relaying Committee report, topics related to auto-reclosing in transmission lines have been considered as an imperative beneﬁt for electric power systems. An important issue in reclosing, when performed correctly, is identifying the fault type, i.e., permanent or temporary, which keeps the faulted transmission line in service as long as possible. In this paper, a multivariable analysis was used to classify signals as permanent and temporary faults. Thus, by using a simple convolution process among the mother functions called eigenvectors and the fault signals from a single end, a dimensionality reduction was determined. In this manner, the feature classiﬁer based on the support vector machine was used for acceptably classifying fault types. The algorithm was tested in di ﬀ erent fault scenarios that considered several distances along the transmission line and representation of ﬁrst and second arcs simulated in the alternative transients program ATP software. Therefore, the main contribution of the analysis performed in this paper is to propose a novel algorithm to discriminate permanent and temporary faults based on the behavior of the faulted phase voltage after single-phase opening of the circuit breakers. Several simulations let the authors conclude that the proposed algorithm is e ﬀ ective and reliable.


Introduction
It is clear that electric power systems (EPS) have been considered as one of the most important developments of humanity, and its principal objective is to supply electric power maintaining a very high level of continuity of service. Nevertheless, each of the steps of generation, transmission and distribution has its own difficulties, where abnormal conditions called short circuits can be especially present on transmission lines (TLs). Thus, TLs become extremely critical [1]. Based on statistics, 80% of faults in overhead TLs are single-phase and of temporary type. When a temporary fault on TLs is presented, which is mostly accompanied by an electric arc, the main goal of protection relays is to isolate the faulted line temporarily until the fault arc path de-ionizes and re-energize it, which is called reclosing [2,3]. Hence, in many cases, the fault on TLs is not solid and is caused by different conditions such as flying birds, branches of trees and others, where the electrical arc plays a major role and permanent and temporary fault have different behavior; see Section 4.2.
During arcing faults, the appearance of a long arc in free air must be modeled, i.e., the nonlinear nature of the arc should be considered and clearly represented. As it is mentioned in References [4,5], the unconstrained long fault arc in the air could be studied and modeled by applying the theory of switching arc, and based on that information, it is possible to develop a more precise model of arcing faults. In fact, the latest research only analyzes this phenomenon in the time domain, based on voltage and current waveforms [6,7].
On the other hand, the fastest manner to extinguish an arc associated with a temporary fault is through a three-phase reclosing scheme, where, after opening the TL, a time known as the dead time (DT) will elapse and the breakers of each phase will be closed at each extreme of the TL. In this situation, there would be no voltage source to feed the arc. However, rotors of generation units can accelerate too much, and cause the loss of synchronism during the DT, which would be catastrophic for the stability of the EPS [8,9].
Considering not only the possible problems of three-phase reclosing, but also if the temporary fault is usually single-phase, an effective method is the single-phase reclosing [10,11]. In this case, the two healthy phases are kept energized during the fault period, while the faulted phase becomes deenergized temporarily until the arc is cleared. However, the arc cannot be extinguished, immediately. Hence, in single-phase reclosing it is imperative to avoid reclosing when the arc is not cleared yet. Therefore, the main problem with the traditional reclosing method is that there is no guarantee that the fault type is temporary. In fact, the traditional relay algorithm recloses the faulted phase regardless of the fault type [1].
The criterion of equal areas shows the behavior of three-phase opening and reclosing, where the area A1 is greater than the area A2, Figure 1, which means that the synchronous machine loses synchronism [8].  Figure 2 shows the single-phase opening and reclosing effect. Thus, area A1 is smaller than area A2. The machine will not lose synchronism [8].   Figure 2 shows the single-phase opening and reclosing effect. Thus, area A1 is smaller than area A2. The machine will not lose synchronism [8].
2 During arcing faults, the appearance of a long arc in free air must be modeled, i.e., the nonlinear nature of the arc should be considered and clearly represented. As it is mentioned in References [4,5], the unconstrained long fault arc in the air could be studied and modeled by applying the theory of switching arc, and based on that information, it is possible to develop a more precise model of arcing faults. In fact, the latest research only analyzes this phenomenon in the time domain, based on voltage and current waveforms [6,7].
On the other hand, the fastest manner to extinguish an arc associated with a temporary fault is through a three-phase reclosing scheme, where, after opening the TL, a time known as the dead time (DT) will elapse and the breakers of each phase will be closed at each extreme of the TL. In this situation, there would be no voltage source to feed the arc. However, rotors of generation units can accelerate too much, and cause the loss of synchronism during the DT, which would be catastrophic for the stability of the EPS [8,9].
Considering not only the possible problems of three-phase reclosing, but also if the temporary fault is usually single-phase, an effective method is the single-phase reclosing [10,11]. In this case, the two healthy phases are kept energized during the fault period, while the faulted phase becomes deenergized temporarily until the arc is cleared. However, the arc cannot be extinguished, immediately. Hence, in single-phase reclosing it is imperative to avoid reclosing when the arc is not cleared yet. Therefore, the main problem with the traditional reclosing method is that there is no guarantee that the fault type is temporary. In fact, the traditional relay algorithm recloses the faulted phase regardless of the fault type [1].
The criterion of equal areas shows the behavior of three-phase opening and reclosing, where the area A1 is greater than the area A2, Figure 1, which means that the synchronous machine loses synchronism [8].  Figure 2 shows the single-phase opening and reclosing effect. Thus, area A1 is smaller than area A2. The machine will not lose synchronism [8].  When the single-phase fault occurs, the primary arc current occurs and switches at the ends of the failed phase will open, the other phases will remain closed [8].
When the phase-in fault is opened, it is coupled with the healthy phases in an electrostatic way or capacitive coupling (of greater influence) and electromagnetic or inductive coupling. Capacitive coupling generates the secondary arc current that will self-extinguish in few cycles [11].
A successful reclosing will occur when the secondary arc is self-extinguished before the DT. According to Reference [7], for a 500 kV system, this time varies between 0.4 to 0.5 s and can be obtained from Equation (1), which is a function of the TL voltage level VL [5].
During this time, reclosing will occur with or without the clear fault, so the extinction of the secondary arc must be ensured in this period. According to Reference [5], the secondary arc times measured in single-phase reclosing tests have shown a considerable dispersion around the average value, a high standard deviation, due to variation of physical factors. In this context, it is usually used as a safe DT of three times the average, about 1.5 to 2 s. The transient stability of the extra high voltage (EHV) TL is threatened by safe-long single-phase reclosing DTs, only because in unfavorable circumstances, the secondary arcs can last between 1.5 and 2 s [5].
Besides that, currently, there are various automatic reclosing systems, especially high-speed and time-delayed. However, most of reclosing algorithms are based on time-delay. In this case, the breaker is being closed after an intentional time-delay, which is longer than what is considered for deionization of the arc [12]. In this case, it is performed regardless of the fault type (permanent or temporary). However, in this case, there is no guarantee if the arc is extinguished at the moment of reclosing for temporary fault. Thus, there could be a risk of reclosing for permanent fault cases, which is considered as unsuccessful reclosing [13][14][15]. It is necessary to identify the fault type. On the contrary, high-speed algorithms reclose the breaker with a delay which is the minimum time for deionization of the arc. Based on the IEEE Standard C.37. 104-2012 guide [16], in order to have successful operations at auto-reclosing attempts identifying permanent or temporary faults, it is suggested to consider high-speed reclosure algorithms.
In addition, it is necessary to indicate that the single-phase reclosing is widely used and disabled in different continents due to the danger of the unsuccessful operation of traditional reclosing algorithms [17]. The main reason for the application of single-phase reclosing is related to increasing the power transfers capacity of the TL and the stability of EPS [18][19][20][21][22][23].
Based on what has been mentioned above, it is imperative to do research about the automatic reclosing in order to identify permanent and temporary faults, thus, in this research, a new automatic reclosure algorithm based on multivariable analysis and support vector machine (SVM) is proposed. The main contribution of the proposed single-phase reclosing relay algorithm is to acceptably identify the fault type, i.e., permanent or temporary, by using a simple mathematical procedure compared to the wavelet transform, which has been traditionally used to develop protection algorithms. This work then is divided into different sections as follows: a bibliographic review about automatic reclosure is given in the second section. The mathematical bases and the proposed algorithm are included in the third section. The simulation of the EPS and arc faults is presented in the fourth section. Main results are included in the fifth section. The autoreclosure flowchart is shown in the sixth section, and finally, the main conclusions of this research are presented in the last section.

Bibliographic Review about Arcing Faults on TLs
A primary arc is formed when an arcing fault occurs on TLs. It remains during the fault condition until the breakers open. After the breaker's tripping, the secondary arc appears being sustained by the mutual coupling between the faulted phase and the healthy phases or phase. In this context, different algorithms useful to identify permanent and temporary faults have been proposed using several mathematical tools. Those works are summarized as follows: In Reference [24], a technique based on the analysis of harmonics spectrum of the faulted phase voltage is proposed. Using the amplitude of the second, third and fifth harmonic of the faulted phase voltage, permanent and temporary faults are identified. To detect the extinction of the secondary arc, the DC component is used due to the asymmetry of the recovery voltage and, the Wavelet Transform (WT) method is used to analyze those signals by means of the db6 as a mother wavelet. However, the determination of proper threshold value for distinguishing both types of faults are not clearly explained or well defined.
It is also possible to find the discrete wavelet transform (DWT) and the wavelet singularity considering several mother wavelets to classify the fault type [25][26][27][28][29][30][31][32][33][34]. Although those methods have acceptable behavior, their function and reliability are influenced by different conditions and parameters that must be chosen appropriately. For example, to determine the fault type, an index based on the normalized energy from decomposition levels 1 to 14 must be tested empirically. As an improvement, the DWT in combination with other techniques have been used, where a threshold adjustment is usually needed which depends directly on a trial and error procedure [35,36].
Regarding autoreclosure schemes, in References [37,38], the conventional single-pole automatic reclosure (SPAR) algorithms are introduced; which are based on the RMS value of the faulted phase and a DT. However, in some cases, that value may not be enough before the secondary arc is extinguished. As a new technique, in Reference [39], the derivate of the voltage amplitude and the angle of the faulted phase is used for classifying fault type. In that method, the least error square (LES) method is used to calculate the derivate. However, its behavior depends on the sampling rate and it can sensitivity to outliers, which could increase the processing and calculation times. It is possible to find in References [40][41][42][43][44][45][46][47][48], different methods based on the frequency analysis, especially harmonic values of the faulted phase. In those methods, by using a discrete Fourier transform (DFT), several indexes are calculated. However, it is clear that the DFT has to be used with sampled signals minimum of one cycle. Besides that, these techniques can erroneously analyze the harmonics when conditions of fault with different oscillations are present. Moreover, in Reference [49] the evaluation of transient energy of the system is used by the proposed technique. Thus, spectral energy (SE) is used to predict the reclosure time.
The use of artificial intelligent had also been probed, fuzzy logic (FL) is used in Reference [50] to create a comparator that has as inputs the ratio between the voltage of the faulted phase and mutual inductance voltage and between capacitance coupling voltage and mutual inductance voltage. For the inputs of various intelligent classifiers, the signal features are extracted using FT, FFT and WT to decompose the voltage waveforms. In this context, in References [51,52], two processes are explained, one to obtain features of faults using FFT, and the other one, to identify faults prior to reclosing employing artificial neural networks (ANN). In addition, it is clear that these previous methods usually provide relevant information of stationary signals. Hence, the behavior of permanent and temporary faults may lead to inaccurate results when using the FT, FFT and WT. Therefore, in order to analyze arcing fault signals signal processing techniques (SPT) more advanced than FT and WT are needed. In Reference [53], the ANN was employed again, using as inputs those coefficients obtained with the Gabor transform (GT) as a technique of signal processing. In this case, this technique is a special case of the WT, which uses a Gaussian shape as a mother function. Therefore, the fault signals must be as similar as possible to the Gaussian shape.
According to the above mentioned, different algorithms to identify permanent and temporary faults on TLs are proposed. All of them using different SPT such as WT and FT. However, in the automatic reclosure environment, the algorithm must be accurate, i.e., it cannot depend on a trial and error technique. It is well known that the WT tool is very complex due to the wavelet mother selection and, has several inconveniences which are studied and presented in Reference [36]. The goal of this paper was to propose a new automatic reclosure algorithm based on multivariable analysis, which was used to optimally determine the permanent and temporary faults relevant information and, to classify the fault types using SVM. Unlike the proposed different autoreclosure schemes, the novelty of this scheme was that only a simple convolution process between permanent or temporary fault and the eigenvectors was used, without having to determine threshold levels or energy spectrums like other techniques i.e., avoiding trial and error results. In addition, the proposed algorithm in this work was tested with a signals database in different locations along the TL, including the accurate representation of the first and second arcs.

Mathematical Bases of the Proposed Algorithm
Principal component analysis (PCA) is an important and very useful tool to extract the most relevant information from a group of signals obtaining newer and simpler ones called principal components (PCs), as a linear combination of the original variables [54]. In this way, it is possible to generate a database smaller than the original. Thus the calculus of variance-covariance matrix R is fundamental to apply this technique, and with these values, the eigenvalues and eigenvectors are calculated.
With the original fault signals denoted in this research by Xtrai, the matrix X is formed, which is normalized, the process consists in scaling the matrix to have a maximum (+1) and minimum (−1) value. In this paper, the permanent and temporary fault signals are represented by a data-row vector X trai of length p = 4000. Later on, each value of the matrix X is divided to its maximum absolute data denoted by Xmax [54].
where X traip , represents the training fault signal number that forms the matrix X and m corresponds to the signal and p is the variable number.
Since this new matrix, its mean value Q was calculated, and with all those data, the R matrix was calculated.
After that, by using the matrix R, their eigenvalues, G, and eigenvectors, υ, were calculated. Both had the same size: Energies 2019, 12, 2126 where p is the number of variables and T the transposed.
Finally, based on these eigenvectors, the principal components PC were calculated as follows: Based on the previous equation, it is clear that the new matrix PC is calculated by multiplying the original matrix X by the eigenvectors' matrix U, obtaining a 4000 × 4000-sized matrix PC. However, because matrix U is considered as a transformation matrix, it has the advantage that the data is reorganized based on its variability. Hence, in order to determine which data of permanent and temporary faults registered by the protection relay had major information quantity, an index constituted by the sum of the eigenvalues was used. The first eigenvalue was related to the first eigenvector, the second eigenvalue was related to the second eigenvector, and so on. In this paper, the index value denoted by Ivar must be longer than the threshold value denoted by thv, which in this research was 90%, approximately. Therefore, after a detailed analysis of eigenvalues, by using the first twenty-four PCs, a variance value longer than thv was obtained. The next equations were used for this purpose: In this context, the first twenty-four PCs were calculated as follows: In this research, it is possible to see that the proposed methodology transforms the original row vector 1 × 4000 to 1 × 24, approximately. Therefore, a factor of reduction of 4000/24≈166 in the original signals was obtained.
Besides that, protection relay algorithms are related to an analysis-classification problem of fault signals. In the first instance of this research, the permanent and temporary faults were analyzed by using some PCA, those new signals were used as input vectors in the classifier based on SVM (see Section 5.2).

Arc
When an arcing fault occurs in the system, a primary arc is formed and is presented during fault duration until the moment when the breakers open. After the tripping of the breaker, the secondary arc appears and it is sustained by the mutual coupling between the faulted phase and the healthy phases. Considering the nature of arcing faults, the arcs (primary and secondary) have to be represented as accurately as possible [16,55]. In fact, in Reference [55] it has been proven that the switching arc theory can be applied to model an unconstrained long fault arc in the air.
The primary arc model could be implemented by using the one developed by Kizilcay. This model, developed by Mustafa Kizilcay and others, is time-dependent, it describes the dynamic behavior of the arc through the air when a fault occurs. It also takes into consideration the dynamic interaction of the arc with the remaining electrical circuit, and it is based on the balance of energy of the arc column [55,56]. This model describes the arc in the air by means of the differential equation of the conductance of arc g, written as: where g is the conductance variable with time, τ is the time constant and G is the stationary conductance of the arc, physically interpreted as the value of constant conductance. If the conditions of the primary arc were sustained for a long period of time from an external source and can be represented by the following function: where: µ 0 is a constant voltage and is defined in per unit length, r 0 is the resistive component of the arc in per unit length, i is the arc current, and l is the length of the arc which may vary with time.
As regards the secondary arc, its behavior has been studied since the last decade, without having an exact model to date that gathers all the factors that contribute to it because generating a secondary arc in the laboratory to measure its parameters is a very difficult task since it is a very fast phenomenon that depends mainly on the primary arc. Considering the secondary arc, all the references found simulate it by using the technique proposed by Goldberg [57]. However, this model is based on the one developed by Jones [58,59], which is based on an empirical approach of two features, described as follows: Features of the arc's conductivity: This characteristic is used when the arc conducts current, which occurs when the magnitude of the voltage across the arc exceeds the resisting voltage before starting to drive. The volt-ampere characteristics of the model for a range of currents between 1 and 55 A. The constant value of the voltage parameter V p is given by: where I p is the peak value of the secondary arc current. It was assumed that the arc resistance was zero (0), taking into account that the equivalent resistance of the ascending and descending parts is a function of the length of the arc and the variation of time.
Feature of the re-ignition voltage of the arc: This characteristic was applied to obtain the voltage that resists the arc before starting to conduct electricity, that is, the voltage of the dielectric recovery (in this case air) when it was not conducting. The expression that defines this voltage can be approximated by the following equation: In practice, the time of extinction of the secondary arc depends on the variation of the arc length. For relatively low wind speed values between 0-1 m/s [60], this variation as a function of time can be approximated by: where l s (t r ) is the length of the arc and so is the initial length of the arc, both in cm. According to Equations (18) and (19), the reignition voltage of the arc is obtained by the following equation: If the voltage imposed by the supply system was greater than the restarting voltage when the secondary arc crosses zero, the arc was kept in a conduction state, otherwise, it remained in a partial or final condition.

Signals and Simulated EPS
In this research, ATP [61] was used for modeling a typical 500 kV EPS, which consisted of two areas connected by a 350 km TL and breakers that are activated when the fault occurs; see Figure 3. In this case, voltage sources are considered. For the purpose of modeling the TL, a model proposed by J. Marti was employed [62]. In order to perform an acceptable behavior analysis, models of primary and secondary arcs were used for simulation together with various permanent and temporary faults, which are simulated along different sections of the TL. In this context, for the cases of study, the TL was divided into thirty-four parts as 10 km in each section. Therefore, the permanent or temporary fault occurrence at 10 km, 20 km, 30 km, 40 km, and so on up to 340 km distances from the location of the protection relay can be simulated. The single-phase-to-ground faults were applied to the TL using a single-phase breaker. Therefore, it was possible to perform appropriate operations for permanent or temporary faults. The sampling rate for recording the ATP signals was 10 kHz. These signals were loaded to Matlab to make different analysis and algorithms. An original data matrix comprised by 68 fault signals evaluated by relay R1 at bus M was used in the training process, which is represented in this paper by a matrix of size 68 × 4000, where each permanent or temporary fault was composed by 4000 points or variables.
the location of the protection relay can be simulated. The single-phase-to-ground faults were applied to the TL using a single-phase breaker. Therefore, it was possible to perform appropriate operations for permanent or temporary faults. The sampling rate for recording the ATP signals was 10 kHz. These signals were loaded to Matlab to make different analysis and algorithms. An original data matrix comprised by 68 fault signals evaluated by relay R1 at bus M was used in the training process, which is represented in this paper by a matrix of size 68 × 4000, where each permanent or temporary fault was composed by 4000 points or variables. As regards the waveforms of temporary fault, the voltage waveform is shown as time function for the case of a temporary fault measured on the faulted phase in Figure 4, when after a time the fault is disconnected and the waveform is restored. For this particular case, the fault inception time was t = 0.1 s (A) which was the moment at which the primary arc appears. Then, when the fault appeared, and the detection and clearing relay operated to open the faulted phase, the secondary arc was formed at t = 0.2 s (B); this fault, the secondary arc, continued during a period of time, and then, it was finally extinguished and re-striked at 0.55s (C) due to the characteristic high-frequency components in the waveform. As regards the waveforms of temporary fault, the voltage waveform is shown as time function for the case of a temporary fault measured on the faulted phase in Figure 4, when after a time the fault is disconnected and the waveform is restored. For this particular case, the fault inception time was t = 0.1 s (A) which was the moment at which the primary arc appears. Then, when the fault appeared, and the detection and clearing relay operated to open the faulted phase, the secondary arc was formed at t = 0.2 s (B); this fault, the secondary arc, continued during a period of time, and then, it was finally extinguished and re-striked at 0.55 s (C) due to the characteristic high-frequency components in the waveform. fault is disconnected and the waveform is restored. For this particular case, the fault inception time was t = 0.1 s (A) which was the moment at which the primary arc appears. Then, when the fault appeared, and the detection and clearing relay operated to open the faulted phase, the secondary arc was formed at t = 0.2 s (B); this fault, the secondary arc, continued during a period of time, and then, it was finally extinguished and re-striked at 0.55s (C) due to the characteristic high-frequency components in the waveform. During this last stage, from point B to point C, it was noticeable in the line a small and normal system frequency voltage sinusoidal component, which was caused by the electrostatic coupling between the two healthy phases and the faulted phase. During this last stage, from point B to point C, it was noticeable in the line a small and normal system frequency voltage sinusoidal component, which was caused by the electrostatic coupling between the two healthy phases and the faulted phase.
On the contrary, the fault voltage of the permanent fault was similar to the previous one, but in this case, the secondary arc disappeared, as can be appreciated in Figure 5. It represented a voltage waveform due to a permanent single-phase fault.
In the first part, the transients obtained in the primary arc were present, but the characteristics of the secondary arc were absent. Thus, after the breakers of the circuit have acted due the presence of the fault, a small system frequency voltage induced onto the tripped line is presented, whose magnitude depended directly not only on the accuracy and on how well the two healthy phases of the TL were coupled to the faulted phase but also on the fault impedance. For that reason, in this process of reconnection, the size of the voltage mentioned was determined.
On the contrary, the fault voltage of the permanent fault was similar to the previous one, but in this case, the secondary arc disappeared, as can be appreciated in Figure 5. It represented a voltage waveform due to a permanent single-phase fault.
In the first part, the transients obtained in the primary arc were present, but the characteristics of the secondary arc were absent. Thus, after the breakers of the circuit have acted due the presence of the fault, a small system frequency voltage induced onto the tripped line is presented, whose magnitude depended directly not only on the accuracy and on how well the two healthy phases of the TL were coupled to the faulted phase but also on the fault impedance. For that reason, in this process of reconnection, the size of the voltage mentioned was determined.

Results
By using the mathematical formulation presented in Section 2 to the permanent and temporary faults of training, coefficients of PC were computed by correlating these fault signals with the calculated eigenvectors. The study shows that the analysis of PCs generates 24 PCs, shown in Table 1, which were analyzed as follows:

Analyzing PCA
In this section, by using the data of the new matrix, the PCs were projected in different spaces in order to make an analysis of distinctive identification of permanent and temporary faults. On this context, Figures 6-10 show the representation of faults through the PCA. Figure 6 shows the distribution of permanent and temporary faults on the first two PCs. Thus, in Figure 6 the red colored points correspond to temporary faults. On the contrary, the blue colored points represent permanent faults. Besides that, based on Table 1, it is clear that for permanent faults, the first PC was not significant. Its value was very similar to zero. Unlike permanent faults, for temporary faults, the first PC was very significant. Based on what was previously said, it was clear that in this 2D space projection, some of the faults were clearly identifiable. However, some temporary faults were superimposed on the space of the permanent faults.

Analyzing PCA
In this section, by using the data of the new matrix, the PCs were projected in different spaces in order to make an analysis of distinctive identification of permanent and temporary faults. On this context, Figures 6-10 show the representation of faults through the PCA. Figure 6 shows the distribution of permanent and temporary faults on the first two PCs. Thus, in Figure 6 the red colored points correspond to temporary faults. On the contrary, the blue colored points represent permanent faults. Besides that, based on Table 1, it is clear that for permanent faults, the first PC was not significant. Its value was very similar to zero. Unlike permanent faults, for temporary faults, the first PC was very significant. Based on what was previously said, it was clear that in this 2D space projection, some of the faults were clearly identifiable. However, some temporary faults were superimposed on the space of the permanent faults. A similar analysis was developed considering the first and third PC. Figure 7 shows the representation of permanent and temporary faults on these axes. Similar to the previous case, that projection shows important information of the disposition of these faults. A similar analysis was developed considering the first and third PC. Figure 7 shows the representation of permanent and temporary faults on these axes. Similar to the previous case, that projection shows important information of the disposition of these faults.

Analyzing PCA
In this section, by using the data of the new matrix, the PCs were projected in different spaces in order to make an analysis of distinctive identification of permanent and temporary faults. On this context, Figures 6-10 show the representation of faults through the PCA. Figure 6 shows the distribution of permanent and temporary faults on the first two PCs. Thus, in Figure 6 the red colored points correspond to temporary faults. On the contrary, the blue colored points represent permanent faults. Besides that, based on Table 1, it is clear that for permanent faults, the first PC was not significant. Its value was very similar to zero. Unlike permanent faults, for temporary faults, the first PC was very significant. Based on what was previously said, it was clear that in this 2D space projection, some of the faults were clearly identifiable. However, some temporary faults were superimposed on the space of the permanent faults. A similar analysis was developed considering the first and third PC. Figure 7 shows the representation of permanent and temporary faults on these axes. Similar to the previous case, that projection shows important information of the disposition of these faults. From Figures 8-10, the study cases where the permanent and temporary faults projected on the second/fifth, sixth/ninth, and twenty-third/twenty-fourth PCs are presented. It is important to see that as the fault signals were projected onto the subsequent axes of the PCs, the dispersion or identification of the fault types was not very significant or clear. Thus, the first PC had a major variability of original fault signals. From Figures 8-10, the study cases where the permanent and temporary faults projected on the second/fifth, sixth/ninth, and twenty-third/twenty-fourth PCs are presented. It is important to see that as the fault signals were projected onto the subsequent axes of the PCs, the dispersion or identification of the fault types was not very significant or clear. Thus, the first PC had a major variability of original fault signals.  From Figures 8-10, the study cases where the permanent and temporary faults projected on the second/fifth, sixth/ninth, and twenty-third/twenty-fourth PCs are presented. It is important to see that as the fault signals were projected onto the subsequent axes of the PCs, the dispersion or identification of the fault types was not very significant or clear. Thus, the first PC had a major variability of original fault signals.  Besides that, as stated previously, the most important issue in order to propose novel protection relay algorithms was related to the mathematical tool used to analyze the original permanent and temporary fault signals. Thus, the multiresolution analysis (MRA) has widely been used for reclosing relay algorithms, which is processed through a mother function called wavelet mother and different levels of details of the discrete voltage samples recovered by the relay. However, in order to choose the best performance of MRA, a hard work of testing and error must be developed. For instance, Figure 11 shows the first two variables of the original fault signals. By analyzing this figure, it is clear that a distribution of the fault type is not possible. On the other hand, Figure 12 shows the coefficients of MRA projected to a 2D subspace. Similar to the previous case, it is possible to see that permanent and temporary faults are not well distinguished. Unlike the PCA, which can distinguish the fault type depending on the number of eigenvectors chosen (see Figure 6). Besides that, as stated previously, the most important issue in order to propose novel protection relay algorithms was related to the mathematical tool used to analyze the original permanent and temporary fault signals. Thus, the multiresolution analysis (MRA) has widely been used for reclosing relay algorithms, which is processed through a mother function called wavelet mother and different levels of details of the discrete voltage samples recovered by the relay. However, in order to choose the best performance of MRA, a hard work of testing and error must be developed. For instance, Figure 11 shows the first two variables of the original fault signals. By analyzing this figure, it is clear that a distribution of the fault type is not possible. On the other hand, Figure 12 shows the coefficients of MRA projected to a 2D subspace. Similar to the previous case, it is possible to see that permanent and temporary faults are not well distinguished. Unlike the PCA, which can distinguish the fault type depending on the number of eigenvectors chosen (see Figure 6). Besides that, as stated previously, the most important issue in order to propose novel protection relay algorithms was related to the mathematical tool used to analyze the original permanent and temporary fault signals. Thus, the multiresolution analysis (MRA) has widely been used for reclosing relay algorithms, which is processed through a mother function called wavelet mother and different levels of details of the discrete voltage samples recovered by the relay. However, in order to choose the best performance of MRA, a hard work of testing and error must be developed. For instance, Figure 11 shows the first two variables of the original fault signals. By analyzing this figure, it is clear that a distribution of the fault type is not possible. On the other hand, Figure 12 shows the coefficients of MRA projected to a 2D subspace. Similar to the previous case, it is possible to see that permanent and temporary faults are not well distinguished. Unlike the PCA, which can distinguish the fault type depending on the number of eigenvectors chosen (see Figure 6).  In addition, it is necessary to indicate that in the proposal of new protection algorithms, two scenarios must be clearly defined, off-line and on-line, respectively. The first one refers to the stage where the algorithm is trained, and the second stage refers to testing the algorithm with new fault signals different than those used in the training stage. In this sense, the proposed algorithm in this paper based on PCA had special features in order to test a new permanent or temporary fault signal in real time or on-line denoted in this paper by Xtest. Thus, this new test signal only processed through the first twenty-four eigenvectors selected in the previous analysis, omitting to do all the mathematical processes presented in Section 3, which were used in the training process. Therefore, 72 new test signals measured by the relay R1 at bus M were used; 36 for permanent faults and 36 for temporary faults, where each permanent or temporary testing fault was composed by 4000 points or variables, that is 1 × 4000 row vector, which was processed by using those eigenvectors as follows:

12
The new test signal was divided into the maximum value of the original matrix X denoted in this research by Xmax, giving a new vector Xnorm as follows: The vector Xnorm was subtracted from the mean vector of the original matrix X Finally, Rtest was processed by using the first twenty-four eigenvectors that were determined in the training process (see Section 3). Thus, the PCs of the test signal were calculated as follows: <!--MathType@Translator@5@5@MathML2 (no namespace).tdl@MathML 2.0 (no namespace)@ --> <math> <semantics> <mrow> <mi>P</mi><msub> <mi>C</mi> <mrow> <mi>t</mi><mi>e</mi><mi>s</mi><mi>t</mi></mrow> </msub> <mo>=</mo><mrow><mo>[</mo> <mrow> In addition, it is necessary to indicate that in the proposal of new protection algorithms, two scenarios must be clearly defined, off-line and on-line, respectively. The first one refers to the stage where the algorithm is trained, and the second stage refers to testing the algorithm with new fault signals different than those used in the training stage. In this sense, the proposed algorithm in this paper based on PCA had special features in order to test a new permanent or temporary fault signal in real time or on-line denoted in this paper by X test . Thus, this new test signal only processed through the first twenty-four eigenvectors selected in the previous analysis, omitting to do all the mathematical processes presented in Section 3, which were used in the training process. Therefore, 72 new test signals measured by the relay R1 at bus M were used; 36 for permanent faults and 36 for temporary faults, where each permanent or temporary testing fault was composed by 4000 points or variables, that is 1 × 4000 row vector, which was processed by using those eigenvectors as follows: The new test signal was divided into the maximum value of the original matrix X denoted in this research by X max , giving a new vector X norm as follows: The vector X norm was subtracted from the mean vector Q of the original matrix X Finally, R test was processed by using the first twenty-four eigenvectors that were determined in the training process (see Section 3). Thus, the PCs of the test signal were calculated as follows:

Classification Based on a Support Vector Machine
The information obtained in the PCA analysis was applied for studying any registered fault signal. The resultant matrix of this process was applied to SVM for classifying fault types based on the data set transformation from a specific dimension to a higher dimension by using a function called the kernel. It allowed working on a data set such as a linear problem.
An easier case occurred when a data set was separated using a hyper-plane represented as follows: where w is a normal vector of plane and b is a term called bias. However, each data set class was separated for a different hyper-plane, in such a way these allowed us to find an optimal hyperplane based on the margin between these two. In this context, the outputs of the PCs were used as inputs to the classifier based on SVM. Here, it was important to assign the necessary kernel parameters used in this method. The kernel type employed was the RBF which needed a gamma parameter. In this case, due to the number of variables and features of the signals, a gamma of 1.009 was established. This process gave an acceptable result, with a percentage of correct classification of 94.286%. All the processes including obtaining PCA up to the classification were developed with other databases. In this other case, 72 test signals were used; 36 for permanent faults and 36 for temporary faults, respectively. Finally, by varying the gamma parameter, 97% of correct classification was obtained. Table 2 presents the results of some test signals.   Figure 13 shows the flow chart to classify the permanent or temporary fault on the TL. In this context, the discrete voltage values were registered by the protection relay at one end of the TL. Then, the PCs were calculated. Accordingly, if the I vark value did not exceed the t hv value, the eigenvalues' magnitude was continuously updated and analyzed by the protection relay algorithm. However, if the I vark value was higher than the t hv set, the number of eigenvalues and eigenvector were determined and chosen. The PC numbers chosen were assigned as inputs to the classifier based on SVM, where the permanent and temporary faults can be identified.  Figure 13 shows the flow chart to classify the permanent or temporary fault on the TL. In this context, the discrete voltage values were registered by the protection relay at one end of the TL. Then, the PCs were calculated. Accordingly, if the Ivark value did not exceed the thv value, the eigenvalues' magnitude was continuously updated and analyzed by the protection relay algorithm. However, if the Ivark value was higher than the thv set, the number of eigenvalues and eigenvector were determined and chosen. The PC numbers chosen were assigned as inputs to the classifier based on SVM, where the permanent and temporary faults can be identified.

Conclusions
In this proposed research, the main goal was to develop a novel algorithm for single-phase reclosing on TLs, which must be able to classify the fault type (permanent or temporary). The discrete voltage samples from one end of the TL and the PC patterns, as input in the classifier based on SVM, were used for identification of permanent and temporary faults.
Besides that, in this work, a comprehensive bibliographic review of various techniques, methodologies and algorithms useful to single-pole autoreclosure on TLs is presented. Unlike previous algorithms, the proposed algorithm did not need to extract harmonic components using any mathematical tools.
In the results section, it was shown that the proposed algorithm had a successful behavior for all 140 cases the study simulated. Thus, it was able to differentiate or recognize the fault type depending on the eigenvectors and eigenvalues, which allowed TL protection relay to initiate single-phase tripping.
Besides that, as a comparison of the performance of the proposed algorithm based on PCA in order to analyze permanent or temporary fault signal, these signals were also processed by using the MRA. However, the PCA had a better performance than MRA to differentiate those fault spectrums.
In order to analyze on-line new test fault signals, the proposed algorithm used only the eigenvectors chosen, which reduced the processing time. Therefore, it can be used as an option to the traditional reclosing relay algorithms.