Fault Diagnosis Model of Photovoltaic Array Based on Least Squares Support Vector Machine in Bayesian Framework

: With the rapid development of the photovoltaic industry, fault monitoring is becoming an important issue in maintaining the safe and stable operation of a solar power station. In order to diagnose the fault types of photovoltaic array, a fault diagnosis method that is based on the Least Squares Support Vector Machine (LSSVM) in the Bayesian framework is put forward. First, based on the elaborate analysis of the change rules of the output electrical parameters and the equivalent circuit internal parameters of photovoltaic array in different fault states, the input variables of the photovoltaic array fault diagnosis model are determined. Second, through the LSSVM algorithm in the Bayesian framework, the fault diagnosis model based on the output electrical parameters and the equivalent circuit internal parameters of the photovoltaic array is built, which can effectively detect the photovoltaic array faults of short circuit, open circuit, and abnormal aging. Then, the simulation model is built to verify the validity of the LSSVM algorithm in the Bayesian framework by comparing it with the model of LSSVM and the Support Vector Machine (SVM). Moreover, a 5 × 3 photovoltaic array and a reference photovoltaic string are established and experimentally tested to validate the performance of the proposed method.


Introduction
With the aggravation of the global energy crisis and regional environmental pollution, Chinese photovoltaic power generation still faces key problems of sustainable development [1], of which maintaining solar power station safety and maintaining stable operations are important issues. At present, research on solar power station fault monitoring is mainly focused on the photovoltaic strings, modules, and inverters, but rarely on the photovoltaic arrays. The diagnosis of the photovoltaic arrays is an important issue, because the performance of photovoltaic modules affects the output characteristics of photovoltaic arrays directly, thus further affecting the stability of the photovoltaic generation system, so the fault monitoring of the solar power station can diagnose the photovoltaic arrays, locate the faulty photovoltaic modules in a certain area first, and then further precisely position the faulty photovoltaic modules. This diagnostic method can greatly reduce the number of sensors, thus reducing costs while ensuring that the solar power station operates safely and stably.
In recent years, many fault detection and diagnosis methods of photovoltaic systems were proposed. The algorithm of the artificial neural network was presented to diagnose the fault [2][3][4][5]. In [6], the identification of the fault type is carried out by analyzing and comparing the amount of error deviations of both simulated and measured current and voltage with respect to a set of error thresholds that are evaluated. In [7], a simple method to detect and diagnose short circuits and open circuit faults in photovoltaic systems based on the evaluation of three coefficients is presented. By analyzing the

The Selection of the Fault Feature
This paper mainly focuses on the fault types of photovoltaic array: short circuit, open circuit, and photovoltaic modules' abnormal aging. By analyzing the different fault types of photovoltaic array, we can obtain the change laws of the output characteristics and equivalent circuit internal parameters of photovoltaic array in different fault conditions, providing a theoretical basis and fault feature information for the diagnosis and location of photovoltaic array.

Analysis of Internal Parameters of Photovoltaic Array Equivalent Circuit in Fault State
If the photovoltaic array malfunctions, the internal parameters of the photovoltaic array equivalent circuit will change, and these differences contain the most abundant fault feature information, directly reflecting the state that the photovoltaic array is in. Figure 1 presents the equivalent circuit model of photovoltaic module [17], I ρh is the photovoltaic current generated by the photovoltaic module, I D is the current through the diode, I sh is the current flowing through the shunt resistance R sh , R s is the series resistance, and I is the output current of the photovoltaic module. Appl. Sci. 2017, 7, 1199 2 of 13 circuit faults in photovoltaic systems based on the evaluation of three coefficients is presented. By analyzing the light and dark current-voltage (I-V) characteristics of the photovoltaic module, a fault identification method was used to distinguish the faulty photovoltaic modules [8]. The above-mentioned methods are for the photovoltaic modules, of which each photovoltaic module needs to be monitored, leading to high costs. In [9][10][11][12][13], the fault diagnosis methods of the photovoltaic arrays were introduced. The fault detection is based on the comparison between the measured and model prediction results of the power production [9]. Hu, YH, et al., analyze the terminal characteristics of faulty photovoltaic arrays and reduce the number of sensors by optimizing its locations [10]. In [11], the authors diagnose the fault by detecting the change of PV internal resistance using the signals that are available in Extremum-Seeking Control (ESC)-based Maximum Power Point Tracking (MPPT). The methods in these works can only identify whether there is a fault or not, but the fault type is unknown. In other research [12], a method is proposed to detect faults and partial shading under all of the irradiation conditions using the measured values of array voltage, array current, and irradiance, but the diagnosis of fault type is not comprehensive enough.
In order to diagnose whether there are faulty photovoltaic modules in a photovoltaic array or not and further judge its fault type, this paper presents a fault diagnosis method that is based on Least Squares Support Vector Machine (LSSVM) in the Bayesian framework. The algorithm of LSSVM in the Bayesian framework has been used in the field of fault diagnosis, in domains such as aeronautics, power transformation, biology, and so on [13][14][15][16]; in this work, we introduce the algorithm into the field of photovoltaic fault diagnosis.

The Selection of the Fault Feature
This paper mainly focuses on the fault types of photovoltaic array: short circuit, open circuit, and photovoltaic modules' abnormal aging. By analyzing the different fault types of photovoltaic array, we can obtain the change laws of the output characteristics and equivalent circuit internal parameters of photovoltaic array in different fault conditions, providing a theoretical basis and fault feature information for the diagnosis and location of photovoltaic array.

Analysis of Internal Parameters of Photovoltaic Array Equivalent Circuit in Fault State
If the photovoltaic array malfunctions, the internal parameters of the photovoltaic array equivalent circuit will change, and these differences contain the most abundant fault feature information, directly reflecting the state that the photovoltaic array is in. Figure 1 presents the equivalent circuit model of photovoltaic module [17], Iρh is the photovoltaic current generated by the photovoltaic module, ID is the current through the diode, Ish is the current flowing through the shunt resistance Rsh, Rs is the series resistance, and I is the output current of the photovoltaic module. As the equivalent circuit model of photovoltaic module shows that the photovoltaic module output current is: As the equivalent circuit model of photovoltaic module shows that the photovoltaic module output current is: In the Formula, I 0 is the reverse saturation current of the diode, U is the output voltage of the photovoltaic module, q is the electron charge (1.60218 × 10 −19 Coulomb), n is the ideality factor of the diode, k is the Boltzmann's constant (1.38066 × 10 −231 J/K), and T is the absolute temperature of photovoltaic module.
In an actual solar power station, there are five major connection types of photovoltaic array: series structure, parallel structure, serial-parallel (SP) structure, total-cross-tied (TCT) structure, and bridge-linked (BL) structure, among which the SP structure is the most widely used; therefore, this paper studies the photovoltaic array of the SP structure. Figure 2 is an equivalent circuit model of photovoltaic array, of which the type of the photovoltaic array is M × N, M is the number of photovoltaic modules in a photovoltaic string, and N is the number of photovoltaic strings in a photovoltaic array. Appl In an actual solar power station, there are five major connection types of photovoltaic array: series structure, parallel structure, serial-parallel (SP) structure, total-cross-tied (TCT) structure, and bridge-linked (BL) structure, among which the SP structure is the most widely used; therefore, this paper studies the photovoltaic array of the SP structure. Figure 2 is an equivalent circuit model of photovoltaic array, of which the type of the photovoltaic array is M × N, M is the number of photovoltaic modules in a photovoltaic string, and N is the number of photovoltaic strings in a photovoltaic array. Furthermore, Figure 3 is the equivalent circuit model of Figure 2. I' , U' are the output current and voltage of the photovoltaic array, respectively; Iρh ' is the photovoltaic current generated by the photovoltaic array; and, Rs ' , Rsh ' are the series resistance and shunt resistance of the photovoltaic array, respectively.
In the literature [18], the output current of photovoltaic array is: Furthermore, Figure 3 is the equivalent circuit model of Figure 2. In an actual solar power station, there are five major connection types of photovoltaic array: series structure, parallel structure, serial-parallel (SP) structure, total-cross-tied (TCT) structure, and bridge-linked (BL) structure, among which the SP structure is the most widely used; therefore, this paper studies the photovoltaic array of the SP structure. Figure 2 is an equivalent circuit model of photovoltaic array, of which the type of the photovoltaic array is M × N, M is the number of photovoltaic modules in a photovoltaic string, and N is the number of photovoltaic strings in a photovoltaic array. Furthermore, Figure 3 is the equivalent circuit model of Figure 2. I' , U' are the output current and voltage of the photovoltaic array, respectively; Iρh ' is the photovoltaic current generated by the photovoltaic array; and, Rs ' , Rsh ' are the series resistance and shunt resistance of the photovoltaic array, respectively.
In the literature [18], the output current of photovoltaic array is: I', U' are the output current and voltage of the photovoltaic array, respectively; I ρh ' is the photovoltaic current generated by the photovoltaic array; and, R s ', R sh ' are the series resistance and shunt resistance of the photovoltaic array, respectively.
In the literature [18], the output current of photovoltaic array is: The above analysis shows that I ρh ' = N I ρh , R s ' = MR s /N, and R sh ' = MR sh /N. For the model, its internal equivalent parameters I ρh ', R s ', and R sh ' is proportion to I ρh , R s , and R sh of the photovoltaic module.
From Figures 1-3 and the above analysis, it shows that under the same test condition, I ρh , R s , and R sh of the faulty photovoltaic module are zero when there is a short-circuit fault in the photovoltaic array; thus, the I ρh of its string keeps constant, R s and R sh decreases, and then the I ρh ' is basically unchanged, while R s ' and R sh ' decrease.
When there is an open-circuits photovoltaic module in the photovoltaic array, the I ρh , R s , and R sh of its string are zero, and the internal parameters of photovoltaic array equivalent circuit are: Of which, z is the number of photovoltaic strings that has open-circuits. In this case, the I ρh ' decreases, while R s ' and R sh ' increase.
From the above analysis and the literature [19], we know that when a photovoltaic module in photovoltaic array is abnormally aging, so its I ρh , R sh decreases, R s increases, the I ρh ', R sh ' of photovoltaic array equivalent circuit decreases, and R s ' increases.
Therefore, the I ρh ', R s ', and R sh ' of photovoltaic array equivalent circuit can be used as the parameters of the photovoltaic array fault diagnosis model, which can effectively identify the faults of short circuit, open circuit, and abnormal aging.

Analysis of the Output Characteristics of Photovoltaic Array in Fault State
When the photovoltaic array fails, different fault types have different influences on the output of photovoltaic array. Figure 4 shows the I-V curves of the 3 × 2 photovoltaic array in normal and different fault conditions, of which the photovoltaic modules are in same test conditions (1000 W/m 2 , 25 • C); the model of photovoltaic module is CHN310-72P. Appl The above analysis shows that For the model, its internal equivalent parameters Iρh ' , Rs ' , and Rsh ' is proportion to Iρh, Rs, and Rsh of the photovoltaic module. From Figures 1-3 and the above analysis, it shows that under the same test condition, Iρh, Rs, and Rsh of the faulty photovoltaic module are zero when there is a short-circuit fault in the photovoltaic array; thus, the Iρh of its string keeps constant, Rs and Rsh decreases, and then the Iρh ' is basically unchanged, while Rs ' and Rsh ' decrease.
When there is an open-circuits photovoltaic module in the photovoltaic array, the Iρh, Rs, and Rsh of its string are zero, and the internal parameters of photovoltaic array equivalent circuit are: Of which, z is the number of photovoltaic strings that has open-circuits. In this case, the Iρh ' decreases, while Rs ' and Rsh ' increase.
From the above analysis and the literature [19], we know that when a photovoltaic module in photovoltaic array is abnormally aging, so its Iρh, Rsh decreases, Rs increases, the Iρh ' , Rsh ' of photovoltaic array equivalent circuit decreases, and Rs ' increases.
Therefore, the Iρh ' , Rs ' , and Rsh ' of photovoltaic array equivalent circuit can be used as the parameters of the photovoltaic array fault diagnosis model, which can effectively identify the faults of short circuit, open circuit, and abnormal aging.

Analysis of the Output Characteristics of Photovoltaic Array in Fault State
When the photovoltaic array fails, different fault types have different influences on the output of photovoltaic array. Figure 4 shows the I-V curves of the 3 × 2 photovoltaic array in normal and different fault conditions, of which the photovoltaic modules are in same test conditions (1000 W/m 2 , 25 °C); the model of photovoltaic module is CHN310-72P.    Figure 4 shows that when the fault of short-circuits occurs in the photovoltaic array, the short-circuit current I SC and maximum power current I m are basically unchanged, while the open-circuit voltage U OC and the maximum power voltage U m are significantly decreased. When the fault of short-circuits occurs in the photovoltaic array, the current of faulty photovoltaic module is zero; then, the current and voltage of its string decreases, resulting in a decrease in the output voltage of the photovoltaic array. From the I-V characteristic curve of a photovoltaic cell, we can know that when the output voltage of normal string reduces, its output current will increase, resulting in little change in the total output current of photovoltaic array.
When the fault of open-circuits occurs in the photovoltaic array, the open-circuit voltage U OC and maximum power voltage U m are basically unchanged, while the short-circuit current I SC and the maximum power current I m significantly decrease. When the fault of open-circuits occurs in the photovoltaic array, the current of its string is zero, which causes the output current of the photovoltaic array to decrease dramatically, and the output voltage of normal strings and the photovoltaic array are basically unchanged.
When the photovoltaic modules in the photovoltaic array are abnormally aged, the open-circuit voltage U OC and short-circuit current I SC are basically unchanged, while the maximum power voltage U m and the maximum power current I m significantly decrease. The literature [20] points out that the open-circuit voltage U OC and the short-circuit current I SC contain the information of temperature and light intensity. The above analysis and the literature [20] show that the open-circuit voltage U OC , the short-circuit current I SC , the maximum power voltage U m , and the maximum power current I m can be regarded as the external characteristic parameters of the photovoltaic array fault diagnosis model.

The Establishment of the Photovoltaic Array Fault Diagnosis Model
The key of establishing the photovoltaic array fault diagnosis model is the optimal LSSVM multi-classifiers; the output electrical parameters and equivalent circuit internal parameters of photovoltaic array we attained are input into the optimal multiple classifiers model, thus obtaining the posteriori probabilities of the photovoltaic array and further detecting the fault types of the short circuit, open circuit, and abnormal aging.
In the process of the optimal LSSVM multi-classifiers model being built, we established an initial model first, and then the Bayesian theory was used to optimize the parameters of the initial model. Empirical results obtained from 10 public domain data sets show that the LSSVM classifier designed within the Bayesian evidence framework consistently yields good generalization performances and prediction precision [21].

The Method of LSSVM Classifier
In [22], J.A.K. Suykens and J. Vandewalle present the LSSVM, which uses the least squares linear system error square and loss function as the empirical loss of training sample set, changing the constraint condition from inequation to equation; then, the convex quadratic programming problem transforms and is able to solve the problem of linear equations, and calculation speed of the model is improved.
The objective function and constraint condition of LSSVM are: In the above linear equations, ω is the weight vector, C is the penalty factor, g is the number of training samples, ε is slack variable, ε i is the ith component of the slack variable ε, x i ∈ R m is the training sample, and m is the dimension of the training sample's feature vectors; in this paper, x i presents the output characteristics U OC , I SC , U m , I m and equivalent circuit internal parameters I ρh ', R s ', R sh ' of photovoltaic array; y i ∈ y = {1,−1} is the output; y i presents the states of short circuit, open circuit, abnormal aging, and normal; b is the classification threshold.
The above optimization problem is the convex quadratic programs of ω and b, so we introduce the Lagrangian function, which can be constructed as follows: In the Formula (7), α is the Lagrangian multiplier, and K (x i , x j ) is the kernel function. The Lagrangian function converts the primal problem (6) into the dual problem (8) Through Equation (8), we can get: The optimal hyperplane is constructed, then the decision function is obtained: Through the above problem-solving procedure, the LSSVM multi-classifiers model is built. In order to obtain the optimal LSSVM multi-classifiers, we can optimize the parameters of C and K (x i , x j ).

The Initial LSSVM Multi-Classifiers Model
The basic idea of the kernel function K (x i , x j ) is for mapping the random vectors in n-dimensional space to the high-dimensional feature space by nonlinear function, which can reduce the dimensionality.
In practice, there are three most common kernel functions: the polynomial kernel function, the Radial Basis Function (RBF), and the Sigmoid kernel function. Keerthi et al., testified that the polynomial kernel function is the special form of the RBF kernel function [23]. In [24], the Sigmoid kernel function is similar to the RBF kernel function in some cases, so, in this work, the RBF kernel function is used as the kernel function of the photovoltaic array fault diagnosis model.
where σ is the kernel parameter of K (x i , y i ).
For the sake of processing, the objective function of the LSSVM optimization problem is divided by C, and 1/C is replaced by θ, which is defined as regularization parameter.
First, we set the initial values of θ and σ 2 arbitrarily, and establish the initial model H 0 to train the training set.
The SVM and LSSVM were presented for secondary classification initially, but the photovoltaic array fault diagnosis model is a multi-classification problem; thus, the classification algorithm can be used to convert the multi-classifiers into several two-classifiers. The common classification algorithms are "One vs. One", "One vs. Rest", and "Divide-and-Conquer approach"; the algorithm of "One vs. One" is better than the "One vs. Rest" in diagnosis efficiency, and the algorithm of "Divide-and-Conquer approach" can lead to accumulations of errors-that is, if the classification of root is wrong, then the error will accumulate [25], so we use the "One vs. One" classification algorithm in the LSSVM multi-classifiers.
In this paper, the LSSVM multi-classifiers were converted into six two-classifiers by the classification algorithm of "One vs. One", which are "the normal vs. the short-circuits", "the normal vs. the open-circuits", "the normal vs. the abnormal aging", "the short-circuits vs. the open-circuits", "the short-circuits vs. the abnormal aging", and "the open-circuits vs. the abnormal aging".

The Posteriori Probability of the Optimal LSSVM Multi-Classifiers
Based on the posteriori distribution, which synthesized the sample information and the a priori information, the Bayesian inference has good statistical inference results. In this paper, Bayesian theory is used to optimize the parameters of the LSSVM classifier, regularization parameter θ, and kernel parameter σ; it then obtains the optimal classifier.
The LSSVM achieves pattern recognition by hard decision, that is, it classifies the sample by outputting 1 or −1 directly; however, pattern recognition is an uncertain problem, so this paper diagnoses the fault by the output probability of LSSVM.
For a given sample set T = {(x 1 , y 1 ), . . . , (x g , y g )} ∈ (R m × y) g , in this paper, x i presents the parameters U OC , I SC , U m , I m , I ρh ', R s ', R sh ' of photovoltaic array, y i presents its real states of short circuit, open circuit, abnormal aging, and normal.
The posterior probability of the given sample set can be described as:

P(y|x) = [P(y)P(x|y)]/P(x)
In the Formula (11), P(y) is the a priori probability of the given sample set, which analyzes the probability of a sample according to the a priori knowledge; P(y|x) is the posteriori probability of the given sample set according to the training set, and the posteriori probability reflects the influence of the training sample data on the test sample.
In this paper, the posteriori probability of the given sample set is: In the Formula (12), D 1 is the given sample data space, and H is the fault diagnosis model space. A posteriori probability is derived from the two-classifiers. According to the above analysis, we can know that the photovoltaic array fault diagnosis model constructs six two-classifiers, so six posteriori probabilities can be generated from the sample data. Then, by combining the posteriori probabilities by the Formula (13), we can get four combination probabilities.
The final posteriori probability of determining the test sample x belonging to the i-class is: In Formula (13), P ij (i, j|x) is the posteriori probability of x that belongs to the i-class in two-classifiers that consist of class i and j class, and L is the sort of photovoltaic array it is in. In this paper, L is 4; they are the states of short circuit, open circuit, abnormal aging, and normal.
When comparing the size of the four combination probabilities, we can judge the fault type of the testing set.

Simulation and Results
In this section, several data sets are constructed to study the performance of the fault diagnosis method. We first introduce the construction of a simulation system, then the test data under different states are simulated and briefly described. Finally, the simulation results are presented.

The Simulation of Photovoltaic System
In order to verify the validity of LSSVM algorithm based on Bayesian framework in photovoltaic array fault diagnosis model, this paper sets up a general simulation model of photovoltaic array by Matlab/Simulink; the photovoltaic array consists of two photovoltaic strings in parallel, and each string has three modules in series.

The Simulation Data in Different States
In the photovoltaic array simulation model, the fault of abnormal aging is simulated by connecting a series resistor. The model of photovoltaic modules in the simulation model is CHN310-72P, and the main parameters of the photovoltaic module at standard test conditions (STC) are shown in Table 1. The outputs of the simulation model are the operating voltage U and the output current I of the photovoltaic array; thus, we can obtain the V-I curve of the photovoltaic array under the STC, which further gets the external parameters U oc , I sc , U m , I m , and the equivalent circuit internal parameters I ρh ', R s ', R sh ' [26]. From the simulation results, we can know that the external parameters of photovoltaic array in normal state are: U oc = 121.00 V, I sc = 15.85 A, U m = 107.00 V, and I m = 15.90 A, and the equivalent circuit internal parameters are: I ρh ' = 16.68 A, R s ' = 0.21 Ω, and R sh ' = 8.12 KΩ. The comparison studies show that the parameters obtained by the simulation are basically consistent with the parameters that are provided by the manufacturer, which shows that the model can simulate the actual photovoltaic array well.
The photovoltaic array runs in the states of normal, short circuit, open circuit, and abnormal aging; 40 sample groups in each state (160 sample groups altogether) were obtained, of which 30 sample groups were used as the training set, and the remaining 10 groups were used as the testing set.

The Standardization Analysis of the Data
Firstly, we imported the fault features to the data editor and normalized the sample data. Saving the standardization scores as a variable, the results of sample data standardization analysis are shown in Table 2; it shows the standardization data of the fault features of 10 test sample groups in short-circuits state.

Results Analysis
This paper classifies the sample data by the Lssvmlab v 1-8 toolbox in Matlab R2014a environment, of which the Gaussian RBF is used as the kernel function and the "One vs. One" classification algorithm is used to build the LSSVM multi-classifiers model. This paper establishes the initial model H 0 first; the regularization parameter θ is set to 10 and the kernel parameter σ 2 is 0.2. Furthermore, the Bayesian inference is used to optimize the parameters of the model and to obtain the optimal parameters (θ MP , σ 2 MP ) = (175.214, 6.17); then, the optimal model is obtained by re-training the training set with the optimal parameters θ MP and σ 2 MP . Finally, the testing set is input into the optimal classifier model and obtains the posteriori probability and classification.
In this paper, the LSSVM multi-classifiers were converted into six two-classifiers by the classification algorithm of "One vs. One", which consists of "1&2", "1&3", "1&4","2&3", "2&4", and "3&4". The output posteriori probability values of 10 test sample groups in short-circuits state are shown in Table 3, of which 1 represents the short-circuits state, 2 represents the open-circuits state, 3 represents the abnormal aging state, and 4 represents the normal state. The posteriori probabilities in Table 3 were combined by the Formula (13) and get the final posteriori probabilities of the 10 test sample groups. The actual fault type was obtained by the size of the final posteriori probability, as shown in Table 4. The Table illustrates the actual fault type and the fault type of diagnosis of the 10 test sample groups.  Figure 5 is the multi-classification diagram based on Bayesian theory, Figure 5a is the multi-classification diagram of the training set, and Figure 5b is the multi-classification diagram of the testing set. x 1 , x 2 are the feature vectors whose dimension is reduced by the RBF kernel function. Compared the LSSVM algorithm in Bayesian theory with the LSSVM algorithm and the standard SVM algorithm, the results are as shown, in Table 5. Among the LSSVM algorithm and the standard SVM algorithm, the RBF is the kernel function, the classification algorithm is the "One vs. One", the difference is that the optimization method of the regularization parameter θ , and the kernel parameter 2 σ is the 10 times cross-validation method. O is the total number of test samples, and Percent is the proportion of the well-judged test samples and the total test samples. The results show that the LSSVM in Bayesian theory has higher generalization ability and good modeling effect.

Experimental Results
In this section, the presented fault diagnosis model is tested with an experimental photovoltaic system, and the experimental platform, as well as the experimental results, are presented.

Experimental Platform
A 35 kW Experimental Substation is applied to validate the performances of the proposed fault diagnosis model. As Figure 6 shows, we take a photovoltaic string as experimental subject, which Compared the LSSVM algorithm in Bayesian theory with the LSSVM algorithm and the standard SVM algorithm, the results are as shown, in Table 5. Among the LSSVM algorithm and the standard SVM algorithm, the RBF is the kernel function, the classification algorithm is the "One vs. One", the difference is that the optimization method of the regularization parameter θ, and the kernel parameter σ 2 is the 10 times cross-validation method. O is the total number of test samples, and Percent is the proportion of the well-judged test samples and the total test samples. The results show that the LSSVM in Bayesian theory has higher generalization ability and good modeling effect.

Experimental Results
In this section, the presented fault diagnosis model is tested with an experimental photovoltaic system, and the experimental platform, as well as the experimental results, are presented.

Experimental Platform
A 35 kW Experimental Substation is applied to validate the performances of the proposed fault diagnosis model. As Figure 6 shows, we take a photovoltaic string as experimental subject, which consists of sixteen modules in series. We took fifteen of them into a 5 × 3 photovoltaic array, which consists of three photovoltaic strings in parallel, and each string has five modules in series. A reference photovoltaic string behind experimental array is used for comparison, which consists of five modules. The solar irradiance (G) is collected by the illumination intensity detector TBQ-2 (Beijing Huatron Technology Co., Ltd., Beijing, China), and the ambient temperature (T) is collected by the temperature sensor PT100 (Haodu Sensors Technology Co., Ltd., Shenzhen, China). The current is provided by the DC (Direct Current) resource DH1718-A (Dahua Technology Co., Ltd., Beijing, China). In the fault diagnosis model, the fault of abnormal aging is simulated by connecting a series resistor, while the fault of short circuit is simulated by paralleling a photovoltaic module with a constructor, disconnecting the constructor between two photovoltaic modules that represent the fault of open circuit. Four instances are implemented and studied, including in the states of normal, short circuit, open circuit, and abnormal aging. 80 sample groups in each state (320 sample groups altogether) were obtained, of which 60 sample groups were used as the training set, and the remaining 20 groups were used as the testing set.

Experimental Results
In this section, Table 6 and Figure 7 illustrate the experimental results.  In the fault diagnosis model, the fault of abnormal aging is simulated by connecting a series resistor, while the fault of short circuit is simulated by paralleling a photovoltaic module with a constructor, disconnecting the constructor between two photovoltaic modules that represent the fault of open circuit. Four instances are implemented and studied, including in the states of normal, short circuit, open circuit, and abnormal aging. 80 sample groups in each state (320 sample groups altogether) were obtained, of which 60 sample groups were used as the training set, and the remaining 20 groups were used as the testing set.

Experimental Results
In this section, Table 6 and Figure 7 illustrate the experimental results. altogether) were obtained, of which 60 sample groups were used as the training set, and the remaining 20 groups were used as the testing set.

Experimental Results
In this section, Table 6 and Figure 7 illustrate the experimental results.  Figure 7. The classification of test samples.   Table 6 demonstrates the final posteriori probabilities of six test samples in short-circuits state, while Figure 7 illustrates the experimental results of the aforementioned four instances, which demonstrate that the accuracy of the proposed method is 97.5%.
It is obvious that the experimental data can be classified by the fault diagnosis model in Bayesian theory, which is similar to the simulated ones.

Conclusions
According to the change rules of the output electrical parameters and the equivalent circuit internal parameters of the photovoltaic array in different fault states, a LSSVM multi-classifiers model in Bayesian theory has been built to diagnose the fault types of the photovoltaic array.
The proposed method has the ability to construct an optimal multiple-classifiers model and to obtain the posteriori probabilities of the samples, which can identify the states of the photovoltaic array. Four kinds of working conditions are simulated to validate the effectiveness of the approach-that is, the normal condition, the short-circuits condition, the open-circuits condition, and the abnormal aging condition. The simulated results indicate that the method can classify the fault types accurately, which have a higher generalization ability and a good modeling effect. Furthermore, an experimental platform is built to test the experimental performance of the developed approach, while the experimental results also demonstrate the effectiveness of the fault diagnosis model in a practical system. This paper deeply analyzes the change rules of the output electrical parameters and the equivalent circuit internal parameters of the photovoltaic array in different fault states. It also introduces the Bayesian Framework for LSSVM into the field of photovoltaic fault diagnosis so that we can locate the faulty photovoltaic modules into a certain photovoltaic array and further diagnose its fault types, thus greatly reducing the number of sensors and the costs, while ensuring that the solar power station operates safely and stably.