The Short-Term Forecasting of Asymmetry Photovoltaic Power Based on the Feature Extraction of PV Power and SVM Algorithm

: To improve forecasting accuracy for photovoltaic (PV) power output, this paper proposes a hybrid method for forecasting the short-term PV power output. First, by introducing the noise level, an improved complementary ensemble empirical mode decomposition (EEMD) with adaptive noise (ICEEMDAN) is developed to determine the ensemble size and amplitude of the added white noise adaptively. ICEEMDAN can change PV power output with non-symmetry into intrinsic mode functions (IMFs) with symmetry. ICEEMDAN can enhance the forecasting accuracy for PV power by IMFs with physical meaning (not including spurious modes). Second, the selection method of relative modes (IF), which is determined by the comprehensive factor, including the shape factor, crest factor and Kurtosis, is introduced to adaptively classify the IMFs into groups including similar ﬂuctuating components. The IF can avoid the drawbacks of threshold determination by an empirical method. Third, the modiﬁed particle swarm optimization (PSO) (MPSO) is proposed to optimize the hyper-parameters in the support vector machine (SVM) by introducing the piecewise inertial weight. MPSO can improve the global and local search ability to make the particles traverse the global space and strengthen the performance of local convergence. Finally, the proposed method (ICEEMDAN-IF-MPSO-SVM) is used to forecast the PV power output of each group individually, and then, the single forecasting result is reconstructed to obtain the desired forecasting result for PV power output. By comparison with the other typical methods, the proposed method is more suitable for forecasting PV power output.


Introduction
Recently, with the rapid development of science and technology, energy demand has become more and more important [1,2]. Many countries have increasing requirements for energy. Conventional energy such as coal, petroleum and gas do not meet the requirements. In comparison with these energy resources, presenting the advantages of cleanliness, good potential, extendibility and universality, solar power is the one of most desired types of energy among energy resources. In many countries, some large-scale photovoltaic (PV) plants have been developed to reduce environmental pollution. However, PV generation has the drawbacks of intermittence and fluctuation, and being subject to ungovernable solar irradiance and metrological factors, such as wind speed, temperature, humidity and dew retting [3,4]. It also brings new challenges for guaranteeing the stable operation of the power system. Furthermore, the accurate prediction of PV power generation can improve the reliability of the power system, increase the penetration of PV power and decrease the uncertainty of PV power in characteristic of the IMFs. Each group stands for the feature fluctuation components of the original PV power output. The key task of the determined specified group is to look for the reasonability threshold. Generally, the determination of the threshold often depends heavily on experience. This may bring a potential risk of the wrong determination of frequency components for making the different fluctuational components mixed into a group. Finally, the prediction accuracy when each group is considered as the "original" PV power output is important. Thus, it also necessary to develop an adaptive method to distinguish the effective frequency components to enhance the prediction accuracy.
To solve the above-mentioned problem, this paper develops a hybrid method based on an improved CEEDMAN (ICEEMDAN), the identification of specified IMFs and the SVM. This method applies the ICEEMDAN to decompose the PV power output into a set of IMFs, the IMFs are divided into different groups, each group is considered as the "original" PV power output, and each "original" PV power output is forecast. Finally, the predicted result is reconstructed to obtain the final PV forecasting. Thus, the main contributions with regard to the previous work are as follows: (1) The effective decomposition of the original PV power output.
In this paper, the two critical parameters of CEEMDAN, which include the amplitude of the added white noise and ensemble size, are optimized by the defined concept of the noise level. The improved CEEMDAN (ICEEMDAN) can suppress the spurious mode in traditional CEEMDNA decomposition and further enhance the decomposition effectiveness.
The identified method based on the correlation between the PV power output data and each IMF is proposed to divide the IMFs into the corresponding groups adaptively. This is an adaptive method. This method can avoid the mixing of the different fluctuation components in same-group IMFs.
(3) The parameter optimization of the SVM.
The modified PSO (MPSO) based on the piecewise inertia weight is proposed to optimize the parameters in the SVM. The MPSO can realize that a large inertia weight is used to enhance the global search ability in the early stage, and a small inertia weight is determined to enhance the local search ability and convergence speed.
(4) The reconstruction of the forecasting result.
Each group's IMFs are considered as the "original" PV power and forecast with the SVM. The forecasting result of each group is reconstructed to obtain the final PV power forecasting output.
The proposed method (ICEEMDAN-IF-MPSO-SVM) is used to forecast the PV power output, and the result indicates that the proposed method has a better forecasting performance compared with the traditional methods.
The rest of the paper is organized as follows. Section 2 describes the related work including the signal decomposition method and the forecasting method for PV power output; Section 3 presents the proposed method; Section 4 discusses the comparison of the forecasting results; Section 5 is the conclusion.

CEEMDAN Algorithm
Although EEMD [34] can overcome the drawback of mode mixing in EMD, it also contains two drawbacks: the residue noise and the spurious mode. Afterwards, the CEEMDAN [35] is introduced to alleviate the above-mentioned problems. The following is the principle of CEEMDAN: (1) Obtain the noisy signal x i ; x i = x + ω 0 ξ i (i = 1, · · · , N), where ξ i is the added white noise with unit variance, ω 0 is the corresponding coefficient, and x is the original signal; (2) Extract the first IMF (c i 1 ) from each noisy signal x i with EMD; (3) Obtain the first IMF (c 1 ) by taking the average of each c i 1 ; (4) Obtain the first residue: r 1 = x − c 1 ; (5) Decompose r 1 + w 1 E 1 ε i with the EMD algorithm, and extract the first IMF to obtain the second IMF (c 2 ), where w 1 means the coefficient of the added white noise, and the operator E m (·) indicates the m-th IMF with EMD; (6) Compute the m-th residue mode (m = 2, . . . , K), and extract the first IMF to generate the (m + 1)-th IMF with Equation (3); (7) Repeat the above steps until the residue R contains fewer than two extrema; Finally, the signal x can be written as: Here, K indicates the decomposing number of the IMF.

SVM Algorithm
The support vector machine (SVM) is an effective tool for tackling the regression and class problem in machine learning. The SVM derives from an optimization problem for the non-linear dataset: subject to: . . , l where l refers to the sample size, (x 1 , y 1 ) . . . (x l , y l ) is the group of input and output vectors, w refers to the weight factor, b refers to the threshold value, C indicates the penalty parameter, the kernel function Φ maps the input samples to higher space, ε means the epsilon parameter, and ξ i and ξ * i are the upper and lower training errors, respectively. At present, the typical kernel functions include the radial basis function (RBF), polynomial function and linear function [25][26][27]. They are used to map non-linear Symmetry 2020, 12, 1777 5 of 20 features between the input and feature space. This paper employs the RBF as the kernel function in the SVM, and the RBF is expressed as follows:

PSO Algorithm
PSO is developed through the simulation of the natural flocking and swarming behavior of birds and insects. Here, the flying space of the birds refers to the search space of the resolution problem. Then, each bird is considered as a particle with no mass and volume. Furthermore, each particle refers to a possible solution. During the search process, each particle adjusts its flying velocity and position according to flying experience and the experiences of the other companion particles. Thus, the optimization problem can be solved by the optimization solution. It is assumed that there is a swarm with size N, and each particle has the own position and velocity. The velocity v ij and position x ij of the i-th particle in the j-th space can be updated: where w refers to the inertia weight, t indicates the iteration times i = 1, . . . , N and j = 1, . . . , N, and D refers to the space dimension. The c 1 and c 2 are constants and indicate the influence level for the particle from social and individual cognition. The r 1 j (t) and r 2j (t) refer to random numbers, which are within [0, 1] and obey a uniform distribution. The x i = (x i1 , x i2 , · · · , x iD ) vector is the potential solution. The position p i = (p i1 , p i2 , · · · , p iD ) and P g = p g1 , p g2 , · · · , p gD indicate that the i-th particle experiences the best local position and global position during the flight, respectively.

Effective Decomposition for PV Power
Subjected to ungovernable metrological factors, PV power output has the drawbacks of intermittence and fluctuation. When traditional EMD is used to decompose PV power output, the related modes cannot be identified effectively due to mode mixing. CEEMDAN can solve the problem of mode mixing. However, the determination of two critical parameters, which are the amplitude of the added white noise (NA) and ensemble size (EN), is not specifically guided. It follows that if the NA is too small, then mode mixing in EMD is difficult to eliminate, and if the NA is too large, a sufficient EN is needed to eliminate the residue noise. It also increases the computational cost. In addition, if the two critical parameters are not suitable to determine, the decomposed result contains a spurious mode because of the interaction between the PV power output and the added noise. The spurious mode, which has no physical meaning, cannot reflect the component information of PV power. To make the decomposition IMFs with CEEMDAN be physically meaningful, this paper introduces the idea of the noise level to determine the two critical parameters (NA and EN).
Assume that the CEEMDAN can decompose the PV power into a set of IMFs as shown in Equation (10): where x(i) refers to the PV power output, N refers to the number of IMFs, c k (i) refers to the k-th IMF, and R N refers to the residue. To decrease the computational cost, the EN is set as the smallest trail (two trails). For the NA determination, the noise level L k is developed to obtain the optimal decomposition with CEEMDAN. The noise level is developed to decompose the PV power with CEEMDAN under each noise level (as shown in Equation (11)): where σ refers to the standard deviation of the PV power, and L k refers to the noise level. The correlation coefficient is calculated between each IMF and the PV power under each noise level decomposition as shown in Equation (12): where C M(N M −1) refers to the correlation coefficient between the (N M − 1)-th IMF and PV power under the M-th noise level. If the decomposition effect under a noise level is better than that under the other noise level, the correlation coefficient between each IMF and PV power is also bigger than that under the other noise level. Here, the decomposition effectiveness is reflected by averaging the correlation coefficient under each noise level, as shown in Equation (13).
where me k refers to the mean of the correlation coefficients under the k-th noise level. In addition, the IMF number can also reflect the decomposition effectiveness. Theoretically, when the signal includes the specified frequency components, CEEMDAN decomposes the signal into a determined number of IMFs. The number is consistent with the number of the components. Each IMF includes a kind of frequency information. However, if there is a spurious mode in the decomposition result, the IMF number should be more than the number of the theoretical IMFs. It indicates that the smaller the IMF number is, the better the decomposed effectiveness is. Thus, the comprehensive evaluation criterion for decomposition effectiveness can be obtained by combing the me k with the decomposition number (as shown in Equation (14)).
If the cr k under the k-th noise level is the smallest: The amplitude of the added white noise (NA) can be also determined.
Finally, the PV power can be decomposed with CEEMDAN by the following expression.

Selection of Related Modes
To enhance the forecasting accuracy for PV power, the identified method of the related mode is developed to combine the IMFs into a given group to obtain the consistent components of the PV power.
If the IMFs include similar fluctuations of PV power output, the corresponding waveforms are also similar. Based on this principle, this paper develops an adaptive method to identify the IMFs. Firstly, Equation (18) is used to obtain the modified IMF (MIMF).
Assume that the first N number of the IMFs are similar. It means that the MIMF N contains similar fluctuation components. If another fluctuation component is introduced into MIMF (N+1) , the waveform between the MIMF N and MIMF (N+1) is different, obviously. It is illustrated that if a kind of fluctuation component is mixed into that of another component, the waveform of the adjacent MIMF must be changed.
Here, the comprehensive factor (CF) is developed to identify the difference in adjacent MIMFs (as shown in Equation (19)). The CF includes three evaluated indexes: the shape factor, crest factor and kurtosis (as shown in Table 1). These factors can reflect the waveform fluctuation. The shape factor mainly reflects the root mean square of the waveform to evaluate the fluctuation. The crest factor mainly focuses on the waveform peak. The kurtosis can reflect the smooth level of the waveform by forth central moment. Thus, the three indexes are combined to identify the similarity of the waveform reasonably. Table 1. Evaluated index of waveform fluctuation.

Number Statistical Parameter Expression
The findings are that the CF mainly concentrates nearby a numerical value for a similar waveform fluctuation; when the other component disturbs the fluctuation, the CF will change with the disturbance level. It indicates that there is a local extremum among the differences in the comprehensive factor (CF) between adjacent MIMFs. Thus, the IMFs can be classified by the feature of the comprehensive factor (CF). where C i refers to the difference between adjacent comprehensive factors. The demarcation point of IMF fluctuation can be determined: Finally, the IMFs can be adaptively classified:

Establishment of the Forecasting Sub-Model for PV Power Output
In this section, the SVM is used to forecast each group's IMFs in Section 3.2. To enhance the forecasting ability of the SVM, the modified PSO is used to optimize the parameters of the SVM. The following is the principle of the modified PSO.
Here, the modified inertia weight (piecewise inertia weight, as shown in Equation (23)) is introduced to realize that a large weight is used to enhance the global search ability and make the particle traverse all the space in the early search stage, and in the late stage, a small inertia weight is applied to enhance the local search performance so as to increase the speed of convergence. This method can overcome the drawbacks of low convergence speed and locally optimal solutions in comparison with the traditional PSO.
where t refers to the iteration number; a 1 , a 2 , a 3 refer to the slope of each piecewise inertia weight; and b 1 , b 2 , b 3 are the corresponding intercepts. Furthermore, to make the inertia weight have continuity, the following expression can be obtained: where the start weight (w start ), end weight (w end ), interval iteration numbers (t 1 and t 2 ) and maximum iteration number (t max ) need to be pre-specified. The findings are that the weight is only related to the slope (a 1 and a 3 ). The slope is used to control the change rate for the weight w.
The following are the established steps of the forecasting model: (1) Determine the training sample for each Group k (k = 1, 2, . . .) and the corresponding input variables: where x refers to the input vector of the training set, y refers to the set of output variable vectors (Group k (k = 1, 2, . . .)), x i is the input variable vector of the i-th sample, and y i is the corresponding output value. (4) Train the SVM with the training sample and calculate the fitness value of each particle. The fitness value is compared with the global best position p g . If the fitness value is superior to p i , the fitness value is considered as p i . Here, the mean square error (MSE) is used to perform the fitness function: where n means the length of the training sample, y i is the real value of the i-th sample, and y i is the corresponding forecasting power output. If the fitness of each particle is smaller than that of all the particles, the global best position p g is placed with the local best position p i .
(5) Update the position and velocity with Equations (8) and (9). (6) Look for the optimal solution until the end condition is satisfied; otherwise, go back to Step 3.

Final Forecasting Model
Assume that SVM Group 1 , SVM Group 2 , . . . indicates the forecasting model of the corresponding Group 1 , Group 2 , . . .. The final model SVM model can be obtained:

Evaluation of Forecasting Result
The forecasting accuracy is evaluated by the determination coefficient (R 2 ), mean absolute error (MRE) and root-mean-square error (RMSE). The expressions are shown in Equations (28), (31) and (32).
The determination coefficient is a statistical measurement and indicates how well a forecasting result approximates real data points. Its range is from 0 to 1. When the R 2 is equal to 1, the forecasting effectiveness is the best, and vice versa.
Here, x i is the i th tested value, andx i is the i th forecasting value. The smaller the MRE value is, the better the forecasting effectiveness is.
The smaller the RMSE value is, the better the forecasting effectiveness is. Thus, the proposed method in this paper consists of the following steps: (1) Decompose the PV power output into the IMFs with physical meaning with the improved CEEMDAN (ICEEMDAN). Here, the two critical parameters (the ensemble size and amplitude of the added white noise) are determined by introducing the noise level. (2) Classify IMFs into the individual feature groups including the most relevant fluctuation components by introducing the comprehensive factor adaptively. It may avoid the drawbacks of threshold determination depending on the empirical method and is an adaptive way.  The smaller the RMSE value is, the better the forecasting effectiveness is. Thus, the proposed method in this paper consists of the following steps: (1) Decompose the PV power output into the IMFs with physical meaning with the improved CEEMDAN (ICEEMDAN). Here, the two critical parameters (the ensemble size and amplitude of the added white noise) are determined by introducing the noise level.

Results and Discussion
This paper uses the PV power output from the UK power network to verify the effectiveness of the proposed method. The time period is the 1 June 2014 to 31 October 2014. Here, only periods between 5 a.m. and 7 a.m. are selected to establish the forecasting model because PV power is almost not generated except during the selected periods. The PV power output is between 0 and 3.5 KW. The data interval is 1 h. The corresponding input variables are six: the time series every day, temperature, wind speed, barometric pressure, rainfall and solar radiation. In addition, because the data including the PV output power and weather conditions are not recorded for each day

Results and Discussion
This paper uses the PV power output from the UK power network to verify the effectiveness of the proposed method. The time period is the 1 June 2014 to 31 October 2014. Here, only periods between 5 a.m. and 7 a.m. are selected to establish the forecasting model because PV power is almost not generated except during the selected periods. The PV power output is between 0 and 3.5 KW. The data interval is 1 h. The corresponding input variables are six: the time series every day, temperature, wind speed, barometric pressure, rainfall and solar radiation. In addition, because the data including the PV output power and weather conditions are not recorded for each day completely, the final time period is 105 days, and the corresponding data length is 1575. The PV power is plotted in Figure 2. completely, the final time period is 105 days, and the corresponding data length is 1575. The PV power is plotted in Figure 2. Here, the data of the first 101 days are set as the training sample. The data of the last 4 days are set as the testing sample. The data length of the testing sample is 60. The autocorrelation function (AF) and partial autocorrelation function (PAF) are used to obtain some insights into whether the PV power output is predictable. The number of lags is set as 60 in the AF and PAF. Figure 3 shows the AF and PAF. It is found that the AF and PAF exhibit significant correlation for most of the numbers of lag. The AF and PAF show obvious periodicity. Especially, the correlation, which belongs to the number of lags at the data point 60, is more than two standard deviations away from 0. This situation, for which the PV power of the first 101 days is used as the training data to predict the PV power of the residue 4 days, is feasible. Next, the parameter optimization method in Section 3.1 is used to determine the critical parameters of CEEMDAN (the amplitude of the added white noise, NAopt). Here, the noise level is set from 10 0 to 10 −10 , with the step 10 −1 . The determined index of the noise level is plotted in Figure 4. The findings are that the determination index is the smallest at the noise level of 10 −3 . Thus, the optimal critical parameters are two trails (ensemble size) and  Here, the data of the first 101 days are set as the training sample. The data of the last 4 days are set as the testing sample. The data length of the testing sample is 60. The autocorrelation function (AF) and partial autocorrelation function (PAF) are used to obtain some insights into whether the PV power output is predictable. The number of lags is set as 60 in the AF and PAF. Figure 3 shows the AF and PAF. It is found that the AF and PAF exhibit significant correlation for most of the numbers of lag. The AF and PAF show obvious periodicity. Especially, the correlation, which belongs to the number of lags at the data point 60, is more than two standard deviations away from 0. This situation, for which the PV power of the first 101 days is used as the training data to predict the PV power of the residue 4 days, is feasible. completely, the final time period is 105 days, and the corresponding data length is 1575. The PV power is plotted in Figure 2. Here, the data of the first 101 days are set as the training sample. The data of the last 4 days are set as the testing sample. The data length of the testing sample is 60. The autocorrelation function (AF) and partial autocorrelation function (PAF) are used to obtain some insights into whether the PV power output is predictable. The number of lags is set as 60 in the AF and PAF. Figure 3 shows the AF and PAF. It is found that the AF and PAF exhibit significant correlation for most of the numbers of lag. The AF and PAF show obvious periodicity. Especially, the correlation, which belongs to the number of lags at the data point 60, is more than two standard deviations away from 0. This situation, for which the PV power of the first 101 days is used as the training data to predict the PV power of the residue 4 days, is feasible. Next, the parameter optimization method in Section 3.1 is used to determine the critical parameters of CEEMDAN (the amplitude of the added white noise, NAopt). Here, the noise level is set from 10 0 to 10 −10 , with the step 10 −1 . The determined index of the noise level is plotted in Figure 4. The findings are that the determination index is the smallest at the noise level of 10 −3 . Thus, the optimal critical parameters are two trails (ensemble size) and  Next, the parameter optimization method in Section 3.1 is used to determine the critical parameters of CEEMDAN (the amplitude of the added white noise, NA opt ). Here, the noise level is set from 10 0 to 10 −10 , with the step 10 −1 . The determined index of the noise level is plotted in Figure 4. The findings are that the determination index is the smallest at the noise level of 10 −3 . Thus, the optimal critical parameters are two trails (ensemble size) and 10 −3 σ (the amplitude of the added white noise), respectively. The σ means the standard deviation of the PV power output. Symmetry 2020, 12, x FOR PEER REVIEW 12 of 20 To prove the decomposition effectiveness, the improved CEEMDAN (ICEEMDAN) is used to decompose the PV power output. Meanwhile, the decomposition result is also used to compare that under noise levels of 10 0 , 10 −1 and 10 −2 . This is shown in Figure 5. The findings are that the decomposition numbers are 13, 12, 10 and 9 under the noise levels of 10 0 , 10 −1 , 10 −2 and 10 −3 , respectively. The decomposition number with the proposed method is the smallest. This indicates that the proposed method (ICEEMDAN) can overcome the spurious mode. It also proves that the proposed method can be more suitable for decomposing the PV power output. In the decomposition results under the other noise levels, because the amplitude of the added white noise is relatively large, the white noise is difficult to eliminate. It follows that the interaction of white noise and PV power output introduces a spurious mode (no real physical meaning) and puts the added noise in the IMFs. Finally, the forecasting model is influenced due to the spurious mode and extra noise.  To prove the decomposition effectiveness, the improved CEEMDAN (ICEEMDAN) is used to decompose the PV power output. Meanwhile, the decomposition result is also used to compare that under noise levels of 10 0 , 10 −1 and 10 −2 . This is shown in Figure 5.  To prove the decomposition effectiveness, the improved CEEMDAN (ICEEMDAN) is used to decompose the PV power output. Meanwhile, the decomposition result is also used to compare that under noise levels of 10 0 , 10 −1 and 10 −2 . This is shown in Figure 5. The findings are that the decomposition numbers are 13, 12, 10 and 9 under the noise levels of 10 0 , 10 −1 , 10 −2 and 10 −3 , respectively. The decomposition number with the proposed method is the smallest. This indicates that the proposed method (ICEEMDAN) can overcome the spurious mode. It also proves that the proposed method can be more suitable for decomposing the PV power output. In the decomposition results under the other noise levels, because the amplitude of the added white noise is relatively large, the white noise is difficult to eliminate. It follows that the interaction of white noise and PV power output introduces a spurious mode (no real physical meaning) and puts the added noise in the IMFs. Finally, the forecasting model is influenced due to the spurious mode and extra noise. The findings are that the decomposition numbers are 13, 12, 10 and 9 under the noise levels of 10 0 , 10 −1 , 10 −2 and 10 −3 , respectively. The decomposition number with the proposed method is the smallest. This indicates that the proposed method (ICEEMDAN) can overcome the spurious mode. It also proves that the proposed method can be more suitable for decomposing the PV power output. In the decomposition results under the other noise levels, because the amplitude of the added white noise is relatively large, the white noise is difficult to eliminate. It follows that the interaction of white noise and PV power output introduces a spurious mode (no real physical meaning) and puts the added noise in the IMFs. Finally, the forecasting model is influenced due to the spurious mode and extra noise.
Next, the decomposition IMFs are combined into identified IMFs (IIMFs) according to the equation in Section 3.2. The IIMFs are plotted in Figure 6. The findings are that the waveform characteristics for IIMF1 to IIMF9 are more obvious than those in Figure 5. It is more effective to identify the related modes and classify the IMFs into corresponding feature groups. The findings are that the first feature group should combine IMF1 to IMF3, and the second feature group is from IMF4 to IMF6, as well as the last group being from IMF7 to IMF9.
Here, to prove the effectiveness of the proposed method, the adaptive identified method (IF) in Section 3.2 is used to distinguish the IIMFs. The identified results are plotted in Figure 7. The findings are that the local maxima are the second index and fifth index of the IIMF. It means that the two local maxima divide the IMFs into three groups. According to Equation (22), IMF1 to IMF3, and IMF4 to IMF6, as well as IMF7 to IMF9, are combined into single groups, respectively. This is shown in Table 2. It proves that the proposed method can classify the related IMFs into corresponding groups effectively. The combined IMFs are plotted in Figure 8. The Figure 8a plots the combined result from IMF1 to IMF3, and the Figure 8b plots the combined result from IMF4 to IMF6, and the Figure 8c plots the combined result from IMF7 to IMF9. Next, the decomposition IMFs are combined into identified IMFs (IIMFs) according to the equation Error!in Section 3.2. The IIMFs are plotted in Figure 6. The findings are that the waveform characteristics for IIMF1 to IIMF9 are more obvious than those in Figure 5. It is more effective to identify the related modes and classify the IMFs into corresponding feature groups. The findings are that the first feature group should combine IMF1 to IMF3, and the second feature group is from IMF4 to IMF6, as well as the last group being from IMF7 to IMF9.
Here, to prove the effectiveness of the proposed method, the adaptive identified method (IF) in Section 3.2 is used to distinguish the IIMFs. The identified results are plotted in Figure 7. The findings are that the local maxima are the second index and fifth index of the IIMF. It means that the two local maxima divide the IMFs into three groups. According to Equation (22), IMF1 to IMF3, and IMF4 to IMF6, as well as IMF7 to IMF9, are combined into single groups, respectively. This is shown in Table  2. It proves that the proposed method can classify the related IMFs into corresponding groups effectively. The combined IMFs are plotted in Figure 8. The figure8a plots the combined result from IMF1 to IMF3, and the figure 8b plots the combined result from IMF4 to IMF6, and the figure8c plots the combined result from IMF7 to IMF9.    Next, the decomposition IMFs are combined into identified IMFs (IIMFs) according to the equation Error!in Section 3.2. The IIMFs are plotted in Figure 6. The findings are that the waveform characteristics for IIMF1 to IIMF9 are more obvious than those in Figure 5. It is more effective to identify the related modes and classify the IMFs into corresponding feature groups. The findings are that the first feature group should combine IMF1 to IMF3, and the second feature group is from IMF4 to IMF6, as well as the last group being from IMF7 to IMF9.
Here, to prove the effectiveness of the proposed method, the adaptive identified method (IF) in Section 3.2 is used to distinguish the IIMFs. The identified results are plotted in Figure 7. The findings are that the local maxima are the second index and fifth index of the IIMF. It means that the two local maxima divide the IMFs into three groups. According to Equation (22), IMF1 to IMF3, and IMF4 to IMF6, as well as IMF7 to IMF9, are combined into single groups, respectively. This is shown in Table  2. It proves that the proposed method can classify the related IMFs into corresponding groups effectively. The combined IMFs are plotted in Figure 8. The figure8a plots the combined result from IMF1 to IMF3, and the figure 8b plots the combined result from IMF4 to IMF6, and the figure8c plots the combined result from IMF7 to IMF9.

Group Order
First Group Second Group Three Group Combined IMFs IMF1 + IMF2 + IMF3 IMF4 + IMF5 + IMF6 IMF7 + IMF8 + IMF9   The proposed method is used to forecast the three groups' PV power output in Figure 8. The modified PSO (MPSO) in Section 3.3 is used to optimize the hyper-parameters in the SVM. Table 3 indicates the parameter settings in the SVM and MPSO. Population number 20 Figure 9 shows the comparison of the modified and traditional inertial weight. The findings are that the modified inertial weight has a relatively big weight value in the early stage and small value in the late stage, obviously. They indicate that the modified inertial weight can enhance the global search ability to make the particle traverse all the space and also local search performance to increase the convergence speed. The proposed method is used to forecast the three groups' PV power output in Figure 8. The modified PSO (MPSO) in Section 3.3 is used to optimize the hyper-parameters in the SVM. Table 3 indicates the parameter settings in the SVM and MPSO. Parameter γ range 0-100 5 Slope a 2 −10 −5 10 Population number 20 Figure 9 shows the comparison of the modified and traditional inertial weight. The findings are that the modified inertial weight has a relatively big weight value in the early stage and small value in the late stage, obviously. They indicate that the modified inertial weight can enhance the global search ability to make the particle traverse all the space and also local search performance to increase the convergence speed. The SVM is used to forecast the sample of data (combined IMFs in Figure 8). The MPSO is used to optimize the parameters of the SVM in Table 3. The forecasting results are plotted in Figures 10-12, respectively. Meanwhile, the error (as shown in Equation (33)), which is the difference between the forecasting result and original series, is used to compare the forecasting effectiveness for each hour's PV power output.
where x i refers to the forecasting PV power output, y i refers to the tested PV power output, and N refers to the number of tested samples. The error is also plotted in Figures 10-12, respectively. The findings are that the forecasting results are all almost consistent with the original tested series of the three groups' PV power output. For the first group's PV power output, the forecasting error for the high-frequency component is bigger than that in the low-frequency components (the second group and the third group). The forecasting effectiveness is the best for the third group's PV power output. Furthermore, it also illustrates the effectiveness of the proposed method; when the different fluctuation components are separated into a single group adaptively, the fluctuation is weaker than in the original series; the forecasting accuracy for each group is better than that for the original series. The final forecasting accuracy from reconstructing the forecasting result of each group should also be better than that for the original series.
12, respectively. Meanwhile, the error (as shown in Equation (33)), which is the difference between the forecasting result and original series, is used to compare the forecasting effectiveness for each hour's PV power output.
where i x refers to the forecasting PV power output, i y refers to the tested PV power output, and N refers to the number of tested samples. The error is also plotted in Figures 10-12, respectively. The findings are that the forecasting results are all almost consistent with the original tested series of the three groups' PV power output. For the first group's PV power output, the forecasting error for the high-frequency component is bigger than that in the low-frequency components (the second group and the third group). The forecasting effectiveness is the best for the third group's PV power output. Furthermore, it also illustrates the effectiveness of the proposed method; when the different fluctuation components are separated into a single group adaptively, the fluctuation is weaker than in the original series; the forecasting accuracy for each group is better than that for the original series. The final forecasting accuracy from reconstructing the forecasting result of each group should also be better than that for the original series.    [42] (as shown in Figure 13a). The MRE, RMSE and R 2 are used to evaluate the forecasting effectiveness for each forecasting method, respectively. Table 4 shows the compared results. The findings are that the MRE and RMSE of the proposed method are the smallest among those of all the methods, and the corresponding R 2 is the biggest. It indicates that the forecasting effectiveness of the proposed method outperforms the other   [42] (as shown in Figure 13a). The MRE, RMSE and R 2 are used to evaluate the forecasting effectiveness for each forecasting method, respectively. Table 4 shows the compared results. The findings are that the MRE and RMSE of the proposed method are the smallest among those of all the methods, and the corresponding R 2 is the biggest. It indicates that the forecasting effectiveness of the proposed method outperforms the other forecasting methods. Additionally, the findings in Figure 13b,c show that the forecasting accuracy for  [42] (as shown in Figure 13a). The MRE, RMSE and R 2 are used to evaluate the forecasting effectiveness for each forecasting method, respectively. Table 4 shows the compared results. The findings are that the MRE and RMSE of the proposed method are the smallest among those of all the methods, and the corresponding R 2 is the biggest. It indicates that the forecasting effectiveness of the proposed method outperforms the other forecasting methods. Additionally, the findings in Figure 13b,c show that the forecasting accuracy for a high amplitude in PV output is better than that for low amplitude.  [42] (as shown in Figure 13a). The MRE, RMSE and R 2 are used to evaluate the forecasting effectiveness for each forecasting method, respectively. Table 4 shows the compared results. The findings are that the MRE and RMSE of the proposed method are the smallest among those of all the methods, and the corresponding R 2 is the biggest. It indicates that the forecasting effectiveness of the proposed method outperforms the other forecasting methods. Additionally, the findings in Figure 13b,c show that the forecasting accuracy for a high amplitude in PV output is better than that for low amplitude.   Figure 13. Comparison of different forecasting methods for PV output. In addition, the forecasting error for each hour is also plotted in Figure 14. The method of error estimation is same as that in Equation (33). The findings are that the error for the proposed method (ICEEMDAN-IF-MPSO-SVM) is the smallest among all the methods. Furthermore, they also prove that the proposed method is more suitable for forecasting the PV power output. In addition, the forecasting error for each hour is also plotted in Figure 14. The method of error estimation is same as that in Equation (33). The findings are that the error for the proposed method (ICEEMDAN-IF-MPSO-SVM) is the smallest among all the methods. Furthermore, they also prove that the proposed method is more suitable for forecasting the PV power output. In addition, the correlation between the forecasting results for the PV power output and the corresponding tested data is also analyzed to evaluate the forecasting effectiveness. Figure 15 plots the correlation. The findings are that the correlation is 0.9910. The forecasting effectiveness has higher accuracy for a high PV power output than that for a low PV power output. Furthermore, this also proves that the proposed method has better forecasting effectiveness.  In addition, the correlation between the forecasting results for the PV power output and the corresponding tested data is also analyzed to evaluate the forecasting effectiveness. Figure 15 plots the correlation. The findings are that the correlation is 0.9910. The forecasting effectiveness has higher accuracy for a high PV power output than that for a low PV power output. Furthermore, this also proves that the proposed method has better forecasting effectiveness.
In addition, the correlation between the forecasting results for the PV power output and the corresponding tested data is also analyzed to evaluate the forecasting effectiveness. Figure 15 plots the correlation. The findings are that the correlation is 0.9910. The forecasting effectiveness has higher accuracy for a high PV power output than that for a low PV power output. Furthermore, this also proves that the proposed method has better forecasting effectiveness.

Conclusions
This paper proposes a novel hybrid method for forecasting PV power output. The following conclusions are drawn: (1) The ICEEMDAN method is introduced to decompose the PV power output into IMFs with physical meaning. The two critical parameters, which include the ensemble size (EN) and amplitude of the added white noise (NA), can be determined by setting the ensemble size as two Forecating PV power output (KW) Figure 15. Correlation between forecasting results and original tested series.

Conclusions
This paper proposes a novel hybrid method for forecasting PV power output. The following conclusions are drawn: (1) The ICEEMDAN method is introduced to decompose the PV power output into IMFs with physical meaning. The two critical parameters, which include the ensemble size (EN) and amplitude of the added white noise (NA), can be determined by setting the ensemble size as two trails and introducing the noise level. This method can avoid the interference of a spurious mode for the forecasting of PV power output. (2) The adaptive identified method of the relative mode is proposed to classify the IMFs into the corresponding feature groups. The method can separate the complex fluctuating components in PV power output into single components to enhance the forecasting accuracy. (3) The modified PSO (MSPO) is proposed to optimize the hyper-parameters in the SVM. The MPSO applies the piecewise inertial weight to enhance the global search ability to make the particle traverse all the space and also local search performance to increase the convergence speed.
Finally, the forecasting result for each group is reconstructed to obtain the desired result. By comparison with the other typical methods, the results indicate that the proposed method is more suitable for forecasting PV power output.

Conflicts of Interest:
The authors declare no conflict of interest.