Hybrid Forecasting Methodology for Wind Power-Photovoltaic-Concentrating Solar Power Generation Clustered Renewable Energy Systems

Forecasting of large-scale renewable energy clusters composed of wind power generation, photovoltaic and concentrating solar power (CSP) generation encounters complex uncertainties due to spatial scale dispersion and time scale random fluctuation. In response to this, a short-term forecasting method is proposed to improve the hybrid forecasting accuracy of multiple generation types in the same region. It is formed through training the long short-term memory (LSTM) network using spatial panel data. Historical power data and meteorological data for CSP plant, wind farm and photovoltaic (PV) plant are included in the dataset. Based on the data set, the correlation between these three types of power generation is proved by Pearson coefficient, and the feasibility of improving the forecasting ability through the hybrid renewable energy clusters is analyzed. Moreover, cases study indicates that the uncertainty of renewable energy cluster power tends to weaken due to partial controllability of CSP generation. Compared with the traditional prediction method, the hybrid prediction method has better prediction accuracy in the real case of renewable energy cluster in Northwest China.


Introduction
As the energy crisis intensifies, renewable energy sources such as wind and solar energy as have been widely concerned. According to statistics, the world's wind power generation in 2020 reached 733 GW which increased by 17.8% over 2019. The world's solar power generation in 2020 reached 714 GW and increased by 21.6% over last year [1]. In fact, both wind and photovoltaic (PV) power generation are fluctuant and intermittent. High renewable energy penetrated power systems bring severe challenges to security and economy of the power grid [2]. As a result, under the premise of ensuring the economic and stable operation of the power grid, we should improve the ability of the power grid to penetrate more renewable energy.
In order to meet this requirement, on the one hand, energy storage devices can be installed on the grid side, such as concentrating solar power (CSP) with heat storage [3,4]. On the other hand, according to the complementary and smoothness of wind and solar energy [5,6], it is also an effective means to raise the renewable energy penetration by bundling multiple types of renewable energy in the same region [7]. Nevertheless, CSP and bundling grid connection methods cannot guarantee the stability of renewable energy generation. Moreover, CSP, wind power and PV power are intermittent, random, and difficult to predict accurately, which lead to more complex uncertainty of power generation cluster and more obvious time and space complementary of power generation. In the context of large scale and multi-types renewable energy clusters, it is necessary to explore the change regularities of regional renewable generation and forms a power forecasting method which is suitable for wind power, PV power and CSP cluster.
With the improvement of computer operation speed, more machine learning and artificial intelligence approaches have been used in forecasting. Normally, compared with the statistical and physical forecasting approaches, machine learning approaches can get better results [8][9][10]. However, machine learning needs a large amount of data in the training process, and the model with large fluctuation of training data is easy to over fit [11][12][13]. Ahmed et al. [14] summarize the recent photovoltaic power forecasting (PVF) methods including physical, statistical, artificial intelligence, ensemble and hybrid approaches, the results show that the ensembles of artificial neural network (ANN) is most suitable for short-term PVF. Ding [15] summarizes and compares several common forecasting methods of wind power generation (WPG) such as the autoregressive integrated moving average (ARIMA) model, Kalman filter (KF), support vector machine (SVM) and ANN. The main advantage of ANN is its ability to study and deal with nonlinearity in WPG data. In order to find a more suitable power generation forecasting method, Sharifzadeh et al. [16] compared ANN, SVM and Gaussian process regression. All these models can forecast wind and solar power, but only the ANN can successfully consider the electricity demand. Although ANN can analyze time series data, when the time series is too long, the forecast accuracy is low. In order to solve this problem, a long short-term memory (LSTM) based on gating unit is proposed [17]. Because of its unique structural characteristics, LSTM has great advantages in processing long time series such as renewable energy generation data. Considering that LSTM can extract and learn the correlation in hours of solar data and even longer term information, Wang et al. [18] proposed a hybrid forecasting strategy which combines LSTM with Gaussian process regression. The results show that this method is better than SVM and other methods in WPG forecasting. Besides, renewable energy may have great changes in different seasons and weather conditions. When the data set is insufficient, the change of weather and season is often ignored in renewable energy forecasting [9]. For example, if the model is trained with the data in summer, the forecasting performance of the model may be better in summer, but it is not sure that the model has the same performance in winter [19]. That's why we should consider the applicability of the LSTM model. Nevertheless, these methods independently forecast one type of power generation such as PV or WPG. The authors in [20] propose that ANN can be used to forecast hybrid renewable energy systems. By comparison, LSTM has better performance than multi-layer perceptron (MLP) and other simple ANN in forecasting of hybrid renewable energy systems. In traditional regional prediction models, most of them are renewable energy generation predictions of the same power generation type [21]. The combined prediction of multiple units greatly improves the accuracy of the prediction model [22]. Consider the application scenarios of renewable energy clusters with multiple power generation types are common in these days [23]. The wind power and PV power generation can be predicted separately, and then the regional power can be obtained by simple accumulation. Because of the complementary between solar and wind energy, when wind turbines and photovoltaic power plants are built in a power plant or region, the combined use of wind and solar data can improve the accuracy of prediction [24,25]. When there is CSP in the area, there is a strong trend correlation between CSP and PV power generation [26]. Due to the flexibility of CSP power generation, the uncertainty of photovoltaic power generation and wind power can be weakened to a certain extent [27,28]. With this just the opposite is, the nonlinear relationship between CSP generation and the other two types of renewable energy power generation is more complicated. For renewable energy clusters including CSP generation, the applicability of typical statistical method and cumulative method cannot be guaranteed.
In view of the above concerns, this paper proposes a short-term hybrid prediction model based on LSTM neural network. In this way, the spatial and temporal comple-mentary characteristics of different units are deeply explored. The advantages and disadvantages of bundling prediction of various types of renewable energy are compared with the traditional independent prediction. Finally, a short-term hybrid prediction model based on LSTM is established. The research work shows that the method can effectively describe the complex uncertainty of the large-scale and multiple types renewable energy, which has great advantages in renewable energy cluster forecasting. The rest of this paper is summarized as follows: The second part introduces the renewable power generation cluster including CSP generation and the structure of LSTM neural network. Part III describes the construction process of short-term hybrid prediction model based on LSTM neural network. In the fourth part, through the measured data of a certain region in China, the method proposed in this paper is compared with traditional independent prediction method. Finally, the main contributions are summarized in Part V.

Wind Power-Photovoltaic-Concentrating Solar Power Cluster
In CSP, wind power and PV power cluster, according to the local power characteristics and climate complementation, the establishment of wind and photovoltaic power cluster can effectively realize the complementary of wind and solar energy [29]. Wind speed and direct normal irradiance (DNI) are meteorological data, which have generating fluctuation in day-night and seasonal cycle. Thus, wind power and photovoltaic power generation are complementary in time. In the hybrid power generation cluster, integrated energy complementary power generation can effectively improve the new energy consumption capacity of power system [30].
At the same time, the CSP generation with adjustable power is mixed in the cluster, which can greatly improve the stability and smoothness of the hybrid cluster power [28]. In the daytime, in addition to normal power generation according to the electricity demand, CSP generation also stores the remaining heat of power generation through the heat storage device, and then generates power by the heat storage device in the case of no sunshine at night, rainy days, etc. This makes the CSP generation units can maintain continuous and stable power supply, which is better than wind power generation and PV power generation. The increase of the proportion of CSP generation will alleviate the negative impact of the renewable energy penetrate to the grid.
A wind power-photovoltaic-concentrating solar power (Wind-PV-CSP) generation cluster will still have a certain impact on the grid, because the integration of a variety of renewable energy brings more complex uncertainty. Although the uncertainty between different renewable energy sources can be offset to reduce the uncertainty after the cluster, it is difficult to judge the uncertainty after the cluster. Part of the uncertainty is offsets and part of the uncertainty is further enhanced. As a result, it is more difficult to predict the power of hybrid generation cluster. The operation of this type of hybrid generation cluster is shown in Figure 1.
In renewable energy cluster, photovoltaic power generation, wind power generation are generally regulated by CSP generation whose power is partially controllable [31], so it is rare to make independent prediction of CSP generation. In fact, the generation of CSP always changes with the generation of other renewable energy. Therefore, when considering the power prediction of renewable energy cluster including CSP, there is a certain correlation between the generation of CSP and other renewable energy generation types. Different from the traditional independent prediction, the new input data of spatial location relationship is included in the hybrid prediction of renewable energy cluster. The spatial location relationship between different renewable energy power generations can be expressed mathematically by the Pearson coefficient.

LSTM Network
LSTM forms the repeating module through three gates (forge gate, input gate, output gate) and a memory unit (cell). The structure of LSTM is shown in Figure 2. The following part gives a brief introduction to the structure of long-term memory and short-term memory.

Short-term memory
In terms of bionics, forgotten gate plays an important role in the model's short-term memory. It can "forget" the unimportant information in time series data processing. Like our short term memory, these useless data are quickly forgotten by the LSTM. From the viewpoint of mathematics, forgotten gate is the reduction of xt and ht-1. If xt contains important information, the value of σ and ft are both close to 0. On the contrary, the value of σ and ft are both close to 1:

LSTM Network
LSTM forms the repeating module through three gates (forge gate, input gate, output gate) and a memory unit (cell). The structure of LSTM is shown in Figure 2. The following part gives a brief introduction to the structure of long-term memory and short-term memory.

LSTM Network
LSTM forms the repeating module through three gates (forge gate, input gate, output gate) and a memory unit (cell). The structure of LSTM is shown in Figure 2. The following part gives a brief introduction to the structure of long-term memory and short-term memory.

Short-term memory
In terms of bionics, forgotten gate plays an important role in the model's short-term memory. It can "forget" the unimportant information in time series data processing. Like our short term memory, these useless data are quickly forgotten by the LSTM. From the viewpoint of mathematics, forgotten gate is the reduction of xt and ht-1. If xt contains important information, the value of σ and ft are both close to 0. On the contrary, the value of σ and ft are both close to 1:  Short-term memory In terms of bionics, forgotten gate plays an important role in the model's short-term memory. It can "forget" the unimportant information in time series data processing. Like our short term memory, these useless data are quickly forgotten by the LSTM. From the viewpoint of mathematics, forgotten gate is the reduction of x t and h t-1 . If x t contains important information, the value of σ and f t are both close to 0. On the contrary, the value of σ and f t are both close to 1: where x t is the input of LSTM at time t, h t−1 is the output of LSTM at time t − 1, f t is the output of forgotten gate at time t, the square brackets indicate that two vectors are joined together, W f is the weight matrix of the gate layer that is forgotten, σ is the sigmoid function, and b f is the bias term of the forgotten gate layer.
• Long-term memory From the viewpoint of bionics, input gate and memory unit give LSTM the ability of long-term memory. Unlike the forgotten gate, the function of input gate and memory unit is "remember". Some important information is stored in the memory unit through the input gate. Like humans' long-term memory, these useful data have been stored in LSTM for a long time. Input gate determines how much x t of the network is saved in the memory unit C t . If the input x t has no useful information, the value of σ and it are close to 0. Conversely, the value of σ and it are close to 1. The mathematical Equation (2) of the input gate is as follows: where i t is the input of input gate at time t, W i is the weight matrix of the first part, and b i is the input gate layer bias terms.
The old information C t−1 and the new information C t are stored in memory unit. The mathematical Equation (3) of memory unit is as follows: LSTM has great advantages in forecasting WPG or PV power generation, which has been confirmed by many ways [32,33]. Nevertheless, there are few studies on hybrid forecasting by LSTM.

Hybrid Forecasting of Different Generation Types
A special case of PV-CSP complementary generation system is given to prove the advantage of LSTM in hybrid forecasting. It is assumed that the CSP model has enough energy storage capacity. In order to balance the intermittent and fluctuation of PV generation, CSP plant with heat storage device stores energy during the day and releases energy at night. The power generation of CSP plant in one day is shown in Figure 3. Where, blue curve and red curve show the DNI and PV power generation during the day respectively. The yellow curve represents the power generated by CSP when it is independently generated, and the purple curve represents the power generated by CSP in complementary generation system.
where xt is the input of LSTM at time t, ht−1 is the output of LSTM at time t − 1, ft is the output of forgotten gate at time t, the square brackets indicate that two vectors are joined together, Wf is the weight matrix of the gate layer that is forgotten, σ is the sigmoid function, and bf is the bias term of the forgotten gate layer.

Long-term memory
From the viewpoint of bionics, input gate and memory unit give LSTM the ability of long-term memory. Unlike the forgotten gate, the function of input gate and memory unit is "remember". Some important information is stored in the memory unit through the input gate. Like humans' long-term memory, these useful data have been stored in LSTM for a long time. Input gate determines how much xt of the network is saved in the memory unit Ct. If the input xt has no useful information, the value of σ and it are close to 0. Conversely, the value of σ and it are close to 1. The mathematical Equation (2) of the input gate is as follows: where it is the input of input gate at time t, Wi is the weight matrix of the first part, and bi is the input gate layer bias terms. The old information Ct−1 and the new information Ct are stored in memory unit. The mathematical Equation (3) of memory unit is as follows: LSTM has great advantages in forecasting WPG or PV power generation, which has been confirmed by many ways [32,33]. Nevertheless, there are few studies on hybrid forecasting by LSTM.

Hybrid Forecasting of Different Generation Types
A special case of PV-CSP complementary generation system is given to prove the advantage of LSTM in hybrid forecasting. It is assumed that the CSP model has enough energy storage capacity. In order to balance the intermittent and fluctuation of PV generation, CSP plant with heat storage device stores energy during the day and releases energy at night. The power generation of CSP plant in one day is shown in Figure 3. Where, blue curve and red curve show the DNI and PV power generation during the day respectively. The yellow curve represents the power generated by CSP when it is independently generated, and the purple curve represents the power generated by CSP in complementary generation system.  It is clear that the power of CSP in the complementary generation system has a long time lag, which is often has a serious impact on the short-term forecast. Because of the It is clear that the power of CSP in the complementary generation system has a long time lag, which is often has a serious impact on the short-term forecast. Because of the shortterm memory, recurrent neural network (RNN) cannot get enough effective components from the daytime DNI. In this case, a complex artificial intelligence model is not even as effective as the simple physical model. Actually, the LSTM network has achieved good results in processing long time series. LSTM network can "remember" enough information from the input data in the daytime, and forecast the power of CSP at night.

Data and Correlation Analysis
The model proposed in this paper takes the renewable hybrid generation system in the northwest of China as the research object. Hybrid generation in the region consists of a PV plant, a CSP plant and a wind farm. Their capacities are 40 MW, 20 MW and 100 MW, respectively. The data set used in Section III and IV are all from this region. This data set includes not only the power data of each unit, but also some meteorological parameters including wind speed and DNI. All the data in the data set reserves a decimal fraction while recording, and the temporal resolution is one hour. The time scale of this data set is one year, so the data set can be considered seasonal. Through reasonable arrangement of training samples and test samples, the applicability of the method in different seasons can be proved.
Descriptive statistics is used to reflect the basic situation of each member variable in the data set and recorded in Table 1, such as maximum value (MAX) and minimum value (MIN), average (AVE), standard deviation (STD). MAX and MIN are used to represent the upper and lower limits of the member variables. AVE is used to describe the average level by each type of data in the year. STD is used to describe the degree of dispersion. Before modeling, correlation among different variables should be analyzed [34]. Compared with meteorological variables, we are more concerned about whether there is significant correlation among different types of power generation. Based on a large number of actual data, the correlation analysis method of Pearson coefficient is used to analyze the correlation among three types of power generation and meteorological variables. Record the obtained results in Table 2. The calculation method of Pearson coefficient is shown in the following: where P CC is the Pearson correlation coefficient among different data types, X i and Y i are two different data types at the i moment, X ave and Y ave are the mean values of X i and Y i respectively, and N is the length of the series. According to the results in Table 2, it is quite obvious that PV and DNI have a high degree of correlation, and the Pearson coefficient between them is more than 0.9. Significant correlation is also found between WPG and wind. There is a certain correlation among the three types of power generation. The correlation coefficient between PV and WPG was 0.42, that between CSP and PV was 0.38, and that between CSP and WPG was 0.11. The reasons for the correlation are as follows: (1) Because wind speed and DNI are correlated meteorological variables, there is a weak correlation between WPG and PV. During the day, the DNI is high and the wind speed is often weak. After the sun sets at night, the wind speed is strengthened because of the great change of the surface temperature. Besides, the wind speed is often higher on rainy days than on sunny days. What's more, In China's monsoon climate region, generally, wind speed and DNI are complementary in season. In short, it is precisely because of these meteorological correlations that WPG and PV power generation show a certain correlation. (2) DNI is the main meteorological variable determining PV and CSP power generation, so there is a trend correlation between PV and CSP. The correlation is very strong when there is no power compensation. (3) In the hybrid power generation system, CSP is used to compensate for the fluctuation of PV and WPG generation, so there is a certain negative correlation between the generation of CSP and the other two types of generation. (4) The three types of power generation may be affected by temperature, air pressure, humidity and other meteorological factors at the same time. There is a certain coupling relationship among the three types of power generation.
All in all, we believe that there are complex but close correlations among the three types of power generation. We think that the hybrid forecasting of different generation types can make good use of these correlations, so as to improve the forecasting ability of the model.

Modeling Process
The modeling process of the hybrid forecasting method based on LSTM is shown in Figure 4. The modeling process is mainly composed of preprocessing, modeling, testing and evaluating, which will be introduced in the following.
The data set including meteorological data and power data cannot be directly used for power forecasting, which means it is necessary to perform some preprocessing operations on the dataset, such as normalization and correction. Correction is mainly used to identify and eliminate the influence of abnormal data. Suppose data P(t) represents the value of dataset P at time t. On the one hand, the missing data P(t) is completed by interpolation. On the other hand, if the value of P(t) is not in the range [1.5 × P(t − 1) − 0.5 × P(t + 1), 1.5 × P(t + 1) − 0.5 × P(t − 1)], P(t) is considered as an outlier. Outliers are removed and replaced by interpolation. Besides, data normalization should be performed. After data normalization, the neural network is easier to converge to the optimal solution in the training process. From the mathematical point of view, normalization is a linear transformation of the original data. In this way, the original data is mapped to the interval [0,1]. The normalization formula is as follows: where A max and A min are the maximum and minimum values of data type A, x is the original value of A, x' is the normalized value of x. Correlation of generation power in renewable energy cluster is related to spatial location and temporal change [35], in order to describe input data from two aspects of time scale and spatial dimension, spatial panel data can be used for modeling. The normalized data are directly used to form spatial panel data and spatial panel data is used to train neural network. The training data is mainly used to train the parameters of LSTM. Validation data is actually a set of samples that used to verify the performance of different models. The different models here are mainly refer to the different super parameters. It can also be said that the role of validation data is to adjust the model parameters. Usually, LSTM will not use all the data in network training and parameters adjusting. Instead, part of the data that does not participate in the training are used to the network testing. For this data set, 11 months of data are used to model and 1 month of data are used to test the effect of the proposed method.
For a trained LSTM, test set is used to objectively evaluate the performance of the network. Nevertheless, the results are not convincing if we only use the data from a certain season. We cannot guarantee the model will have the same effect in other test samples, for example, in different seasons. Cross validation is used to ensure that the model has sufficient applicability. It can objectively judge the compliance degree of LSTM to the data outside the training set. The core idea of cross validation is to use different months to test and the remaining months to train LSTM. The specific arrangement is shown in Table 3.  Figure 4. Flow chart of modeling procedure.
Correlation of generation power in renewable energy cluster is related to spatial location and temporal change [35], in order to describe input data from two aspects of time scale and spatial dimension, spatial panel data can be used for modeling. The normalized data are directly used to form spatial panel data and spatial panel data is used to train neural network. The training data is mainly used to train the parameters of LSTM. Validation data is actually a set of samples that used to verify the performance of different models. The different models here are mainly refer to the different super parameters. It can also be said that the role of validation data is to adjust the model parameters. Usually, LSTM will not use all the data in network training and parameters adjusting. Instead, part of the data that does not participate in the training are used to the network testing. For this data set, 11 months of data are used to model and 1 month of data are used to test the effect of the proposed method.
For a trained LSTM, test set is used to objectively evaluate the performance of the network. Nevertheless, the results are not convincing if we only use the data from a certain season. We cannot guarantee the model will have the same effect in other test samples, for example, in different seasons. Cross validation is used to ensure that the model has sufficient applicability. It can objectively judge the compliance degree of LSTM to the data outside the training set. The core idea of cross validation is to use different months to test and the remaining months to train LSTM. The specific arrangement is shown in Table 3. Generally, there are three standards for evaluate, including root mean squared error (RMSE), mean absolute error (MAE) and mean absolute percentage error (MAPE). RMSE is used to measure the deviation between the forecasting value and the true value. MAE can avoid the problem of offsetting the positive and negative errors, so it can accurately reflect the actual forecasting error. In this paper, we only consider the RMSE of the forecast results when evaluating the results. Because each error in RMSE is square and the outliers have a greater impact on the RMSE. The reason why MAPE is not selected in this paper is that some data in the dataset is equal to 0 and MAPE is not available. As an evaluation index, RMSE can guarantee the forecasting accuracy of the forecasting model in special weather. The Equation for P RMSE is as follows: where P Mi is the observed power of i time, P Pi is forecasting power of i time, n is the number of samples.
Because of different generation capacity, the true value is not comparable. To solve this problem, we calculate the unit value of RMSE, which called P RMSE %: where P N is the rated power of renewable energy power generation.

Case Setup
In addition to theoretical analysis, the effectiveness of the method needs to be verified by cases. Section 3.2 analyzes the correlation among the three types of power generation, and it should be proved by an example that the forecasting efficiency can be significantly improved by hybrid forecasting among the generation types with correlation. On this basis, Wind-PV-CSP hybrid generation bundled into the power system are superior is verified by the evaluation of hybrid forecasting, partial hybrid forecasting and independent forecasting. Finally, the correctness of Section 2 theory should be proved by comparative experiments. In the hybrid forecasting of multiple renewable energy sources, LSTM network do better than the traditional RNN network.
(1) Case 1 In case 1, a hybrid forecasting scheme for CSP and PV power generation is modeled. As a contrast, a scheme for independent forecasting of CSP and PV power generation is established. The main purpose of case 1 is to prove that the hybrid forecasting between PV and CSP with significant correlation can effectively improve the forecasting accuracy. For hybrid forecasting, the power of regional PV and CSP can be obtained directly. The difference is that for independent forecasting, the sum of CSP and PV generation in the region can be obtained by accumulation. The input data of these two models include historical power data and meteorological data (MD). The meteorological data include wind speed and DNI. We record more detailed information about each scheme in Table 4. The models of these two forecasting schemes are shown in Figure 5. WPG and CSP Hybrid Figure 5. Two forecasting schemes of PV and CSP.
Although PV was significantly correlated with CSP, WPG had a weak correlation with CSP and PV. In order to prove whether the hybrid forecasting between generation types with weak correlation can improve the forecasting accuracy, we established the hybrid forecasting of WPG and PV and the hybrid forecasting of WPG and CSP. The details are recorded in Table 4.
(2) Case 2 Based on correlation analysis, hybrid generation bundled into the power system are superior is verified by the evaluation of hybrid forecasting, partial hybrid forecasting and independent forecasting. A total of five experimental groups were set up and recorded in Table 5. Independent forecast details are recorded in group 1. Details of the partial hybrid forecasts are recorded in groups 2, 3, and 4. Details of the hybrid forecasting are recorded in group 5. The differences of the above five prediction models in structure can be expressed more intuitively by mathematics. The mathematical expressions of group 1 to group 5 are expressed by Equations (8)-(12) respectively: Although PV was significantly correlated with CSP, WPG had a weak correlation with CSP and PV. In order to prove whether the hybrid forecasting between generation types with weak correlation can improve the forecasting accuracy, we established the hybrid forecasting of WPG and PV and the hybrid forecasting of WPG and CSP. The details are recorded in Table 4.
(2) Case 2 Based on correlation analysis, hybrid generation bundled into the power system are superior is verified by the evaluation of hybrid forecasting, partial hybrid forecasting and independent forecasting. A total of five experimental groups were set up and recorded in Table 5. Independent forecast details are recorded in group 1. Details of the partial hybrid forecasts are recorded in groups 2, 3, and 4. Details of the hybrid forecasting are recorded in group 5. The differences of the above five prediction models in structure can be expressed more intuitively by mathematics. The mathematical expressions of group 1 to group 5 are expressed by Equations (8)-(12) respectively: P Σ (t + 1) = f (V, D, P WPG , P PV ) + g(D, P CSP ) (9) P Σ (t + 1) = f (V, D, P WPG , P CSP ) + g(D, P PV ) (10) P Σ (t + 1) = f (D, P PV , P CSP ) + g(V, P WPG ) (11) P Σ (t + 1) = f (V, D, P WPG , P CSP , P PV ) (12) where: P ∑ is the total output of the new energy cluster at t + 1 time, V is the row vector of historical wind speed, D is the row vector of historical DNI, P WPG is the historical power data of wind power plant, P PV is the historical power data of PV power plant, and P CSP is the historical power data of CSP plant. The length of time series input into LSTM is 24, then two sets of historical meteorological data and three sets of historical output data are expressed by Equations (13)- (17): (3) Case 3 In case 3, it is verified that the hybrid forecasting model based on a LSTM neural network is superior to other forecasting algorithms. It has been proved that LSTM can extract more effective information from historical data which could be used to forecast future power, the LSTM with long-term memory can be used to analyze the data with obvious time lag. Although Section 2.2 analyze the theory through LSTM models. But in reality, the model becomes extremely complex. We hope to prove the correctness of this theory through the experiment of real data. The specific arrangement of case 3 is shown in Table 6. By changing the length of time series, it is proved that the "long-term memory" ability of LSTM network can effectively improve the forecasting accuracy of hybrid forecasting model. The length of time series is 24 h, 12 h and 3 h respectively. Besides, the other three kinds of forecasting model are also used in this hybrid forecasting method, they are ARIMA, MLP neural network and RNN. Their forecasting performance is compared with LSTM network.

1.
Case 1 Figure 6 shows the actual and forecast power of PV and CSP in December. The RMSE% of hybrid and independent forecasting are 7.09% and 6.46%, respectively. It is obvious that the power of PV and CSP can be forecast by the hybrid forecasting method, which gives better results. It is clear that the result is unconvincing, which because of the same results cannot be guaranteed out of the sample. Twelve-fold cross validation is used to prove the generality and universality of the model. The monthly RMSE obtained by cross validation is recorded in Figure 7. Compared with the forecasting results in December, the hybrid forecasting model based on LSTM still has good forecasting in the other 11 groups of samples. Similar to the hybrid forecasting of PV and CSP, the other two groups have the same process. In order to record the forecast results more clearly, the average RMSE and RMES% obtained by cross validation are recorded in Table 7.
Sustainability 2021, 13, x FOR PEER REVIEW 12 of 16 groups have the same process. In order to record the forecast results more clearly, the average RMSE and RMES% obtained by cross validation are recorded in Table 7.  For any two renewable energy sources among WPG, PV and CSP, the RMSE% of hybrid forecasting is lower than RMSE% of independent forecasting. The simulation results show that the hybrid forecasting between the two types of power generation with correlation can greatly improve the forecasting accuracy.

Case 2
In Case 1, we know that any two combinations of PV, CSP, WPG is better than independent forecasting. We are still curious whether the hybrid forecasting among PV, CSP and WPG can improve the forecasting accuracy. In this case, we have done five comparative groups to test that theory. The groups of case 2 are shown in Table 5, where group 1 is an independent forecasting, groups 2 to 4 are partial hybrid forecasting, and group5 are hybrid forecasting. Their results were added together and the statistical indicators were recorded in Table 8. groups have the same process. In order to record the forecast results more clearly, the average RMSE and RMES% obtained by cross validation are recorded in Table 7.  For any two renewable energy sources among WPG, PV and CSP, the RMSE% of hybrid forecasting is lower than RMSE% of independent forecasting. The simulation results show that the hybrid forecasting between the two types of power generation with correlation can greatly improve the forecasting accuracy.

Case 2
In Case 1, we know that any two combinations of PV, CSP, WPG is better than independent forecasting. We are still curious whether the hybrid forecasting among PV, CSP and WPG can improve the forecasting accuracy. In this case, we have done five comparative groups to test that theory. The groups of case 2 are shown in Table 5, where group 1 is an independent forecasting, groups 2 to 4 are partial hybrid forecasting, and group5 are hybrid forecasting. Their results were added together and the statistical indicators were recorded in Table 8.  For any two renewable energy sources among WPG, PV and CSP, the RMSE% of hybrid forecasting is lower than RMSE% of independent forecasting. The simulation results show that the hybrid forecasting between the two types of power generation with correlation can greatly improve the forecasting accuracy.

2
Case 2 In Case 1, we know that any two combinations of PV, CSP, WPG is better than independent forecasting. We are still curious whether the hybrid forecasting among PV, CSP and WPG can improve the forecasting accuracy. In this case, we have done five comparative groups to test that theory. The groups of case 2 are shown in Table 5, where group 1 is an independent forecasting, groups 2 to 4 are partial hybrid forecasting, and group5 are hybrid forecasting. Their results were added together and the statistical indicators were recorded in Table 8.  Figure 8 records the forecast results of five prediction schemes on a day in December. For better comparison, we recorded the annual average of RMES and RMSE% in descending order in Table 8 and Figure 9. Obviously, the RMSE% of independent forecasting is less than that of partial hybrid forecasting, and the RMSE% of hybrid forecasting model is the smallest. Finally, we believe that hybrid forecasting among PV, CSP and WPG can improve the forecasting accuracy. The effect of hybrid forecasting is best because of the following reasons.   Figure 8 records the forecast results of five prediction schemes on a day in December. For better comparison, we recorded the annual average of RMES and RMSE% in descending order in Table 8 and Figure 9. Obviously, the RMSE% of independent forecasting is less than that of partial hybrid forecasting, and the RMSE% of hybrid forecasting model is the smallest. Finally, we believe that hybrid forecasting among PV, CSP and WPG can improve the forecasting accuracy. The effect of hybrid forecasting is best because of the following reasons.  First of all, because wind speed and DNI are correlated meteorological variables, there is a weak correlation between WPG and PV. During the day, the DNI is high and the wind speed is often weak. After the sun sets at night, the wind speed is strengthened because of the great change of the surface temperature difference. Besides, the wind speed is often higher on rainy days than on sunny days. What is more, In China's monsoon climate region, generally, wind speed and DNI are complementary in season. In short, it is precisely because of these meteorological correlations that WPG and PV power generation show a certain correlation. Besides, DNI is the main meteorological variable determining PV and CSP power generation, so there is a trend correlation between PV and CSP. The correlation is very strong when there is no power compensation. What is more, in the hybrid power generation system, CSP is used to compensate for the fluctuation of PV and   Figure 8 records the forecast results of five prediction schemes on a day in December. For better comparison, we recorded the annual average of RMES and RMSE% in descending order in Table 8 and Figure 9. Obviously, the RMSE% of independent forecasting is less than that of partial hybrid forecasting, and the RMSE% of hybrid forecasting model is the smallest. Finally, we believe that hybrid forecasting among PV, CSP and WPG can improve the forecasting accuracy. The effect of hybrid forecasting is best because of the following reasons.  First of all, because wind speed and DNI are correlated meteorological variables, there is a weak correlation between WPG and PV. During the day, the DNI is high and the wind speed is often weak. After the sun sets at night, the wind speed is strengthened because of the great change of the surface temperature difference. Besides, the wind speed is often higher on rainy days than on sunny days. What is more, In China's monsoon climate region, generally, wind speed and DNI are complementary in season. In short, it is precisely because of these meteorological correlations that WPG and PV power generation show a certain correlation. Besides, DNI is the main meteorological variable determining PV and CSP power generation, so there is a trend correlation between PV and CSP. The correlation is very strong when there is no power compensation. What is more, in the hybrid power generation system, CSP is used to compensate for the fluctuation of PV and First of all, because wind speed and DNI are correlated meteorological variables, there is a weak correlation between WPG and PV. During the day, the DNI is high and the wind speed is often weak. After the sun sets at night, the wind speed is strengthened because of the great change of the surface temperature difference. Besides, the wind speed is often higher on rainy days than on sunny days. What is more, In China's monsoon climate region, generally, wind speed and DNI are complementary in season. In short, it is precisely because of these meteorological correlations that WPG and PV power generation show a certain correlation. Besides, DNI is the main meteorological variable determining PV and CSP power generation, so there is a trend correlation between PV and CSP. The correlation is very strong when there is no power compensation. What is more, in the hybrid power generation system, CSP is used to compensate for the fluctuation of PV and WPG generation, so there is a certain negative correlation between the generation of CSP and the other two types of generation. Finally, the three types of power generation may be affected by temperature, air pressure, humidity and other meteorological factors at the same time. There is a certain coupling relationship among these types of power generation. 3 Case 3 The Wind-PV-CSP hybrid forecasting was repeated by changing the time series length of LSTM neural network. The forecasting results show that with the shortening of time series, the value of RMSE% becomes larger and the forecasting accuracy becomes worse, which because the important information is lost by LSTM. The RMSE% of the LSTM forecasting model under different time series lengths are recorded in Table 9 and Figure 10. WPG generation, so there is a certain negative correlation between the generation of CSP and the other two types of generation. Finally, the three types of power generation may be affected by temperature, air pressure, humidity and other meteorological factors at the same time. There is a certain coupling relationship among these types of power generation.

Case 3
The Wind-PV-CSP hybrid forecasting was repeated by changing the time series length of LSTM neural network. The forecasting results show that with the shortening of time series, the value of RMSE% becomes larger and the forecasting accuracy becomes worse, which because the important information is lost by LSTM. The RMSE% of the LSTM forecasting model under different time series lengths are recorded in Table 9 and Figure 10.  In addition to changing the time series length of LSTM neural network, the hybrid forecasting model of Wind-PV-CSP by other three forecasting model are also modeled, and the RMSE% of these three forecasting models are recorded in Table 9 and Figure 10. The results show that the LSTM method with long-term memory has a higher forecasting accuracy in the hybrid forecasting of Wind-PV-CSP generation.

Conclusions
High proportion of renewable energy clusters connected to the power grid are expected to be an important trend in the future. Improving the forecasting accuracy of these hybrid generation clusters is a great challenge and a difficult task. This paper proposes a short-term forecasting method based on LSTM for hybrid generation cluster composed of three types of renewable energy power generation, namely wind power, photovoltaic power and CSP. A series of conclusions can be drawn from the results of this study.
The generation cluster of CSP and photovoltaic power generation can reduce the uncertainty of renewable energy generation, so that the hybrid forecasting of CSP and PV generation has higher forecast accuracy than the independent forecasting. Besides, complementarity between wind and solar energy is also used for the forecasting of hybrid  In addition to changing the time series length of LSTM neural network, the hybrid forecasting model of Wind-PV-CSP by other three forecasting model are also modeled, and the RMSE% of these three forecasting models are recorded in Table 9 and Figure 10. The results show that the LSTM method with long-term memory has a higher forecasting accuracy in the hybrid forecasting of Wind-PV-CSP generation.

Conclusions
High proportion of renewable energy clusters connected to the power grid are expected to be an important trend in the future. Improving the forecasting accuracy of these hybrid generation clusters is a great challenge and a difficult task. This paper proposes a short-term forecasting method based on LSTM for hybrid generation cluster composed of three types of renewable energy power generation, namely wind power, photovoltaic power and CSP. A series of conclusions can be drawn from the results of this study.
The generation cluster of CSP and photovoltaic power generation can reduce the uncertainty of renewable energy generation, so that the hybrid forecasting of CSP and PV generation has higher forecast accuracy than the independent forecasting. Besides, complementarity between wind and solar energy is also used for the forecasting of hybrid generation cluster. The hybrid forecasting improves the forecast accuracy of Wind-PV-CSP generation cluster. Additionally, LSTM can make better use of the correlation and long time series data among different types of power generation. The memory function makes LSTM perform better in hybrid forecasting than that of ARIMA, MLP and RNN.
For future works, on the one hand, the structure of the hybrid forecasting is relatively simple and we are considering the ensemble method based on LSTM. On the other hand, for renewable energy cluster, only one group of PV, wind power and CSP data is used. The next step is increasing the number of power plants for each type of power generation.