You are currently viewing a new version of our website. To view the old version click .
Energies
  • Article
  • Open Access

15 October 2018

Ultra-Short-Term Wind Power Prediction Based on Multivariate Phase Space Reconstruction and Multivariate Linear Regression

,
and
1
College of Electrical and Information Engineering, Hunan University, Changsha 410082, China
2
College of Automation, Foshan University, Foshan 52800, China
*
Author to whom correspondence should be addressed.
This article belongs to the Special Issue Power Electronics in Renewable Energy Systems

Abstract

In order to improve the accuracy of wind power prediction (WPP), we propose a WPP based on multivariate phase space reconstruction (MPSR) and multivariate linear regression (MLR). Firstly, the multivariate time series (TS) are constructed through reasonable selection of wind power and weather factors, which are closely associated with wind power. Secondly, the phase space of the multivariate time series is reconstructed based on the chaos theory and C-C method. Thirdly, an auto regression model for multivariate phase space is created by regarding phase variables as state variables, and the very-short-term wind power is predicted by using a multi-linear regression algorithm. Finally, a parallel algorithm based on map/reduce is presented to improve computing speed. A cloud computing platform, Hadoop consisting of five nodes, is established as a matter of convenience, followed by the prediction of wind power of a wind farm in the Hunan province of China. The experimental results show that the model based on MPSR and MLR is more accurate than both the continuous method and the simple approximation method, and the parallel algorithm based on map/reduce effectively accelerates the computing speed.

1. Introduction

In the past decades, with the increasing population, industrial need, and energy need [1], a large amount of fuels such as fossil oil, coal, and natural gas have been consumed. However, the fossil fuels can discharge a large amount of greenhouse gas and pollute the environment. What is more, the fossil fuels are non-renewable and diminishing day-by-day. Therefore, researchers have focused on the renewable energy sources, among which wind power generation is one of the most mature renewable energies with lower pollution and greenhouse gas emissions [2]. Wind power generation is affected by wind speed, wind direction, temperature, turbine type, terrain roughness, air density, and so on [3]. The wind power is random and intermittent. To reduce the risk that is caused by the wind power’s fluctuation, both a one-day-ahead (0–24 h) and a real-time (15 min–4 h) wind power predicting report should be submitted to the Grid Dispatch Center in China. Actually, the ultra-short-term wind power prediction (UST-WPP) has been extensively employed in many fields, such as balancing load, optimal operation of reserves [4], and wind farm control [5]. The wind farm owners, power users, and facilities benefit from an improved wind power prediction (WPP).
According to the current law in China, the wind power prediction with horizons of 1–4 h and 24 h are necessary. This paper focuses on forecasting wind power with short horizons. Specifically, we propose a distributed model for ultra-short-term WPP based on multivariate phase space reconstruction (MPSR) and multivariate linear regression (MLR). Compared with other approaches for short-term wind power prediction, such as artificial neural networks (ANNs), support vector machine (SVM) and single-variable phase space reconstruction, the proposed wind power prediction model is more precise and quicker.
The remainder of this paper is organized as follows. The background of our work is introduced in Section 1. Section 2 reviews the related work. The prepared knowledge is given and the parallel model is proposed in Section 3. Section 4 expresses the key points of the parallel algorithm based on map/reduce. Section 5 describes the prediction experiments. Section 6 outlines the conclusions.

3. Prepare Knowledge and System Modeling

3.1. Time-Series Similarity

Time-series similarity [27,28] is measured with many methods, such as Minkowski distance, Euclidean distance, Pearson coefficient, and dynamic time warping (DTW) distance [29,30]. Compared to other methods, DTW is not sensitive to synchronization and can measure the similarity of time-series with different lengths. Thus, DTW is employed to measure the similarity of numerical weather prediction series. DTW is defined as follows:
Definition 1.
Dynamic time warping (DTW) distance. X = { x 1 , x 2 , , x l } and Y = { y 1 , y 2 , , y n } are 2 time-series, where l and n are the lengths of time series X and Y, respectively. The DTW between X and Y is defined recursively as Equation (1).
D d t w ( X , Y ) = { 0 ,       i f   X = Y = { } + ,     i f   X = { }   a n d   Y { } + ,     i f   X { }   a n d   Y = { } d ( x 1 , y 1 ) + min { D d t w ( r e s t ( X ) , Y ) D d t w ( r e s t ( X ) , r e s t ( Y ) ) D d t w ( X , r e s t ( Y ) )
where {} indicates empty time-series, d ( x , y ) = | | x , y | | 2 is the 2-norm [31] of x and y, and r e s t ( X ) = { x 2 , , x l } .
Definition 2.
Time-series similarity. X and Y are 2 time-series, and the time-series similarity between X and Y is defined as Equation (2).
S ( X , Y ) = { 0 , i f     θ < D d t w ( X , Y ) θ D d t w ( X , Y ) , e l s e
where θ is a threshold, D d t w ( X , Y ) is the DTW between X and Y.

3.2. Multi-Variable Phase Space Reconstruction

According to Takens-theorem [32,33], the phase space of chaotic time series x 1 , x 2 , , x n is reconstructed with appropriate embedding dimension m and time delay τ ; the original system and the trajectory of reconstructed phase space are differential homeomorphism. The phase space reconstruction theory [34] of single-variable chaotic time series could be extended to multi-variable chaotic time series. The multi-variable phase space reconstruction is defined as follows.
Definition 3.
Multi-variable phase space reconstruction. X1, X2, …, XK are K chaotic time-series, where Xi = { x 1 i , x 2 i , , x n i } , and i = 1, 2, …, K. The q-th phase point of reconstructed phase space for multi-variable chaotic time series X1, X2, …, X is shown as Equation (3).
V q = { x q 1 , x q τ 1 1 , , x q ( m 1 1 ) τ 1 1 ,                       x q 2 , x q τ 2 2 , , x q ( m 2 1 ) τ 2 2 , ,                       x q K , x q τ K K , , x q ( m K 1 ) τ K K }
where q = j , j + 1 , n , j = max 1 i K { ( m i 1 ) τ i } + 1 and m i and τ i are embedding dimension and time delay of the chaotic time-series X i , respectively. The embedding dimension m i and time delay τ i are obtained using C-C algorithm [33].

3.3. Ultra-Short-Term WPP Modeling

In this paper the ultra-short-term WPP model is based on multi-variable phase space reconstruction, similarity of time-series, and linear regression. Our forecasting model includes steps steps, which are specifically presented in Figure 1.
Figure 1. Flow diagram of ultra-short-term wind power prediction (WPP).
More details about the proposed ultra-short-term WPP model are listed below.
Step 1: Data preprocessing. The historical wind power data and weather data may be missing for some reason, for instance, equipment failure, network interruption, etc. The continuous missing data series M d , whose length is more than 5, is removed from historical data series. Otherwise, data is specified by simple interpolation, as expressed in Equation (4).
x i = x 0 + i × ( x e x s ) N + 1
where x s is the data point before M d , x e is the data point behind M d , i = 1 , 2 , , N , and N is the length of missing data series M d .
Step 2: Data dimensionality reduction. NWP includes some important parameters, for instance, wind speed v, wind direction θ , atmospheric pressure p, and temperature t. The speed and direction of wind are shown in Figure 2.
Figure 2. The decomposition of wind speed.
In Figure 2, the x-axis is the latitude line of the given location, and the east is the positive direction. The y-axis is the longitude of the given location, and the north is the positon direction. In order to conveniently forecast wind power, wind speed ν is decomposed to ν x and ν y .
ν x = ν × cos θ ν y = ν × sin θ
where θ is the angle between wind direction ν and the x-axis.
Only the most important parameters ( ν x , ν y and p) of NWP are selected in the presented model. The historical NWP data can be expressed as a N × 3 matrix M, which is clearly shown as Equation (6).
M = ( u 1 , ν 1 , p 1 u 2 , ν 2 , p 2 u N , ν N , p N )
where N is the number of NWP data points. The matrix M is converted to a N × 1 matrix using the approach in reference [2], as displayed in Equation (8).
C = M T × M
X = M × e = ( w 1 ,   w 2 , , w N ) T
where e is an eigenvector, whose corresponding Eigen value is the maximal Eigen value of matrix C.
Step 3: The most similar NWP series segments searching: NWP0 is the NWP data series during 2L hours, which consist of two equal parts, namely, the former L hours and the latter L hours. NWP0 includes 8L data points (4 points per hour). NWP0 is converted to X 0 , and the whole historical NWP data series is converted to X according to Equations (6)–(8). X i s is the top K sub-series of X, which are most similar to X 0 , and S i is the similarity between X 0 and X i s , i = 1 , 2 , , K .
Step 4: The phase space reconstruction for multivariate time series. τ X and τ P are the delay time of X and P, respectively, and m X and m P are the embedding dimensions of X and P, respectively. τ X , τ P , m X and m P are achieved by C-C method. P i and X i ( i = 0 , 1 , , K ) are the wind power sub-series and reduced NWP sub-series at the same time period, and P h i is the reconstructed phase space of P i and X i .
Step 5: Multivariate linear regression. The multivariate linear regressions are shown as Equation (9)
( y 1 y 2 y m ) = ( c 1 , 1 , c 1 , 2 , , c 1 , m c 2 , 1 , c 2 , 2 , , c 2 , m c m , 1 , c m , 2 , , c m , m ) ( x 1 x 2 x m ) + ( c 1 , 0 c 2 , 0 c m , 0 ) + ( ξ 1 ξ 2 ξ m )
where ξ = ( ξ 1 , ξ 2 , , ξ m ) T is residual error, ( x 1 , x 2 , , x m ) T is the input, and ( y 1 , y 2 , , y m ) T is the output. The coefficient matrix C and constant term C 0 are shown as Equation (10).
C = ( c 1 , 1 , c 1 , 2 , , c 1 , m c 2 , 1 , c 2 , 2 , , c 2 , m c m , 1 , c m , 2 , , c m , m )   and   C 0 = ( c 1 , 0 c 2 , 0 c m , 0 )
The training process of linear regression models is to search for appropriate coefficient matrix C and constant term C 0 , so that residual error ξ minimizes.
Given the residual error equals 0, then Equation (9) is converted into Equation (11).
( y 1 y 2 y m ) = ( c 1 , 0 , c 1 , 1 , c 1 , 2 , , c 1 , m c 2 , 0 , c 2 , 1 , c 2 , 2 , , c 2 , m c m , 0 , c m , 1 , c m , 1 , , c m , m ) ( 1 x 1 x 2 x m )
The Equation (11) is further expressed as follows:
Y = ( C 0 , C ) ( 1 X )
Given a group of training samples < X i , Y i > , ( i = 1 , 2 , , K ), the goal of the training process is to find appropriate C and C0, so that the mean error, namely, Equation (13) minimizes.
i = 1 n ( Y i Y i 2 ) n
where Y i = ( C 0 , C ) ( 1 X i ) .
After the linear regression model is trained, Y is obtained from Equation (12) by assuming X is given.
P h ( i ) is the i-th phase point of reconstructed phase space P h . Considering that the reconstructed phase space is locally linear, P h ( i ) and P h ( i + 1 ) are linearly related as follows.
P h ( i + 1 ) = ( C 0 , C ) ( 1 P h ( i ) )
The Equation (15) is obtained by substituting all the phase points of Ph into Equation (14).
( P h ( i + 1 ) , P h ( i ) , , P h ( 2 ) ) = C (       1 ,                   1 ,               ,         1 P h ( i ) , P h ( i 1 ) , , P h ( 1 ) )
where C = ( C 0 , C ) . C is achieved from the following Equation.
C = ( P h ( i + 1 ) , P h ( i ) , , P h ( 2 ) ) × (       1 ,                   1 ,               ,         1 P h ( i ) , P h ( i 1 ) , , P h ( 1 ) ) 1
where (       1 ,                   1 ,               ,         1 P h ( i ) , P h ( i 1 ) , , P h ( 1 ) ) 1 is the generalized inverse matrix of (       1 ,                   1 ,               ,         1 P h ( i ) , P h ( i 1 ) , , P h ( 1 ) ) . When C is given, the next phase point is obtained from the current phase point and Equation (14).
Step 6: The comprehensive forecasting results. If Ph1,Ph2, …, Phk are the reconstructed phase space of the top K most similar time-series segments respectively, according to step 4, then C k is achieved by substituting Phk into Equation (16). The multivariate linear regression modes are shown as Equation (17).
Ph k ( i + 1 ) = C k (         1 P h k ( i ) )
where Phk (i) and Phk (i + 1) are the input and output of the k-th linear regression model, k = 1, 2, …, k. The predicted phase point is achieved when the phase points of the current NWP and wind power data are input into Equation (17). The predicted value of the k-th WPP model is shown as Equation (18).
P k = P h k ( m P , i + 1 )
where Pk is the predicted value of wind power by the k-th regression model and P h k ( m P , i + 1 ) is the m P -th element of the predicted phase point Phk (i + 1), k = 1 , 2 , , K . The comprehensive forecasting sums the weighed P k as follows.
P = S k i = 1 K S k P k , k = 1 , 2 , , K
where S k is the similarity between X 0 and X i s . The forecasting wind power series will be obtained when the above comprehensive forecasting process is executed iteratively.

4. Parallel Algorithm Based on Map/Reduce

To improve the ultra-short-term WPP, the multivariate phase space reconstruction, the similarities of time-series and the multi-variate linear regression are employed to model wind power prediction. However, the increasing wind power and NWP data increase the time and space complexity and affect the predicting accuracy of the ultra-short-term WPP. In order to accelerate the computing speed, we present a parallel algorithm of ultra-short-term WPP, which is based on the map/reduce programming model.

4.1. Map/Reduce Programming Model

Map/reduce is an extendable parallel programming model, which is widely used in the parallel computation of big data. The diagrammatic layout of map/reduce is shown in Figure 3.
Figure 3. Diagrammatic layout of map/reduce. MAP: the mapping process of hadoop; HDFS: hadoop distributed file system.
Map/reduce includes two phases: mapping and reducing. Every sub-process is highly parallel in these two phases. The specific procedures are as follows:
  • The raw data are input into the key/value pairs (key, value), and the data are processed as much as possible without communication. The intermediate data created in map phase are also saved as key/value pairs (intermediate-key, intermediate-value).
  • The intermediate pairs with the same intermediate-key are transferred to the same reducing process with the completion of the mapping process. The reducing process starts when all intermediate data are transferred. When both mapping and reducing processes are completed, the final results are achieved.

4.2. The Algorithm of Ultra-Short-Term WPP

The map/reduce model is widely used to process big data, however, a single map/reduce job only can complete simple jobs. To solve complex problems, the complex process is usually divided into some sub-processes, and the work is completed together with many map/reduce jobs. Due to the complexity of the ultra-short-term WPP, which is proposed in Section 3.2, the work is completed by five map/reduce jobs (see Algorithm 1).
Algorithm 1. Parallel Ultra-short-term wind power prediction based on Map/reduce.
Job 1: Reducing dimension of NWP matrix.
Map: The NWP matrix M is separated into N sub-matrixes M i , where i = 1, 2, …, N. Each sub-matrix M i is a reduced dimension based on the method in Section 3.2.
(1) Each map process reads sub-matrix M i of M .
(2) The intermediate data pairs ( 0 , i , X i ) is achieved by reducing the data dimension of each sub-matrix M i , according to Equation (8).
Reduce: The reducing dimension series X is created by connecting in order all X i , which are from map process.
(1) X = { } ;
(2) For i = 1:N
                 X i is added to the end of X ;
       end for
(3) Output X ;
        For the sake of convenience, X 0 is the reduced NWP data series during 2L hours, which consists of two equal parts, namely, the former L hours and the latter L hours. X 0 is removed from X , and the remainder of X is divided into N sub-series, X 1 , X 2 , , X N , where | X i | | 8 L , the head of X 1 is the first node in X, and the head of X i + 1 is the (8L − 1)-th node of X i from the bottom, i = 1 , 2 , , N 1 .
Job 2: Searching top k sub-series, which are the most similar to X0.
Map:
(1) Each map process reads the pairs ( i , < X i , X 0 > ) ;
(2) For j = 1:||Xi|| − L % where ||Xi|| is the length of Xi [31].
        (1) We compute the DTW between X 0 and X i j , which is X i ’s sub-series including L elements and starting with the j-th element of Xi;
        (2) The intermediate data are key/value pairs (i, ‘j, DTW’)
End for
Reduce:
(1) k = ( k = 1 i 1 X i ) + j ;
(2) S k = | θ D T W i j | ;
(3) Output ( k , S k ) .
        The sub-series are sorted by S k according to the method in reference [35], and the top k sub-series of NWP and wind power are selected. The top k most similar sub-series of NWP and wind power are denoted as X k and P k , where k = 1, 2, …, k.
        The sub-series are sorted by S k according to the method in reference [35], and the top k sub-series of NWP and wind power are selected. The top k most similar sub-series of NWP and wind power are denoted as X k and P k where k = 1, 2, …, k.
Job 3: The embedding dimension and delaying time of P and X are computed in this job.
        For the sake of convenience, the time series P and X in our algorithm are denoted as P1 and P2, respectively.
Map:
(1) ( i , P i ) C C     a lg o r i t h m ( m i , τ i ) ;  % m i and τ i are obtained by C-C algorithm
(2) J i = ( m i 1 ) τ i ;         % calculating the starting index J i of Pi
(3) The intermediate data of this map are data pairs ( 0 , m i , τ i , J i ) , where m i , τ i and J i are the embedding dimension, delaying time, and starting point of Pi, respectively.
Reduce:
(1) J = max 1 i 2 { J i } + 1
(2) m P = m 1 , τ P = τ 1 , m X = m 2 , τ X = τ 2 ;
(3) Output m P , τ P , m X , τ X , J .
Job 4: Multivariate phase space reconstruction
Map:
(1) ( k , J # m X # τ X # m P # τ P # X k # P k ) r e c o n s t r u c t e d ( 0 , k ) , where Xk and Pk are the most similar sub-series of top K achieved from Job 2, and k = 1 , 2 , , k .
Reduce:
(1) Output reconstructed phase space k .
Job 5: Multi-variables linear regression
Map:
(1) Input < i , k # S k # 0 > , where 0 is the reconstructed phase space during 2L hours.
(2) The linear model is trained by < k , j , k , j + 1 , where k , j is the j-th phase point of reconstructed phase space k , j = 1, 2, …, k 1 ;
(3) After the linear model has been trained, the next phase point is obtained according to Equation (12) and the last point of 0 .
(4) The step 3 is executed iteratively, and then the forecasting reconstructed phase space k P is achieved.
(5) The forecasting power sequence P k j is obtained based on an inverse process of the phase space reconstruction and reconstructed phase space k P .
(6) The intermediate data are < 0 , P k j , S k > .
Reduce:
The comprehensive wind power is given as follows.
P j = S k k = 1 k S k P k j , k = 1 , 2 , , k ; j = 1 , 2 , , B
where j = 1, 2, …, B, and B is the forecasting time-scale

5. Application and Case Study

5.1. The Experimental Data and Environments

The experimental data includes 2 sections: NWP data and wind power data. The NWP data ranged from 1 September 2012 to 31 August 2013 and were from the key laboratory of regional numerical weather predictions in a province of China. NWP data include temperature, humidity, wind speed, wind direction, atmosphere pressure, and so on. The NWP is from the mesoscale numerical prediction model. The temporal resolution of NWP is 1 h. The horizontal resolution of NWP is 3 × 3 km, and it is improved to 1 × 1 km by the dynamic downscaling model [36]. In our experiment, only the data of wind speed, wind direction, and atmosphere pressure were selected. The wind speed and wind direction were first converted into x components and y components, where the east and north are regarded as the positive direction of x-axis and y-axis, respectively. The converted NWP data are shown in Table 3.
Table 3. Historical numerical weather prediction (NWP) data of a wind farm.
In Table 3, each row is an NWP data record, which is composed of the time, the wind speed in the x-component, the wind speed in the y-component, and the atmosphere pressure. simple interpolation method was used to get the 15-min NWP data. The wind power (15-min) from 1 September 2012 to 31 August 2013 was used to train the model. The Figure 4 shows the daily peak wind power in each month.
Figure 4. Historical data of wind power.
In Figure 4, the x axis is time, y axis is the per-unit value of wind power. Because the wind power fluctuates severely at different times, the data are normalized based on Equation (18).
P N = P / P max
where P N is the normalized wind power, P is wind power, and P max is the peak daily wind power from 1 September 2012 to 31 August 2013.
To implement the experiment, we tried to establish an experimental cloudy computing platform Hadoop, which is composed of five nodes including a 4-core central processing unit (CPU) (Intel Core i5) and an Ubuntu operation system. The ultra-short-term wind power was individually predicted by continuous prediction method, simple approximation method, and other methods proposed in this paper.
In terms of continuous prediction method, the nearest observed values were regarded as the next predicting values, namely x i + 1 = x i . The simple approximation method is another kind of continuous method based on phase space reconstruction, and the nearest observed phase points were regarded as the values of next phase point i, namely, X i + 1 = X j + 1 , where X j is the nearest phase point of X i . The error index was the normal mean absolute error (NMAE) in this paper.
N M A E = i = 1 N | x i y i | N × C
where x i is the measured wind power, y i is the forecasting wind power, C is the installed capacity of wind farm, and N is the number of periods being forecasted. More details about NMAE are described in reference [26].

5.2. Results and Analysis of the Experiment

The embedding dimension and delaying time of wind power series P and NWP series X were computed by C-C method with 4000 points and 8000 points, respectively. The embedding dimension and delaying time of wind power series P were 5 and 18, respectively. The embedding dimension and delaying time of NWP series P were 6 and 17, respectively.
The number of the most similar sub-series could affect the ultra-short-term wind power prediction. To optimize the number of most similar sub-series, the number of most similar sub-series ranged from 1 to 10 in the experiment. The experimental results are shown as Figure 5, when the number of most similar sub-series ranges from 1 to 10.
Figure 5. Relation between the number of most similar sub-series and normal mean absolute error.
Figure 5 shows the relation between NMAE and the number of most similar sub-series. When fewer of the most similar segments (especially 1) are used to train a multi-variate linear regression model, the multi-time weighted linear regression becomes a single variable linear regression, and the results have bigger errors. With increasing K, the errors are slowly reduced. When K is near to the threshold, the NMAE keeps getting smaller. However, when the K is larger than a certain number, the errors increase slowly. The reason for this phenomenon is that the dissimilar series could increase the errors of ultra-short-term WPP. A good result can be achieved when K is between 4 and 8.
Figure 6 demonstrates the relation between forecasting time-scale and errors. When the predicting time-scale is less than 6 h, the predicting errors are stable and smaller. With an increasing forecasting time-scale, the average predicting error increases gradually. Figure 6 illustrates that the proposed model is suitable for ultra-short-term WPP, due to its smaller predicting time-scale.
Figure 6. Relation between time-scale and NMAE.
Our model is trained with the historical wind power and NWP data from 00:00 of 1 September 2012 to 24:00 of 31 August 2013. When the five most similar segments are used to train the linear regression model, the experimental results are shown in Figure 7.
Figure 7. Results of wind power predictions. ‘Measured Value’ is the measurement of wind power. ‘PM’ is the forecasting results of persistence method. ‘ARIMA’ is the forecasting results of the autoregressive integrated moving average. ‘BPNN’ is the forecasting result of the back propagation neural networks [37]. ‘SPSR’ is the forecasting of the WPP based on single-variable phase space reconstruction. ‘LSSVR’ is the forecasting results of the WPP based on LSSVR. ‘MPSR-MLR’ is the forecasting results of the proposed model.
Figure 7 shows that most of the forecasting results of the proposed model are near to the actual measurements. It proves that MPSR and MLR are suitable for ultra-short-term WPP. The error of the different models are shown as Figure 8.
Figure 8. The errors of different wind power predictions.
In Figure 8, ‘PM’ is the errors of the persistence method. ‘ARIMA’ is the errors of the WPP based on autoregressive integrated moving average. ’BPNN’ is the errors of the WPP based on back propagation neural networks [37]. ‘SPSR’ is the errors of the WPP based on the single-variable phase space reconstruction. ‘LSSVR’ is the errors of the WPP based on LSSVR. ‘MPSR-MLR’ is the errors of the proposed model. The statistical results of errors for different methods are listed in Table 4.
Table 4. Historical data of a wind farm.
Figure 8 and Table 4 show that PM is not applicable to the seriously fluctuating wind power series. MPSR-MLR can reduce the predicting errors in the seriously fluctuating wind power series. The error of our model is lower than that of other WPP methods. MPSR-MLR outperforms SPSR by mining the correlation of different time series.

6. Conclusions

Various models and methods have been extensively employed in the field of ultra-short-term WPP as no model is suitable for all conditions and all wind farms. In order to improve the ultra-short-term WPP, we have proposed a novel model of the ultra-short-term WPP based on the multi-variates phase space reconstruction and the linear regression in this paper. The proposed model improves the ultra-short-term WPP by mining the correlation of different time series. The performance is particularly good, when the wind power series fluctuate seriously. A map/reduce-based parallel algorithm is implemented on a cloudy computing platform, Hadoop. The research results allow for three conclusions:
  • Wind power is a chaotic time series, and multi-variate phase space reconstruction can improve the ultra-short-term WPP by mining the correlation of these wind power series.
  • It is very difficult to forecast wind power, especially when the wind power series fluctuates seriously. The proposed model improves the performance during the wind ramp drastically.
  • The forecasting speed is accelerated by adopting both the map/reduce-based parallel algorithm and the cloudy computing platform.

Author Contributions

Conceptualization, R.L. and M.P.; methodology, R.L.; software, R.L.; validation, X.X. and M.P.; formal analysis, R.L.; investigation, R.L.; resources, R.L.; data curation, R.L.; writing—original draft preparation, R.L.; writing—review and editing, R.L. and M.P.; visualization, R.L.; supervision, R.L.; project administration, M.P.; funding acquisition, M.P.

Acknowledgments

The authors would like to acknowledge the support provided by the Natural Science Foundation of China No. 61472128 and No. 61173108.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Xue, Y.; Yu, C.; Zhao, J.; Li, K.; Liu, X.; Wu, Q.; Yang, G. A Review on Short-term and Ultra-short-term Wind Power Prediction. Autom. Electric Power Syst. 2015, 39, 141–151. [Google Scholar]
  2. Rafique, M.M.; Rehman, S.; Alam, M.M.; Alhems, L.M. Feasibility of a 100MW Installed CapacityWind Farm for Different Climatic Conditions. Energies 2018, 11, 2147. [Google Scholar] [CrossRef]
  3. Xie, W.; Zhang, P.; Chen, R.; Zhou, Z. A Nonparametric Bayesian Framework for Short-Term Wind Power Probabilistic Forecast. IEEE Trans. Power Syst. 2018. [Google Scholar] [CrossRef]
  4. Khorramdel, B.; Chung, C.Y.; Safari, N.; Price, G.C.D. A Fuzzy Adaptive Probabilistic Wind Power Prediction Framework Using Diffusion Kernel Density Estimators. IEEE Trans. Power Syst. 2018, 958–965. [Google Scholar] [CrossRef]
  5. Wang, Y.; Liao, W.; Chang, Y. Gated Recurrent Unit Network-Based Short-Term Photovoltaic Forecasting. Energies 2018, 11, 2163. [Google Scholar] [CrossRef]
  6. Foley, A.; Leahy, P.G.; Marvuglia, A.; McKeogh, E.J. Current methods and advances in forecasting of wind power generation. Renew. Energy 2012, 31, 1–8. [Google Scholar] [CrossRef]
  7. Khodayar, M.; Wang, J.; Manthouri, m. Interval Deep Generative Neural Network for Wind Speed Forecasting. IEEE Trans. Smart Grid 2018, 1–16. [Google Scholar] [CrossRef]
  8. Pimkumwong, N.; Wang, M.-S. Online Speed Estimation Using Artificial Neural Network for Speed Sensorless Direct Torque Control of Induction Motor based on Constant V/F Control Technique. Energies 2018, 11, 2176. [Google Scholar] [CrossRef]
  9. Lu, P.; Ye, L.; Sun, B.; Zhang, C.; Zhao, Y.; Teng, J. A New Hybrid Prediction Method of Ultra-Short-Term Wind Power Forecasting Based on EEMD-PE and LSSVM Optimized by the GSA. Energies 2018, 11, 697. [Google Scholar] [CrossRef]
  10. Ozkan, M.B.; Karagoz, P. A Novel Wind Power Forecast Model: Statistical Hybrid Wind Power Forecast Technique (SHWIP). IEEE Trans. Ind. Inform. 2015, 11, 375–387. [Google Scholar] [CrossRef]
  11. Shi, J.; Ding, Z.; Lee, W.-J.; Yang, Y.; Liu, Y.; Zhang, M. Hybrid Forecasting Model for Very-Short-Term Wind Power Forecasting Based on Grey Relational Analysis and Wind Speed Distribution Features. IEEE Trans. Smart Grid 2014, 5, 521–526. [Google Scholar] [CrossRef]
  12. Xu, Q.; He, D.; Zhang, N.; Kang, C.; Xia, Q.; Bai, J.; Huang, J. A Short-Term Wind Power Forecasting Approach with Adjustment of Numerical Weather Prediction Input by Data Mining. IEEE Trans. Sustain. Energy 2015, 6, 1283–1291. [Google Scholar] [CrossRef]
  13. Safari, N.; Chung, C.Y.; Price, G.C.D. Novel Multi-Step Short-Term Wind Power Prediction Framework Based on Chaotic Time Series Analysis and Singular Spectrum Analysis. IEEE Trans. Power Syst. 2018, 33, 590–601. [Google Scholar] [CrossRef]
  14. Lee, D.; Baldick, R. Short-Term Wind Power Ensemble Prediction Based on Gaussian Processes and Neural networks. IEEE Trans. Smart Grid 2014, 5, 501–510. [Google Scholar] [CrossRef]
  15. Chen, L.; Lai, X. Comparision between ARIMA and ANN models used in short-term wind speed forecasting. In Proceedings of the 2011 Asia-Pacific Power and Energy Engineering Conference, Wuhan, China, 25–28 March 2011; pp. 1–4. [Google Scholar]
  16. Zeng, J.; Qiao, W. Short-Term Wind Power Prediction Using a Wavelet Support Vector Machine. IEEE Trans. Sustain. Energy 2012, 3, 255–264. [Google Scholar] [CrossRef]
  17. Giorgi, M.G.D.; Campilongo, S.; Ficarella, A.; Congedo, P.M. Comparison between wind power prediction models based on wavelet decomposition with least-squares support vector machine (LSSVM) and artificial neural network (ANN). Energies 2014, 7, 5251–5272. [Google Scholar] [CrossRef]
  18. Wu, Q.; Peng, C. Wind power generation forecasting using least square support machine combined with ensemble empirical model decomposition, principal component analysis and a bat algorithm. Energies 2016, 9, 261. [Google Scholar] [CrossRef]
  19. An, X.L.; Jiang, D.X.; Zhao, M.H.; Liu, C. Short-term preciton of wind power using EMD and chaotic theory. Commun. Nonlinear Sci. Numer. Simul. 2012, 17, 1036–1042. [Google Scholar] [CrossRef]
  20. Zhang, Y.; Lu, J.; Meng, Y.; Yan, H.; Li, H. Wind power short-term forecasting Based on Empirical Mode Decomposition and Chaotic Phase Space Recontruction. Autom. Electric Power Syst. 2012, 36, 24–28. [Google Scholar]
  21. Yang, L.; He, M.; Zhang, J.; Vittal, V. Support-Vector-Machine-Enhanced Markov Model for Short-Term Wind Power Forecast. IEEE Trans. Sustain. Energy 2015, 6, 791–799. [Google Scholar] [CrossRef]
  22. Morshedizadeh, M.; Kordestani, M.; Carriveau, R.; Ting, D.S.-K.; Saif, M. Power production prediction of wind turbines using a fusion of MLP and ANFIS networks. IET Renew. Power Gener. 2018, 12, 1025–1033. [Google Scholar] [CrossRef]
  23. Rarki, R.; Thapa, S.; Billinto, R. A Simplified risk-based method for short-term wind power commitment. IEEE Trans. Sustain. Energy 2012, 3, 498–505. [Google Scholar]
  24. Wen, Y.; Li, W.; Hunag, G.; Liu, X. Frequency dynamaics constrained unit commitment with battery energy storage. IEEE Trans. Power Syst. 2016, 31, 5115–5125. [Google Scholar] [CrossRef]
  25. Bitaraf, H.; Rahman, S.; Pipattanasomporn, M. Sizing energy storage to mitigate wind power forecast error impacts by signal processing techniques. IEEE Trans. Sustain. Energy 2015, 6, 1457–1465. [Google Scholar] [CrossRef]
  26. Zhao, Y.; Ye, L.; Pinson, P.; Tang, Y.; Lu, P. Correlation-Constrained and Sparsity-Controlled Vector Autoregressive Model for Spatio-Temporal Wind Power Forecasting. IEEE Trans. Power Syst. 2018, 33, 5029–5040. [Google Scholar] [CrossRef]
  27. Cui, B.; Zhao, Z.; Tok, W.H. A Framework for similarity search of Time Series Cliques with Natural Relations. IEEE Trans. Knowl. Data Eng. 2012, 24, 385–398. [Google Scholar] [CrossRef]
  28. Schramm, R.; Jung, C.R.; Miranda, E.R. Dynamic Time Warping for Music Conducting Gestures Evaluation. IEEE Trans. Multimed. 2015, 17, 243–255. [Google Scholar] [CrossRef]
  29. Yin, H.; Yang, S.; Ma, S.; Liu, F.; Chen, Z. A novel parallel scheme for fast similarity search in large time series. China Commun. 2015, 12, 129–140. [Google Scholar] [CrossRef]
  30. Han, M.; Zhang, R.; Qiu, T. Multivariate Chaotic Time Series Prediction Based on Improved Grey Relational Analysis. IEEE Trans. Syst. Man Cybern. Syst. 2018, 1–11. [Google Scholar] [CrossRef]
  31. Mercorelli, P. Denoising and Harmonic Detection Using Nonorthogonal Wavelet Packets in Industrial Applications. J. Syst. Sci. Complex. 2007, 20, 325–343. [Google Scholar] [CrossRef]
  32. Johnson, M.T.; Povinelli, R.J.; Lindgren, A.C.; Ye, J.; Liu, X.; Indrebo, K.M. Time-domain isolated phoneme classification using reconstructed phase spaces. IEEE Trans. Speech Audio Proc. 2005, 13, 458–466. [Google Scholar] [CrossRef]
  33. Zhao, F.; Sun, B.; Zhang, C. Cooling, Heating and Electrical Load Forecasting Method for CCHP System Based on Multivariate Phase Space Reconstruction and Kalman Filter. Proc. CSEE 2016, 36, 399–406. [Google Scholar]
  34. Jin, J. Research on Optimization of Sorting Algorithm under MapReduce. Comput. Sci. 2014, 41, 155–159. [Google Scholar]
  35. Chen, N.; Qian, Z.; Nabney, I.T.; Meng, X. Wind Power Forecasts Using Gaussian Processes and Numerical Weather Prediction. IEEE Trans Power Syst. 2014, 29, 656–665. [Google Scholar] [CrossRef]
  36. Han, Y. The Design and Implementation of a Wind Power Forecasting System for Wind Farm in Gansu Province; University of Electronic Science and Technology of China: Chengdu, China, 2017. [Google Scholar]
  37. Castellani, F.; Astolfi, D.; Mana, M.; Burlando, M.; Meißner, C.; Piccioni, E. Wind power forecasting techniques in complex terrain: ANN vs. ANN-CFD hybrid approach. J. Phys. Conf. Ser. 2016, 735, 082002. [Google Scholar] [CrossRef]

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.