Next Article in Journal
Seasonal Variations of Dissolved Organic Matter in the East China Sea Using EEM-PARAFAC and Implications for Carbon and Nutrient Cycling
Previous Article in Journal
Air Quality Monitoring Network Design Optimisation for Robust Land Use Regression Models
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Hybrid Model Based on A Modified Optimization Algorithm and An Artificial Intelligence Algorithm for Short-Term Wind Speed Multi-Step Ahead Forecasting

1
School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou 310018, China
2
School of Information Science & Engineering, Lanzhou University, Lanzhou 730000, China
*
Author to whom correspondence should be addressed.
Sustainability 2018, 10(5), 1443; https://doi.org/10.3390/su10051443
Submission received: 22 March 2018 / Revised: 25 April 2018 / Accepted: 26 April 2018 / Published: 5 May 2018
(This article belongs to the Section Energy Sustainability)

Abstract

:
In the last few years, researchers have paid increasing attention to improving the accuracy of wind speed forecasting because of its vital impact on power dispatching and grid security. However, it is difficult to achieve a good forecasting performance due to the randomness and intermittency characteristics of wind speed time series. Current forecasting models based on neural network theory could adapt to various types of time series data; however, these models ignore the importance of data pre-processing and model parameter optimization, which leads to poor forecasting accuracy. In this paper, a new hybrid model is developed for short-term multi-step wind speed forecasting, which includes four modules: (1) the data pre-processing module; (2) the optimization module; (3) the hybrid nonlinear forecasting module and (4) the evaluation module. In order to estimate the forecasting ability of the proposed hybrid model, 10 min wind speed data were applied in this paper as a case study. The experimental results in six real forecasting cases indicate that the proposed hybrid model can provide not only accurate but also stable performance in terms of multi-step wind speed forecasting can be considered an effective tool in planning and dispatching for smart grids.

1. Introduction

Wind energy, as one of the most promising renewable energy resources (RESs), has gradually become a remarkable alternative resource to fossil energy due to being non-polluting, environmentally friendly cost effective [1]. In the past few decades, as a result of policy-driven environmental and energy security concerns, it has become the fastest-growing RES for power generation and has attracted more and more attention worldwide [2]. In late 2016, the Global Wind Energy Council (GWEC) reported that more than 54.6 GW of global wind power capacity was added in 2016, which represents a growth rate of 11.8% and means that the global installed capacity of wind turbines can generate approximately 5% of electricity demand worldwide [3]. However, due to the intermittency and uncertainty inherent to wind sources, the wholesale integration of wind power into the traditional grid system presents a significant challenge in terms of reducing cost and improving reliability in power system operation [4]. Therefore, with the goal of moderating the negative impacts and increasing the utilization efficiency of wind power conversion, it is necessary to increase the accuracy of wind speed forecasting [5].
During recent years a number of technologies have been proposed in the literature for increasing the accuracy of wind speed forecasting, which can be classified into four categories: physical models, statistical models, spatial correlation models, artificial intelligence models [6]. The physical forecasting models are generally based on terrain features, atmospheric pressure, ambient temperature, other meteorological information to obtain an evaluation of the considered site’s wind speed forecasting. These models are very effective for medium-term and long-term wind speed forecasting because of the many relevant variables they include [7]. However, these physical-based approaches require considerable observed data with a limited simulation scale always have a higher cost [8]. In contrast, statistical forecasting models, which include the Auto Regressive Moving Average (ARMA) [9], Auto Regressive Integrated Moving Average (ARIMA) [10], Generalized Autoregressive Conditional (GARCH) [11], are more suitable for handling time series forecasting are simple to implement by utilizing a set of historical data. This makes these techniques time saving in building models. However, statistical models cannot deal with the non-linearity that exists in wind speed time series because of their linear correlation structure [12]. Typically, spatial correlation models primarily utilize the spatial relationships between wind speeds at different sites. In some cases, they can achieve satisfactory prediction accuracy [13]. However, their information requirements, including wind speed and delay times, add complexity and cost to their implementation [6]. With the rapid development of soft-computing techniques, some artificial intelligence approaches are widely used for wind speed forecasting in various fields [14,15], such as artificial neural networks (ANNs), support vector machines (SVMs), fuzzy logical based models [6]. Among these approaches, the most prevalent intelligent wind speed forecasting models are based on ANNs, which including the back propagation (BP) neural network [16], general regression neural network (GRNN) [17], radial basis function neural network (RBFNN) [18] and deep belief network (DBN) [19], among others. These models possess the ability to structure the relationship between input data and output data with higher data error tolerance perform well in non-linear time series forecasting [16]. Nevertheless, the main drawbacks of individual artificial intelligence models are that they may easily get into local optimum and exhibit a relatively low convergence rate due to the random network initial weights and thresholds [19]. From the above discussion, it can be seen that each model has its own specific benefits and drawbacks we summarize the merits and demerits of the four types of wind speed models in Table 1.
Due to the inherent weaknesses of each model, as well as the intermittency and complex fluctuations of wind speed, individual forecasting models cannot always capture the characteristics of a time series, especially when it comes to the non-linear traits of wind speed. Thus, in order to obtain more accurate wind speed forecasting, as well as overcome the shortcomings in the individual models, some hybrid models which incorporate the individually superior features of various algorithms have been developed. In recent years, with the aim of obtaining promising forecasting results, many hybrid models for wind speed forecasting have been successfully developed and have improved the accuracy of forecasting to some degree. For instance, Cadenas et al. [20] proposed a hybrid model, which consists of ARIMA models and ANN models for wind speed forecasting. The results showed that the forecasting achieved with this hybrid outperformed the individual ARIMA and ANN models in three experimental sites in Mexico. Liu et al. [21] developed a hybrid model for wind speed forecasting based on the theories of wavelet, wavelet packet, time series analysis artificial neural networks. The results of three simulation cases indicated that the proposed hybrid model achieved a satisfactory performance in wind speed forecasting. Du et al. [22] developed a wind speed forecasting model based on certain confidence and fluctuation characteristics for 10 min wind speed forecasting at three wind farms in Penglai Shandong Province, eastern China. The experiments indicated that the developed hybrid model can obtain more accurate forecasting results compared with other alternative models. Ma et al. [23] proposed a model based on a de-noising approach with a dynamic fuzzy neural network to improve the accuracy of wind speed forecasting. Three experimental results demonstrated that the model satisfactorily approximated the observed wind speed value. Wang et al. [24] developed an ensemble empirical mode decomposition (EEMD), genetic algorithm (GA), BP neural network for wind speed forecasting. In this proposed hybrid model, the EEMD is employed to analyze and de-noise the nonlinear stochastic signals the BP neural network, optimized by GA (GA-BP), is developed for dealing with each of the decomposed sub-series. The experimental results from a wind farm in Inner Mongolia, China, indicated that the proposed hybrid model can provide more accurate wind speed forecasting than a traditional wavelet neural network, including both the GA-BP and GA-BP with EMD (EMD-GA-BP) models. Ren et al. [25] developed hybrid wind speed forecasting by integrating EMD and a support vector machine (SVR). Several intrinsic mode functions (IMFs) and a residue handled by EMD were separately generated to train the SVR. The case studies showed that the proposed EMD-SVR model outperformed other alternative models involved in their research with respect to computational complexity and accuracy. Zhang et al. [26] presented a hybrid model based on EMD, feature selection with ANN and SVM yield for short-term wind speed forecasting. The results of their study demonstrated that the proposed hybrid approach achieved a satisfactory performance in dealing with wind speed forecasting using three real datasets in China. Zhang et al. [27] developed a hybrid model which combined an extreme learning machine (ELM), based on feature selection parameters optimized by using a hybrid backtracking search algorithm (HBSA). The developed hybrid model was applied to data from two wind farms in Inner Mongolia, China, Sotavento Galicia, respectively. The experimental results indicated that the hybrid model was able to capture the non-linear features of a wind speed series and thus achieve a more promising forecasting performance.
The previous research indicates that there is still no one approach that can provide an elegant solution to wind speed forecasting. Model performance under specific circumstances should be analyzed and understood then incremental improvements should be made based upon the knowledge that is gained [28]. Thus, obtaining a satisfactory wind speed forecasting performance is still a difficult and challenging task it is necessary to further exploit more efficient and accurate models in the field of wind speed forecasting. Due to these factors, a novel wind speed forecasting model which combines wavelet de-noising (WD), a modified ant colony optimization algorithm, BP neural networks (WD-APSOACO-BP) is developed in this paper. The developed hybrid model consists of four modules: a data pre-processing module, an optimization module, a forecasting module, a comprehensive evaluation module. In the data pre-processing module, the WD technique is employed to extract the basic characteristics from the non-stationary wind speed time series fractal extrapolation is used to handle the missing data. The optimization module includes use of a modified adaptive particle swarm optimization algorithm-based ant colony optimization algorithm (APSOACO), which is adopted to optimize the weights and thresholds of individual BP neural networks and overcome the drawbacks of the BP algorithm in terms of converging on a local optimum when the training surface has a multimodal distribution. In this way, we aimed to further improve the forecasting accuracy. In the forecasting module, the hybrid forecasting APSOACO-BP model is established as a forecasting engine for wind speed forecasting. After completing the steps above, the evaluation module is adopted to synthetically assess the developed hybrid wind speed forecasting system. This module consists of the evaluation criteria, forecasting validity, a population stability index four experiments designed in six wind speed sites using time series from the Shandong province of China as well as a comprehensive analysis of these experiments. The significant contributions of this study are summarized as follows:
(1)
Given that the wind speed exhibits uncertainty and randomness, a valid mathematical data preprocessing module, which was derived from the theory of signal time-frequency localization, is adopted to decompose the original wind speed time series into a number of sub-series. This module can effectively de-noise and fully extract the main features of a wind speed time series thus improves the forecasting accuracy.
(2)
A novel modified optimization algorithm, the APSOACO, is proposed to optimize the initial weights and thresholds of the BP neural network, due to its better convergence performance and lower number of required iterations compared with individual traditional optimization algorithms. This module not only provides a new option for solving problems such as computational complexity and easy trapping into a local optimal solution that traditional forecasting engines may encounter also makes a contribution to improving the accuracy of wind speed forecasting.
(3)
In order to estimate the overall performance of the developed hybrid model, a more scientific and comprehensive evaluation module is developed in this paper. This module not only sufficiently analyzes both the accuracy and stability of forecasting results also discusses the effectiveness of the proposed model in terms of the performance of the employed optimization algorithm and wavelet function.
(4)
With the aim of improving the quality of wind speed data and further enhancing the forecasting accuracy, an effective data interpolation technique and a rolling mechanism were also adopted in this paper. These techniques possess the capacity to enrich and improve the information of wind speed observations, which can in turn provide more accurate and stable wind speed forecasting.
The rest of this paper is organized as follows: Section 2 presents a brief description of related methodology. Section 3 describes the back propagation neural network optimized by different heuristic algorithms and describes the structure of the proposed integrated forecasting framework. Six real experiments and their corresponding results are presented in Section 4. The forecasting validity of the proposed hybrid model is discussed in Section 5. Finally, the conclusion of this research is given in Section 6.

2. Methodology

Before utilizing the proposed hybrid model for wind speed forecasting, it is necessary to introduce some of the significant components. Therefore, some operating theories—including the wavelet de-noise algorithm, the fractal interpolation the single nonlinear neural network—are concisely introduced in this section.

2.1. The Wavelet De-Noise Technique

The wavelet transform is a time-frequency representation of a signal. It can be used for noise reduction, feature extraction, or signal compression, for example [29]. It is an essential tool for data pre-processing and has been widely utilized in de-noising and for extracting the basic characteristics from a non-stationary time series. This type of algorithm can be divided into two categories: continuous wavelet transforms (CWTs) and discrete wavelet transforms (DWTs).
In continuous wavelet transforms, a given signal of finite energy is projected onto a continuous family of frequency bands (or similar subspaces of the Lp function space L2(R)). For instance, the signal may be represented on every frequency band of the form [f, 2f] for all positive frequencies f > 0. Then, the original signal can be reconstructed by a suitable integration of all the resulting frequency components.
The frequency bands or subspaces (sub-bands) are scaled versions of a subspace at scale 1. This subspace is in most situations generated by the shifts of one generating function ψ in L2(R), the mother wavelet. For the example of the scale 1 frequency band [1,2] this function is:
ψ ( t ) = 2 sin c ( 2 t ) sin c ( t ) = sin ( 2 π t ) sin ( π t ) π t
In discrete wavelet transforms it is computationally impossible to analyze a signal using all the wavelet coefficients, so one may wonder if it is sufficient to pick a discrete subset of the upper halfplane to be able to reconstruct a signal from the corresponding wavelet coefficients. One such system is the affine system for some real parameters a > 1, b > 0. The corresponding discrete subset of the halfplane consists of all the points (am, namb) with m, n in Z. The corresponding child wavelets are now given as:
ψ m , n ( t ) = 1 a m ψ ( t n b a m )
A sufficient condition for the reconstruction of any signal x of finite energy is given by the formula:
x ( t ) = m z n z x , ψ m , n
in which the functions { ψ m , n : m, n z } form an orthonormal basis of L2(R).
This separates the signal into components at various scales corresponding to successive frequencies. It can be noted that the DWT corresponds to the multi-resolution approximation expressions, which can analyze a signal in a number of frequency bands (or at a number of scales) [30,31].

2.2. The Artificial Neural Network Model and Fractal Representation

Backpropagation is a method used in artificial neural networks to calculate a gradient that is needed in the calculation of the weights to be used in the network, which is a powerful algorithm for apportioning error responsibility through a multi-layer network was formulated in the form of the backpropagation algorithm. The backpropagation algorithm employs the Delta Rule, calculating error at output units in a manner to that used in structure of neural network, while error at neurons in the layer directly preceding the output layer is a function of the errors on all units that use its output. The effects of error in the output node(s) are propagated backward through the network after each training case. The essential idea of backpropagation is to combine a non-linear multi-layer perceptron-like system capable of making decisions with the objective error function of the Delta Rule [32]. As a result, BP neural network is selected as a forecaster to integrate the hybrid model in this study. It is based on a gradient descent method that minimizes the sum of the squared errors between the actual and desired output values. The transfer function is of the neuron type (commonly referred to as the activation function [33]). The output function is between 0 and 1 and can transform input to output for continuous nonlinear mapping.
Definition 1.
Neural network models in artificial intelligence are usually referred to as artificial neural networks (ANNs). These are essentially simple mathematical models defining a function f: XY or a distribution over X or both X and Y sometimes models are also intimately associated with a particular learning algorithm or learning rule.
In order to eliminate the difference of each variable, the input and output data of the neural network should be normalized the normalization formula is as follows:
X = 2 × X i X i , min X i , max X i , min   1 ,   ( i = 1 ,   2 , ,   n )   X     [ 1 ,   1 ]
where X is the normalized data. X i , min and X i , max are the minimum and maximum value of actual data vectors and X i denotes the actual data.
Step 1. Calculate outputs of all hidden layer nodes.
y j = f ( i ω ji x i + b j ) = f ( net j ) ( i = 1 ,   ,   n ;   j = 1 ,   ,   2 n )
net j = i ω ji x i + b j ( j = 1 ,   ,   2 n )
where the activation value of node j is net j , ω ji represents the connection weight from input node i to hidden node j, bj represents the threshold of neuron j, y j represents the output of hidden layer node j. f is the activation function of a node, which is usually a sigmoid function.
f ( x ) = 1 1     e x
Step 2. Calculate the output data of the neural network.
O 1 =   f ( i ω oj y i + b o ) ( i = 1 ,   ,   n ;   j = 1 ,   ,   2 n )
where ω oj represents the connection weight from hidden node j to the output node, b o represents the threshold of the neuron, O 1 represents the output data of the network. f o is the activation function of the output layer node.
Step 3. Minimize the global error via the training algorithm.
M e a n   s q u a r e   e r r o r   =   1 t ( O 1 R t ) 2
where Rt is the real output of the training data in Equation.

2.3. Fractal Interpolation

The fractal interpolation technique was developed by Barnsley and Harrington and provides benign deterministic representation for complicated phenomena [34]. The fractal interpolation function is defined as fixed points of the map between function spaces using iterated function systems (IFSs). The details of the fractal interpolation is as follows:
Theorem 1.
Let [x0, xN] be an interval and x0 < x1 < … < xi−1 < xi < … < xN be a subdivision of this interval (N ≥ 2). Let Fi (i = 0, 1, ..., N) be some arbitrary value attached to the point xi which is to be interpolated over the interval [x0, xN] by a continuous function f: [x0, xN]→ with f(xi) = Fi (i = 0, 1, …, N).
Let ui: [x0, xN]→[xi−1, xi] be the invertible maps:
u i ( x ) = x i x i 1 x N x 0 x + x N x i 1   x 0 x i x N x 0   ( i = 0 ,   1 ,   ... ,   N )
Let αi (i = 0, 1, …, N) be any given number (called the vertical scaling factors) with 0 < |αi| < 1 and hi: [x0, xN]→ be the linear functions.
h i ( x ) = ( F i - F i 1 x N x 0 α i F N F 0 x N x 0 ) x + x N F i 1   x N F i   x N     x 0   α i x N F 0     F N x N     x 0   ( i = 0 ,   1 ,   ... ,   N )
For i = 0, 1, …, N.
Then there exists a unique continuous function f: [x0, xN]→ with f(xi) = Fi (i = 0, 1, …, N) such that the following condition holds true:
f ( u i ( x ) ) = α i f ( x ) + h i ( x )   for   x     [ x 0 , x N ]   and   i = 0 ,   1 ,   ,   N
Proof. 
Let F = {g: [x0, xN]→ be continuous with g(x0) = F0, g(xN) = FN}. F is a complete metric space with the maximum metric. Define the operator T: F F piecewise by:
T ( g ) ( y ) = α i g ( u i 1 ( y ) ) + h i ( u i 1 ( y ) )   for   x     [ x 0 , x N ]   and   i = 0 ,   1 ,   ... ,   N
Then, T(g) is well defined and T(g)(xi) = Fi i = 0, 1, …, N. This operator is a contraction and its unique fixed point is the function f. ☐

2.4. Rolling Mechanism Based Multi-Step Forecasting

The Rolling Mechanism (RM), which inspired the metabolic technique that updates the input data by discarding old data for each loop in the BP neural network, can be applied to perform the forecasting. The Rolling Mechanism forecasting steps are shown below.
(a)
1-step ahead forecasting: The forecasting value x N + 1 forecast is calculated based on the historical values, where N is the sample of training.
(b)
2-step ahead forecasting: The forecasting value is x N + 2 forecast calculated based on the historical values and the previous forecasting value x N + 1 forecast .
(c)
3-step ahead forecasting: The forecasting value x N + 3 forecast is calculated based on the historical values and the previous forecasting value { x N + 1 forecast , x N + 2 forecast }.

2.5. Adaptive Particle Swarm Optimization

An adaptive particle swarm optimization (APSO) is presented that features better search efficiency than classical particle swarm optimization (PSO). More importantly, it can perform a global search over the entire search space with a faster convergence speed. The APSO consists of two main steps. First, by evaluating the population distribution and particle fitness, a real-time evolutionary state estimation procedure is performed to identify one of four defined evolutionary states in each generation; exploration, exploitation, convergence, jumping out. It enables the automatic control of the inertia weight, acceleration coefficients, other algorithmic parameters at run time in order to improve the search efficiency and convergence speed. Then, an elitist learning strategy is performed when the evolutionary state is classified as the convergence state. This strategy will act on the globally best particle to jump out of the likely local optima [35].

3. Back Propagation Neural Network Optimized by Hybrid Optimization Algorithm

In order to optimize the initial weights and thresholds of the BP neural network, a novel APSOACO algorithm is introduced in this section.

3.1. Modified Ant Colony Optimization

The proposed APSOACO was inspired from the velocity update nature of particles from the adaptive particle swarm algorithm [35] and the foraging behavior of the ant colony algorithm [36,37]. We combine the distance metric of the ant colony algorithm with the direction (velocity) metric of the adaptive particle swarm algorithm and generate a new probability metric. The concept of the pheromone update in the ant colony algorithm is replaced with the velocity update from the particle swarm algorithm. A sigmoid function is used to convert distance and velocity into heuristic values. These heuristic values are used in the probability metric with different weights (shown in Section 4). The advantages of the hybrid algorithm include: (1) avoiding convergence to a local optimum; (2) providing a better solution within fewer iterations (fast convergence) and (3) achieving low computational complexity. In APSOACO, we denote the swarm population as particles (as ‘ants’ in the ant colony algorithm and ‘particles’ in the adaptive particle swarm algorithm). The basic steps of the APSOACO algorithm are presented as follows:
Step 1: Initialize a certain number of three-dimensional particle swarms P0, P1, …, Pn randomly.
Step 2: Feed the parameters corresponding to each particle back into the ant colony algorithm. Each particle corresponds to a set of parameters (α, β, ρ). Apply this group of parameters to the ant colony algorithm, then reinitialize the pheromone in the environment. According to the resulting solution, update the pheromone. The ant colony algorithm adopts the pheromone update method combined with the global asynchronous and elitist strategy, which does not change the pheromone with an increasing number of ant colony algorithm iterations. If a more adaptive solution appears, the pheromone is updated and the path of the global optimal solution is enhanced. The pheromone updating formula is as follows:
Δ τ ij ( t ) = { Q / G k 0   if   [ i ,   j ]   is   the   min error otherwise
where τ ij represents the amount of pheromone released by the best ant with the minimum error. Q is the total amount of pheromone; G k is the minimum error; and ρ is the degree of volatilization of the pheromones. The range of values is [0,1]; τ ij 1 is expressed as the update pheromone when it is updated by a basic updating mode and maintains the current algebra when stronger fitness is needed to solve the pheromone quantity. The calculation formula is as follows:
τ ij 1 ( t + 1 ) = ( 1 - ρ ) τ ij 1 ( t ) + Δ τ ij 1 ( t )
Δ τ ij 1 ( t ) = τ ij k ( t )
where Δ τ ij 1 is the released amount of pheromone with the error [i, j] by the kth ant in this cycle and Δ τ ij 1 is the change of the τ ij 1 on the error [i, j] after this cycle. In the ant-cycle model, the formula of Δ τ ij 1 is as follows:
Δ τ ij 1 ( t ) = { Q / L k 0     the   k th   pass   the error   [ i ,   j ]   otherwise
where L k is the error obtained by the kth ant in this cycle. In this step the pheromones are not cleared when the particles and their corresponding parameters are changed.
Step 3: According to the results of ant colony algorithm in judging the location of particles, find and update the optimal particle position. If the corresponding parameters of a particle are returned to the ant colony algorithm, the ant colony algorithm can obtain the best solution, so it calls this particle the optimal particle. A set of corresponding parameters of the optimal particle indicates the location of the particle in the solution space; that is, the best particle position. The location of the optimal particle is Pbest. The optimal particle location found in the whole population is Gbest. The fitness values of Gbest and Pbest are used to evaluate the error of the ant colony algorithm—the smaller the error, the better the quality of the parameters.
Step 4: Update the position and velocity of the particle according to the following equations:
V i ( k + 1 ) = ω   ( k ) V i ( k ) + c 1 r 1 ( P best . i - x i ( k ) ) + c 2 r 2 ( G best - x i ( t ) )
x i ( k + 1 ) = x i ( k ) + V i ( k + 1 )
ω ( k + 1 ) = { λ ω ( t ) + θ f ( G best ) f ( x i ( t ) ) f ( G best ) f ( x min ( t ) ) ω ( t )   > ω min ω min   otherwise
where ω(k) is the adaptive inertia weight, c 1 is the weight that makes the particle move to its own best position. c 2 is the weight that makes the particle move to the global best position. The range of c 1 and c 2 is [0,2]. r 1 and r 2 are two independent random numbers in the range of 0–1; λ and θ are constraint factors in the range of 0–1; ω min is the minimum inertia weight; f(·) is the fitness function; V i ( k ) is the velocity of particle i at iteration k; x i ( k ) is the position of particle i at the iteration; Pbest,i is the current best local position of particle i at iteration k. Gbest is the global best solution of all particles.
Step 5: If the stopping criteria is satisfied (by arriving at the maximum number of iterations or particles to achieve the minimum goal value), the algorithm will stop and return to the current global optimum particle position. (α, β, ρ) is the best parameter combination in the ant colony algorithm. Otherwise, return to Step 2.
When optimizing the parameters (α, β, ρ) of the ant colony algorithm by applying the particle swarm optimization algorithm, introducing some new parameters (such as ω(k), c 1 , c 2 and maximum iteration) is unavoidable. The selection of the four parameters requires a certain level of experience, however the determination of their values is relatively simple and has only a small effect on the final solution. The ant colony algorithm will be completely called one time if the particle moves one time. Therefore, if the number of particle is N and they move M times, the ant colony algorithm will be called N × M times. Each time, the ant colony algorithm will iterate dozens or even hundreds of times.

3.2. Three Different Heuristic Algorithms for Optimizing BP Neural Network

In this section, the adaptive particle swarm algorithm, ant colony algorithm and APSOACO are applied to optimize the weights and the thresholds of the nonlinear BP neural network model by minimizing the mean square error of the BP neural network. However, the structure of the network is confirmed and, in order to achieve optimal parameters of the network, the fitness function and encoding strategy should be explained.
The purpose of the forecasting model is to minimize the mean square error of the BP neural network. According to Equation (21), the fitness value of the ith sample can be calculated as follows:
Fitness ( X i ) = MSE ( X i )
The final step is to arrange the weights and the thresholds of the nonlinear BP network for each ant. In this paper, the structure of the weights and the thresholds matrix in the BP neural network is as follows:
Optimization   matrix = [ w ji , w oj ]
where wji represents the connection weight from input node i to hidden node j. woj represents the connection threshold from hidden node j to the output node. Figure 1 shows the structures of the three different hybrid models.

4. Simulation Experimentation and Forecasting Result

In this study, 10 min ahead wind speed data from January 2014 from six study sites in the Shandong province of China were collected are employed to evaluate the effectiveness of this proposed integrated method. In order to further improve the wind speed forecasting performance, longitudinal data selection will be applied to construct the forecasting dataset.

4.1. Study Area and Data Description

Wind power generation turns the kinetic energy of wind into mechanical energy the mechanical energy is then transformed into electric energy. Wind speed plays a key role in this process. To verify the effectiveness of the proposed model, wind speed data from six observation sites were collected. Shandong is located in the east of China and has high level of wind activity every year. The long-term and stable distribution of wind power in the Shandong province is provided by onshore wind farms. The main types of turbines in the Shandong province are listed in Table 1. The speed in this area reaches 3 m/s more than 70% of the time, resulting in a good level of electricity generation. Table 2 shows the average monthly power generation distribution, where it is indicated that the distribution from January to July has a seasonal impact. The cut-in speed reaches approximately 70–80%. It is noted that in August the rated wind speed declines from 22% to 7%, the primary reason for which is strong tropical cyclone activity in the summer (which is harmful for turbines). Thus, there are some days when the wind farms cannot generate electricity in August.
According to the features of wind speed, six observation sites are used for this case study. However, the wind speed time series is not continuous in some months and has a number of missing data points. Specifically, for the six sites in Shandong, the number of missing data entries is 4464/48, 4464/54, 4464/66, 4464/32, 4464/18, 4464/96, respectively. We used fractal interpolation to deal with the missing data in this study [38].

4.2. Evaluation Criteria of Forecasting Performance

To evaluate the forecasting performance, four forecasting error measures between the actual values and the forecasting values were employed for model evaluation and comparison: the average error (AE), mean absolute error (MAE), mean square error (MSE), mean absolute percentage error (MAPE) [37,38]. The smaller the values of these measures, the better the forecasting performance obtained. The definitions of these criteria can be indicated as follows.
AE = 1 N i = 1 N ( x i - x ^ i ) .
MAE = 1 N i = 1 N | x i - x ^ i | . MSE = 1 N i = 1 N ( x i - x ^ i ) 2 . MAPE = 1 N i = 1 N | x i - x ^ i x i | × 100 % .

4.3. Experimental Process

To effectively forecast the wind speed, three forecasting experiments are implemented in this study. Experiment I aimed to compare the performance of different single artificial neural network models in order to select an individual model with the best performance among well-known forecasting models, with the best structure of each neural network for wind speed forecasting at each site. Experiment II was employed to compare the performances of the BP neural network when its parameters have been optimized by different optimization algorithms. Experiment III analyzed the forecasting performance offered by the proposed hybrid model in terms of multi-step ahead forecasting.
All the named algorithms were run on the platform 3.2 GHz CPU, 8.00 GB RAM, Windows 8, MATLAB R2012a. These experiments all selected from six wind speed data sites, which were described in Section 4.1. To evaluate the applicability, superiority, generality of the proposed novel combined model, wind speed data with a 10 min period from three datasets in Penglai, on the Shandong peninsula of China, were randomly assembled for multistep ahead forecasting (i.e., 1-step ahead, 2-step ahead, 3-step ahead). A large number of data points (2016) were selected from each dataset and these observations were split into two subsets; the training set and the testing set. In this paper, the experience proportion between the training and testing sets is 7:1. That is, the 1008 initial data points act as the training sample and the remaining 144 data points act as the testing sample and are repeated 7 times. In other words, in each of the wind speed data sites, the first 1008 data points are used as the training sample set and the next 1008 data points (from the 1009th to the 2016th) are used as the testing set.

4.4. The Experiment Preparation: Data Preprocessing

In order to enhance the prediction precision of the wind speed forecasting model, a valid mathematical data pre-processing module based on the theory of signal time-frequency localization was adopted. In this study, the Daubechies wavelet filter of order 4 (db4) decomposed the wind speed time series 7 levels and reconstituted the time series for each of the sites. For more detailed information about wavelet transforms and wavelet de-noising techniques, please refer to the research by Authors [39,40,41].

4.5. Structure of the Proposed Forecasting Framework

At the beginning of this paper, fractal interpolation was used to estimate missing values of the original wind speed time series. Then, the wavelet de-noising method was used to pre-process the original time series. However, the wavelet de-noising need to select two parameters: the wavelet function and the number of decomposition level. In this, the wavelet function is Daubechies and the level is 7.
After the data pre-processing, the de-noised wind speed time series data was input into the optimized neural network, in which the parameter of neural network were optimized by three different optimization algorithms (APSO, ACO, APSOACO). The flowchart of the proposed model is shown in Figure 1.
It is worth pointing out that the original wind speed time series is divided into a training set and a testing set, which means the input data of the training data and testing data is de-noising data and the output data of the training data and testing data is the original wind speed time series. The flowchart of data structures of the forecasting models is shown in Figure 2.
For the Figure 2, N is the number of the training sample which is fixed according to the original time series. For example, supposing a study the last 1008 points of the wind speed time series with length of 2016 (N = 1008) will be forecasted.
It must be noting that there are multi-step ahead forecasting h = 1, 2, 3 of different wind speed sites in this paper.

4.6. Experiment I: Selection of the Wind Speed Forecasting Model

There are some artificial intelligence algorithms, such as the BP NN, the Elman NN, WNN, GRNN, ELM, that can be used to forecast the wind speed. The artificial intelligence forecasting models used in different forecasting steps in this experiment and the flowchart of the experiment are shown in Figure 2 and the experimental results follow in a later section.
To determine the number of neurons many experiments were conducted the best trial results were selected [42,43]. In this experiment, the listing technique was used to determine the number of input layers and hidden layers for the different ANN models [44]. From Figure 3 and Table 3, the following conclusions were obtained:
(1)
Table 3 shows the range of the input layer of each neural network is 3–8. With an increasing number of hidden layers, the test accuracy is gradually improved. When the number of hidden layers reaches a certain maximum number, the test accuracy is no longer improved.
(2)
Figure 3B shows that with an increase in the number included in the training sample, the fluctuation of testing accuracy reduces. For example, the confidence intervals obtained by the MAPE of the BP neural network is narrower than in the other four kinds of neural networks.
From the above analysis, it can be seen that the best performance for all sites (shown in Figure 4) among the five different neural networks is offered by the Back-propagation Neural Network.
Remark 1.
Through the above-mentioned analysis, we discovered that it is hard to find the general relationship between the nodes of the input layer and the hidden layer. Different time series have different forecasting structures, which suggests that different time series need to construct different BP neural networks structures. The best forecasting structure of the neural network is obtained by the enumeration method and experimental results.

4.7. Experiment II: Comparison of Three Optimization Algorithms to Fine-Tune the Parameters of Hybrid Model

In order to evaluate the forecasting performance of the hybrid model optimized by three different optimization algorithms (two traditional algorithms and APSOACO), three different optimization algorithms are used to optimize the hidden layer and output layer weight matrix the neural network and to optimize hidden layer and output layer parameter matrix. Two other measures (the Index Agreement and Bias2) are also employed for model evaluation.
The Index Agreement is a dimensionless index that measures the fitness of the model, ranging from 0 (worst fitness) to 1 (best fitness) [45].
I A = 1 - i = 1 T ( x i - x ^ i ) 2 i = 1 T ( | x i - x ^ i | + | x i - x ¯ | ) 2
x ^ i and x i are the forecasting and actual values of wind speed at time point i of the forecasting horizon T, respectively. x ¯ is the mean of the actual values of the wind speed.
VAR ( x ^ ) + Bias 2 ( x ^ ) = E [ x ^ - E ( x ^ ) ] 2 + [ x - E ( x ^ ) ] 2
xi is the actual data, x ^ i is the forecasting data, E( x ^ ) = 1 N i = 1 N x ^ i is the expectation value of the forecasting data, x = 1 N i = 1 N x i is the expectation of the actual data. N is the number of the sample. The bias-variance framework [46] is employed to evaluate the forecasting accuracy and stability of the forecasting models. In this framework, the values of Bias 2 ( x ^ ) and VAR ( x ^ ) reflect the accuracy and stability of the forecasting model, respectively.
10 min wind speed time series were generated from 5–18 January at each site. Figure 5 shows the data structures of the forecasting models involved in this paper. For instance, the total sample size is 2016, data of the former seven days (sample size 1008) are the training sample. The remaining data are the testing sample. The input of this proposed hybrid model is set as the de-noised wind speed time series data, which is shown in Figure 2. This is achieved by employing the wavelet function decomposed (the wavelet is Daubechies (db4) and the decomposition level is 7) to the wind speed series. The final assessment of the 1-step ahead forecasting results is shown in Table 4.
(1)
It is noted that ACO suffers from slow convergence with increasing iterations. If the APSO algorithm falls into the local optimal points the capacity of the parameter optimization will be affected. The APSOACO algorithm combines the global search abilities of ACO with the local search capability of APSO, which significantly improves the parameter optimization ability of the single optimization algorithm. The numerical experimentation results indicate that the proposed hybrid model (WD-APSOACO-BP) outperforms the other hybrid models (WD-APSO-BP and WD-ACO-BP) when compared with the MAE, MSE, MAPE, the variance of MAPE, IA from Site 1 to Site 6. As such, the MAPE values of the WD-APSOACO-BP are 4.592%, 4.045%, 5.021%, 4.891%, 4.214%, 5.234%, corresponding to the Bias2 values which are 1.4400, 1.1852, 1.2405, 1.0029, 1.1860 and 1.3891 from Site 1 to Site 6, respectively. The WD-APSOACO-BP model can obtain higher forecasting accuracy than two single hybrid models. However, the IA values are 0.9619, 0.9627, 0.9456, 0.9584, 0.9545, 0.9456 from Site 1 to Site 6, which demonstrates that the WD-APSOACO-BP can achieve a better forecasting performance than the other hybrid models. Thus, the WD-APSOACO-BP can precisely forecast the future changes of a wind speed time series.
(2)
With regards to further analysis of the forecasting results in terms of each day (the testing sample from Monday to Sunday), Table 5 and Figure 6A1–A4 clearly depict the forecasting values for 0:10 a.m. to 24:00 p.m., from 12 to 18. Part A and B of Figure 6 demonstrate the MAPE and IA from Site 1 to Site 6. The MAPE and IA of WD-APSOACO-BP are the minimum and the maximum, respectively. Part A1 of Figure 6 demonstrates the forecasting results of three different hybrid models and the actual wind speed series from 0:10 a.m. to 24:00 p.m., from 12 to 18 January 2014. Part A2 of Figure 6 demonstrates the 95% confidence intervals (CI) obtained by the WD-APSOACO-BP model. It can be clearly seen that both the upper CI and the lower CI are very close to the actual wind speed time series for Monday. Part A3 of Figure 6 is the Box-Plot of the MAPE from 0:10 a.m. to 24:00 p.m., over the course of a week, from 12–18 January, for the three hybrid models for Site 2. It can be seen that the performance of the WD-APSOACO-BP is better than that of the other two hybrid models. In addition, A4 clearly displays the actual wind speed time series compared with the forecasting results. It is obvious that the forecasting results offered by the proposed WD-APSOACO-BP method are very approximate to the target.
Remark 2.
The above experimental analysis aimed to demonstrate whether the MAE, MSE, VAR(Y), MAPE, Bias2 and IA of the proposed WD-APSOACO-BP hybrid model mean it manifests the best forecasting performance. In brief, the proposed hybrid optimization algorithm has a better capacity for optimization to find the optimum weights and thresholds for the Nonlinear BP Neural Network.

4.8. Experiment III: Comparison Three Different Hybrid Models in Multi-Step Ahead Wind Speed Forecasting

To further verify the feasibility, performance, meliority of the proposed hybrid forecasting model, the wind speed data from six sites are employed for establishing the proposed WD-APSOACO-BP model. In addition, the proposed hybrid WD-APSOACO-BP model is compared with two other forecasting models (WD-APSO-BP and WD-ACO-BP) for multi-step ahead wind speed forecasting. The results are shown in Table 6 and Table 7 and Figure 7 and Figure 8. The index of agreement produced by the proposed hybrid model for the 2-step and 3-step ahead wind speed forecasts of different sites is shown in Part B of Figure 7 and Figure 8.
(1)
Table 6 shows the forecasting performance for each site of the three different hybrid models in 2-step forecasting in terms of six criteria: MAE, MSE, MAPE, Bias2 VAR(Y), and IA. For 2-step ahead forecasting, the proposed hybrid model outperforms the other hybrid models based on each of the evaluation criteria. For example, at Site 1, the proposed hybrid model provides a minimum forecasting error with MAE, MSE, MAPE values of 0.3490, 0.2135 and 4.79%, respectively. The IA and Bias2 achieved by this hybrid model are 0.9456 and 1.1183, respectively. Compared with the 2-step ahead forecasting achieved by the other hybrid models, the forecasting performance of the proposed hybrid model is still superior.
(2)
Table 7 aims to evaluate the forecasting accuracy and stability of the proposed hybrid model in 3-step forecasting. The criterion in terms of evaluating the accuracy is the minimum MAPE values in the 1008 times experiments. This was selected to compare the accuracy of the three different hybrid models. The Bias2 values of the 1008 times experiments are utilized for the stability test. In terms of the accuracy testing, the WD-APSOACO-BP obtains a lower MAPE value than the other hybrid models. The Bias2 values of the proposed hybrid model always remains the lowest of the hybrid models. The proposed hybrid model obtains satisfactory forecasting stability and accuracy. In terms of the value of MAPE, the proposed hybrid model has the lowest value of MAPE among the three hybrid models. For 3-step forecasting at six sites, the proposed hybrid model obtains the smallest values of Bias2 compared to the other two hybrid models.
Part A3 of Figure 7 and Figure 8 shows the MAPE and box-whisker plot of MAPE for the hybrid models of Site 3. For the 2-step and 3-step forecasting values, the proposed hybrid model obtains the lowest MAPE value. The down arrow indicates the lowest value of MAPE in the figure. Part A3 also shows that the performance of the hybrid model is better than that of the other two hybrid models. The MAPE of each point for the hybrid model is not only smaller also has a small discrete degree. Part A2 shows the 95% confidence intervals (CIs), which were obtained using the actual wind speed. The forecasting value of WD-APSOACO-BP is closer to both the upper CI and the lower CI of the observed wind speed time series. In addition, Part A.4 clearly displays the actual wind speed data compared with the forecasting results for Site 3. The multi-step (2-step and 3-step) ahead forecast values from the proposed hybrid model are closer to the target than those of multi-step (2-step and 3-step) ahead forecasting. The minimum criteria of MAE, MSE, MAPE are generated by the hybrid model of 2-step and 3-step ahead forecasting from Site 3, respectively.
Remark 3.
From the above analysis,the best performance among the three-different hybrid forecasting models for each site is obtained by the proposed hybrid model.The proposed hybrid model provided a satisfactory forecasting performance between the WD-APSO-BP and WD-ACO-BP for 10 min wind speed multi-step forecasting. The numerical experimentation results showed that more features of the actual wind speed fluctuations can be obtained by the proposed hybrid model, which shows a satisfactory wind speed forecasting performance at six wind farm sites.

5. Discussion

In this section, the forecasting accuracy (as a reflection of the forecasting performance), the forecasting results stability of the three different hybrid models are discussed. Furthermore, the performance of each optimization algorithm is presented and discussed.

5.1. Discussion the Forecasting Accuracy of the Model

At present, the establishment of the hybrid model is mostly based on the minimum mean absolute error, mean square error, mean absolute percentage error. Such criteria and assumptions cannot be well reflected in the validity of the forecasting method. Thus, the feasibility and validity of the hybrid model are verified by two-order forecasting validity. Forecasting validity is defined as follows:
The actual value is set as {xt,t = 1, 2, …, N}. m kinds of models are now used to forecast; xit is the forecast value in the ith forecasting method at the jth time point (i = 1, 2, …, m, t = 1, 2, …, N); eit is the error in the ith forecasting method at the jth time points; and Ait = 1 − |eit| is the forecasting accuracy of the ith forecasting method at the jth time point. The formula for two-order forecasting validity is M = E(A)(1 − σ(A)), where E(A) represents the mathematical expectation of the forecasting accuracy of the hybrid forecasting method, σ(A) represents the standard deviation of the prediction accuracy of the hybrid forecasting method. M is the forecasting validity. If the value of the forecasting validity is close to 1, that forecasting model is better. Table 8 indicates that the forecasting availabilities offered by the proposed hybrid model outperform those of the other models for all six sites in multi-step ahead wind speed forecasting.
Remark 4.
The two-order validity of the forecasting model, which possesses the characteristics of high computational efficiency and less forecasting error, not only utilizes an approximate solution but also integrates a variety of forecasting methods and thus provides a more reliable index of forecasting validity.

5.2. Verification Stability of Forecasting Results

The population stability index (PSI) is the most commonly used model stability evaluation index. The formula of the population stability index is as follows:
PSI = ( Actual % - Expected % ) × In ( Actual % Expected % )
The expected proportion (Expected%) and the actual proportion (Actual%) represent the forecasting sample and the actual sample of the model, respectively. Generally speaking, the population stability index is less than 0.1, which represents high stability of the model. If the population stability index is greater than 0.1 and less than 0.25 the model stability is moderate, and if the population stability index is greater than 0.25 the model stability is poor. The definition of the PSI guidelines is shown in Table 9.
In Table 10 and Figure 9, by comparing the WD-APSOACO-BP with two other hybrid models, we conclude that the proposed hybrid model has the most stable forecasting results. In addition, as the number of forecasting steps increases, the PSI becomes larger.
Table 11 and Figure 10 show the individual calculations of the PSI for each bin. The percentage of the observations which lie in each bin are shown for both the forecasting value and the actual datasets. The PSI column shows the calculated PSI for each bin, using the formula from above. Using the guidelines as defined in Table 9, the value of the proposed hybrid forecasting model is much less than 0.1, indicating a minimal shift in the population between the actual data and the forecasting values.
Remark 5.
From comparisons among the three hybrid forecasting models, it can be seen that the PSI value of the proposed hybrid model is lower than that of the others. It can be concluded that the proposed hybrid model can achieve the most stable forecasting results. In addition, as the number of forecasting steps increases, the stability of the forecasting results decreases.

5.3. Analysis of the Optimization Performance of Hybrid Optimization Algorithm

A large number of simulation experiments were undertaken for the selection of the parameters of α, β, ρ in the proposed hybrid optimization algorithm. The parameters of the ant colony optimization algorithm optimized by the adaptive particle swarm optimization algorithm are significantly different in terms of forecasting accuracy (as shown in Figure 11). The performance of the hybrid wind speed forecasting model is positively affected by the parameters in the hybrid APSOACO algorithm, which are as follows:
(1)
The inspired factor has large influence on the optimization performance and convergence of the algorithm in the hybrid APSOACO optimization. When the inspired factor is α ∈ [1.0, 4.0], the performance of the algorithm is better.
(2)
The expectation inspired factor of the APSOACO algorithm has notable influence on the performance of the algorithm and the convergence speed of the algorithm. When the expectation inspired factor is β ∈ [3.0, 5.0], the performance of the algorithm is better.
(3)
The pheromone volatilization factor of the APSOACO algorithm affects the convergence of the algorithm. 1 − ρ is a residual factor, and when ρ ≈ 1–0.5 (i.e., p ≈ 0.5), the stability and performance of the algorithm is maximized. For example, in Table 12, the application of the adaptive particle swarm optimization algorithm is employed to obtain the three parameters α = 1.031792, β = 4.483219, ρ = 0.491852, and these values are fed back to the parameters of the ant colony optimization algorithm. Then, the neural network is optimized for Site 1.
Remark 6.
Through a large number of simulation experiments, the parameters of the different forecasting models are solved by the hybrid optimization algorithm. This shows that the new optimization algorithm has a higher optimization performance. It is also shown that the hybrid optimization has the ability to adaptively adjust the parameters and obtain the approximate optimal solution of the forecasting problem.

6. Conclusions

Accurate wind speed forecasting not only plays an important role in the efficient and safe integration of wind energy into the power system, but also has benefits for the environment, economics, and social development. However, frequent fluctuations, autocorrelation, and stochastic volatility in wind speed make it difficult to obtain a satisfactory forecasting performance. In this study, a hybrid WD-APSOACO-BP model for multi-step ahead short-term wind speed forecasting is presented to overcome these problems. A data pre-processing module is exploited to eliminate the stochastic volatility and fully extract the main features of wind speed by removing small fluctuations and retaining only useful information, as well as creating conditions for improvement of the forecasting accuracy. A hybrid APSOACO algorithm which avoids the limitations caused by subjective setting parameters is utilized to tune and optimize the parameters of the BP neural network in the optimization module. Then, a set of high quality parameters is calculated and applied to optimize the parameters of the BP neural network in the forecasting module. In the evaluation module, experiments from six wind speed data sites in China clearly indicate that the proposed hybrid model is superior to the other alternative models that were explored in this paper. In addition to single-step ahead forecasting, multi-step ahead wind speed forecasting is also adopted and discussed in this paper. The obtained testing results with respect to different forecasting horizons suggest that the developed hybrid wind speed forecasting approach based on the BP neural network integrated with the APSOACO algorithm has the ability to yield satisfactory wind speed forecasting accuracy.

Author Contributions

Chen Wang. conceived and designed the experiments; Chen Wang performed the experiments; Chen Wang and Zonggui Yao. analyzed the data; Chen Wang contributed reagents/materials/analysis tools; Chen Wang wrote the paper.

Acknowledgments

This research was supported by the National Natural Science Foundation of China (Grant No. 71171102/G0107).

Conflicts of Interest

The authors declare no conflict of interest.

Nomenclature

RESsrenewable energy sources
GWECglobal wind energy council
ARMAauto regressive moving average
ARIMAauto regressive integrated moving average
GARCHgeneralized autoregressive conditional heteroscedasticity
ANNartificial neural networks
BPback propagation
GRNNgeneral regression neural network
RBFNNradial basis function neural network
DBNdeep belief network
EEMDensemble empirical mode decomposition
GAgenetic algorithm
EMDempirical mode decomposition
SVRsupport vector machine
IMFintrinsic mode function
WPDwavelet packet decomposition
COAcrisscross optimization algorithm
ELMextreme learning machine
HBSAhybrid backtracking search algorithm
WDwavelet de-noising
ACOant colony optimization
ACOant colony optimization
APSOadaptive particle swarm optimization
APSOACOa modified adaptive particle swarm optimization algorithm based ant colony optimization algorithm
WTTwavelet transforms technique
CWTcontinuous wavelet transforms
DWTdiscrete wavelet transforms
IFSiterated function system
RMrolling mechanism
AEaverage error
MAEmean absolute error
MAPEmean absolute percentage error
BPNNback propagation neural network
Elman NNElman neural network
WNNwavelet neural network
ESexponential smoothing
dbNN-order Daubechies wavelet
IAIndex Agreement
CIconfidence interval
PSIpopulation stability index
ELMextreme learning machine
ACOant colony optimization

References

  1. Hu, J.; Wang, J.; Xiao, L. A hybrid approach based on the gaussian process with t-observation model for short-term wind speed forecasts. Renew. Energy 2017, 114, 670–685. [Google Scholar] [CrossRef]
  2. REN21 Renewables 2012 Global Status Report; REN21 Secretariat: Paris, France, 2012.
  3. World Wind Market has Reached 486 GW from Where 54 GW has been Installed Last Year. Available online: http://www.wwindea.org/11961-2/ (accessed on 5 March 2018).
  4. Zhang, Z.S.; Sun, Y.Z.; Cheng, L. Potential of trading wind power as regulation services in the California short-term electricity market. Energy Policy 2013, 9, 885–897. [Google Scholar] [CrossRef]
  5. Tascikaraoglu, A.; Uzunoglu, M. A review of combined approaches for prediction of short-term wind speed and power. Renew. Sustain. Energy Rev. 2014, 34, 243–254. [Google Scholar] [CrossRef]
  6. Wang, J.; Heng, J.; Xiao, L.; Wang, C. Research and application of a combined model based on multi-objective optimization for multi-step ahead wind speed forecasting. Energy 2017, 125, 591–613. [Google Scholar] [CrossRef]
  7. Giorgi, M.G.D.; Ficarella, A.; Tarantino, M. Assessment of the benefits of numerical weather predictions in wind power forecasting based on statistical methods. Energy 2011, 36, 3968–3978. [Google Scholar] [CrossRef]
  8. Cassola, F.; Burlando, M. Wind speed and wind energy forecast through kalman filtering of numerical weather prediction model output. Appl. Energy 2012, 99, 154–166. [Google Scholar] [CrossRef]
  9. Erdem, E.; Shi, J. Arma based approaches for forecasting the tuple of wind speed and direction. Appl. Energy 2011, 88, 1405–1414. [Google Scholar] [CrossRef]
  10. Kavasseri, R.G.; Seetharaman, K. Day-ahead wind speed forecasting using f-arima models. Renew. Energy 2009, 34, 1388–1393. [Google Scholar] [CrossRef]
  11. Shen, Z.; Ritter, M. Forecasting volatility of wind power production. Appl. Energy 2016, 176, 295–308. [Google Scholar] [CrossRef]
  12. Chang, S.G.; Yu, B.; Vetterli, M. Spatially adaptive wavelet thresholding with context modeling for image denoising. IEEE Trans. Image Process. 2000, 9, 1522–1531. [Google Scholar] [CrossRef] [PubMed]
  13. Tascikaraoglu, A.; Sanandaji, B.M.; Poolla, K.; Varaiya, P. Exploiting sparsity of interconnections in spatio-temporal wind speed forecasting using wavelet transform. Appl. Energy 2016, 165, 735–747. [Google Scholar] [CrossRef]
  14. Weron, R. Electricity price forecasting: A review of the state-of-the-art with a look into the future. Int. J. Forecast. 2014, 30, 1030–1081. [Google Scholar] [CrossRef]
  15. Amjady, N.; Keynia, F. Day ahead price forecasting of electricity markets by a mixed data model and hybrid forecast method. Int. J. Electr. Power Energy Syst. 2008, 30, 533–546. [Google Scholar] [CrossRef]
  16. Guo, Z.H.; Wu, J.; Lu, H.Y.; Wang, J.Z. A case study on a hybrid wind speed forecasting method using bp neural network. Knowl. Based Syst. 2011, 24, 1048–1056. [Google Scholar] [CrossRef]
  17. Zhao, W.; Wei, Y.M.; Su, Z. One day ahead wind speed forecasting: A resampling-based approach. Appl. Energy 2016, 178, 886–901. [Google Scholar] [CrossRef]
  18. Chen, B.; Zhao, L.; Wang, X.; Lu, J.H.; Liu, G.Y.; Cao, R.F.; Liu, J. Wind speed prediction using ols algorithm based on rbf neural network. In Proceedings of the IEEE Power and Energy Engineering Conference (APPEEC 2009), Wuhan, China, 27–31 March 2009. [Google Scholar]
  19. Wang, H.Z.; Wang, G.B.; Li, G.Q.; Peng, J.C.; Liu, Y.T.; Yan, J. Deep belief network based deterministic and probabilistic wind speed forecasting approach. Appl. Energy 2016, 182, 80–93. [Google Scholar] [CrossRef]
  20. Cadenas, E.; Rivera, W. Wind speed forecasting in three different regions of Mexico, using a hybrid ARIMA–ANN model. Renew. Energy 2010, 35, 2732–2738. [Google Scholar] [CrossRef]
  21. Liu, H.; Tian, H.Q.; Pan, D.F.; Li, Y.F. Forecasting models for wind speed using wavelet, wavelet packet, time series and artificial neural networks. Appl. Energy 2013, 107, 191–208. [Google Scholar] [CrossRef]
  22. Du, P.; Jin, Y.; Zhang, K. A hybrid multi-step rolling forecasting model based on ssa and simulated annealing—Adaptive particle swarm optimization for wind speed. Sustainability 2016, 8, 754. [Google Scholar] [CrossRef]
  23. Ma, X.; Jin, Y.; Dong, Q. A generalized dynamic fuzzy neural network based on singular spectrum analysis optimized by brain storm optimization for short-term wind speed forecasting. Appl. Soft Comput. 2017, 54, 296–312. [Google Scholar] [CrossRef]
  24. Wang, S.; Zhang, N.; Wu, L.; Wang, Y. Wind speed forecasting based on the hybrid ensemble empirical mode decomposition and GA-BP neural network method. Renew. Energy 2016, 94, 629–636. [Google Scholar] [CrossRef]
  25. Ren, Y.; Suganthan, P.N.; Srikanth, N. A novel empirical mode decomposition with support vector regression for wind speed forecasting. IEEE Trans. Neural Netw. Learn. Syst. 2016, 27, 1793–1798. [Google Scholar] [CrossRef] [PubMed]
  26. Zhang, C.; Zhou, J.; Li, C.; Fu, W.; Peng, T. A compound structure of ELM based on feature selection and parameter optimization using hybrid backtracking search algorithm for wind speed forecasting. Energy Convers. Manag. 2017, 143, 360–376. [Google Scholar] [CrossRef]
  27. Zhang, C.; Wei, H.; Zhao, J.; Liu, T.; Zhu, T.; Zhang, K. Short-term wind speed forecasting using empirical mode decomposition and feature selection. Renew. Energy 2016, 96, 727–737. [Google Scholar] [CrossRef]
  28. Moghram, I.; Rahman, S. Analysis and evaluation of five short-term load forecasting techniques. IEEE Trans. Power Syst. 2002, 4, 1484–1491. [Google Scholar] [CrossRef]
  29. Goswami, J.C.; Chan, A.K. Fundamentals of Wavelets: Theory, Algorithms and Applications; Wiley: Hoboken, NJ, USA, 2011. [Google Scholar]
  30. Wang, J.Z.; Wang, J.J.; Zhang, Z.G.; Guo, S.P. Forecasting stock indices with back propagation neural network. Expert Syst. Appl. 2011, 38, 14346–14355. [Google Scholar] [CrossRef]
  31. Xiao, L.; Wang, J.; Dong, Y.; Wu, J. Combined forecasting models for wind energy forecasting: a case study in China. Renew. Sustain. Energy Rev. 2015, 44, 271–288. [Google Scholar] [CrossRef]
  32. Sun, W.; Xu, Y. Financial security evaluation of the electric power industry in china based on a back propagation neural network optimized by genetic algorithm. Energy 2016, 101, 366–379. [Google Scholar] [CrossRef]
  33. Section, T. Short-term wind speed hybrid forecasting model based on bias correcting study and its application. Math. Probl. Eng. 2015, 2015, 1–13. [Google Scholar]
  34. Navascués, M.A.; Sebastián, M.V. Smooth fractal interpolation. J. Inequal. Appl. 2006, 2006, 78734. [Google Scholar] [CrossRef]
  35. Zhan, Z.H.; Zhang, J. Adaptive particle swarm optimization. In Ant Colony Optimization and Swarm Intelligence; Springer: Berlin/Heidelberg, Germany, 2008; pp. 227–234. [Google Scholar]
  36. Shen, Q.; Jiang, J.H.; Tao, J.C.; Shen, G.L.; Yu, R.Q. Modified ant colony optimization algorithm for variable selection in QSAR modeling: QSAR studies of cyclooxygenase inhibitors. Cheminform 2005, 45, 1024–1029. [Google Scholar]
  37. Stützle, T. Ant colony optimization. Comput. Intell. Mag. IEEE 2003, 1, 28–39. [Google Scholar]
  38. Jiang, P.; Liu, F.; Wang, J.; Song, Y. Cuckoo search-designated fractal interpolation functions with winner combination for estimating missing values in time series. Appl. Math. Modell. 2016, 40, 9692–9718. [Google Scholar] [CrossRef]
  39. Lu, C.J. Integrating independent component analysis-based denoising scheme with neural network for stock price prediction. Expert Syst. Appl. 2010, 37, 7056–7064. [Google Scholar] [CrossRef]
  40. Aussem, A.; Campbell, J.; Murtagh, F. Wavelet-based feature extraction and decomposition strategies for financial forecasting. J. Comput. Intell. Financ. 1998, 6, 5–12. [Google Scholar]
  41. Zheng, G.; Starck, J.L.; Campbell, J.; Murtagh, F.; Zheng, G.; Campbell, J. The wavelet transform for filtering financial data streams. J. Comput. Intell. Finance 1999, 7, 18–35. [Google Scholar]
  42. Zhang, J.; Morris, A.J. A sequential learning approach for single hidden layer neural networks. Neural Netw. 1998, 11, 65–80. [Google Scholar] [CrossRef]
  43. Niu, T.; Wang, J.; Zhang, K.; Du, P. Multi-step-ahead Wind Speed Forecasting Based on Optimal Feature Selection and a Modified Bat Algorithm with the Cognition Strategy. Renew. Energy 2017, 118, 213–229. [Google Scholar] [CrossRef]
  44. Liu, L.; Wang, Q.; Wang, J.; Liu, M. A rolling grey model optimized by particle swarm optimization in economic prediction. Comput. Intell. 2016, 32, 391–419. [Google Scholar] [CrossRef]
  45. Jolayemi, E.T. A Multiraters Agreement Index for Ordinal Classification. Biom. J. 1991, 33, 485–492. [Google Scholar] [CrossRef]
  46. Yu, L.; Lai, K.K.; Wang, S.; Huang, W. A Bias-Variance-Complexity Trade-Off Framework for Complex System Modeling. In Computational Science and Its Applications—ICCSA 2006; Springer: Berlin/Heidelberg, Germany, 2006. [Google Scholar]
Figure 1. The structure of three different hybrid models. (a) the structure of neural network combined with ant colony optimization algorithm (b) the structure of neural network combined with adaptive particle swarm optimization algorithm. (c) the structure of neural network combined with hybrid optimization algorithm (d) the structure of neural network
Figure 1. The structure of three different hybrid models. (a) the structure of neural network combined with ant colony optimization algorithm (b) the structure of neural network combined with adaptive particle swarm optimization algorithm. (c) the structure of neural network combined with hybrid optimization algorithm (d) the structure of neural network
Sustainability 10 01443 g001
Figure 2. The data structures of the forecasting process (the six wind speed time series were generated in the Shandong province. The actual data was pre-processed by a wavelet de-noise. In the forecasting process, the training sample numbers 1008, the output sample numbers 144. The input data is the de-noised time series and the output data is the actual time series).
Figure 2. The data structures of the forecasting process (the six wind speed time series were generated in the Shandong province. The actual data was pre-processed by a wavelet de-noise. In the forecasting process, the training sample numbers 1008, the output sample numbers 144. The input data is the de-noised time series and the output data is the actual time series).
Sustainability 10 01443 g002
Figure 3. Forecasting performance by each neural network.
Figure 3. Forecasting performance by each neural network.
Sustainability 10 01443 g003
Figure 4. Best structure of BPNN for prediction result in each site.
Figure 4. Best structure of BPNN for prediction result in each site.
Sustainability 10 01443 g004
Figure 5. Flowchart of three hybrid forecasting models.
Figure 5. Flowchart of three hybrid forecasting models.
Sustainability 10 01443 g005
Figure 6. The forecasting results of three methods for Site 2 (1-step forecasting).
Figure 6. The forecasting results of three methods for Site 2 (1-step forecasting).
Sustainability 10 01443 g006
Figure 7. The forecasting results of three methods for Site 1 (2-step forecasting).
Figure 7. The forecasting results of three methods for Site 1 (2-step forecasting).
Sustainability 10 01443 g007
Figure 8. The forecasting results of three methods for Site 3 (3-step forecasting).
Figure 8. The forecasting results of three methods for Site 3 (3-step forecasting).
Sustainability 10 01443 g008
Figure 9. PSI values of different models.
Figure 9. PSI values of different models.
Sustainability 10 01443 g009
Figure 10. The expect (%) and actual (%) by each hybrid model.
Figure 10. The expect (%) and actual (%) by each hybrid model.
Sustainability 10 01443 g010
Figure 11. The performance of three different optimization algorithms.
Figure 11. The performance of three different optimization algorithms.
Sustainability 10 01443 g011
Table 1. Theoretical comparison of the strengths and disadvantages exist in present individual wind speed forecasting models.
Table 1. Theoretical comparison of the strengths and disadvantages exist in present individual wind speed forecasting models.
The Type of the ModelMain MethodsThe Features and Strengths of Each ModelThe Disadvantages of Each Model
Physical modelsMesoscale numerical model;
Computational fluid dynamics method [7]
Obtaining good performance in the long-term forecasting;
Having a high parallel efficiency [7].
Requiring considerable observed data with limited simulation scale;
Having higher cost [8].
Statistical modelsTime series approach [9,10,11]A wide application and cost less time to build models [12].Obtaining poor performance in dealing with non-linear time series forecasting [12].
Spatial correlation modelsSpatial correlation models [13]Obtaining a satisfactory wind speed forecasting by vast quantities of information that need be considered and collected [13].Requiring wind speed measurements from multiple spatial correlated sites so that the implementation has difficulty due to the measurements and their time delays [6].
Artificial intelligence modelsNeural network approach;
Support vector machine;
Fuzzy and clustering approach [6,15,16,17,18,19]
Having a high ability of fault tolerance do not require accurate mathematical models with each man-machine interaction;
Obtaining a satisfactory performance in non-linear time series forecasting [18]
Easily getting into local optimum, over-fitting and exhibiting the relatively low convergence rate;
Having a relatively low accuracy and lack for systematization [19].
Table 2. The main type of wind power turbines in Shandong province of China.
Table 2. The main type of wind power turbines in Shandong province of China.
Type ParameterGuodian-UP82CCWE-1500/82.DFGamesa-G58/850Goldwind-GW82/1500
Rated power1500 KW1500 KW850 KW1500 KW
Cut-in Speed3 m/s3 m/s3 m/s3 m/s
Rated wind speed10.5 m/s11 m/s10.5 m/s10.3 m/s
Cut-out Speed25 m/s25 m/s22 m/s22 m/s
Survival wind speed52.5 m/s (3 s)52.5 m/s (3 s)52.5 m/s (3 s)52.5 m/s (3 s)
Swept area of rotor5384 m25278 m22642 m25324 m2
Rotor diameter82.76 m82 m58 m82 m
Table 3. Forecasting result by different Artificial Neural Network and the best forecasting structure.
Table 3. Forecasting result by different Artificial Neural Network and the best forecasting structure.
Forecasting StepOne-Step ForecastingTwo-Step ForecastingThree-Step Forecasting
ModelMetricSite 1Site 2Site 3Site 4Site 5Site 6Site 1Site 2Site 3Site 4Site 5Site 6Site1 Site 2Site 3Site 4Site 5Site 6
BPNNInput655455586488775455
Hidden141816171514151314151714151612141912
MAE0.44510.37980.37990.38170.38280.38510.39120.43600.49620.43030.49020.59150.50800.65350.51140.64850.58720.5746
MSE0.35040.23150.23270.23560.23740.24180.30510.28580.37170.34690.27890.32490.38220.51740.38870.39620.41390.4504
MAPE6.39%6.08%6.07%6.09%6.11%6.14%7.71%8.56%6.85%7.21%7.18%7.80%8.31%9.16%9.36%10.51%10.42%10.93%
ELMInput563367783547733467
Hidden171418141618182414151413211819162214
MAE0.450.49270.47510.39660.46960.49520.42150.53020.51960.54860.48090.60230.59950.72580.7520.70860.94310.8917
MSE0.36610.40560.35830.23050.37870.4770.30590.42830.48860.51110.36060.9260.66540.91461.05310.93681.53511.4202
MAPE6.81%6.79%9.18%6.64%6.88%9.34%7.07%7.73%9.66%8.91%7.45%9.80%10.04%10.20%13.50%11.35%13.60%14.21%
Elman NNInput836535866386745673
Hidden191419142015161917131513252218181816
MAE0.49070.63040.54160.47510.53130.55880.7010.66510.86540.63580.65380.67460.74770.74421.04350.77580.99260.8299
MSE0.41370.64110.57790.3920.48070.58170.78780.77371.31380.72160.66040.85930.90620.97751.97761.10361.40481.2713
MAPE7.99%8.72%9.88%7.67%7.98%10.67%9.72%10.81%15.14%10.91%9.69%13.11%10.26%12.13%18.05%13.53%14.62%14.96%
GRNNInput374636445388667765
Hidden131417161519181816241713231916252419
MAE0.46090.58470.43320.40180.52630.72460.47260.7130.67290.85370.81510.86870.74180.98130.99741.27230.99330.9972
MSE0.34290.55220.2870.28510.44960.94650.37910.94941.23021.31971.27831.32761.02391.60891.75122.741.63641.8682
MAPE7.22%8.87%8.51%7.35%8.73%14.05%7.71%9.57%11.07%12.51%10.52%16.83%11.57%12.95%19.75%17.68%14.28%18.17%
WNNInput467753754533664454
Hidden171620141818252520121516231415141518
MAE0.45210.5140.51260.46520.4890.48540.64580.70380.73230.66460.67080.69340.77720.84860.91130.82070.82520.8531
MSE0.38210.43540.48420.35990.42190.42390.77970.82770.98810.73450.77290.8651.10741.21711.48741.1121.1451.2837
MAPE7.29%7.58%9.14%7.83%8.03%9.30%10.42%9.70%13.06%11.18%9.83%13.29%12.42%11.60%16.18%13.86%12.04%16.34%
Table 4. The forecasting result of each site (1-step forecasting).
Table 4. The forecasting result of each site (1-step forecasting).
Forecasting ModelMetricSite
Site 1Site 2Site 3Site 4Site 5Site 6
WD-GRNNMAE0.39220.41230.41020.42200.41390.4193
WD-Elman0.40390.41410.39130.39810.39480.4219
WD-BP0.40220.39560.38960.40750.40450.4061
WD-WNN0.40990.41120.40420.39360.41190.3903
WD-APSO-BP0.38500.38860.42720.44700.39850.3818
WD-ACO-BP0.39130.41230.43880.49040.41790.4025
WD-APSOACO-BP0.30270.31290.34460.35350.32250.3108
WD-GRNNMSE0.24990.26030.34050.38700.28570.2811
WD-Elman0.25730.26140.32480.36510.27250.2829
WD-BP0.25620.24980.32340.37380.27910.2722
WD-WNN0.26120.25960.33550.36100.28430.2617
WD-APSO-BP0.24530.24310.31960.35310.26570.2581
WD-ACO-BP0.25030.45200.34080.41590.31050.2972
WD-APSOACO-BP0.15280.16180.21630.22390.17590.1753
WD-GRNNMAPE5.96%5.39%6.71%6.85%5.62%6.97%
WD-Elman6.13%5.41%6.40%6.46%5.36%7.02%
WD-BP6.11%5.17%6.37%6.61%5.49%6.75%
WD-WNN6.23%5.37%6.61%6.39%5.59%6.49%
WD-APSO-BP5.85%5.03%6.30%6.25%5.22%6.40%
WD-ACO-BP5.93%5.28%6.47%6.88%5.48%6.72%
WD-APSOACO-BP4.59%4.05%5.02%4.89%4.21%5.23%
WD-GRNNVAR(Y)0.80281.00090.84921.20140.94510.8391
WD-Elman0.82671.00510.81001.13350.90150.8444
WD-BP0.82320.96040.80651.16040.92360.8127
WD-WNN0.83900.99810.83671.12070.94060.7812
WD-APSO-BP0.78810.93460.79701.09620.87910.7705
WD-ACO-BP0.76541.19800.79641.05820.88320.8135
WD-APSOACO-BP0.71010.90900.72170.91920.80090.7405
WD-GRNNBias21.50901.31651.34231.14631.26961.4001
WD-Elman1.55381.32211.28031.08151.21101.4089
WD-BP1.54731.26321.27481.10711.24061.3560
WD-WNN1.57701.31291.32261.06931.26351.3034
WD-APSO-BP1.48131.22931.25981.04591.18091.2856
WD-ACO-BP1.46681.17801.29611.00651.28661.4229
WD-APSOACO-BP1.44001.18521.24051.00291.18601.3891
WD-GRNNIA0.94430.89890.88750.87450.88780.8682
WD-Elman0.91700.89510.93050.92690.93080.8628
WD-BP0.92090.93680.93450.90540.90850.8965
WD-WNN0.90350.90140.90070.93740.89210.9327
WD-APSO-BP0.93980.94330.91960.93200.92990.9144
WD-ACO-BP0.93720.89430.91770.92320.91650.9109
WD-APSOACO-BP0.96190.96270.94560.95840.95450.9456
Note: Marked by bold is the best forecasting model.
Table 5. The forecasting results of Site 2 (1-step forecasting).
Table 5. The forecasting results of Site 2 (1-step forecasting).
WeekForecasting ModelEvaluation Criteria Metric
MAEMSEMAPEVAR(Y)Bias2IA
MONWD-GRNN0.47660.33527.05%1.52633.92150.9158
WD-Elman0.48060.31847.05%1.46064.10740.9723
WD-BP0.44630.32416.53%1.50454.06090.8994
WD-WNN0.48100.33257.05%1.41284.12380.8925
WD-APSO-BP0.44070.31536.43%1.39303.76280.9758
WD-ACO-BP0.43500.32436.20%1.29263.59120.9748
WD-APSOACO-BP0.35350.21565.14%1.12893.57400.9833
TueWD-GRNN0.53500.40524.71%0.35031.12120.8379
WD-Elman0.53950.38484.71%0.33521.17430.8896
WD-BP0.50100.39174.36%0.34531.16110.8229
WD-WNN0.53990.40194.71%0.32421.17910.8166
WD-APSO-BP0.49470.38114.29%0.31971.07580.8928
WD-ACO-BP0.48890.36624.24%0.28241.02020.8902
WD-APSOACO-BP0.40450.25673.49%0.27700.96380.9302
WedWD-GRNN0.37290.18645.97%0.69960.02950.8681
WD-Elman0.37600.17705.98%0.66950.03090.9216
WD-BP0.34920.18025.54%0.68960.03050.8525
WD-WNN0.37630.18495.98%0.64750.03100.8460
WD-APSO-BP0.34480.17535.45%0.63850.02830.9250
WD-ACO-BP0.35050.18285.54%0.61420.03110.9209
WD-APSOACO-BP0.28410.11934.51%0.58350.02220.9504
ThuWD-GRNN0.48600.35795.35%1.97431.27970.9029
WD-Elman0.49010.33995.35%1.88931.34030.9586
WD-BP0.45510.34604.96%1.94611.32520.8868
WD-WNN0.49040.35505.36%1.82741.34570.8799
WD-APSO-BP0.44940.33664.88%1.80191.22790.9621
WD-ACO-BP0.48360.37745.24%1.84771.18670.9576
WD-APSOACO-BP0.36290.22373.90%1.70101.14000.9754
FriWD-GRNN0.35940.17926.41%0.47390.24040.8513
WD-Elman0.36240.17016.42%0.45350.25180.9038
WD-BP0.33650.17325.94%0.46710.24890.8360
WD-WNN0.36270.17776.42%0.43860.25280.8296
WD-APSO-BP0.33230.16855.85%0.43250.23070.9071
WD-ACO-BP0.35030.19306.08%0.37570.21700.8904
WD-APSOACO-BP0.27740.11954.84%0.33010.20330.9355
SatWD-GRNN0.39060.18945.27%2.20040.35820.8230
WD-Elman0.39390.17985.27%2.10570.37520.8737
WD-BP0.36580.18314.89%2.16890.37090.8082
WD-WNN0.39420.18785.28%2.03670.37670.8020
WD-APSO-BP0.36120.17814.81%2.00820.34370.8769
WD-ACO-BP0.42061.53195.30%3.16690.06580.6492
WD-APSOACO-BP0.25070.09773.33%1.08150.20510.9764
SunWD-GRNN0.35360.16594.30%0.96022.16560.9213
WD-Elman0.35660.15754.30%0.91892.26830.9782
WD-BP0.33120.16033.99%0.94652.24260.9048
WD-WNN0.35690.16454.30%0.88882.27740.8979
WD-APSO-BP0.32700.15603.92%0.87632.07800.9817
WD-ACO-BP0.35690.18844.38%0.85642.27910.9770
WD-APSOACO-BP0.25720.10013.11%0.81151.95630.9878
Note: Marked by bold is the best forecasting model.
Table 6. The forecasting result of each site (2-step forecasting).
Table 6. The forecasting result of each site (2-step forecasting).
Forecasting ModelMetricSite
Site 1Site 2Site 3Site 4Site 5Site 6
WD-GRNNMAE0.47430.47070.50430.55650.49990.4446
WD-Elman0.47790.44930.4930.53680.49720.4578
WD-BP0.47720.47290.49550.56970.51560.4558
WD-WNN0.46160.44310.53110.52210.5170.4646
WD-APSO-BP0.44420.44170.49070.52030.47890.4364
WD-ACO-BP0.46310.47860.52490.53560.48790.4797
WD-APSOACO-BP0.36640.3490.39540.41270.38020.3472
WD-GRNNMSE0.35880.36250.45530.50060.4230.3612
WD-Elman0.36150.3460.4450.48290.42070.372
WD-BP0.3610.36420.44730.51260.43620.3704
WD-WNN0.34920.34130.47950.46970.43740.3775
WD-APSO-BP0.3360.34020.4430.46810.40520.3546
WD-ACO-BP0.36170.45150.49270.49940.40730.4275
WD-APSOACO-BP0.23090.21350.28940.30410.25470.2298
WD-GRNNMAPE0.06210.07070.0750.07910.0660.0746
WD-Elman0.06260.06750.07330.07630.06560.0768
WD-BP0.06250.0710.07370.0810.0680.0765
WD-WNN0.06050.06660.0790.07420.06820.078
WD-APSO-BP5.82%6.63%7.30%7.39%6.32%7.33%
WD-ACO-BP6.06%7.34%7.89%7.56%6.43%7.99%
WD-APSOACO-BP4.79%5.25%5.90%5.87%5.07%5.79%
WD-GRNNVAR(Y)1.02720.82860.8421.20890.97780.8358
WD-Elman1.03480.79090.82311.16620.97240.8607
WD-BP1.03340.83250.82731.23781.00840.8571
WD-WNN0.99960.78010.88681.13431.01120.8735
WD-APSO-BP0.96190.77760.81931.13040.93670.8205
WD-ACO-BP0.96180.78180.80961.12960.87970.7911
WD-APSOACO-BP0.93020.77660.78621.02120.86790.7844
WD-GRNNBias21.21051.49091.27591.02361.2191.4092
WD-Elman1.21951.42321.24720.98741.21241.451
WD-BP1.21781.4981.25361.0481.25721.4449
WD-WNN1.17811.40371.34370.96041.26071.4727
WD-APSO-BP1.13361.39921.24150.95711.16781.3833
WD-ACO-BP1.14361.4081.14620.96941.1451.3562
WD-APSOACO-BP1.11831.30641.12450.90191.12951.3219
WD-GRNNIA0.86340.85920.87170.84580.85760.8828
WD-Elman0.85710.90010.89170.87680.86230.8573
WD-BP0.85820.85510.88720.82610.83150.8609
WD-WNN0.88720.91260.82760.90150.82930.8447
WD-APSO-BP0.9220.91550.89580.90460.89520.8993
WD-ACO-BP0.91490.88160.87820.89790.89240.8806
WD-APSOACO-BP0.94560.94680.93010.93820.92990.9356
Note: Marked by bold is the best forecasting model
Table 7. The forecasting result of each site (3-step forecasting).
Table 7. The forecasting result of each site (3-step forecasting).
Forecasting ModelMetricSite
Site 1Site 2Site 3Site 4Site 5Site 6
WD-GRNNMAE0.55270.55640.63840.65550.70580.4943
WD-Elman0.55500.53070.60230.62530.71030.4995
WD-BP0.53030.52840.61660.64050.68360.5279
WD-WNN0.55120.54820.59550.65230.65710.4994
WD-APSO-BP0.51610.52220.58250.60970.64810.487
WD-ACO-BP0.52460.55560.44970.64310.56770.5368
WD-APSOACO-BP0.40710.43030.38980.48970.45140.3834
WD-GRNNMSE0.51010.50440.69090.69430.86770.4683
WD-Elman0.51220.48110.65190.66230.87310.4733
WD-BP0.48940.47900.66730.67850.84030.5002
WD-WNN0.50870.49700.64450.69090.80770.4731
WD-APSO-BP0.47630.47340.63040.64580.79670.4614
WD-ACO-BP0.48390.53090.35350.70790.57930.5746
WD-APSOACO-BP0.29720.33220.2930.43730.36370.2874
WD-GRNNMAPE8.39%7.33%9.56%9.34%9.38%8.34%
WD-Elman8.42%6.99%9.02%8.91%9.44%8.43%
WD-BP8.05%6.96%9.23%9.13%9.08%8.91%
WD-WNN8.36%7.22%8.92%9.30%8.73%8.43%
WD-APSO-BP7.83%6.88%8.72%8.69%8.61%8.22%
WD-ACO-BP7.97%7.31%6.67%9.23%7.53%9.02%
WD-APSOACO-BP6.15%5.71%5.86%6.99%6.03%6.43%
WD-GRNNVAR(Y)0.83241.04220.89651.04800.96610.8162
WD-Elman0.83600.99400.84580.99970.97220.8249
WD-BP0.79880.98970.86591.02410.93560.8718
WD-WNN0.83011.02680.83631.04290.89940.8246
WD-APSO-BP0.77730.97810.8180.97480.88710.8042
WD-ACO-BP0.77170.97870.81481.01740.93780.84
WD-APSOACO-BP0.73280.9420.8030.92260.78390.7756
WD-GRNNBias21.39071.10061.40680.95451.18691.4039
WD-Elman1.39661.04971.32730.91041.19431.4188
WD-BP1.33441.04521.35870.93271.14941.4995
WD-WNN1.38691.08441.31230.94991.10491.4184
WD-APSO-BP1.29861.03291.28360.88781.08981.3832
WD-ACO-BP1.36321.07031.06560.93251.10691.5875
WD-APSOACO-BP1.28571.00431.05580.86451.06121.2428
WD-GRNNIA0.82270.83240.78180.74240.75240.8607
WD-Elman0.81930.87270.82860.77830.74770.8517
WD-BP0.85740.87650.80940.75980.77690.8059
WD-WNN0.82500.84480.83800.74600.80820.8519
WD-APSO-BP0.88110.88690.85680.79820.81940.8736
WD-ACO-BP0.87340.87290.89730.85330.85140.8431
WD-APSOACO-BP0.92680.92270.89930.9070.90550.9207
Note: Marked by bold is the best forecasting model
Table 8. Forecasting validity of each forecasting model.
Table 8. Forecasting validity of each forecasting model.
Forecasting STEPForecasting ModelSite
Site 1Site 2Site 3Site 4Site 5Site 6
1-step forecastingARIMA0.80220.80370.80570.80990.73170.8054
ARMA0.79590.83140.80070.80250.78070.8046
ES(2)0.8310.89940.83920.83370.79340.8287
ES0.78870.78880.79410.7940.72530.7839
GRNN0.81150.808240.81560.81740.80920.8062
WNN0.92320.919150.93110.93190.91610.9271
BPNN0.92090.913690.92260.92570.91680.923
ElmanNN0.92610.919780.92840.92650.92230.9202
WD-APSO-BP0.96040.97020.96450.96810.99690.9932
WD-ACO-BP0.95720.96450.95870.96190.9940.9871
WD-APSOACO-BP0.96310.96810.96680.97110.99160.9886
2-step forecastingARIMA0.80560.81810.81140.74240.80260.795
ARMA0.80580.80770.82920.81270.8020.7903
ES(2)0.80590.83430.83890.80090.80310.7987
ES0.72790.73450.79880.72790.72480.7251
GRNN0.80880.82670.81080.81110.80620.8059
WNN0.92390.9950.92980.93250.92650.9229
BPNN0.92270.95830.93180.92290.91590.9219
ElmanNN0.92790.96190.93170.93220.92130.9251
WD-APSO-BP0.96070.96670.97030.96750.96570.989
WD-ACO-BP0.95920.9630.95960.96420.96330.9784
WD-APSOACO-BP0.96290.96870.96540.97580.98880.9868
3-step forecastingARIMA0.73560.80410.81080.81260.79470.73
ARMA0.78190.72380.80580.8060.79020.7776
ES(2)0.79360.83490.81390.80960.82870.789
ES0.72780.79380.73130.7330.78110.7246
GRNN0.80870.813310.81660.8160.80150.8038
WNN0.92710.961230.92920.93150.91610.9199
BPNN0.92460.936950.92820.93150.91520.9144
ElmanNN0.92210.990820.9250.92460.92610.9215
WD-APSO-BP0.96240.96390.96310.96310.99270.9823
WD-ACO-BP0.95150.96010.95890.95930.98740.9657
WD-APSOACO-BP0.96450.97180.96630.96870.98050.9756
Note: Marked by bold is the best forecasting model
Table 9. The definition of population stability index (PSI) Value guidelines.
Table 9. The definition of population stability index (PSI) Value guidelines.
PSI ValueInferenceStability
Less than 0.1Insignificant changeThe model has high stability
0.1—0.25Some minor changeThe model stability is moderate
Greater than 0.25Major changeThe stability of the model is poor
Table 10. The stability of forecasting by each model in different forecasting horizons.
Table 10. The stability of forecasting by each model in different forecasting horizons.
SitesStability (PSI) of 1-Step ForecastingStability (PSI) of 2-Step ForecastingStability (PSI) of 3-Step Forecasting
ACOAPSOAPSOACOACOAPSOAPSOACOACOAPSOAPSOACO
Site 10.01900.02840.01250.02100.03970.02240.03730.07080.0256
Site 20.02070.03790.01920.04770.03060.02060.05510.03060.0218
Site 30.02870.01850.01660.06110.04880.01910.03070.05500.0252
Site 40.00590.00860.00190.01370.02530.01120.02530.02720.0192
Site 50.05360.05070.06290.09070.07950.05680.09730.09370.0511
Site 60.02830.03160.02320.03330.03140.02230.04950.04490.0208
Table 11. The PSI of forecasting by each model in different forecasting step in Site 1.
Table 11. The PSI of forecasting by each model in different forecasting step in Site 1.
Stability of One Step Forecasting
BinEdgesActual (%)Expected of ACO (%)PSI of ACOExpected of APSO (%)PSI of APSOExpected of APSOACO (%)PSI of APSOACO
[3, 4.1]3.17%2.08%0.0045971.49%0.0127782.58%0.001236
[4.1, 5.2]20.44%21.63%0.00067423.02%0.00306622.02%0.001187
[5.2, 6.3]23.02%22.72%3.87E-0521.92%0.0005321.73%0.000744
[6.3, 7.4]16.96%18.95%0.00219518.95%0.00219518.85%0.001986
[7.4, 8.5]9.13%7.14%0.0048648.33%0.0007228.04%0.00139
[8.5, 9.6]9.03%9.33%9.65E-057.44%0.0030698.13%0.00093
[9.6, 10.7]6.35%5.95%0.0002566.25%1.56E-055.95%0.000256
[10.7, 11.8]7.34%8.43%0.0015129.23%0.0043089.13%0.003888
[11.8, 12.9]4.17%3.67%0.0006293.37%0.0016773.57%0.000918
[12.9, 14]0.40%0.10%0.0041260.00%0.00%0.00%0
PSI =0.018987PSI =0.028361PSI =0.012534
Stability of two step forecasting
BinEdgesActual (%)Expected of ACO (%)PSI of ACOExpected of APSO (%)PSI of APSOExpected of APSOACO (%)PSI of APSOACO
[3, 4.1]3.17%2.68%0.0008431.19%0.0193982.28%0.002949
[4.1, 5.2]20.44%20.24%1.94E-0519.48%0.00045522.12%0.001337
[5.2, 6.3]23.02%23.51%0.00010624.95%0.00156122.12%0.000353
[6.3, 7.4]16.96%19.25%0.00287919.18%0.00273219.05%0.002413
[7.4, 8.5]9.13%8.53%0.0004019.34%5.1E-058.53%0.000401
[8.5, 9.6]9.03%7.74%0.0019887.16%0.0043447.74%0.001988
[9.6, 10.7]6.35%5.75%0.0005865.77%0.0005635.36%0.001686
[10.7, 11.8]7.34%9.13%0.0038889.84%0.0073259.82%0.007219
[11.8, 12.9]4.17%3.08%0.0033143.08%0.0032742.98%0.004006
[12.9, 14]0.40%0.10%00.00%00.00%0
PSI =0.014025PSI =0.039704PSI =0.022352
Stability of three step forecasting
BinEdgesActual (%)Expected of ACO (%)PSI of ACOExpected of APSO (%)PSI of APSOExpected of APSOACO (%)PSI of APSOACO
[3, 4.1]3.17%1.79%0.0079910.50%0.0497222.48%0.001714
[4.1, 5.2]20.44%22.12%0.00133720.93%0.00011920.54%4.8E-06
[5.2, 6.3]23.02%21.53%0.00099524.40%0.00081422.92%4.29E-06
[6.3, 7.4]16.96%20.24%0.00577719.74%0.00421219.44%0.003384
[7.4, 8.5]9.13%10.71%0.0025457.84%0.0019658.83%9.87E-05
[8.5, 9.6]9.03%5.75%0.0147467.34%0.0034887.94%0.001406
[9.6, 10.7]6.35%5.26%0.0020586.94%0.0005335.56%0.00106
[10.7, 11.8]7.34%8.53%0.0017899.42%0.0052049.62%0.006175
[11.8, 12.9]4.17%4.07%2.39E-052.88%0.0047772.58%0.007612
[12.9, 14]0.40%0.00%00.00%00.10%0.004126
PSI =0.037262PSI =0.070834PSI =0.025586
Table 12. The optimal parameter of hybrid optimization algorithm and the optimal solution for each Site.
Table 12. The optimal parameter of hybrid optimization algorithm and the optimal solution for each Site.
SitesInspired Factor: αExpectation Inspired Factor: βPheromone Volatilization Factor: ρOptimal SolutionOperation Time
Site 11.0317924.4832190.49185210.2307579.92981
Site 21.0948824.7329910.5024787.87617570.2076
Site 31.1034794.3257730.4795216.5698570.57872
Site 43.496684.7596320.4856739.2664871.67526
Site 51.0098874.2986470.4967737.570672.82799
Site 63.2068934.9789350.479557.51374274.70964

Share and Cite

MDPI and ACS Style

Yao, Z.; Wang, C. A Hybrid Model Based on A Modified Optimization Algorithm and An Artificial Intelligence Algorithm for Short-Term Wind Speed Multi-Step Ahead Forecasting. Sustainability 2018, 10, 1443. https://doi.org/10.3390/su10051443

AMA Style

Yao Z, Wang C. A Hybrid Model Based on A Modified Optimization Algorithm and An Artificial Intelligence Algorithm for Short-Term Wind Speed Multi-Step Ahead Forecasting. Sustainability. 2018; 10(5):1443. https://doi.org/10.3390/su10051443

Chicago/Turabian Style

Yao, Zonggui, and Chen Wang. 2018. "A Hybrid Model Based on A Modified Optimization Algorithm and An Artificial Intelligence Algorithm for Short-Term Wind Speed Multi-Step Ahead Forecasting" Sustainability 10, no. 5: 1443. https://doi.org/10.3390/su10051443

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop