Smart Urban Wind Power Forecasting: Integrating Weibull Distribution, Recurrent Neural Networks, and Numerical Weather Prediction

Shirzadi, Navid; Nasiri, Fuzhan; Menon, Ramanunni Parakkal; Monsalvete, Pilar; Kaifel, Anton; Eicker, Ursula

doi:10.3390/en16176208

Open AccessArticle

Smart Urban Wind Power Forecasting: Integrating Weibull Distribution, Recurrent Neural Networks, and Numerical Weather Prediction

by

Navid Shirzadi

¹

,

Fuzhan Nasiri

^1,*

,

Ramanunni Parakkal Menon

¹,

Pilar Monsalvete

¹

,

Anton Kaifel

²

and

Ursula Eicker

¹

Gina Cody School of Engineering and Computer Science, Concordia University, 1455 Boulevard de Maisonneuve, Montreal, QC H3G 1M8, Canada

²

Centre for Solar Energy and Hydrogen Research (ZSW), Meitnerstr. 1, 70563 Stuttgart, Germany

^*

Author to whom correspondence should be addressed.

Energies 2023, 16(17), 6208; https://doi.org/10.3390/en16176208

Submission received: 28 July 2023 / Revised: 16 August 2023 / Accepted: 24 August 2023 / Published: 26 August 2023

(This article belongs to the Section G1: Smart Cities and Urban Management)

Download

Browse Figures

Versions Notes

Abstract

:

The design, operational planning, and integration of wind power plants with other renewables and the grid face challenges attributed to the intermittent nature of wind power generation. Addressing this issue necessitates the development of a smart wind power (and in particular wind speed) forecasting approach. This is a complex task due to substantial fluctuations in wind speed. To overcome the inherent stochastic nature of wind speed and mitigate related challenges, traditionally, numerical weather prediction (NWP) models are employed for wind speed forecasting. However, the applicability of NWP models is limited to short-term forecasting due to their computational constraints. In this study, a hybrid AI-based approach is proposed to improve forecast accuracy over a 48 h horizon for the city of Montreal. The results demonstrate that by integrating the probability distribution of wind speed with a deep learning model, the forecasted values align closely with the observed values in terms of seasonality and trend, exhibiting enhanced accuracy. Evaluation metrics reveal a substantial reduction in the root mean squared error (13–31%) across three prediction horizons (summer, fall, and winter) compared to a single long, short-term memory model. Furthermore, integrating the improved model with the numerical weather prediction model yields increased accuracy and decreased error compared to the LSTM–Weibull model.

Keywords:

wind power generation; wind speed forecasting; deep learning; Weibull distribution; numerical weather prediction; smart cities

1. Introduction

According to the World Wind Energy Association, the global wind power generation achieved a new record of 744 Gigawatts by the end of 2020, with an addition of 93 Gigawatts [1]. However, fluctuations in wind power generation lead to a lack of reliability, posing significant challenges and uncertainties for control systems and operators in ensuring a stable power supply [2]. Hence, the importance of reliable and precise wind forecasting cannot be overstated, as it plays a crucial role in various applications, including load following, unit commitment, scheduling, economic viability, and the design and operational planning of renewable energy systems. Nonetheless, the volatile and intermittent nature of wind speed presents a formidable obstacle in achieving accurate short-term predictions.

Various methods exist for wind prediction, which can be broadly classified into four main groups: (1) physical models, (2) spatial correlation models, (3) conventional statistical models, and (4) artificial intelligence (AI) models [3]. Physical models utilize meteorological data, including temperature and physical characteristics, and are commonly employed for large-scale weather prediction [4]. On the other hand, spatial correlation models rely on the predicted wind speeds of nearby sites to estimate wind speeds at new locations.

In recent times, both conventional statistical models and AI models have gained significant popularity for intraday wind speed predictions, particularly in the context of the design and operational planning of integrated renewable energy systems and wind farms. These models have been extensively employed to enhance the accuracy of forecasting in such applications.

Wang et al. [4] introduced a hybrid model consisting of an autoregressive moving average (ARMA) model and a bivariate fuzzy time series model to predict daily wind speed in Hainan province, China. The findings of their study reveal that, compared to conventional models such as ARMA and ARIMA, the hybrid model significantly reduces the mean absolute percentage error (MAPE) for day-ahead wind speed forecasting. The MAPE for the conventional models across four different sites ranged from 18.15% to 22.08%. In contrast, the hybrid model achieved an error range of 16.64% to 18.29%, showcasing its improved performance in wind speed prediction.

In 2017, Yatiyana et al. [5] presented a statistical model utilizing autoregressive integrated moving average (ARIMA) to predict wind speed and direction in Western Australia in 2017. The selection of this method was motivated by its shorter response time. Their findings demonstrated a mean absolute percentage error (MAPE) of 4.9% for wind speed prediction at a 6 h lead time and a MAPE of 15.6% for wind direction forecasting with a 7-day lead time. However, the integration of these two models into a single model for enhanced overall accuracy was not reported, and the ARIMA method’s ability to capture wind speed fluctuations was not explained.

In another study, the application of fractional ARIMA for day-ahead and two-day-ahead wind speed forecasting was explored [6]. The results exhibited a significant reduction in error and improvement in accuracy compared to persistence methods. Several studies have employed seasonal autoregressive integrated moving average (SARIMA) models to account for the seasonality of training data. Wang et al. [7] used SARIMA for daily and monthly wind speed forecasting in four sites in Northwestern China. To enhance accuracy, they hybridized SARIMA with an extreme learning machine (ELM) and Ljung–Box Q-test (LBQ), considering the nonlinearity and non-stationarity inherent in wind speed data. In one of the sites, the mean daily forecast results indicated approximately 34% MAPE for the single SARIMA method, whereas their proposed hybrid model achieved an error of about 14%, demonstrating a significant improvement in accuracy.

In addition to the mentioned methods, researchers have explored the use of fuzzy theory [8,9] and machine learning techniques, such as support vector machine (SVM) [10,11], for day-ahead wind forecasting. However, artificial neural network (ANN) has garnered significant attention and is widely employed for wind speed forecasting, either as a standalone model or as part of a hybrid approach in combination with other models, such as statistical models. In a study by [12], a comparison was made between ANN, autoregressive integrated moving average (ARIMA), and a hybrid model combining ARIMA and ANN for wind speed forecasting in three regions of India. The results indicated that the hybrid model exhibited improved performance in wind speed prediction, regardless of the linear or non-linear behavior of the wind speed. Although the hybrid model demonstrated significantly lower error compared to the ANN-only model, the mean absolute percentage error (MAPE) of the hybrid model forecasts (ranging from 18% to 25%) for various lead times (1 h, 3 h, 8 h, and 24 h) still remained relatively high.

In recent years, deep learning techniques, including recurrent neural networks (RNNs), Elman neural networks, and convolutional neural networks (CNNs), have gained significant attention in time series forecasting due to their ability to handle sequential data effectively [13,14,15,16]. Liu et al. [17] introduced a hybrid model combining an Elman neural network with a long short-term memory (LSTM) network for wind speed forecasting. Their findings indicate that LSTM is suitable for predicting non-stationary wind speeds, and their proposed hybrid model achieved reasonable accuracy in forecasting. In another study by Wang et al. [14], wind power forecasting was performed using a CNN model. The results demonstrated the adequacy of the proposed CNN model for wind power prediction.

Although AI and statistical methods generally yield satisfactory results in various forecasting horizons (short term, medium term, and long term), the utilization of physical approaches becomes imperative, particularly in short- and very-short-term horizons. This is due to the increasing significance of atmospheric dynamics, which have a more substantial impact on wind speed and power generation during these time frames [18].

Numerical weather prediction (NWP) models are mathematical models that provide information about the present and future state of the atmosphere and surface, including the ocean and land. These models typically have a forecast horizon of one to two weeks and are widely used in weather forecasting.

In a study presented in [19], the authors introduced a wind speed forecasting model that combines numerical weather prediction and historical measurements. The model utilizes multiple sources of past physical model outputs to enhance its forecasting accuracy. The developed model was applied to forecast wind speeds in a region near the U.S. Great Lakes. The results demonstrated an improvement in the root mean squared error of the proposed model, indicating its effectiveness in wind speed prediction.

Short-term wind forecasting using numerical weather prediction (NWP) models can be prone to significant errors due to its reliance on initial conditions. These models are slowly updated and may lag behind actual changes, leading to inaccuracies in short-term wind forecasts [20].

While AI-based methods, statistical methods, and hybrid models have been widely utilized for day-ahead wind speed forecasting, they may not be suitable for applications requiring high accuracy, such as operational control of a microgrid. The unpredictable nature of wind behavior and its direct correlation with physical indicators make it challenging for proposed models to achieve the desired level of accuracy in such scenarios. Additionally, NWP models have been employed for wind speed prediction; however, these models only consider current physical conditions and do not learn from past wind speed values or unexpected changes, limiting their predictive capabilities.

This paper aims to contribute to the advancement of knowledge in the field of wind speed forecasting, specifically for applications such as the design and planning of renewable energy systems. The proposed approach introduces a novel hybrid model that combines the Weibull distribution, long short-term memory (LSTM), and numerical weather prediction (NWP) models. The objective is to reduce the error associated with wind speed prediction by incorporating the distribution probability of historical wind speed data and considering the physical characteristics of the area.

The main contributions of this study, with respect to the prior literature, are as follows:

Proposal of a hybrid model that overcomes the limitations of single statistical approaches. The LSTM method, which offers advantages over conventional feed-forward neural networks, is utilized in the proposed model;
Introduction of a Weibull distribution of wind speed to capture the stochastic nature of wind behavior. By combining the probability distribution of wind speed with the LSTM model, the integrated model achieves a lower error compared to using a single LSTM model or a seasonal autoregressive integrated moving average (SARIMA) model with exogenous variables;
Development of a hybrid model that integrates the results of the NWP model with AI models to enhance short-term forecasting accuracy (24–72 h). This hybrid model achieves minimal error and demonstrates the benefits of combining physical and AI-based approaches.

The remainder of this paper is organized into three main sections. Section 2 represents the related methodology of Weibull distribution and development of the LSTM model. Section 3 describes the results, including the comparison between each model and the final hybrid model. Finally, a conclusion providing a summary of the research and suggestions for future works is discussed in Section 4.

2. Methodology

This section provides an overview of the various forecasting models used in the study, as well as the proposed hybrid approach. The framework of the study is depicted in Figure 1, illustrating the flowchart that consists of three main sections.

The first section focuses on feature selection and data preprocessing. This includes tasks such as feature scaling, outlier detection, and handling missing values. Additionally, a grid search technique is utilized to identify the optimal parameters for the statistical model.

In the second section, the preprocessed data are used to train the developed models. Here, the hybrid model is constructed by incorporating the Weibull distribution output as one of the input features for the LSTM model. Furthermore, the numerical weather prediction (NWP) data are extracted from the NWP model and utilized as inputs for the integrated LSTM–Weibull model, resulting in the final hybrid model.

The last section involves evaluating and comparing the accuracy of each model to determine the most suitable one. Hyperparameter optimization for the LSTM model is also performed in this section to fine-tune its performance.

2.1. LSTM

The recurrent neural network (RNN) is a deep learning algorithm commonly used for sequential data analysis, including time series data [21]. RNNs possess a unique feature called short-term memory, which is achieved through feedback connections within the network [22]. However, in practical applications, RNNs face challenges in capturing long-term dependencies in the data [23]. To overcome this limitation, the long short-term memory (LSTM) was developed as a specialized type of RNN [24,25].

The LSTM is a type of RNN proposed by Hochreiter and Schmidhuber in 1997 to deal with long-term dependencies by upgrading the remembering capacity of a simple recurrent cell [26].

An LSTM cell, in contrast to a simple RNN cell that consists of a single tanh layer [27]—which is a type of activation function commonly used in neural networks that squashes the input values to a range between −1 and 1—is composed of multiple layers, as illustrated in Figure 2. The initial layer is known as the forget layer, which determines whether the incoming information should be retained or discarded using an activation function. The name “forget layer” reflects its primary function of regulating the retention or deletion of prior information as new data are processed sequentially. Typically, the activation function used is a sigmoid function, which produces a value between 0 and 1 based on the input. A value of 1 indicates that the input can be added to the cell state, while a value of 0 signifies that the input should be forgotten or disregarded. (

f_{t}

) is the output of the forget layer, and it is determined using the below equation [28]:

f_{t} = φ (w_{f} . [y_{t - 1}, x_{t}] + b_{f})

(1)

where

φ

is activation function,

y_{t - 1}

is the output of the previous module,

x_{t}

is input at time t and

b_{f}

, and

w_{f}

are bias and weight, respectively. This equation captures the role of the forget layer in LSTM, influencing the extent to which the cell retains or discards prior information for current predictions.

In the second step, the update of new values (

I_{t})

, using Equation (2), and a vector of new information (

\tilde{g}

), as shown in Equation (3), are created to add to the cell state by employing a sigmoid and tanh functions, respectively [28]:

I_{t} = φ (w_{i} . [y_{t - 1}, x_{t}] + b_{i})

(2)

\tilde{g} = t a n h (w_{S} . [y_{t - 1}, x_{t}] + b_{S})

(3)

Subsequently, in the third step, a new cell state (

g_{t}

) is expressed as the sum of the previous cell state multiplied by the first step results, and the multiplication of the

I_{t}

and

\tilde{g}

is shown in notational form as in the below equation [28]:

g_{t} = f_{t} \times g_{t - 1} + I_{t} \times \tilde{g}

(4)

In the final step, by employing a sigmoid function, the cell decides what part of the cell state should be the cell’s output and input to the next cell, and by using a tanh function, it regenerates the values between −1 and 1 (Equations (5) and (6) [28]).

\partial_{t} = φ (w_{\partial} . [y_{t - 1}, x_{t}] + b_{\partial})

(5)

y_{t} = \partial_{t} \times \tanh (g_{t})

(6)

where

\partial_{t}

is the portion of the cell’s state that is transmitted as the output.

The presence of the four layers within each cell of an LSTM model makes it a suitable algorithm to be evaluated for handling the unpredictable nature of wind speed.

2.2. Weibull Distribution

Wind speed can be expressed in time series, and the variation of the speed can be described using a probability distribution function (PDF). For many years, the Weibull distribution has been used to fit wind speed data, and it is an explicitly proper fit to average wind speed data [29].

The Weibull PDF can be described with Equation (7) [30]:

F (v) = (\frac{k}{c}) {(\frac{v}{c})}^{k - 1} e x p [- {(\frac{v}{c})}^{k - 1}]

(7)

where

F (v)

is the probability of occurrence of wind speed (

v

), and k is the Weibull shape parameter that is calculated based on the standard deviation (

σ

) and the average (

\bar{v}

) of the wind speed data using Equation (8) [31]:

k = {(\frac{σ}{\bar{v}})}^{- 1.086}

(8)

And

c

is the Weibull scale parameter that is given as follows [29]:

c = \frac{\bar{v}}{Γ (1 + \frac{1}{k})}

(9)

where

Γ

is the gamma function.

2.3. SARIMAX

SARIMAX, which stands for Seasonal Autoregressive Integrated Moving Average with Exogenous Factors, is a statistical model commonly employed for time series prediction, specifically when there is seasonality present. It extends the SARIMA model by incorporating additional exogenous factors or predictors to further reduce the forecasting error. With considering

y_{t}

as the wind speed in time step t, SARIMAX can be modeled as below [7,32,33,34]:

φ_{p} (B) \emptyset_{P} (B^{s}) {(1 - B)}^{d} {(1 - B^{s})}^{D} y_{t} = γ_{q} (B) δ_{Q} (B^{s}) ε_{t}

(10)

where

B

is a lag operator that is responsible for back shifting, and

φ_{p} (B)

and

\emptyset_{P} (B^{s})

are non-seasonal and seasonal autoregressive operators of order p and P, respectively.

γ_{q} (B)

and

δ_{Q} (B^{s})

are non-seasonal and seasonal moving average functions of order q and Q, respectively.

{(1 - B)}^{d}

and

{(1 - B^{s})}^{D}

are non-seasonal and seasonal differencing operators.

s

is the order of the seasonal component, and

ε_{t}

is the residual error that captures the difference between the observed values and the values predicted by the model.

φ_{p} (B) = 1 - φ_{1} (B) - φ_{2} (B^{2}) - \dots - φ_{p} (B^{p})

(11)

\emptyset_{p} (B) = 1 - φ_{1} (B^{s}) - φ_{2} (B^{2 s}) - \dots - φ_{P} (B^{P s})

(12)

γ_{q} (B) = 1 - γ_{1} (B) - γ_{2} (B^{2}) - \dots - δ_{q} (B^{q})

(13)

δ_{p} (B) = 1 - δ_{1} (B^{s}) - δ_{2} (B^{2 s}) - \dots - δ_{P} (B^{Q s})

(14)

Here,

p, d,

and

q

are integer parameters to show the delay order of non-seasonal autoregressive, differencing, and moving average terms, respectively, while

P, D,

and

Q

are integer parameters for indicating the delay order of seasonal autoregressive, differencing, and moving average terms, respectively. An optimum set of these parameters could be specified for the model as inputs using different criteria for parameter selection such as the Akaike information criterion (AIC), Bayesian information criterion (BIC), or Hannan–Quinn information criterion (HQIC) methods. Furthermore, the seasonal length of the model should be estimated using the decomposition of the training data. Afterward, the model is applied to forecast the future wind speed. The forecast horizon has a direct impact on the accuracy of prediction. An increase in the length of the horizon results in a reduction in accuracy [7].

2.4. NWP Model

In this study, the NWP data were obtained from a model created by professionals with expertise in the field. The NWP model is a mathematical representation that characterizes the present and future state of the atmosphere and surface conditions, encompassing factors like temperature, pressure, humidity, and wind speed. It is formulated using established physical principles and numerical algorithms to simulate atmospheric behavior.

Although specific information regarding the development of the NWP model is not provided in this context, it is typically devised and upheld by meteorological agencies, research institutions, and weather forecasting centers. In this research, the NWP data were extracted from the NWP model that was developed by the “Centre for Solar Energy and Hydrogen Research (ZSW)” in Stuttgart, Germany.

2.5. Hybrid Model

In this study, the proposed hybrid model is built upon the foundational structure of the LSTM model. To integrate an additional model, referred to as Model X, with the LSTM model, the outputs of Model X are scaled and combined with other predictors and target variables. This augmented dataset is then used as input for the LSTM model.

By integrating Model X with the LSTM model, the overall input dimension of the neural network is increased by one, effectively adding an additional input parameter. This integration allows for the incorporation of additional information from Model X into the LSTM model, potentially enhancing the predictive capabilities of the hybrid model.

2.6. Preprocessing and Evaluation Metrics

Due to the use of several predictor variables, such as humidity, temperature, and air pressure, along with wind speed as the dependent variable, feature scaling is necessary to eliminate the issues associated with dimensionality caused by a dissimilar range of values. The min–max scaler method is used to scale the data into a similar range:

X_{N} = \frac{X - X_{\min}}{X_{\max} - X_{\min}}

(15)

To find the outliers in the dataset based on an extreme outlier detection procedure, the minimum and maximum bounds were calculated based on Equations (16) and (17), respectively.

Q 1 - 3 (IQR)

(16)

where Q1 is the lower quartile that shows the number that is more than 25 percent of the data, and IQR is the interquartile range.

Q 3 + 3 (IQR)

(17)

where Q3 is the upper quartile that shows the number that is more than 75 percent of the data.

To assess the forecasting models’ performances, root mean squared error (RMSE), mean absolute error (MAE), and mean squared logarithmic error (MSLE) are employed to determine the goodness of fit. RMSE evaluates the error using Equation (18):

RMSE = \sqrt{\sum_{i = 1}^{n} \frac{{({\bar{y}}_{i} - y_{i})}^{2}}{n}}

(18)

where

y

is the observed value, and

\bar{y}

is the predicted value.

And MAE is calculated using Equation (19)

MAE = \frac{\sum_{i = 1}^{n} |{\bar{y}}_{i} - y_{i}|}{n}

(19)

Due to the wind speed, as the target variable is distributed based on Weibull distribution, and the considerable difference between the minimum and maximum value in wind speed data, MSLE could be a proper metric to evaluate the error of a model. It could be calculated using Equation (20):

MSLE = \frac{1}{n} \sum_{i = 1}^{n} {(l o g (y_{i} + 1) - \log ({\bar{y}}_{i} + 1))}^{2}

(20)

3. Case Study and Data Characteristics

In this research, Montreal, the second-most populous city in Canada, is considered as the case study. Montreal is located in the southern part of the province of Quebec, Canada with latitude and longitude coordinates of 45 N and −73 E degrees, respectively [35]. Montreal’s hourly resolved data of temperature and humidity were obtained from NASA’s prediction of worldwide energy resources website [36]. The data for predictors were collected from January 2020 until January 2021 for training and test purposes. Figure 3 shows the variation and trend of the independent variables. Although the ascending and descending trend from the beginning to the end of the year is noticeable for temperature, no notable trend is detected in relative humidity. However, a lower fluctuation range at the beginning of the year (winter) compared to the middle of the year (summer) is perceptible for relative humidity.

Furthermore, the target variable, Montreal’s wind speed (in m/s) at 50 m in height, is also collected from [36] in an hourly resolution from January 2020 to January 2021 for training and testing purposes. To evaluate the behavior of the target variable further, the additive decomposition of the wind speed data is plotted (Figure 4). As it is demonstrated in Figure 4, although there is no notable trend, the seasonality graph shows daily fluctuations (ascending and then descending during a day). However, as the residual graph that shows the error of fitting this seasonality on the real wind speed data is noticeable, this seasonality can be seen as not being strong.

4. Results

4.1. Implementation

LSTM, SARIMAX, and the proposed hybrid methods were developed in the Python programming language (Version 3.7.9). To generalize the results of the testing of the developed models for the whole year, three different test sets from summer (the last 2 days of July 2020), fall (the last two days of October 2020), and winter (the last two days of December 2020) were selected as the representative of different seasons. For summer, the model trained with the data from January 2020 until 29 July 2020, while for Fall and Winter, the training set included data from January 2020 until 29 October 2020 and 29 December 2020, respectively. A few missing values were found in the training sets, and they have all been replaced by the average of the previous and next values. Also, the outlier detection procedure was implemented using boxplot visualization and by calculating quartiles based on the formula explained in the methodology section. The results showed no outlier in the training sets.

To scale up the predictors and target variable into a unique scale, the MinMaxScaler method from preprocessing sub-package of the Sklearn library (Version 0.24.2) was used. All features were scaled into the range between 0 and 1 before feeding to the neural network model.

To form the LSTM layers, the Keras library (Version 2.8.0) with TensorFlow (Version 2.8.0) backend was used. Furthermore, grid search optimization was applied to find the optimum hyperparameters (Table 1). The hyperparameter optimization shows that the combination of Adam optimizer and a batch size of 32 with 150 epochs results fits in with only a minor loss in the training stage. A further increase in the number of epochs results in further reduction in errors with the training data, as shown in Figure 5; however, above 150 epochs, the overfitting tends to cause a reduction in accuracy of the model in forecasting the test dataset. As explained in the previous section, the trained model was then tested on the first two days of July 2020. The training and test datasets are of the same resolution.

The Weibull model was developed by creating a function to generate the wind speed distribution. The Weibull distribution of the wind speed for the year 2020 was calculated in the Python environment by creating a Weibull function using

c

,

k,

and

Γ

parameters that have been explained in the methodology section. The histogram graph in Figure 6 shows the data distribution in the range of 0–20 m/s. Also, the Weibull probability feature was created using the Stats package of the Scipy library (Version 1.7.0).

The parameter selection of the SARIMAX model was made by applying the Autoarima package from the Pmdarima library (Version 1.8.2) and a grid search through 42 different combinations of the

(p, d, q) (P, D, Q, s)

parameters. The last two months of wind speed historical data of the first half of the year 2020 were used for training the Autoarima for parameter selection. The results (Table 2) show that the combination of (2,0,1)(2,1,0,24) yields the minimum AIC and was selected as the optimum set of parameters for the SARIMAX model. All the other combinations that are not mentioned in Table 2 led to AIC equal to infinity.

The results consist of details of the selected combinations, including the AIC, BIC, and HQIC, which are reported in Table 3.

4.2. Discussion

Figure 7 shows the forecasting results of all three models and the hybrid models’ results for the last two days of July, October, and December 2020. At a glance, the results using a single LSTM model do not show proper wind speed forecasting, especially in peak hours that are way over or under actual values. By applying the SARIMAX model, although the mean value of the forecasted wind speeds is nearer to the mean value of the actual wind speed compared with the single LSTM model, it has not captured the fluctuations, peaks, and trends decently. While using the NWP model that resulted in a significant error, especially in winter, applying the proposed integrated model can considerably reduce this error. Furthermore, the seasonality and trend issues seem to be fixed for the whole prediction horizon in different seasons. However, the accuracy is not high in the two major peaks. A quick comparison between the result of the proposed hybrid model and the result of the other models reveals the hybrid model’s ability to better integrate the fluctuations and trends.

To evaluate and compare the models precisely, the RMSE, MAE, and MSLE of each model’s results are calculated based on what was explained in the methodology section. The results are reported in Table 4. The LSTM model results show a 2.21–3.16 root mean squared error in different seasons that depicts the LSTM model’s inability to accurately predict using the three meteorological historical data (temperature and humidity) as the independent variables. Although with the SARIMAX model, the RMSE and MSLE are improved in fall, the error is still high. As explained in the methodology section, the LSTM model with different layers in its cells can deal with unexpected behavior of data. Therefore, a proper feature should be added to the LSTM model for better training. Integrating the probability distribution of the wind speed with the LSTM model and using it as an input feature could be one of the alternatives to boost the LSTM ability. The results in Table 4 show that the proposed integrated LSTM–Weibull model can reduce the RMSE of the single LSTM model in forecasting winter, summer, and fall representative days by about 13, 39, and 31 percent, respectively. These error reductions show that adding a proper feature, such as the Weibull probability of the wind speed, can help LSTM accurately forecast the future. However, in case of any unexpected wind behavior that has not happened before (and it is normal in climatic situations), even the integrated model could lead to a considerable error. Therefore, to solve this challenge, hybridizing the results of the NWP model predictions with the proposed integrated model could be helpful in capturing the unexpected behavior of the wind that was not recorded in the historical data. The single NWP prediction results also show high RMSE and MAE and even higher MSLE compared with other models, especially in winter and fall. However, by hybridizing the NWP model with the integrated model, the RMSE of the proposed model decreased 47%, 17%, and 32%, respectively, in summer, winter, and fall compared with the single LSTM model.

To consider the effect of the prediction horizon on the final accuracy, the prediction periods were extended to 168 h (one week) for all seasons instead of 48 h (two days). Since the look-back period is 48 h, it means that after forecasting the first 48 h into the future, the next hours will be predicted based on the prior predictions. Therefore, the accuracy of the model could be lower with increasing the prediction horizon. The result of the prediction horizon extension is shown in Figure 8 for the hybrid model. It is evident that the hybrid model acts less and less accurately when increasing the prediction period except for fall, which still can predict the third day (until 72 h) correctly, and that could be because of the fewer fluctuations on the third day.

5. Validation

Since training the model with different types of historical data and parameters such as learning rate could lead to different results [37], from what was shown in Table 4 and from significant result changes from changing the forecasting horizon, validating the results of this study with the results of the other research could not be insightful. However, based on the literature [38], the mean absolute percentage error for wind forecasting ranged between 25% to 40%. Furthermore, based on similar research [4] that forecasted wind speed in four different sites in China, the RMSE of their proposed hybrid model for daily wind forecast ranged between 1.6–1.8. Comparing this result with the output of the hybrid model presented in this research (Table 4) for a two-days-ahead forecast shows acceptable prediction accuracy.

6. Conclusions

Wind power stands as a pivotal source of clean energy, poised to synergize with other renewables for fostering a robust future grid. Nonetheless, the intermittent nature of wind resources mandates accurate wind speed or wind power predictions to optimize grid control and unit dispatch. Addressing the challenges posed by the volatile wind speed patterns and the complexities of deciphering genuine daily trends and seasonality in historical data, this study sought to pioneer an innovative hybrid wind speed forecasting model. This model harnesses the prowess of deep learning, probability distributions, and numeric weather prediction techniques to minimize forecasting errors.

Our findings illuminate the limitations of the LSTM model, which, despite its memory mechanisms, falters in precise predictions—particularly during abrupt surges or unforeseen shifts. Our initial innovation, fusing Weibull distribution probabilities with a singular LSTM model, resulted in remarkable error reduction. Specifically, the hybridization yielded an average RMSE reduction of approximately 28% across three diverse prediction horizons throughout the year.

To account for unforeseen wind behavior unrepresented in historical data, we augmented our approach by hybridizing results from the numeric weather prediction model into the LSTM–Weibull integrated framework. The outcomes showcased the final hybridized model’s prowess in diminishing the average RMSE of the solo LSTM predictions by around 32%, especially when confronted with fluctuations occurring in central peaks. This hybrid model emerges as a promising avenue for curbing wind speed forecasting errors, thus offering a potential catalyst for robust management and control of renewable energy systems.

Looking ahead, future research should explore synergizing the proposed model with statistical approaches like SARIMAX or ARIMA to amplify wind speed forecasting precision. Such a comprehensive amalgamation would capitalize on the strengths of distinct models, effectively addressing individual method limitations.

In summation, our proposed hybrid model paves the way toward heightened wind speed forecasting accuracy—a critical stride toward efficient renewable energy system management and control. As the renewable energy landscape evolves, the fusion of pioneering methodologies promises an increasingly sustainable and efficient energy future.

Author Contributions

Conceptualization, N.S.; Methodology, N.S.; Software, N.S.; Validation, N.S.; Formal analysis, N.S.; Investigation, N.S., F.N., R.P.M. and U.E.; Resources, N.S. and A.K.; Data curation, N.S. and R.P.M.; Writing—original draft, N.S.; Writing—review & editing, F.N., R.P.M., P.M., A.K. and U.E.; Visualization, N.S.; Supervision, F.N. and U.E.; Project administration, F.N. and U.E.; Funding acquisition, F.N. and U.E. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by [NSERC Discovery grant] grant number [RGPIN-2016-06727] and also the [Canada Excellence Research Chair in Smart, Sustainable and Resilient Communities and Cities] funded by [Tri-Agency Institutional Program Secretariat].

Data Availability Statement

Not applicable.

Acknowledgments

We would like to acknowledge the financial support of the NSERC discovery grant program; the Gina Cody School of Engineering Faculty Research Support; and the Canada Excellence Research Chair in Smart, Sustainable and Resilient Communities and Cities funded by the Tri-Agency Institutional Program Secretariat We also extend our thanks to the Centre for Solar Energy and Hydrogen Research (ZSW) for generously providing the numerical weather prediction data.

Conflicts of Interest

The authors declare no conflict of interest.

References

Worldwide Wind Capacity Reaches 744 Gigawatts–An Unprecedented 93 Gigawatts Added in 2020-World Wind Energy Association. Available online: https://wwindea.org/worldwide-wind-capacity-reaches-744-gigawatts/ (accessed on 1 September 2021).
Yan, J.; Zhang, H.; Liu, Y.; Han, S.; Li, L.; Lu, Z. Forecasting the High Penetration of Wind Power on Multiple Scales Using Multi-to-Multi Mapping. IEEE Trans. Power Syst. 2018, 33, 3276–3284. [Google Scholar] [CrossRef]
Cadenas, E.; Rivera, W. Wind speed forecasting in three different regions of Mexico, using a hybrid ARIMA–ANN model. Renew. Energy 2010, 35, 2732–2738. [Google Scholar] [CrossRef]
Wang, J.; Xiong, S. A hybrid forecasting model based on outlier detection and fuzzy time series—A case study on Hainan wind farm of China. Energy 2014, 76, 526–541. [Google Scholar] [CrossRef]
Yatiyana, E.; Rajakaruna, S.; Ghosh, A. Wind speed and direction forecasting for wind power generation using ARIMA model. In Proceedings of the 2017 Australasian Universities Power Engineering Conference, AUPEC, Melbourne, VIC, Australia, 19–22 November 2017; pp. 1–6. [Google Scholar] [CrossRef]
Kavasseri, R.G.; Seetharaman, K. Day-ahead wind speed forecasting using f-ARIMA models. Renew. Energy 2009, 34, 1388–1393. [Google Scholar] [CrossRef]
Wang, J.; Hu, J.; Ma, K.; Zhang, Y. A self-adaptive hybrid approach for wind speed forecasting. Renew. Energy 2015, 78, 374–385. [Google Scholar] [CrossRef]
Haque, A.U.; Meng, J. Short-Term Wind Speed Forecasting Based on Fuzzy Artmap. Int. J. Green Energy 2011, 8, 65–80. [Google Scholar] [CrossRef]
An, S.; Shi, H.; Hu, Q.; Li, X.; Dang, J. Fuzzy rough regression with application to wind speed prediction. Inf. Sci. 2014, 282, 388–400. [Google Scholar] [CrossRef]
Zhou, J.Y.; Shi, J.; Li, G. Fine tuning support vector machines for short-term wind speed forecasting. Energy Convers. Manag. 2011, 52, 1990–1998. [Google Scholar] [CrossRef]
Fu, X.; Feng, Z.; Yao, X.; Liu, W. A Novel Twin Support Vector Regression Model for Wind Speed Time-Series Interval Prediction. Energies 2023, 16, 5656. [Google Scholar] [CrossRef]
Nair, K.R.; Vanitha, V.; Jisma, M. Forecasting of wind speed using ANN, ARIMA and Hybrid models. In Proceedings of the 2017 International Conference on Intelligent Computing, Instrumentation and Control Technologies, ICICICT, Kerala, India, 6–7 July 2017; pp. 170–175. [Google Scholar] [CrossRef]
Kuremoto, T.; Kimura, S.; Kobayashi, K.; Obayashi, M. Time series forecasting using a deep belief network with restricted Boltzmann machines. Neurocomputing 2014, 137, 47–56. [Google Scholar] [CrossRef]
Wang, H.; Wang, G.; Li, G.; Peng, J.; Liu, Y. Deep belief network based deterministic and probabilistic wind speed forecasting approach. Appl. Energy 2016, 182, 80–93. [Google Scholar] [CrossRef]
Wang, H.-Z.; Li, G.-Q.; Wang, G.-B.; Peng, J.-C.; Jiang, H.; Liu, Y.-T. Deep learning based ensemble approach for probabilistic wind power forecasting. Appl. Energy 2017, 188, 56–70. [Google Scholar] [CrossRef]
Khadem, S.A.; Rey, A.D. Nucleation and growth of cholesteric collagen tactoids: A time-series statistical analysis based on integration of direct numerical simulation (DNS) and long short-term memory recurrent neural network (LSTM-RNN). J. Colloid Interface Sci. 2021, 582, 859–873. [Google Scholar] [CrossRef]
Liu, H.; Mi, X.-W.; Li, Y.-F. Wind speed forecasting method based on deep learning strategy using empirical wavelet transform, long short term memory neural network and Elman neural network. Energy Convers. Manag. 2018, 156, 498–514. [Google Scholar] [CrossRef]
Jung, J.; Broadwater, R.P. Current status and future advances for wind speed and power forecasting. Renew. Sustain. Energy Rev. 2014, 31, 762–777. [Google Scholar] [CrossRef]
Bessac, J.; Constantinescu, E.; Anitescu, M. Stochastic simulation of predictive space–time scenarios of wind speed using observations and physical model outputs. Ann. Appl. Stat. 2018, 12, 432–458. [Google Scholar] [CrossRef]
Hu, S.; Xiang, Y.; Zhang, H.; Xie, S.; Li, J.; Gu, C.; Sun, W.; Liu, J. Hybrid forecasting method for wind power integrating spatial correlation and corrected numerical weather prediction. Appl. Energy 2021, 293, 116951. [Google Scholar] [CrossRef]
Jozefowicz, R.; Zaremba, W.; Sutskever, I. An empirical exploration of Recurrent Network architectures. In Proceedings of the 32nd International Conference on Machine Learning, ICML, Lille, France, 6–11 July 2015; Volume 3, pp. 2332–2340. [Google Scholar]
Koutník, J.; Greff, K.; Gomez, F.; Schmidhuber, J. A clockwork RNN. In Proceedings of the 31st International Conference on Machine Learning, ICML, Beijing, China, 21–26 June 2014; Volume 5, pp. 3881–3889. [Google Scholar]
Greff, K.; Srivastava, R.K.; Koutník, J.; Steunebrink, B.R.; Schmidhuber, J. LSTM: A Search Space Odyssey. IEEE Trans. Neural Netw. Learn. Syst. 2017, 28, 2222–2232. [Google Scholar] [CrossRef]
Hochreiter, S. Untersuchungen zu Dynamischen Neuronalen Netzen. Master’s Thesis, Institut Für Informatik, Technische Universität, Munchen, Germany, 1991; pp. 1–71. [Google Scholar]
Bengio, Y.; Simard, P.; Frasconi, P. Learning long-term dependencies with gradient descent is difficult. IEEE Trans. Neural Networks 1994, 5, 157–166. [Google Scholar] [CrossRef] [PubMed]
Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef]
Rahman, A.; Srikumar, V.; Smith, A.D. Predicting electricity consumption for commercial and residential buildings using deep recurrent neural networks. Appl. Energy 2018, 212, 372–385. [Google Scholar] [CrossRef]
Wen, L.; Zhou, K.; Yang, S.; Lu, X. Optimal load dispatch of community microgrid with deep learning based solar power and load forecasting. Energy 2019, 171, 1053–1065. [Google Scholar] [CrossRef]
Ozay, C.; Celiktas, M.S. Statistical analysis of wind speed using two-parameter Weibull distribution in Alaçatı region. Energy Convers. Manag. 2016, 121, 49–54. [Google Scholar] [CrossRef]
Kadhem, A.A.; Wahab, N.I.A.; Aris, I.; Jasni, J.; Abdalla, A.N. Advanced Wind Speed Prediction Model Based on a Combination of Weibull Distribution and an Artificial Neural Network. Energies 2017, 10, 1744. [Google Scholar] [CrossRef]
Odo, F.C.; Offiah, S.U.; Ugwuoke, P.E. Weibull distribution-based model for prediction of wind potential in Enugu, Nigeria. Adv. Appl. Sci. Res. 2012, 3, 1202–1208. [Google Scholar]
Alencar, D.B.; Affonso, C.M.; Oliveira, R.C.L.; Filho, J.C.R. Hybrid Approach Combining SARIMA and Neural Networks for Multi-Step Ahead Wind Speed Forecasting in Brazil. IEEE Access 2018, 6, 55986–55994. [Google Scholar] [CrossRef]
Chen, Y.; Tjandra, S. Daily Collision Prediction with SARIMAX and Generalized Linear Models on the Basis of Temporal and Weather Variables. Transp. Res. Rec. J. Transp. Res. Board 2014, 2432, 26–36. [Google Scholar] [CrossRef]
Arunraj, N.S.; Ahrens, D.; Fernandes, M. Application of SARIMAX Model to Forecast Daily Sales in Food Retail Industry. Int. J. Oper. Res. Inf. Syst. 2016, 7, 1–21. [Google Scholar] [CrossRef]
Where Is Montreal, Quebec, Canada on Map Lat Long Coordinates. Available online: https://www.latlong.net/place/montreal-quebec-canada-27653.html (accessed on 30 May 2021).
NASA POWER|Prediction of Worldwide Energy Resources. Available online: https://power.larc.nasa.gov/ (accessed on 11 April 2022).
Valdivia-Bautista, S.M.; Domínguez-Navarro, J.A.; Pérez-Cisneros, M.; Vega-Gómez, C.J.; Castillo-Téllez, B. Artificial Intelligence in Wind Speed Forecasting: A Review. Energies 2023, 16, 2457. [Google Scholar] [CrossRef]
Yang, X.; Xiao, Y.; Chen, S. Wind speed and generated power forecasting in wind farm. Proc. Chin. Soc. Electr. Eng. 2005, 25, 1. [Google Scholar]

Figure 1. Schematic design of the forecasting module.

Figure 2. Schematic design of an LSTM module.

Figure 3. Predictors’ overall trend and seasonality in the training set.

Figure 4. Additive decomposition of training data.

Figure 5. Model convergence plot for single LSTM model.

Figure 6. Weibull distribution of the historical wind speed.

Figure 7. 48 h forecasting results in different seasons.

Figure 8. One week (168 h) forecasting results of the hybrid model in different seasons.

Table 1. Hyper parameters of the LSTM model and parameter selection results.

Parameters	Values/Types	Hyper Parameter Optimization Results
No. hidden layers	3	-
No. neurons per hidden layer	60	-
Activation function	Sigmoid	-
Optimizer types	{Adam, RMSprop}	Adam
Batch size	{1,32,64}	32
No. of epochs	{50,80,100,110,125,150}	150

Table 2. Report on the grid search for SARIMAX parameter selection.

Combination	AIC	Combination	AIC
(0,0,0)(0,1,0,24)	11,271	(1,0,1)(2,1,0,24)	4044
(1,0,0)(1,1,0,24)	4955	(2,0,1)(2,1,0,24)	3959
(1,0,0)(0,1,0,24)	5548	(3,0,1)(2,1,0,24)	3961
(1,0,0)(2,1,0,24)	4709	(2,0,2)(2,1,0,24)	4221
(0,0,0)(2,1,0,24)	10,553	(1,0,0)(2,1,0,24)	4707
(2,0,0)(2,1,0,24)	3998	(1,0,2)(2,1,0,24)	3972
(2,0,0)(1,1,0,24)	4211	(3,0,0)(2,1,0,24)	3962
(3,0,0)(2,1,0,24)	3964	(3,0,2)(2,1,0,24)	3963
(3,0,0)(1,1,0,24)	4175	(4,0,1)(2,1,0,24)	3963
(5,0,0)(2,1,0,24)	3965	(3,0,1)(1,1,0,24)	4176

Table 3. Selected combination information.

Parameter/Metric	Value/Type
Optimum non-seasonal orders	(2,0,1)
Optimum seasonal orders	(2,1,0,24)
No. of observations	2160
Log likelihood	−4270.623
AIC	3959.516
BIC	4004.849
HQIC	3976.106
Covariance type	Outer Product of Gradients (OPG)

Table 4. Evaluation metrics of all the models for wind speed forecasting.

Model	RMSE	MAE	MSLE
July (Summer)
LSTM	2.21	1.86	0.395
SARIMAX	2.64	2.24	0.234
NWP	2.02	1.70	0.227
Integrated LSTM–Weibull	1.35	1.12	0.071
Hybrid LSTM–Weibull–NWP	1.18	0.95	0.066
December (Winter)
LSTM	2.14	1.63	0.106
SARIMAX	2.81	2.3	0.155
NWP	5.58	4.90	0.849
Integrated LSTM–Weibull	1.87	1.55	0.110
Hybrid LSTM–Weibull–NWP	1.78	1.50	0.078
October (Fall)
LSTM	3.16	2.58	0.272
SARIMAX	2.25	2.73	0.294
NWP	4.14	3.11	0.799
Integrated LSTM–Weibull	2.18	1.60	0.100
Hybrid LSTM–Weibull–NWP	2.16	1.67	0.139

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Shirzadi, N.; Nasiri, F.; Menon, R.P.; Monsalvete, P.; Kaifel, A.; Eicker, U. Smart Urban Wind Power Forecasting: Integrating Weibull Distribution, Recurrent Neural Networks, and Numerical Weather Prediction. Energies 2023, 16, 6208. https://doi.org/10.3390/en16176208

AMA Style

Shirzadi N, Nasiri F, Menon RP, Monsalvete P, Kaifel A, Eicker U. Smart Urban Wind Power Forecasting: Integrating Weibull Distribution, Recurrent Neural Networks, and Numerical Weather Prediction. Energies. 2023; 16(17):6208. https://doi.org/10.3390/en16176208

Chicago/Turabian Style

Shirzadi, Navid, Fuzhan Nasiri, Ramanunni Parakkal Menon, Pilar Monsalvete, Anton Kaifel, and Ursula Eicker. 2023. "Smart Urban Wind Power Forecasting: Integrating Weibull Distribution, Recurrent Neural Networks, and Numerical Weather Prediction" Energies 16, no. 17: 6208. https://doi.org/10.3390/en16176208

APA Style

Shirzadi, N., Nasiri, F., Menon, R. P., Monsalvete, P., Kaifel, A., & Eicker, U. (2023). Smart Urban Wind Power Forecasting: Integrating Weibull Distribution, Recurrent Neural Networks, and Numerical Weather Prediction. Energies, 16(17), 6208. https://doi.org/10.3390/en16176208

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Smart Urban Wind Power Forecasting: Integrating Weibull Distribution, Recurrent Neural Networks, and Numerical Weather Prediction

Abstract

1. Introduction

2. Methodology

2.1. LSTM

2.2. Weibull Distribution

2.3. SARIMAX

2.4. NWP Model

2.5. Hybrid Model

2.6. Preprocessing and Evaluation Metrics

3. Case Study and Data Characteristics

4. Results

4.1. Implementation

4.2. Discussion

5. Validation

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI