A Novel Hybrid Method for Short-Term Wind Speed Prediction Based on Wind Probability Distribution Function and Machine Learning Models

: The need to deliver accurate predictions of renewable energy generation has long been recognized by stakeholders in the ﬁeld and has propelled recent improvements in more precise wind speed prediction (WSP) methods. Models such as Weibull-probability-density-based WSP (WEB), Rayleigh-probability-density-based WSP (RYM), autoregressive integrated moving average (ARIMA), Kalman ﬁlter and support vector machines (SVR), artiﬁcial neural network (ANN), and hybrid models have been used for accurate prediction of wind speed with various forecast horizons. This study intends to incorporate all these methods to achieve a higher WSP accuracy as, thus far, hybrid wind speed predictions are mainly made by using multivariate time series data. To do so, an error correction algorithm for the probability-density-based wind speed prediction model is introduced. Moreover, a comparative analysis of the performance of each method for accurately predicting wind speed for each time step of short-term forecast horizons is performed. All the models studied are used to form the prediction model by optimizing the weight function for each time step of a forecast horizon for each model that contributed to forming the proposed hybrid prediction model. The National Oceanic and Atmospheric Administration (NOAA) and System Advisory Module (SAM) databases were used to demonstrate the accuracy of the proposed models and conduct a comparative analysis. The results of the study show the signiﬁcant improvement on the performance of wind speed prediction models through the development of a proposed hybrid prediction model


Introduction
Wind energy is a variable renewable energy source [1] and the power produced by the wind turbine hence fluctuates with the variation of wind speed [2]; therefore, in wind farms, unexpected variations of wind power output may increase the operating costs of the electricity system.So, intermittency of wind is the biggest challenge for a wind farm to implement wind energy as a reliable autonomous source of electric power [3].Moreover, a wind speed forecasting (WSF) system based on an accurate model that reflects the variation of wind speed is critical to effective wind energy harvesting, integration of available wind power into the electrical power grid, and analyzing the efficiency and performance of wind-turbine-based electrical generation systems [4].Despite the development of various WSF methods, accurately predicting wind speed still remains a challenge.Furthermore, the length of the forecast horizons correlates with the accuracy of forecasting techniques.Wind speed prediction can have various implications requiring different time scales.For example, turbine control often necessitates a response time of seconds or fractions of seconds, whereas grid integration production planning and market response require longer time horizons.The time scale of prediction also differs according to the energy markets.
The real-time energy market requires a response in minutes, whereas the day-ahead energy market requires the prediction up to 24 h ahead as it requires information for energy trading for the next day [5,6].There may be a requirement for different time scale forecast horizons in between these two-time scales.For example, economic load dispatching and load increment/decrement decisions require a time scale of 30 min to 6 h ahead [3].
Wind speed prediction models have been classified mainly into four categories in the literature: (a) The persistence model, in which future wind speed is deemed to be equal to the wind speed at the forecasting time [7].It is an economical and simple method that can be adopted by almost everyone to serve as the base model for comparing forecasted values by other methods and its main drawback is its unsuitability for forecasting more than a one-time step of forecast horizons; (b) The physical method, in which numerical weather prediction (NWP) is used by incorporating complex atmospheric characteristics, including temperature, pressure, and wind shear into wind speed predictions [8].For longterm forecasts, NWP produces precise estimates that are generally applied over vast areas.However, since numerical weather prediction models are memory-and time-intensive, they are not ideal candidates for short time horizons forecasting; (c) Statistical methods, in which one can explore the mathematical relationship between the various features of the wind time series data.This method includes the following models: Weibull-probabilitydensity-based forecasting (WEB), autoregressive integrated moving average (ARIMA), and the Bayesian probability density function (BBM) approach.These models are mostly used for short forecast horizons and are not suitable for longer forecast horizons due to their non-linearity assumption for wind data; and (d) Artificial intelligence, which includes neural networks (ANN) [9,10], regression or decision trees (RT) [11,12], support vector regression (SVR) [13][14][15][16], and recurrent neural networks (RNN) [17][18][19].
This study was inspired by the work of Kadhem et al., Kaplan et al.,, where the idea of a probability-density-function-based wind speed prediction model was introduced.In this study, the performance of various univariate models is compared, and an error correction algorithm is proposed for the probability-density-based wind speed prediction model.The contribution of this research is twofold: firstly, the proposed error correction method is a novel method that improves the performance of the previously introduced wind-probability-density-based wind speed prediction; and secondly, it introduces a novel hybrid method that is capable of integrating all the studied methods with an optimized weighted coefficient for both the classical time series method and artificial intelligence methods.
This manuscript is divided into the following sections: literature review, methodology, results and discussion, and conclusion.The literature review section consists of a description of the wind speed prediction methods found in literature, which forms the basis of the proposed methodology.The methodology sections consist of a description of all the methods of wind speed prediction, the method of developing error correction, and making a hybrid wind speed prediction model.The results tables that were obtained are presented under the results and discussion section, along with discussion.Finally, the conclusion of the research and future works on the research area are presented under the conclusion section.

Literature Review
This section is dedicated to the theory related to the current study, the methods developed, and the description of the time scale of WSP, Weibull and Rayleigh probability distribution function, support vector regressions, and LSTM networks.

Time Scale in Wind Speed Prediction
One of the important subjects in wind speed prediction is the time scale requirement of forecast horizons; since the different application of wind speed prediction requires different types of time scale, the classification of the time scale of forecast horizons in wind speed prediction methods is an ambiguous subject [3].Turbine control often necessitates a response time of seconds or fractions of seconds, whereas grid integration production planning and market response require longer time horizons.The time scale of prediction also differs according to the energy markets; the real-time energy market requires the response in minutes, whereas the day-ahead energy market requires the prediction up to 24 h ahead as it requires information for energy trading for the next day [5,6]; also, there may be a requirement of different time scale forecast horizons in between these two time scales.For example, economic load dispatching and load increment/decrement decisions require a time scale of 30 min to 6 h ahead [3].In this study, we focus on short-term wind speed prediction (a few hours ahead prediction, not exceeding 12 h).

Wind-Probability-Distribution-Function-Based Wind Speed Prediction Model
Wind-probability-distribution-function-based WSP models were developed by assuming that wind speed follows the same distribution for the next time period.This approach is aligned with the concept behind the persistence model (PM), according to which, any future wind speed value is equal to its last known value of wind speed due to the high autocorrelation on the behavior of the wind speed [7].Despite its simplicity, the PM produces excellent WSP results and is used to assess the quality of new WSP methods [23].PMs forecast wind speed (u t+h ) at any future time, as t + h, h > 0 is the same as wind speed (u t ) at current time t.
Wind speed follows non-negative and right-skewed distribution rather than the normal distribution [24].Several probabilities distribution functions are right-skewed and nonnegative and are used for modeling wind speed.Weibull distribution and Rayleigh distribution are the most common probability distribution functions for wind speed modeling [25].Although the Weibull distribution function is the most widely used function, there is no consensus on which best describes wind speed data for a specific case study site.Therefore, in this study, we consider both Weibull-and Rayleigh-based wind speed modeling and forecasting.

Weibull-Distribution-Based WSP (WEB)
The Weibull probability density function is a two-parameter distribution with a dimensionless shape parameter k and a velocity scale parameter c in m/s [26].
where f (u) denotes the probability distribution of wind speed u.The quality of wind resources can be evaluated from the parameters c and k.The parameter c is proportional to the wind speed, and k characterizes the shape of the Weibull distribution.Variable wind speeds are indicated by smaller values of k, whilst constant wind speeds are indicated by greater values.Typical values of k are between 1 and 3 [27].Even though there are numerous methods to derive Weibull parameters, such as the graphical method, method of moments, maximum likelihood method, standard deviation method, modified maximum likelihood method, power density method, and equivalent energy method, the maximum likelihood method is deemed to be the best fit [28]; this method employs the following expression to calculate shape parameters (k) using an iterative process [26]: where u i is the wind speed at the time step of i and the number of time steps is given by N.After getting the shape parameter k, the expression below is used to measure the scale parameter c [26].
Similar to the PM method, the Weibull-distribution-based wind speed prediction model (WEB) assumes that wind speed follows the same distribution for the next time period.So, the mean speed (u ) can be used as a point forecast in the WEB [25].
where Γ is the gamma function, defined as: x−1 dx.Median and mode can also be used for forecasting purposes [22].However, mean speed might not provide accurate predictions due to the skewness in the Weibull probability density function.
where mode = 0 when k ≤ 1.It is practically not possible to have a scale factor less than 1 at commercial wind farms [22].In our study, we used mean speed as point forecast in WEB.
The cumulative distribution F(u) is an integral of the probability distribution function given by Equation ( 8), which gives the probability of getting wind speed u or less.
Using inverse transform, we get: where R is a random variable with values between 0 and 1 and, as shown in Equation ( 9), representing the cumulative distribution function [21].The values of random variable R between 0 and 1 should be uniformly distributed.Hence, in this way, we have used Equation (10) to simulate a wind speed using parameters of the Weibull distribution function and the method is represented as WEBS.

Rayleigh-Distribution-Based WSP (RYM)
The Rayleigh probability distribution function is a special case of the Weibull distribution function where k = 2. Therefore, in this case, the scale parameter can be determined using the following expression: Therefore, the probability density function represented by Rayleigh distribution and its cumulative distribution function are given as: Similar to Equations ( 9) and ( 10), using an inverse transform of Equation ( 12), we get, Hence, Equation ( 14) can be used to simulate wind speed using the parameter of the Rayleigh distribution function.

Autoregressive Integrated Moving Average (ARIMA) Model
In ARIMA, a time series model reproduces the patterns of a variable's previous movements across time and uses this information to forecast its future movements [29].Wind speed measurements obtained over time tend to be positively correlated.Many parametric time series models that consider the autoregressive (AR) process exist to account for this autocorrelation [30].In an autoregressive model, we forecast the wind speed using a linear combination of past wind speed values.
Equation ( 15) provides the AR model of order p, where γ is the autoregression coefficient and t is the noise in time t.A moving average term is added in the autoregressive model, and the autoregressive moving average model is developed and described as follows: The autoregressive moving average (ARMA) model is a type of autoregressive model that also adapts the moving average model.It is a statistical model that could be used for time series prediction of future wind speed values using past values and lagged forecast error.A general ARMA is denoted by ARMA (p, q) and can be expressed by the following expressions: where the second term from the right in Equation ( 16) is the moving average (MA) part of the ARMA model, δ is the constant, ∅ j is the j th moving average coefficient, e t is the error term at time period t, and u t is the value of wind speed predicted at time step t.If the differencing is added to the ARMA model, the model is transformed into the ARIMA model.Therefore, the ARIMA model, introduced by Box and Jenkins, includes autoregression (AR), a moving average (MA), and differencing [31].The non-seasonal model structure of ARIMA is expressed in the form of ARIMA (p, d, q), where d is the order of differencing (I) to make the model stationary.Hence, the seasonal time series is stationery in nature and becomes zero, and the ARIMA model is converted to the ARMA model [32].

Support Vector Regression (SVR)
Support vector regression is an extension of a support vector machine and was proposed by Drucker et al. [33].A support vector machine was initially developed for the classification problem.SVR is based on the structural error minimization principle and consists of the 'Kernel Trick' and other optimization features that allow it to perform a noise-robust and non-linear regression [34,35].Its stability and accuracy depend on several aspects, such as parameter tuning and feature selection.Parameter tuning is a procedure consisting of properly selecting the kernel function and its parameters and penalization term [36].Feature selection consists of the selection of the most important variables of the model to describe the behavior of the trend [37].SVR does the best trade-off between Field's empirical error and complexity [38].

Long Short-Term Memory (LSTM) Model
Recurrent neural networks (RNN) are a suitable model for time series forecasting problems.However, RNN are not suitable for long-term dependency tasks due to the vanishing/exploding gradient decent issues [39].Therefore, the LSTM neural network arises, which can learn the long-term dependency jobs very efficiently compared to the general RNN model [40].The LSTM model solves the vanishing/exploding gradient decent issues with gates present within each cell of an LSTM network [41][42][43].LSTM is one of the popular artificial recurrent neural network architectures used in wind speed prediction [44].The LSTM neural network was first proposed by Hochreiter and Schmidhuber [45].An LSTM cell's internal state memory offers the internal storing of pertinent historical information.The flow of information through the cell is controlled by the cell's input, output, and forget gates and the mathematical implementation of each LSTM cell is described using Equations ( 17) to (22).With the help of these gates, LSTM analyzes and saves pertinent data [46].The phrase stacked/deep LSTM is often used to denote the LSTM network, referring to an LSTM network with two or more hidden layers.An LSTM network with a detail structure of an LSTM cell is shown in Figure 1.
RNN model [40].The LSTM model solves the vanishing/exploding gradient decent issues with gates present within each cell of an LSTM network [41][42][43].LSTM is one of the popular artificial recurrent neural network architectures used in wind speed prediction [44].The LSTM neural network was first proposed by Hochreiter and Schmidhuber [45].An LSTM cell's internal state memory offers the internal storing of pertinent historical information.The flow of information through the cell is controlled by the cell's input, output, and forget gates and the mathematical implementation of each LSTM cell is described using Equations ( 17) to (22).With the help of these gates, LSTM analyzes and saves pertinent data [46].The phrase stacked/deep LSTM is often used to denote the LSTM network, referring to an LSTM network with two or more hidden layers.An LSTM network with a detail structure of an LSTM cell is shown in Figure 1.
II Step: Update of new values (I t ) and creation of a vector of new information ( ∼ g) to add to the cell state.
III Step: Final cell state IV Step: Last stage using a sigmoid function and a tanh, regenerating values between −1 and 1.
In Equations ( 17) to ( 22), x t is the input, ∼ g is the state of the network, g t is the temporary state and y t is the output state at time step t.I t denotes the input gate, ∂ t represents the output gate and f t denotes the forget gate.The weight corresponding to the hidden layer, input layer, and output layer is denoted by w f , w i , and w ∂ , respectively.β f , β i β s , and β ∂ represent bias corresponding to the input, sate of network, temporary state, and output layer of the network.∅ and tanh represent the sigmoid and tanh activation function, which is defined by the following expressions: tan h(z) = e z − e −z e z + e −z (24)

Methodology 3.1. Proposed Hybrid Method for WSP
In the hybrid wind speed prediction model, all five models are used, i.e, PM, WEB, ARIMA, SVR, and LSTM.For each time step of a forecast horizon, a weight parameter is assigned.The weight parameter is then optimized using linear optimization by minimizing the loss function.Any one of the performance parameters, such as MAPE, MAE, and RMSE, is taken as a loss function to be minimized.After deriving weight parameters for each time step for each model, the hybrid WSP model is applied to predict the future time step wind speed up to given forecast horizons.

Data Acquisition
Data for this study comprise wind speed data at a hub height of 80 m and were extracted from the National Renewable Energy Laboratory System Advisor Model (SAM) database [46].The hub height of 80 m was chosen as the commercial larger scale wind turbine operates mostly at this hub height.We have collected data from four different regions (South Plains region of Texas, Southern Offshore region of Texas, Hills region of Arizona, and Hills region of West Virginia) in the United States to encompass various weather conditions.The variations of wind speed and direction, being dominant features for WSP, are shown in Figure 2. The dataset contains pressure and temperature data as well.The data were sampled at hourly intervals.
vation function, which is defined by the following expressions:

Proposed Hybrid Method for WSP
In the hybrid wind speed prediction model, all five models are used, i.e, PM, WEB, ARIMA, SVR, and LSTM.For each time step of a forecast horizon, a weight parameter is assigned.The weight parameter is then optimized using linear optimization by minimizing the loss function.Any one of the performance parameters, such as MAPE, MAE, and RMSE, is taken as a loss function to be minimized.After deriving weight parameters for each time step for each model, the hybrid WSP model is applied to predict the future time step wind speed up to given forecast horizons.

Data Acquisition
Data for this study comprise wind speed data at a hub height of 80 m and were extracted from the National Renewable Energy Laboratory System Advisor Model (SAM) database [46].The hub height of 80 m was chosen as the commercial larger scale wind turbine operates mostly at this hub height.We have collected data from four different regions (South Plains region of Texas, Southern Offshore region of Texas, Hills region of Arizona, and Hills region of West Virginia) in the United States to encompass various weather conditions.The variations of wind speed and direction, being dominant features for WSP, are shown in Figure 2. The dataset contains pressure and temperature data as well.The data were sampled at hourly intervals.

Statistical Analysis
Basic statistical analysis is performed using the method presented in Section 2.2.1.The scale and shape parameters are calculated using Equations ( 3) and ( 4) for the WEB.The mean speed is calculated using Equation ( 5) for point forecast using WBM.Scale and shape parameters are used in Equation ( 10) to simulate wind speed data for WSP using

Statistical Analysis
Basic statistical analysis is performed using the method presented in Section 2.2.1.The scale and shape parameters are calculated using Equations ( 3) and ( 4) for the WEB.The mean speed is calculated using Equation ( 5) for point forecast using WBM.Scale and shape parameters are used in Equation (10) to simulate wind speed data for WSP using WEBS.Similarly, the scale parameter for Rayleigh probability distribution is calculated using Equation ( 11) for the RYM.In addition, Equation ( 14) is used to simulate wind speed data for WSP using RYMS.

Error Correction and Wind Speed Generation
The flow diagram in Figure 3 shows the proposed algorithm for error correction for the simulated wind speed using Equations ( 10) and ( 14) for Weibull and Rayleigh distribution functions, respectively.Initially, the sequential variation data of wind speed is calculated, and mean variation is recorded.We assume that the wind distribution follows the persistence model.Hence, the predicted wind speed should also have the same sequential variation.Therefore, the simulated wind speed is checked each time to ensure that it is within the acceptable range (−α and α).If it is not within the limits, the algorithm of Gaussian filtering is applied [21]; this algorithm would prevent unnecessary deviations of the predicted value from the range extracted from historical data.Thus, error-corrected wind speed is generated and is used as the predicted value for WEBS and RYMS and input for other machine learning models.The error-corrected model based on WEBS is abbreviated as WEBSEC and the error-corrected model based on RYMS is abbreviated as RYMSEC.

Data Preprocessing
Many machine learning algorithms compare attributes of data points to detect trends in the data.However, problems would arise when features were on different scales.Therefore, data are normalized before being sent for training in machine learning models.One of the most prevalent methods of data normalization is the min-max method in which values are transformed to possess values between 0 and 1.Therefore, if  and ′ are the actual and normalized values of the feature and max and min are maximum and minimum values of the feature, then the normalization can be represented by Equation ( 25) After the normalization process, the dataset is divided into training and testing datasets.Usually, 70% of the data are used for the training of the model and the remaining 30% are used for testing the model.Out of the testing dataset, 10% of the total dataset is used for the validation of the model performance.However, we have train and test da-

Data Preprocessing
Many machine learning algorithms compare attributes of data points to detect trends in the data.However, problems would arise when features were on different scales.Therefore, data are normalized before being sent for training in machine learning models.One of the most prevalent methods of data normalization is the min-max method in which values are transformed to possess values between 0 and 1.Therefore, if x and x are the actual and normalized values of the feature and max and min are maximum and minimum values of the feature, then the normalization can be represented by Equation (25) x = x − min max − min Appl.Sci.2022, 12, 9038 9 of 18 After the normalization process, the dataset is divided into training and testing datasets.Usually, 70% of the data are used for the training of the model and the remaining 30% are used for testing the model.Out of the testing dataset, 10% of the total dataset is used for the validation of the model performance.However, we have train and test datasets for short forecast horizons of 6 h.For this, we have taken the last six time-step data as a test set and the remaining data as a training dataset.

Performance Evaluation
The evaluation of the performance of the individual models and the hybrid model proposed in this study is performed using popular statistical error indicators such as MAE, RMSE, and MAPE [47].If x j , x p j , and x j indicate the actual, predicted, and mean value of the wind speed, respectively, and n is the number of samples, each error indicator can be expressed using Equations ( 26) to (28) as follows: x j − x p j x j × 100% (28)

Probability Distribution Function Parameter Result
The estimation of Weibull and Rayleigh parameters for year 2010 at the South Plains region of Texas, Southern Offshore region of Texas, Hills region of West Virginia, and Hills region of Arizona are presented in this section.The shape and scale parameters were estimated for the whole year's data for all locations.Details are shown Table 1.First, Weibull and Rayleigh parameters were calculated as shown in Table 1.It is clearly seen in the table that the probability distribution for both Weibull and Rayleigh distribution functions were almost the same as the shape parameters of all three sites, close to two.We then investigated the forecasting accuracy using both Weibull-and Rayleigh-distribution-based models.
The results from the analysis of the probability-density-based wind prediction model are presented in Tables 2-5.The performance of the models was assessed through various performance metrics such as RMSE, MAE, and RMSE.This analysis was performed for short-term forecast horizons.While evaluating the performance of the next seven hours of time step, each hour of the time step was also evaluated to know about the detail of the forecasting model.
The results from Tables 2-5 imply that for all the regions, the error-corrected simulated probability-distribution-based model is more accurate than the general probabilitydistribution-based model and the simulated probability-distribution-based model.In the South Plains region of Texas, the error-corrected model can improve the general Weibullbased model to achieve a MAPE as low as 15%, RMSE as low as 1.5, and MAE as low as 1.5.In the Southern Texas Offshore region, the error-corrected model can improve the general Weibull-based model to achieve a MAPE as low as 17%, RMSE as low as 1.49, and MAE as low as 1.5.In the West Virginia Hills region, the error-corrected model can improve the general Rayleigh-based model to achieve a MAPE as low as 9%, RMSE as low as 0.85, and MAE as low as 0.85.Similarly, in the Arizona Hills region, the error-corrected model can improve the general Rayleigh-based model to achieve a MAPE as low as 30%, RMSE as low as 1.03, and MAE as low as 1.03.
The results show that for the South Plains Texas region and Southern Offshore Texas region, the Weibull-based model gives better results.In contrast, for the West Virginia Hills and Arizona Hills region, the Rayleigh-based model gives a better result.Therefore, it can be concluded that the wind distribution of the first two regions, i.e., South Plains TX and Southern Offshore region, can be more accurately described using the Weibull probability density function than the Rayleigh probability distribution function.In contrast, for the last two regions, i.e., West Virginia Hills and Arizona Hills region, wind distribution can be accurately described using the Rayleigh probability distribution function.Hence, the wind speed prediction model based on the probability distribution function also depends on how accurately that model describes the region's wind speed.

Comparative Analysis of Univariate Models
In this section, short-term forecasting was performed for six hours forecast horizon using seven different models based on the persistence model, classical time series model, and machine learning models.The results of short-term forecasting for four different case study sites are presented in Tables 6-9.The LSTM model emerged as the clear winner for the short-term wind speed prediction for all four case study sites.The LSTM model can produce a result with a MAPE as low as 3.53%, MAE as low as 0.4, and RMSE as low as 0.51.WEBSEC is also competitive with the LSTM model with a MAPE as low as 9.82, MAE as low as 1.09, and RMSE as low as 1.26.However, for site IV, whose wind speed distribution is not well described by the Weibull probability distribution function, the WEBSEC also does not give a good result.The SVR model is also competitive compared to the LSTM and WEBSEC models.

Development of Univariate Hybrid Model
The hybrid model based on a univariate wind forecasting model using persistence, classical time series, and machine learning models was developed after analyzing the performance of the model for predicting each time step of the total forecast horizon.For this, wind data of four case study sites were analyzed.After analyzing the performance of the individual model, a liner optimization was performed to minimize the performance metrics values to obtain thevweight function for each model contributing to the hybrid wind speed prediction model.The test data were divided again into the test and train data to evaluate the performance of the individual model and thus determine the weight function on the forecasted value on the test data.Then, the weight function was applied to the wind speed prediction value on the training dataset, thus determining the wind speed prediction value from the hybrid model.The performance results of the individual model of each time step of the forecast horizon are presented in Tables 6-9.developed using multivariate time series data, together with all of the other three models.This research highlights the competitive performance of the probability-density-based wind speed prediction method; the probability distribution of wind speed, which best describes the wind speed distribution of a given location, is also the best model for the wind speed prediction of the given location.Moreover, a novel method of error correction for wind speed forecasting based on the Weibull-distribution-based WSP model was proposed; this error correction can forecast wind speed accurately with MAPE of 4.7%, MAE of 1.7, and RMSE of 1, which is comparable to the best model out of the five models studied, i.e., MAPE of 5.7%, MAE of 0.43, and RMSE of 1.52.The wind speed of a region can be simulated based on the Weibull distribution parameters.
After analyzing the model's performance in predicting each time step of the whole forecast horizon, a hybrid model based on univariate wind forecasting was developed by incorporating persistence, traditional time series, and machine learning.The weight function that needs to be allocated to each model for each time step was determined by observing how well each model performed for each time step of the forecast window.
This study shows the competitive performance of the univariate model, which can be used where only univariate data are available for wind speed prediction.In this analysis, we suggested that the persistence model is the most accurate for predicting wind speed one or two hours in the future, a Weibull-based model for forecast horizons six hours or longer in the future, and an ARIMA or machine learning model is a good choice for forecast horizons in between two and six hours.As a result, a univariate model based on weight functions performs better and gives more weight to the most accurate approach for each time step.As this method used five different models for wind speed prediction, it might be more time-consuming when compared to a single model.

19 Figure 2 .
Figure 2. Distribution of wind speed and direction at four different sites.

Figure 2 .
Figure 2. Distribution of wind speed and direction at four different sites.

19 Figure 3 .
Figure 3. Error correction algorithm for simulated wind speed using Weibull and Rayleigh distribution function.

Figure 3 .
Figure 3. Error correction algorithm for simulated wind speed using Weibull and Rayleigh distribution function.

Table 1 .
Scale and shape parameters for four locations for the year 2010.