1. Introduction
An outstanding advancement in technology associated with wind power generation is occurring within Asian and European countries [
1]. Wind power generation can help South Africa deal with its ongoing electricity problems, opening the way for a better and more successful future for its people, businesses, and economy. In the pursuit of sustainable energy solutions, Golshani et al. [
2] emphasised the importance of precise planning, execution, and accurate wind speed predictions to ensure stable wind power in the implementation region for sustainable energy solutions.
The complexity of atmospheric processes that affect wind speed is one of the obstacles in forecasting wind speed. Factors including humidity (%), surface roughness (μm), temperature (°C), and atmospheric pressure (n/m
2) gradients all have an impact on wind speed prediction [
3,
4]. In order to provide reliable forecasts, forecasting models must take into consideration these factors and their interactions. Warm air is preferred by wind turbines for wind power generation in order to pull more kinetic energy from the environment [
5,
6].
In recent decades, according to Wang et al. [
7], modern machine learning algorithms have notably enhanced the precision of predicting meteorological features. This improvement in accuracy has been facilitated by the integration of high-resolution weather models into machine learning frameworks, including deep learning, and extreme value theory (EVT), along with advanced data integration techniques. This study looks into the exploration of the
r-largest order generalised extreme value distribution (
), which brings together the heavy-tailed Fréchet distribution, the light-tailed Weibull distribution, and the Gumbel distribution, resulting in a unified distribution [
8]. Moreover, this study explores two machine learning models: the Vanilla long short-term memory (Vanilla LSTM) network, an enhanced form of recurrent neural network (RNN), and convolutional neural networks (CNNs). These models offer significant advantages such as exceptional adaptability and the ability to handle various types of datasets and are applicable to real-world scenarios such as rainfall and temperature prediction, as well as stock market forecasting [
9,
10,
11].
Wind power generation has the potential to substantially stimulate local economic growth, offering a cost-efficient energy source while also reducing unemployment rates due to the need for skilled personnel in the construction and maintenance of wind turbines [
12]. This study underscores the significance of ongoing financial backing from governmental bodies to foster the progression and refinement of wind power infrastructure and technologies. Furthermore, the study aims to heighten awareness among government officials and entities like the Electricity Supply Commission of South Africa (Eskom) regarding the efficacy and benefits of wind power generation. Additionally, advocating for policies that promote renewable energy sources like wind power can contribute to a more sustainable energy landscape and foster greater energy independence for the nation.
The study applies three main methodologies, namely, Vanilla LSTM, CNN, and the
, for modelling wind speed data from 2016 to 2023. The principal highlights of this investigation are outlined herein. The primary contribution of the study lies in the utilisation of machine learning algorithms and the EVT model to forecast wind speed for both short-term and long-term extreme events. Improving understanding of wind patterns in South Africa can assist in preparing for the transition to wind power generation while adapting to climate-change-associated events. The rest of this paper is organised as follows:
Section 2 presents a review of the relevant literature, and
Section 3 discusses the advantages of utilising renewable resources for electricity generation.
Section 4 summarises the key findings from this study.
Section 5 details the analytical methods applied in the study.
Section 6 provides the empirical results and findings for each method used. Finally,
Section 7 concludes the paper and offers recommendations for future research.
2. Related Literature Review
In recent decades, a notable trend has emerged in the literature, shifting towards the use of machine learning, deep learning algorithms, and EVT models, offering significantly accurate results. In fields such as environmental science, cybersecurity, and financial modelling, these approaches have gained prominence [
13]. This study adopts a dual approach, incorporating both machine learning and EVT methodologies to model wind speed, which is crucial for understanding wind turbine performance and maintaining power stability. Elsaraiti and Merabet [
14] conducted a comparative analysis between the autoregressive integrated moving average (ARIMA) and bidirectional LSTM model for forecasting wind speed in Canada to determine the most proficient predictive time series model. The findings of the comparison revealed that the LSTM approach outperformed ARIMA, exhibiting fewer errors and higher accuracy in predictions. Various types of LSTM models, such as Bidirectional LSTM, Vanilla LSTM, Stacked LSTM, and Convolutional LSTM, are available for time series forecasting problems [
15]. This study employ the Vanilla LSTM due to its capability to capture nonlinear trends and dependencies effectively.
A related study by Mutavhatsindi et al. [
16] explored the performance prediction of feed-forward neural networks (FFNNs), LSTM networks, and support vector regression (SVR) models for predicting short-term solar energy in South Africa. The findings identified the FFNN model as the robust suitable forecasting model. This current study bridges a gap, highlighting wind energy’s distinct advantages over solar energy, as wind is both cost-free and environmentally friendly. The study focuses on wind speed as the main variable, aiming to leverage its advantages compared to other renewable energy sources.
CNN is primarily known for its ability to detect patterns in spatial data such as images, but recent literature shows that CNN can be adapted to time series forecasting tasks, including wind speed prediction [
17,
18]. Chen et al. [
19] used a hybrid CNN-LSTM model to predict short-term wind speeds. The CNN layers were employed to extract spatial dependencies from meteorological data, while the LSTM layers were used to model the temporal dynamics. This hybrid architecture outperformed traditional machine learning methods, achieving lower mean absolute error (MAE) and root mean square error (RMSE) compared to conventional models. This method is also adopted in our study to compare it with the Vanilla LSTM, as applied by other authors to forecast different climate conditions.
Sikhwari et al. [
20] developed a framework to model extreme rainfall events in the Limpopo province from 1960 to 2020. They employed both generalised Pareto distribution (GPD) and the
modelling techniques. The optimal
r-largest order was determined to be 8, rendering
a suitable model for representing the data. Their findings predicted a 50-year return level of 368 mm, indicating a 0.02 probability of exceeding this threshold in 50 years in the Limpopo province. Similarly, this study incorporates the
and determines the optimal
r-largest order through Akaike information criteria (AIC) and Bayesian information criteria (BIC) values. Sigauke and Bere [
21] examined the importance of choosing the optimal technique for parameter estimation, with a specific focus on the parameter estimation process. In a related study, Diriba et al. [
22] focused on the peaks of yearly and daily maximum wind speeds in Port Elizabeth, South Africa. They explored the dependency effects within extreme value distributions. To estimate the parameters of EVT models, they employed both the maximum likelihood estimation (MLE) method and the Markov chain Monte Carlo (MCMC) approach, employing the Metropolis–Hastings algorithm [
8,
23]. Similarly, this study also employs the MLE method to estimate the location, scale, and shape parameters. This approach allows for a comprehensive examination of the parameters essential for understanding extreme wind events.
Furthermore, Salehin et al. [
10] investigated the capabilities of artificial intelligence models for rainfall prediction. The study applied LSTM and RNN to predict monthly and annual rainfall in Bangladesh, using meteorological data from 12 regions, spanning variables such as wind direction, temperature, wind speed, and pressure. The LSTM model achieved an accuracy of 76%. The findings indicated that precise rainfall prediction could greatly enhance agricultural planning and mitigate the potential damage caused by excessive rainfall. In this study, the same model, Vanilla LSTM, will be applied to the focused area to test its efficiency in predicting short-term wind speed.
Mora et al. [
24] conducted a comparative study focusing on the short-term wind power forecasting using various deep learning models, including Vanilla, Bidirectional, Stacked, Convolutional, and Autoencoder LSTM architectures. The models were tested on a wind power dataset from the Spanish electricity market, collected over six years. The analysis revealed that vanilla LSTM models demonstrated superior performance in one-step ahead forecasting, with a minimum MAPE of 4.30%. The Stacked LSTM model, though more computationally intensive, yielded comparable results for specific configurations (MAPE of 4.37%). This study aligns with existing literature where LSTM and CNN models have been shown to outperform traditional statistical methods in capturing the nonlinear dynamics of wind power generation [
25,
26,
27]. Furthermore, this study highlight the advantage of incorporating CNN layers into LSTM models, such as the Conv-LSTM. However, although more accurate, Conv-LSTM models exhibited higher computational costs compared to simpler LSTM and CNN configurations.
Nadarajah and Bell [
28] investigated the spatial modelling of wind directions in South Africa, addressing the under representation of African data in the existing literature. A spatial model was developed using data from 100 stations across the country, with records spanning from 1 January 1940 to 8 August 2023. Their model used a k-mixture of von Mises distributions fitted via a Markov chain Monte Carlo algorithm to capture the multimodal nature of wind directions. The analysis included parameters such as major mode, minor mode, concentration parameters, mean angle, mean resultant, circular variance, circular skewness, and circular kurtosis. They recommended further research on African data to address regional environmental challenges, with the GEVD and spatial model suggested for optimising applications in renewable energy, agricultural planning, and emergency response in South Africa.
Xie et al. [
29] investigated the use of artificial intelligence models for short-term wind speed forecasting. Their study employed an LSTM model to predict wind speed using meteorological data from two stations in Beijing, incorporating variables such as temperature, humidity, and barometric pressure. The LSTM model outperformed the traditional method ARIMA, achieving lower RMSE, MAE, and MAPE values. The study concluded that this model significantly improves the accuracy of short-term wind speed prediction. Additionally, LSTM is applicable to other fields, including finance and engineering [
30,
31].
Mugware et al. [
32] conducted a comparative study evaluating wind speed forecasting models, including CNN, Dynamic Architecture for Artificial Neural Networks (DAN2), Random Forest, and XGBOOST, across diverse South African weather conditions. The study applied these models to data from three stations with varying climates: Napier station in the Western Cape, and the Noupoort and Upington stations in the Northern Cape. The results demonstrated that CNN outperformed DAN2, primarily due to its advanced feature extraction and spatial hierarchy learning capabilities. This finding is consistent with several studies that highlight CNN’s ability to capture spatial dependencies and outperform several machine learning models in complex environmental data settings [
33]. Furthermore, models combining CNN and quantile regression have proven effective in reducing forecast errors and improving robustness in volatile atmospheric conditions [
34]. The combined forecasts using quantile regression models, such as generalised additive quantile regression (GAQR) and quantile regression neural networks (QRNNs), showed improved accuracy compared to individual models, with the QRNN model yielding slightly better results than GAQR.
Kamel et al. [
35] conducted a comprehensive study comparing the performance of Artificial Neural Networks (ANNs) and the GEVD for predicting wind speed. The study utilised the MLE method alongside MCMC to estimate GEVD parameters, assessing the robustness and accuracy of these models. The results revealed that the MLE GEVD distribution provided more accurate predictions of extreme wind speeds, outperforming traditional ANN models based on Multi-Layer Perceptrons (MLPs) across key metrics such as MAE, MRE, and RMSE. This study aims to assess Vanilla LSTM and CNN models for short-term wind speed prediction while using the
for long-term wind speed forecasting. As noted by Kang et al. [
36], extreme value distributions like Gumbel are often ideal for modelling high-impact, low-frequency events such as wind speed extremes. These models are particularly valuable in risk assessment and long-term forecasting, where accurately capturing rare yet severe occurrences is critical for designing resilient systems and making informed safety decisions.
This study addresses a significant methodological gap in the existing literature, wherein most studies tend to separately apply EVT and machine learning techniques [
20,
29]. Additionally, there exists a notable geographic gap in the literature, with many studies predominantly concentrating on developed countries and financial hub cities. This study stands out by focusing on the semi-developed province of Limpopo in South Africa, providing valuable insights into a region that has received comparatively less attention. By integrating machine learning and EVT, this study contributes not only to meteorology but also extends its implications to fields such as finance, engineering, and many others.
3. Advantages Renewable Resources
In recent decades, renewable energies have emerged as a compelling solution, providing clean, abundant, and competitive energy sources [
37,
38,
39]. The growth of clean energies is clearly evident in the latest statistics from the International Energy Agency (IEA). Global projections indicate a huge increase in the share of renewable energy generation, expected to rise from 28.7% in 2021 to a significant 43% by 2030 [
40]. Notably, renewable resources are positioned to contribute two-thirds of the amplified electricity demand during this period, driven by wind, solar, and geothermal energies. A significant merit lies in the fact that renewable energies, such as wind and solar, eliminate the emission of greenhouse gases, which is a significant contributor to severe weather conditions. Additionally, these renewable energies are cost-free compared to fossil fuels [
41].
Arent et al. [
42] mentioned that beyond environmental benefits, the use of renewable resources brings substantial economic advantages, particularly in job creation. The installation of solar panels requires human involvement, and the maintenance of wind farms calls for technicians. As a result, the renewable energy sector generates more jobs per unit of electricity produced compared to fossil fuels. In the United States of America, for instance, renewable energy already supports a substantial workforce [
43]. In 2016, the wind energy industry alone provided employment for over 100,000 full-time equivalent positions, in manufacturing, project development, construction, turbine installation, maintenance, transportation, and logistics [
44]. The adoption of renewable energies not only contributes to environmental well-being but also stimulates economic growth by creating employment opportunities across different sectors.
The transition to renewable energy resources is not only a critical step towards mitigating climate change but also a robust driver of economic development. The projected growth in renewable energy generation highlights a future where clean energy will play a pivotal role in meeting global electricity demands sustainably. By eliminating greenhouse gas emissions and offering cost-effective energy solutions, renewables significantly reduce the environmental footprint. Moreover, the renewable energy sector’s potential to generate a substantial number of jobs underscores its importance in fostering economic resilience. As the world advances towards a greener future, the widespread adoption of renewable resources will be instrumental in achieving a balanced and prosperous global ecosystem.
6. Empirical Findings and Analysis
Table 1 outlines the descriptive statistics for wind speed data collected in Polokwane, South Africa, using an anemometer over the sampling period spanning from 1 January 2016 to 1 December 2023, providing essential insights into the variability and characteristics of wind patterns in the region.
Table 1 provides crucial insights into the variability and characteristics of wind patterns in the region. The table includes key statistical measures such as the minimum, median, mean, maximum, skewness, kurtosis, and standard deviation of the wind speeds.
Figure 4 presents the seasonal wind speed trend analysis for 2023.
A positive skewness value of 0.68 indicates that the distribution is skewed to the right, meaning there are fewer lower wind speeds and more higher wind speeds. The distribution of the sampled data is leptokurtic, meaning it has heavier tails and a more peaked shape compared to a normal distribution, with a kurtosis value of 4.01. The standard deviation of wind speeds, representing the amount of variation or dispersion of wind speeds around the mean, was 2.86 m/s. This indicates how much individual wind speed measurements deviate from the average wind speed.
In terms of wind turbine performance for electricity generation across the seasons,
Table 2 reveals distinctive patterns in each season. Summer exhibits moderate wind conditions, with a median wind speed of 7.34 m/s, indicating a reasonably efficient period for wind power generation. Winter stands out as the most wind efficient season with a higher median wind speed of 7.96 m/s, although it also presents the potential for extreme wind events, as indicated by a positively skewed distribution and higher kurtosis. Autumn showcased a favourable median wind speed of 8.90 m/s, offering another promising season for wind power generation, while its distribution remains relatively stable. These findings underscore the seasonal dynamics of wind energy potential, highlighting the need for tailored strategies to optimise electricity generation throughout the year.
6.1. Seasonal Wind Speed Trends for 2023
Figure 4 provides the seasonal wind speed trend analysis for the four seasons of 2023, namely autumn, spring, summer and winter. In autumn, the wind speed trend is slightly downward while there is a clear downward wind speed trend in spring and winter. On the contrary, summer has an upward wind speed trend.
6.2. Test for Stationarity
Before presenting the ADF test results, it is pertinent to introduce its application in assessing the stationarity of wind speed data in our focus area. The ADF test serves as an important test in determining whether the observed wind speed series exhibits a stationary behaviour, a crucial consideration for a range of analytical applications.
The hypotheses for the ADF test are as follows:
:
The time series data has a unit root (i.e., it is non-stationary).
:
The time series data does not have a unit root (i.e., it is stationary).
The results of the ADF test revealed that the wind speed in Polokwane is stationary (), indicating the data is suitable for time series modelling and further analysis.
6.3. Theil–Sen Non-Parametric Trend Plot
Figure 5 presents the outcomes of the Theil–Sen estimator within the framework of our focused geographical area. The Theil–Sen estimator value of −0.0002 indicates the potential linear relationship between the two variables, wind speed and date/time. The negative test statistic of −3.4644 suggests that the estimated slope differs from zero, indicating a potential negative association between the two variables. With a degree of freedom
2555, this statistical analysis captures the variability in the data. In the context of wind power generation, this slightly negative slope could imply a slight reduction in wind power output or efficiency as wind speed increases, though the magnitude of this effect is minimal (−0.0002).
6.4. Machine Learning
6.4.1. Vanilla LSTM and CNN Results
This section presents the findings obtained through utilising the Vanilla LSTM and CNN models. The model forecasting accuracy was assessed using the traditional metrics such as MAE, RMSE, and MAPE. Furthermore, predictions for the next five years (2024–2028) along with 95% confidence intervals are included. The following are the prediction results for both the training and testing sets after fitting to the Vanilla LSTM and CNN models.
Figure 6 illustrates the performance of the Vanilla LSTM and CNN models in predicting wind speed. The blue line represents the actual wind speed, while the green and orange lines denote the Vanilla LSTM and CNN predictions, respectively. Both models were trained on historical wind speed data and tested on previously unseen data to evaluate their performance. The close alignment between the actual values and the model predictions indicates that both models perform well, with minimal prediction error. This strong correlation highlights the ability of the models to accurately capture the underlying patterns in wind speed data, proving their reliability for forecasting purposes. The high accuracy and low error rates suggest that these models can be valuable tools for various applications, such as renewable energy management, where accurate wind speed forecasting is crucial.
Figure 7 and
Table 3 provide a comprehensive evaluation of the performance of both the Vanilla LSTM and CNN models on the testing set using various metrics such as MAE, RMSE, MAPE, and model accuracy. The metrics illustrate a strong alignment between predicted and actual wind speed values, demonstrating the robustness and precision of both models in forecasting. Specifically, the CNN model performed slightly better, achieving a monthly MAE of 0.53, compared to the 0.70 of the Vanilla LSTM model, with a corresponding RMSE of 0.75 for CNN versus 0.94 for Vanilla LSTM, indicating lower overall error in the CNN model. Similarly, the CNN’s MAPE values of 0.09 further highlight its minimal deviation from actual values, as opposed to 0.27 for the Vanilla LSTM. Accuracy-wise, CNN outperformed LSTM, with 88.66% for the monthly-time-step model, compared to Vanilla LSTM’s 85.56%.
The boxplots of residual errors (
Figure 7) further demonstrate the comparative performance across different wind speed ranges. In the low wind speed range (0–3 m/s), both models exhibit minimal residual errors, with CNN having a slight advantage. In higher ranges (3–5 m/s and 5–10 m/s), the CNN model again shows more consistency, with smaller variations in residual error, underscoring its superior performance in handling higher wind speeds.
These findings affirm the applicability of both models in real-world wind speed forecasting scenarios, with CNN offering a marginally better overall performance. The results suggest that these models can be effectively used for future predictions, potentially leading to more accurate and reliable wind speed forecasts, particularly for renewable energy management. This not only establishes a solid foundation for the use of both Vanilla LSTM and CNN models in forecasting endeavours but also paves the way for further advancements in predictive analytics and time series forecasting.
6.4.2. Training and Validation Loss between Vanilla LSTM and CNN Models
Figure 8 presents a comparison of the training and validation loss for the Vanilla LSTM and CNN models across 50 epochs. The left subplot depicts the Vanilla LSTM model, where both training and validation loss exhibit a steady decline throughout the epochs. The gap between the training and validation loss is minimal, suggesting that the model generalises well without significant overfitting. However, a slight plateau is observed towards the latter epochs, indicating diminishing improvements in the loss reduction.
The right subplot illustrates the CNN model performance, which similarly shows a consistent reduction in both training and validation loss. However, the CNN model demonstrates slightly superior performance in terms of both lower training and validation losses compared to the Vanilla LSTM. The losses are closely aligned, indicating robust generalisation and effective training.
Overall, both models show effective learning over time, though the CNN appears to converge slightly faster and achieves marginally better loss metrics. This suggests that the CNN model may provide more accurate predictions within 50 epochs, making it a more efficient model for forecasting wind speed in Polokwane, South Africa.
6.5. Model Performance Comparison: Vanilla LSTM and CNN Wind Speed Forecasts
Figure 9 presents a comparison between the forecasting performance of Vanilla LSTM and CNN models for predicting wind speed. The findings suggest that the CNN model outperforms Vanilla LSTM in terms of stability and reliability. The Vanilla LSTM predictions align well with historical wind speed trends, indicating a moderate ability to generalise daily and yearly patterns; however, its wider confidence interval (CI), particularly from 2024 onwards, reflects higher uncertainty in long-term forecasts. This broader CI suggests that the Vanilla LSTM model may be overfitting certain patterns or anomalies in the data, resulting in more pronounced fluctuations in its predictions for 2025–2028. By comparison, the CNN model demonstrates a narrower, green-shaded CI, implying a lower degree of uncertainty in its forecasts. The CNN yearly average predictions remain consistent and stable, with limited fluctuations, as evidenced by the narrow confidence bounds extending from 2023 to 2028. This suggests the CNN model is more reliable for long-term forecasting, as it maintains smoother predictions and exhibits greater confidence in future wind speed trends. The gradual increase in predicted wind speeds, coupled with the narrower CI, highlights the model’s capacity for stable and confident forecasting over an extended period.
Overall, while Vanilla LSTM proves effective in capturing variability and following historical data, its higher uncertainty makes it less dependable for long-term forecasting. Meanwhile, the CNN model provides a more robust and confident forecast, making it a more suitable choice for reliable, long-term wind speed predictions. The comparison underscores the value of the CNN model in offering tighter confidence bounds and more consistent performance, particularly when precision in extended forecasts is essential. Therefore, this study recommend that the government incorporate these forecasts into long-term energy and infrastructure planning, using the CNN model for reliable projections. Investing in renewable energy, particularly wind farms, based on these forecasts, could not only boost energy independence but also contribute to South Africa climate action goals.
6.6. Short-Term Vanilla LSTM and CNN Predictions
Table 4 presents the predicted average wind speeds for the years 2024 to 2028 using the Vanilla LSTM and CNN models. The forecasts indicate a fluctuating trend in wind speeds over the five-year period, with a notable decline in 2027 and a recovery in 2028. These findings suggest that Polokwane will experience moderate wind speed events, with some variability across the forecasted years. A 95% confidence interval (CI) is included to account for uncertainty, providing both lower and upper bounds for each prediction. The upper bound for 2028 is particularly high, suggesting the possibility of extreme wind speed events in that year. When comparing the two models based on the confidence intervals, the CNN model exhibits narrower intervals in most years, particularly in 2025 and 2026, where the range of possible outcomes is smaller. This suggests that the CNN model provides more precise predictions and may be considered more reliable for forecasting wind speeds during this period. Conversely, the Vanilla LSTM model tends to have wider intervals, indicating greater uncertainty in its predictions.
These predictions demonstrate the value of both models in forecasting wind speed trends. The CNN model, with its tighter confidence intervals, may offer better precision, while the Vanilla LSTM captures broader potential variations. The ability of these models to accommodate uncertainty and predict trends allows for better preparation and planning, particularly for extreme weather events. The inclusion of confidence intervals further strengthens the robustness of the predictions, supporting more informed decision-making in managing wind-related risks.
6.7. Extreme Value Theory (EVT)
Fitting the r-Largest Order Statistics
This section presents the findings obtained from applying the
r-largest order
to the dataset. The
r-largest order statistics ranged from 1 to 5. Each order was subsequently fitted to the
to estimate the location, scale, and shape parameters. The analysis aimed to determine the optimal
r-order statistic. The parameters were estimated using the MLE method. The standard deviation for each parameter was computed, and the results are presented in
Table 5.
Table 4 summarises the
optimal order results with
r-order sizes ranging from 1 to 5, denoted by
r. These insights shed light on the statistical traits of extreme events, crucial for their comprehension and modelling. The estimated location parameter (
) gradually decreased after
r = 2 increased, indicating a shift towards lower central tendencies in extreme event data. Standard errors
decreased with increasing
r, enhancing confidence in location parameter estimates for larger order.
The scale parameter () also gradually decreased after the second order. For instance, ranged from 2.526 to 2.496 across the r-order sizes. Conversely, the shape parameter () became less negative after the second order, indicating a shift towards heavier-tailed distributions for larger r values. These findings are vital for understanding extreme events and tail behaviour, assisting in the selection of optimal r-order size for prediction and risk assessment.
6.8. Goodness of Fit
Table 6 shows that increasing the order statistics from second order leads to higher negative log-likelihood, AIC, and BIC values as
r increases from size two to five, indicating poorer fits to the data. Striking a good balance between model goodness of fit and complexity is crucial for selecting the optimal
r. In this context,
emerges as the preferred fit, offering significantly lower AIC and BIC values compared to other models, while maintaining a lower negative log-likelihood. Thus, choosing
r = 2 effectively balances data explanation and complexity, making it the most favourable choice among the options.
6.9. Diagnostic Plots for
Another method for determining the optimal order size involves a comprehensive approach that combines visual inspection of the data using graphical diagnostic plots with insights derived from the results presented in
Table 6. This approach particularly emphasises selecting the model with the smallest AIC and BIC values.
Figure 10 presents the diagnostic plots for
. These plots provide a visual assessment of the model fit, showcasing empirical quantiles against model quantiles, quantiles from model-simulated data against empirical quantiles, the density of empirical versus modelled data, and return levels for various return periods. These visual tools are crucial for evaluating the model performance and ensuring the robustness of the parameter estimates. By leveraging both statistical and graphical diagnostics, we aim to refine our model selection process, enhancing the accuracy and reliability of our extreme value predictions.
Figure 10 provides compelling evidence supporting the suitability of the
model for describing wind speed behaviour. This is clearly demonstrated by the strong alignment observed in both the P–P and Q–Q plots, indicating a close fit between the model’s predicted values and the empirical data. Additionally, the return level plot shows a clear increasing trend, further confirming the model’s effectiveness in capturing the behaviour of extreme wind events. Moreover, the density plot illustrates a close match between the fitted density curve and the empirical density distribution, highlighting the robust agreement between the model and the actual data. This consistency underscores the reliability of the
model in accurately representing wind speed characteristics.
The findings in
Table 6 and
Figure 10 suggest that the
model has the potential to significantly enhance our understanding and prediction of extreme wind events, which is crucial for effective wind power generation strategies. This is particularly crucial for the development of resilient and efficient wind energy systems, as it provides a reliable framework for analysing and forecasting extreme weather conditions.
6.10. Return Levels for
Table 7 provides valuable insights into return levels across different return periods, ranging from 5 to 300 years. These levels act as significant indicators in the distribution, offering predictions regarding the severity of extreme events over specific time frames. For instance, the 5-year return level is recorded at 13.03 m/s, suggesting an event of this magnitude is anticipated approximately every 5 years. This indicates a relatively low probability of
, suggesting a 20% chance of extreme wind speed event occurring in the next 5 years compared to the maximum rare event observed during 2016 to 2023. The 100-year return level estimate is 20.14 m/s, which is lower than the maximum observed wind speed of 22.86 m/s for the period 2016 to 2023.
The 250-year return level, standing at 22.10 m/s, represents an exceedingly rare event expected approximately once every 250 years. This event is closely comparable to the maximum event observed in the past decade. Furthermore, the 300-year return level, indicating an event occurring once every three centuries, underscores an exceptionally rare event with a very low probability of a 0.3% chance of the rare extreme wind event occurring. Compared to the highest recorded wind speed of 22.86 m/s in
Table 1, the 300-year return level denotes a higher magnitude event. Therefore, the 300-year return level becomes a crucial measure for assessing the potential risks associated with encountering extremely severe wind events in the Polokwane region. It is worth noting that such rare events have a high probability of occurrence during the winter and autumn seasons, as highlighted in
Table 2, with high maximum wind speed magnitudes present in the seasonal summary statistics.
Combining the results from
Table 4 and
Table 7, it is evident that both the Vanilla LSTM and CNN models provide comparable predictions for the average wind speed in Polokwane over the next five years. Specifically, in 2028, the average wind speed is projected to be 9.43 m/s with confidence bounds of 6.82 and 13.44 according to the Vanilla LSTM model. Similarly, the CNN model predicts a slightly higher average wind speed of 9.59 m/s, with confidence bounds of 7.63 and 12.11. This overlap in their respective confidence intervals indicates that both models are effective in capturing wind speed variability, thereby supporting the robustness of these deep learning approaches in forecasting wind dynamics. Moreover, the
model, which focuses on predicting extreme wind events, forecasts a return level of 13.03 m/s for a 5-year return period, and 22.89 m/s for a 300-year return period. This extreme value prediction of 13.03 m/s for the 5-year period aligns well within the confidence bounds provided by the Vanilla LSTM and CNN models for the year 2028, thereby reinforcing the consistency and reliability of the three models under different forecasting conditions. The consistency observed between these models suggests that a hybrid modelling approach may be beneficial. Specifically, incorporating the Vanilla LSTM and CNN models for short-term average wind speed forecasting, alongside the GEVD model for extreme event prediction, would provide a more comprehensive view of the wind climate in Polokwane. This combination would allow for enhanced decision-making in the design and implementation of wind energy systems, ensuring that both typical and extreme wind conditions.
7. Conclusions and Recommendation
This study conducted an in-depth analysis of wind speed forecasting in Polokwane, shedding light on statistical properties, seasonal variations, and future predictions. The primary aim was to forecast short-term wind speed events using Vanilla LSTM and CNN and long-term wind speed events using the . Historical data from Polokwane suggests a low likelihood of extreme wind events. Seasonal analysis revealed that winter and autumn are the most efficient seasons for wind power generation and also indicate the potential for extreme wind events. The ADF test confirmed that the time series data is stationary, suggesting a constant mean and variance over time, which is essential for accurate modelling and forecasting.
The CNN network model demonstrated accurate wind speed forecasts with low error rates and high predictive accuracy, supported by low evaluation metrics such as MAE, RMSE, and MAPE. Given the robustness of both Vanilla LSTM and CNN in forecasting average wind speeds and the ability of the GEVD model to capture extreme wind events, it is recommended to adopt a hybrid forecasting strategy that leverages the strengths of all three models. This approach would enable practitioners to more effectively plan for both regular and extreme wind scenarios, thereby improving the resilience and efficiency of wind energy systems. Furthermore, the minor overestimation of wind speeds by the Vanilla LSTM and CNN models warrants a continuous model refinement strategy and an evaluation against real-world data to enhance predictive accuracy. Overall, by integrating the predictive strengths of deep learning with the extreme event analysis of the GEVD model, it is possible to develop a more resilient wind energy strategy tailored to the local wind climate.
In conclusion, this study lays a robust foundation for wind power generation, offering valuable insights for stakeholders in renewable energy and environmental planning. These findings contribute to more sustainable and informed decision-making in Limpopo province. This study offers a set of recommendations in the domain of energy management and sustainability. The following recommendations are proposed for future research direction:
Employ advanced machine learning techniques, such as Kolmogorov–Arnold networks (KANs) and temporal convolutional networks (TCNs). These methods can capture complex relationships within meteorological variables and geographical features, thereby enhancing the accuracy and efficiency of wind forecasting.
Investigate the use of modern EVT techniques, such as blended GEVD (bGEVD) and generalised autoregressive conditional heteroskedasticity-GEVD (GARCH-GEVD), to present a refined approach to analysing extreme events in wind speed data. By incorporating bGEVD and GARCH-GEVD, future research can gain a deeper understanding of the tail behaviour of wind speed distributions, thereby improving the assessment of potential risks associated with extreme wind events.
Expand the geographical scope to encompass all nine provinces of South Africa for a clear understanding of regional wind patterns. A broader geographical scope will enable the research to capture diverse wind patterns, ensuring a more robust and representative analysis.
Encourage collaboration and knowledge sharing among research institutions, government entities, and industry stakeholders. By encouraging collaboration among research institutions, government entities, and industry stakeholders, research outcomes and data can be leveraged collectively. The exchange of knowledge and data can facilitate a more comprehensive understanding of the challenges and opportunities in wind power generation.
8. Limitations
Wind is a magnificent, boundless, and cost-free clean source of renewable energy for electricity generation. At the same time, it is essential to recognise specific constraints within wind power generation. This study focused on Polokwane, the capital city of Limpopo province, limiting the geographical scope to a semi-developed urban area with specific meteorological conditions. This geographical focus presents a limitation, as addressing the issue of load-shedding requires a broader perspective encompassing all cities across South Africa. Additionally, there is not much existing research globally, especially in South Africa, that combines machine learning and EVT to forecast wind speed. This lack of literature makes it difficult to compare and contextualise the findings of this study.
Another limitation is the availability and quality of wind speed data in South Africa. This presents a constraint on the generalisability of the findings, as a more comprehensive dataset spanning several decades would provide a stronger foundation for EVT-based models to accurately capture extreme occurrences and establish more robust patterns. Without such extensive data, the ability to draw conclusive insights on wind behaviour and its long-term trends is limited, which could affect the reliability of the predictions. Despite these limitations, the study provides a foundational approach to combining machine learning and EVT for wind speed forecasting in a South African context, highlighting the potential for enhancing wind power generation and addressing the energy challenges faced by the region.