1. Introduction
Sudan is an enormous reservoir of renewable energy resources, with an evident annual wind energy yield, which is verified by GIS analysis [
1], as shown in in
Figure 1 [
2,
3] and
Figure 2 [
4]. The government-endorsed plans that encourage the switch to clean energy generation were included in the 2015 Intended Nationally Determined Contribution (INDC) declaration that was presented to the United Nations Framework Convention on Climate Change (UNFCCC). Among them, 1000 MW
p of grid-connected wind power plants is planned to be constructed in high-potential regions [
5]. In 2021, the country witnessed the arrival of the first wind turbine, which will have its electricity generation fed into the national network serving 14,000 people, and has already been installed in Dongola, a rich in wind energy northern city [
6].
The Weibull distribution is the most commonly used statistical distribution to model wind speed [
7,
8,
9,
10,
11,
12,
13,
14,
15], which served this purpose for the first time in the late 1970s. It then established many wind energy regulations worldwide and modeled software, such as WASP and HOMER [
16]. The two parameters of the Weibull distribution are well-known for being straightforward and fitting the actual wind speed readings perfectly. Hence, if these two parameters are accurately predicted, the Weibull-based forecasting model will represent the exact wind speed variations [
17,
18]. The work of [
19] depended on the Weibull distribution to obtain wind characteristics in the Alaçatı region, Turkey, and found that its shape and scale parameters equal 2.05 and 9.16, respectively. Additionally, the mean wind speed based on the dataset was
. [
10]. They estimated the same distribution parameters using different analytical methods and compared them with the results of other parametric and non-parametric models. In Pakistan, ref. [
11] used six analytical estimation methods to identify the shape and scale estimates of the Weibull distribution and accordingly obtained an average wind speed range from
for different weather mast heights. In Sudan, [
20] estimated the annual average wind speeds and their corresponding Weibull shape parameters at a 10 m height.
Table 1 presents the mentioned estimates for major cities in the country.
The wind speed prediction models provide the information needed for industry development [
21]. There are four main modeling techniques. (1) Physical models show better forecasting performance with long-term data; still, the precision is low. Still, suppose the resolution is high enough, and the initialization is perfect. In that case, the accuracy can be increased. (2) Statistical models include the conventional model that relies on famous probability distributions, such as Weibull, but with a limited capacity in front of nonlinear wind speed data. (3) Third, spatial algorithms, which require a large amount of information. (4) Lastly, artificial intelligence or metaheuristic algorithms, which are gaining more popularity, especially with nonlinear data [
21,
22,
23,
24,
25,
26,
27,
28,
29,
30,
31].
Statistical models or analytical methods are widely used to extract the parameters of the Weibull distribution [
12,
32]. Among these techniques, the most popular ones are the maximum likelihood method (MLM), the least square method (LSM), the method of moment (MOM), and the graphical method (GM) [
15]. Additionally, the metaheuristics or stochastic processes, mainly swarm intelligence algorithms, are successively used in the literature to evaluate the wind speed Weibull parameters, including the particle swarm optimization, Cuckoo search, Gray wolf algorithm, Firefly algorithm (FA), Ant Colony optimization, and many others [
33]. Since FA was put forward in this study, we have to mention that many researchers developed multiple variants of FA and applied them to estimate the parameters of the Weibull distribution effectively. Ref. [
34] hybridized the FA with the support vector machine method to obtain a novel, efficient way for Weibull parameter estimation. Ref. [
35] used a hybrid technique containing the FA and a back-propagation neural network to evaluate the Weibull distribution parameters for wind data in China, yielding a better prediction performance.
To link points together, our work here targets utilizing several analytical methods in addition to the FA to estimate the parameters of the Weibull distribution for the wind speed data collected in Khartoum. The importance of the city as the country’s capital and the availability of the research means from meteorological stations that save long-term data and the suitable equipment made us focus our attention on this region to help us build a model or a mathematical template that will guide similar future work for the rest of the country. The novelty in this work lies in using the FA with wind speed data in Sudan and the Weibull distribution, and then comparing the results with conventional estimation methods whose quality varies location-wise. For instance, the graphical way showed a good performance, while it has proven ineffectiveness in analyzing the data in some areas in the Republic of Korea, as claimed by [
36]. We placed particular attention to the FA methodology here because if we want to obtain a powerful Weibull-based forecasting model, an accurate estimation of the shape and scale parameters is of paramount importance, and this accuracy is only obtained through stochastic algorithms, precisely the swarm-intelligence set of techniques that the FA methods belong to. Accordingly, the significance of this study is reflected in forming a solid foundation for wind speed forecasting, which will comply with different sites within the surrounding area and in a larger circle involving the whole country.
Therefore, the rest of this article is organized as follows.
Section 2 presents the experimental setup, which sheds information on the meteorological mast station system, and
Section 3 outlines the analysis methodology, including data filtering, missing data imputation, Weibull distribution description, parameter estimation techniques, and goodness-of-fit metrics.
Section 4 delivers and discusses the results, and
Section 5 concludes.
2. Experimental Setup
The measurement tower from which the data were taken is located inside the National Energy Research Center (NERC) campus in the Soba suburb, Khartoum, and has a wind monitoring tower. At the top of the tower, a first anemometer, which measures wind speed, is fixed. Right beneath it, a wind vane that detects the wind direction is placed at . Below the wind vane at , a second anemometer is fixed, which provides a separate measurement for the wind speed. Additionally, there is a second wind vane at , reading another measurement for wind direction.
A barometer, an air pressure measuring device, and a thermocouple sensor are placed at . Moreover, a hygrometer is placed at the same height to measure the water content in the air (i.e., relative humidity). Finally, a pyranometer is connected to the tower at .
All the sensors or measuring devices are connected to a data acquisition system (Model: Meteo-40L, manufactured by Ammonit (Ammonit Measurement GmbH, Berlin, Germany) or data logger, in which the data are continuously collected at a sampling rate of
or higher. Each sensor measures and records its relevant reading once in a minute or less, and for every 10 min, the data logger saves the minimum, maximum, mean, and standard deviation of the set of measurements. It is worth mentioning that the analysis depended on retrieved data that belongs to the anemometer and wind vane set at a height of
.
Figure 3 and
Figure 4 show a schematic diagram and picture of the wind mast station within the NERC premises.
4. Results and Discussion
Figure 10 illustrates the monthly average wind speeds at the NERC site, as drawn from the collected data. Additionally,
Table 3 lists the average wind speed, frequency, frequency percentage, and cumulative frequency percentage corresponding to equal-width wind speed classes.
Table 2 provides the parameters specific to the FA technique, among them the stopping criterion, which was 1000 iterations, that we used to obtain high-accuracy results. It took the FA algorithm less than 200 iterations to reach the steady state, meaning that the fitness value converges or an exact solution is reached.
Figure 11 shows that the convergence to the optimal solution occurred approximately after 100 iterations. It is worth mentioning that the researchers in this study relied on Python code to execute the artificial intelligence algorithm.
Following the statistical analysis and the parameter estimation methods described above,
Table 4 delivers the shape and scale estimates associated with every technique. Consequently, it is clear from the demonstrated results that the FA has the best prediction potential regarding every testing metric used, followed by the MOM and GM. For example, using the FA method, the K–S number corresponding to the extracted shape and scale parameters is the most accurate as it tends to zero. The LSM and EPFM showed a weak performance looking at the R
2PP and R
2QQ. If we rely on visual comparison and inspection to determine the quality of the methods used in estimating the parameters of the Weibull distribution, then
Figure 12,
Figure 13,
Figure 14 and
Figure 15 explicitly show the superiority of the artificial intelligence technique. Additionally, these figures prove that the Weibull distribution perfectly describes the wind data at the NERC site. We can understand the dominance of the FA over the rest of the prediction methods due to the apparent nonlinearity in the wind speed data by looking at the regression analysis results in
Table 5, which favors the stochastic metaheuristic methods in such cases. Moreover, the average wind speeds were evaluated by employing
and
to Equation (3), and are given in
Table 4. The average speed corresponding to the FA method equals
.
The average speed for the wind energy assessment must also consider the dataset’s power content. Hence, the weighted average expression represented by Equation (23) was used [
43], and a
was obtained.
It is worth mentioning that although the FA method has a high prediction accuracy, the average wind speed produced by the classic techniques (i.e., MOM and GM) are closer in magnitude to the weighted average speed. Even after the visual inspection of the wind speed data given in
Figure 10, it is noticeable that the classically produced numbers rest within the region of most data points. On the other hand, looking at the frequency percentages delivered in
Table 2, the wind speed class that embraces the average speed produced by the FA method has the highest frequency, making the FA values more realistic in forecasting.
where
is the average wind speed,
is the sample size, and
are the observations or wind speeds.
Finally, the frequency percentage data in
Table 6, which corresponds to the wind direction, indicates that the wind primarily blows from the north, which is again noticeable in the rose diagram in
Figure 16. The wind direction illustrated in the rose diagram precisely simulates reality according to our on-field remarks that we have noticed monitoring our currently operating 2 kW
p wind turbine recently installed at the NERC campus.
5. Conclusions
In this research, we presented the results of the statistical analysis performed on wind speed and direction data collected by a weather mast station installed at the NERC, Soba, Khartoum. Limited data filtering was applied to detect and remove error codes, and then filling in the gaps was performed using the simple arithmetic mean of the remaining data points, as the distorted data fields were considered trivial (i.e.,
) compared to the whole dataset (i.e.,
) in terms of size. Firstly, we obtained the Weibull distribution parameters using analytical and stochastic methods, shown in
Table 4. In this table, the FA method outperforms the others in the goodness-of-fit. The MOM and the GM rank second and third, while the EPFM performs weakest. The best value for the shape parameter is 2.197, while the worst value is 1.918. The best value for the scale parameter is 4.211, while the worst value is 9.174. Secondly, the nonlinearity in wind speed data, which was conveyed in the regression analysis results provided in
Table 5, explains the advantage of the FA method over the conventional ones, as the artificial intelligence methods best fit nonlinear data. Thirdly, the rose diagram shows wind mainly blowing from the north, which complies with the real-world scenario. Finally, the average wind speed, according to the FA results, is equal to
, while the weighted average speed equals
. Still, the weighted average value looks more realistic from the field measurements point of view, but the FA average figure better serves the forecasting purpose due to the high accuracy applied while obtaining this number coupled with the high frequency of occurrence of this speed. The novelty in this work is reflected in the use of data generated in Sudan to forecast local wind speeds using the FA technique, which is widely used in solar PV modeling. Additionally, since classic estimating approaches execute differently location-wise, assessing their efficacy becomes new, which was achieved here. The authors can present several recommendations regarding the assessment of the local wind energy resource and the optimal use of this potential based on the cutting-edge technologies available in the global market:
A higher and multi-anemometer mast wind station must be installed locally to facilitate the vertical extrapolation of wind speed in heights compatible with utility-scale power production.
Figure 15 demonstrates the high capacity of stochastic methods, in particular the swarm intelligence algorithms, in predicting the wind speed in the region, making this technique the best choice for domestic meteorological and forecasting research.
Private sector participation in power generation from clean energy resources such as wind can fill the energy demand gap in Sudan. Hence, soft financing means provided by the stakeholders and international institutions will be the base for such contributions.
Wind turbine manufacturers need to deploy pilot projects in the country, preferably under the supervision of the NERC, to inspect the prospects of this investment.