1. Introduction
Governments, researchers, and corporations are taking a serious interest in electricity consumption, production, and supply due to their crucial role in livelihoods and global economic development. Conventionally, electricity is generated using primary energy sources such as fossil fuels, nuclear, and renewable energy. Along with an increasing population, urbanization, the growth of industry, and technological advancement, electricity demand is also rising [
1]. A power supply units’ excessive number is activated when load estimates are higher than electricity demands, causing an excessive amount of electricity to be used and giving extra reserves. On the other hand, lower load projections can force the system to operate within limits, leading to insufficient supply [
2]. Nevertheless, load and demand projections serve as the foundation for several energy market choices, enabling electricity system markets to be planned and administered in an effective, transparent, and dependable manner and to meet the sector’s needs [
3]. The demand for energy is impacted by various factors. Import, export, and GDP are significant impacts on estimating energy demand. Population not only affects the overall energy demand but also has an influence on the way energy is utilized and the occupancy per person [
4]. According to Wang et al.’s [
5] threshold regression model, as economies mature, the size of the working population has a weaker influence on energy demand. Electricity demand is more affected by urbanization than primary energy. Demographic and land urbanization’s effects on energy demand were estimated by Yu et al. [
6], while Liu et al. [
7] analyzed the long-term monthly electricity energy demand forecast by examining the relationships among climate, socioeconomics (GDP, populations) and electricity consumption variables. Mutschler et al. [
8] also explored the complex interplay between population growth, technology adoption, climate change, and energy demand to plan future energy systems policies.
Over the past 20 years, there has been a wealth of specialized literature about demand and load forecasting. There are different types of energy demand models, including static or dynamic, univariate or multivariate, and techniques that use time series or hybrid models. This literature seeks to categorize and analyze studies applicable and relevant to Turkey. The goal is to identify which methods have been considered, the input variables selection, and the arrangement of the parameter values [
9].
This systematic review considered the following factors:
To evaluate the effectiveness of different methods, an analysis of Key Performance Indicators (KPIs) is used to assess prediction accuracy. In this context, commonly used measures such as MAE in the literature result in the omission of crucial quality factors such as the highest forecast error and the distribution of error. By avoiding the mutual counteraction negative and positive errors in the prediction, RMSE, and MAE evaluate, respectively, how closely the anticipated value difference resembles the true value. While MAPE emphasizes the accuracy of the forecasting methodologies, MSE illustrates the difference between the actual data and the anticipated value. When different data sets are utilized, MAPE aids in examining how well the estimating methods function.
The model hyper-parameters fine-tuning, data pre-processing methods, the validation and training data set selection, and the outcomes graphical display.
The findings and precision verification of the large dataset collected for the mainland.
This research covers a mix of methods used to estimate electricity demand, including artificial neural networks (ANN), auto-regression, evolutionary algorithms, linear multivariable regression, metaheuristic algorithms, and fuzzy logic techniques, which can be applicable to Turkey.
Table 1 provides an overview of the research conducted on predicting electricity demand.
Kazemzadeh et al. [
20] developed a support vector regression (SVR)-based prediction algorithm approach. The input samples dimension and the SVR technique parameters were both optimized using the particle swarm optimization (PSO) method. For yearly peak load and long-term total electrical energy demand, a hybrid forecasting method was considered to reduce forecasting error. The combination of ANN, autoregressive integrated moving average (ARIMA), and suggested SVR methods served as the foundation for the proposed hybrid method, which was used to estimate total electricity demand and annual peak load. It has been observed that the proposed hybrid methods and PSO-SVR give more precise results than ANN and ARIMA methods. The PSO-SVR method can estimate electricity and load demand with small errors without any sensitivity to seasonal patterns in the initial time series. On the other hand, the artificial bee colony (ABC) algorithm was utilized by Hao et al. [
25] to demonstrate and use a unique ensemble forecasting model for predicting electricity consumption. As independent input variables, several historical time variables, including consumer price index, GDP, industrial structure, urbanization rate, technical innovation, and population, were employed. To train the model, China’s primary electricity demand data was used, and it was determined that the suggested ensemble model delivers more precise results in forecasting hypothesis and accuracy testing than both the benchmark forecasting model and the basic mean ensemble estimation model.
Real et al. [
21] have investigated deep learning application methods to predict energy demand. The authors suggest combining an ANN with a convolutional neural network (CNN) hybrid architecture. French energy demand forecast was trained and given context using Action de Recherche Petite Echelle Grande Echelle (ARPEGE), estimating meteorological data. The outcome demonstrates that this method outperforms the standard subscription-based service provided by Réseau de Transport d’Electricité (RTE, a French transmission system operator). Additionally, the proposed approach performs best when results are contrasted with other options, such as ARIMA and conventional ANN models. Using a deep learning-based system, Bedi and Toshniwal [
22] calculated the demand for power by taking into account long-term historical dependencies. Based on all months’ worth of electricity usage data, cluster analysis is used to create data segments based on the season. To classify load trends, it was necessary to have a thorough understanding of the metadata that belonged to each cluster. It was determined that the suggested method (D-FED) outperforms ANN, SVM, and recurrent neural network (RNN) regression models and can be implemented to estimate electricity demand efficiently.
Using a hybrid model built on the ARIMA and least-square support vector machine (LSSVM), Kaytez [
26] shows how to estimate Turkey’s electricity consumption. Results from this suggested approach were evaluated against official prediction data, a single ARIMA, the body of literature, and a multiple linear regression approach. The results demonstrate that the model responds better to some unexpected reactions in the time series. Di Leo et al. [
27] applied regression analysis (RA) to forecast trends in energy consumption in end-use industries. The suggested method was used to describe the positions of the long-term electricity demand by statistically characterizing the relationships between independent factors such as GDP, population, transportation, business, and residential energy demands. Traditional statistical tests have been used to assess and validate the non-linear and linear regression models’ effectiveness for energy demand forecasting. The outputs demonstrated a close and logical correlation between transport and residential electricity demand with GDP and population, while the commercial energy demand is correlated with GDP.
To estimate the Rodrigues and Mauritius Islands’ peak monthly electricity demand, Ramsami and King [
19] used three techniques: the adaptive network-based fuzzy inference system (ANFIS), data handling group method, and ANN (recurrent and feedforward). Nine error measures were used in conjunction with the suggested models. The optimal value for seven error metrics was created using the proposed model with grid segmentation. As a result, it proved to be better suited for determining the peak electrical demand. For calculating peak demand, an ANFIS model with grid segmentation is more effective. Angelopoulos et al. [
28] published the long-term predictions for electricity demand in Greece and also made use of the connection between time series and effective multiple criteria. Greece’s value estimate model is examined using training-related data collected from 1999 to 2013. When determining the annual total net electricity consumption, the suggested approach was used for an interconnected power system in Greece during the following test period between 2014–2016. The results of the suggested model reveal that, after increasing the efficiency of electrical energy, taking into account the country’s general weather conditions and economic growth as measured by the national GDP, those factors have the most impact on electricity demand. The regression models exhibit superior performance compared to the least squares MLR model regarding predictive dependability, with the minimum MAPE outcome being 0.74%.
The impact of European nations’ power generation during the lockdown period was examined by Sahin et al. [
17], who also reconstructed energy generation in these nations. The total monthly electricity generation from renewable and non-renewable sources in the UK, France, Germany, Turkey, and Spain was examined and compared from January 2017 to September 2020. Machine learning (ML) techniques and grey prediction (GP) models for seasonal periods were employed to predict future trends. Hou et al. [
29] investigated the electricity output effects, GHG emissions, and consumption on climate change. They forecast electricity demand using an optimized ANN. The ANN approach was enhanced using the Improved Pathfinder algorithm. For estimating the demand for electricity, a more sensitive model with lower error numbers was produced by the ANN method’s optimization technique.
Baba [
12] compared and evaluated the effectiveness of three different strategies to forecast the daily energy consumption of the nearby industrial area. A probabilistic approach called the Multiple Model Particle Filter (MMPL) method was suggested. Then, ANNs with one and two hidden layers were developed and examined. Pegalajar et al. [
30] used data from the Spanish Electricity Network between 2007–2019 to predict electricity demand. RNN, multilayer perceptron, linear regression, regression trees (RT), gradient boosting regression (GBR), and random forests (RF) were used among the six estimate models. These experiments show positive results in all scenarios, with a worst-case improvement of 12% and a best-case improvement of 37% over predictions made by the Spanish Electric Grid. By utilizing load profiles, Bendaoud et al. [
31] explored a load forecasting approach (LPs). The hourly temperature profiles used to incorporate seasonal data fluctuations and demand changes have been used to analyze Algeria’s power consumption. For LP-based forecasting, daily, weekly, and annual LP propagation were employed. Mid–short-term load forecasting models have been developed using a variety of artificial intelligence (AI) techniques.
By utilizing ensemble ML techniques, Porteiro et al. [
32] have produced a model for anticipating the electricity demand for residential and commercial buildings. In this research, computational intelligence models were created to estimate the demand for power one day in advance. An ensemble technique was also used to develop a day-ahead forecasting model. Standardization, addressing missing values, and outlier removal were the three processes in the pre-processing of the data. The real data sets for Burgos Industrial Park, Uruguay’s total energy demand, and Montevideo’s distribution substation electricity demand have been chosen for examination. The proposed models were assessed using common performance indicators. The primary findings demonstrate that the best day-ahead proposed model has a MAPE of 5.17% on total consumption data, 9.09% on substation value, and finally, 2.55% on industrial data based on Extra Trees Regressor. Gokceada Island’s electricity demand was estimated using ANN, PSO, and MLR methods. Input values for car numbers, exports, imports, and passenger numbers related to tourism were based on the period from 2014 to 2019 [
11]. The results obtained from these methods were analyzed using statistical errors such as RMSE, R
2, MAE, and MSE. The methods were analyzed by examining their confidence intervals. The correlation matrixes are utilized to indicate the association between the method outputs and actual values, as well as the relationship between dependent and independent parameters. Input parameters are separated into subsets, multi-regression equations related to these parameters, and
p-value performances and R
2 were demonstrated. The results showed that the ANN method had the widest confidence interval of 95% among the techniques used, and the statistical error metrics had the strongest correlation with the actual data and electricity demand output for the ANN method [
11].
A new approach to power load forecasting is introduced by Lu et al. [
33], which involves utilizing an SVR model in combination with WOA that incorporates elite and chaotic opposition-based learning (ECWOA) to enhance forecasting results. The results of experiments indicate that incorporating electricity price information results in higher forecasting accuracy. To enhance the accuracy of CO
2 emissions and energy demand predictions in the transportation industry, Javanmard et al. [
34] implement a mixed method that combines a mathematical model with multiple objectives with ML algorithms that use data-driven approaches. An estimation of the energy consumption demand in China is conducted by Rao et al. [
35]. Firstly, a two-stage model based on least absolute shrinkage and selection operator-random forest (Lasso-RF) is introduced to determine the factors affecting energy demand. Secondly, the SVR-compositional data second exponential smoothing (SVR-CDSES) model is developed to predict the demand for primary electricity, oil, natural gas, and coal. The results indicate that primary electricity will experience significant growth at an annual rate of 8.05%, reflecting a growing focus on clean energy.
Li et al. [
36] propose a hybrid approach that utilizes Manta ray foraging optimization to optimize the parameters of SVM for short-term load forecasting. To assess the precision of this approach, five other optimizers, the Satin Bowerbird optimizer, Tug of War optimization, Fruit-fly optimization, Moth Flame optimization, and Slime Mould algorithm, are utilized to compare the proposed method’s superiority. Huang et al. [
37] developed a transformer-based model for estimating energy consumption in an actual university library and compared it to a baseline model, SVR. To account for the various factors that can influence the development and computation of building electricity energy models, advanced ML models driven by the inherent electricity consumption patterns are proposed by Huang et al. [
38]. In this study, the performance of three ML algorithms, LSTM, SVR, and XGBoost, are evaluated using one-year datasets with sub-hourly temporal granularity to determine the most accurate predictor. Additionally, the performance and robustness of the ML model, assessed by the coefficient of variation (CV), are compared across XGBoost and LSTM trained with the same datasets that include data size, temporal granularity, and building type attributes.
The literature contains various case studies on electricity demand forecasting. Velasquez et al. [
39] utilized regression with seasonality to predict the long-term electricity demand in Brazil. Pallonetto et al. [
40] compared the performance of SVM and LSTM to forecast commercial buildings’ hourly electricity demand in Ireland and found that LSTM outperformed SVM. May et al. [
41] utilized ANN and ANFIS to forecast Mexico’s electricity market hourly electricity demand and observed that ANN outperformed ANFIS. Meanwhile, Niu et al. [
42] suggested a hybrid model to forecast the aggregated four-grid electricity load in China for quarter-hour periods. Their approach involved population-based metaheuristic algorithms and integrating a signal decomposition tool with an ML model. Luzia et al. [
43] conducted a study to assess the suitable application of ANN, ARIMA combined with Wavelet Transform, or Fourier Transform for predicting electricity demand at various time horizons and frequencies. The findings indicate that when considering both time frequencies, ANN proves to be a superior method for short-term predictions. Işık et al. [
44] employed DL techniques to predict the electricity demands of select Fortune 500 companies in Turkey. The study focused on LSTM and MLP techniques, which have demonstrated effectiveness in previous research. Additionally, for the first time in electricity demand forecasting, the Multiple Seasonal-Trend Decomposition using Loess (MSTL) technique was utilized, and the results of MSTL outperformed better.
Albuquerque et al. [
45] employed regularized Lasso Lars and RF models to predict Brazil’s electricity consumption. Energy forecasting model based on a DL approach demonstrated by Rick and Berton [
46] that combined CNN, LSTM, and auto-encoder (AE) for time series with unequal lengths. Maaouane et al. [
47] utilized ANN modelling to estimate and predict energy demand in the transport sector of developing countries. Recently, Chaturvedi et al. [
48] conducted a comparison of several time-series models, such as SARIMA, and LSTMRNN models, for the purpose of predicting India’s peak and total monthly energy demand. Unlike the ANN model, the SVR model is designed to avoid local overfitting and minimization. In this study, the electricity demand of Turkey’s mainland was projected through the year 2040 using MNN, SVM, and WOA. Both error measures (RMSE, R
2, MSE, and MAE) and confidence interval and
p-value analysis statistical techniques were used to compare the prediction performances of the different methods. The correlation matrix was employed to demonstrate the association between the observed value and method-predicted values, as well as the relationship between the independent parameters (import, GDP, population, export) and the dependent data (electricity consumption). The outputs of correlation matrices have revealed which variables influence the result and by how much. Subsets of multiple regression equations for the input variables (import, export, population, and GDP) were developed. The parameters affecting the output with R-squared and
p-value performances were provided and compared in the resulting equations. A statistical procedure known as the confidence interval analysis of the methodologies was also carried out. Overall, the electrical energy demand forecasting performance of MNN and SVM among ML methods and WOA chosen as optimization methods were compared using error metrics (RMSE, MSE, MAE), correlation matrices, and multi-regression equations. In addition,
p-value and confidence interval analysis of statistical methods was performed to determine which method was more effective. The contributions of this study can be summarized in the following five points:
In this study, multi-objective forecasting models were created using various traditional ML methods and a new optimization method, WOA, to improve forecasting accuracy. In the Turkey case study, forecast performances were verified with error metrics by using inter-year data in electrical energy demand forecasting. The predicted results provided reliable and informative references for annual energy demand for the coming decades.
The effect of independent inputs used for electrical energy demand forecasting on forecast output has been investigated with MLR subsets and different combinations.
Statistical performance error metrics are included to effectively improve forecast accuracy and demonstrate the effectiveness of the method used.
It includes the technical analysis of determining the optimal parameters of methods by means of input-output correlation matrices. Thus, it is determined how much the independent variables affect the dependent variable.
The effective electricity demand estimation made in this study prevents extra reserves and limited operation of the system.
The rest of this paper is organized as follows.
Section 2 demonstrates the data sources, pre-processing, and exploration. MNN, SVM, WOA, and Error Metrics are given in
Section 3 under Materials and Methods. The Analysis and Results part under
Section 4 presents electricity demand forecasting results, error metrics, multi-regression equations, and correlation matrix by using the proposed methods. The results discussion, novelty of this study, limitations, and future works are given in the Discussion section.
3. Materials and Methods
The historical energy consumption data was utilized for training, testing, and validating forecasting methods such as MNN, SVM, and WOA, as shown in
Figure 1 of the flowchart of the method. By incorporating demographic input data from the past, the electricity demand was predicted until the year 2040.
According to statistical analysis results, the best-forecasted method output will be used for the next step as future electricity demand data. To compare the performance of the methods, error metrics such as MAE, R2, MSE, and RMSE were used along with statistical methods such as confidence interval and p-value analysis.
The correlation matrix was employed to demonstrate the correlation between the real data and the predicted data derived from the methods, as well as the relationship between the independent (GDP, import, population, and export) and the dependent variable (electricity consumption) for Turkey. The correlation matrices have demonstrated the extent to which variables affect the magnitude of the output. The input parameters, GDP, population, export, and import, were split into subsets, with multiple regression equations calculated for each parameter. The equations that were generated demonstrated the extent to which each parameter impacted the output, and their R2 and p-value performances were compared and presented. Furthermore, a statistical method known as confidence interval analysis was employed to evaluate the reliability of the methods used.
There are three distinct scenarios (low, base, and high) for each method. These scenarios are created by considering the relative differences among the input data’s historical values employed in the proposed methods. Thus, diverse scenario ratios and input values are utilized for the mainland. For the population, the official scenarios of the Turkish Statistical Institute were considered. In addition, for the future estimation of economic data, percentages were designed by considering previous studies in the literature, International Monetary Fund (IMF), and World Bank scenarios [
53,
54,
55].
Table 2 demonstrates the mainland’s assumption and scenarios to estimate future electricity demand.
3.1. Medium Neural Networks (MNN)
MNNs are computer-based systems that have been created to imitate the functions of the human brain, such as learning, comprehension, and the discovery of new information through experience [
56]. Inspired by the human brain structure, MNN performs information analysis by learning, relating, and generalizing over data. MNNs consist of layers connected to each other in parallel [
57]. In this study, the neural networks consist of three layers, including the input, hidden, and output layers. The function of the network is the connections between these layers. By adjusting the weight values that the layers are connected to each other, the network is trained to perform a certain function. Thus, an output is produced in response to an input in the network.
With MNN, predictions can be made in multiple time intervals, including short-, medium-, and long-term, in electricity demand forecasting [
58]. Outputs are transmitted via synapse connections. Synaptic link weights are expressed in numerical values. The architecture of the MNN model is depicted in
Figure 2. The connection between all layers in the ANN, which consists of 3 components, the input, hidden, and output layer, is provided by weights. When inputs from the input layer are transmitted to the hidden layer, they are multiplied by the link weights between the hidden layer and the input layer. The inputs to the neurons in the hidden layer are summed. Then, this expression is multiplied by the link weights between the hidden layer and the output layer and transmitted to the output layer. During the learning process, the weight values of the connections are determined [
59].
In this study, the electricity demand forecasting model for Turkey is modelled using a feedforward multilayer perceptron neural network. Layers and neurons arranged in a feedforward way are evaluated in multilayer perceptron neural networks. In this structure, the input layer of the neurons collects the information outside the system, while the output layer calculates the values according to the input values [
60].
Within this research, the backpropagation method with gradient descent is employed to train the model. This method, also known as backpropagation, begins by comparing the target value with the output value. Then, moving back through the input, the network adjusts the neuron weights to minimize the error, which is the difference between the target and output data. The Levenberg–Marquardt algorithm is utilized in the backpropagation process as a gradient-based algorithm.
The backpropagation method determines the output by finding the optimal combination of weights that minimizes the error function. To ensure continuity and differentiability of the error function, an interneuron transfer function is employed. In feedforward multilayer networks, linear transfer functions are utilized in the output neuron layer [
61].
In a neural network, each neuron, except for the neurons in the input layer, is connected to another neuron in the subsequent layer through weights. The neuron calculates a weighted sum of all the neuron values in the preceding layer. This situation is added to the weighted sum transfer function, and the output value is calculated. The neuron output is calculated using the following Equations (1) and (2) [
11].
where
a is the weighted sum of the inputs,
j is the neuron number,
f represents the transfer function,
w is the weight,
i is the input number, and
y resembles the output value. The sigmoid transfer function is determined as in Equation (3) [
11,
62]:
To simulate, train, and design the system, MATLAB R2021b–Neural Network Toolbox was utilized. By employing this software, an MNN model was trained, constructed, and evaluated using 40 distinct raw data samples obtained from official sources. The input data for the MNN model consists of four parameters, import, GDP, population, and export, while the output parameter comprises electricity consumption, as depicted in
Figure 3.
The MNN model architecture comprises an input layer with four neurons, an output layer with one neuron, and each hidden layer containing five to one neuron. The collected data from the experimental setup are split into two sections, namely, testing and training. The design, training, and simulation of the MNN model were performed using MATLAB-Neural Network Toolbox. Levenberg–Marquardt algorithm used the MNN training algorithm in this study.
3.2. Support Vector Machine
In this research, the SVM algorithm, which is a non-parametric supervised classification method, was utilized. The SVM was originally designed for the classification of two-class linear data but was insufficiently developed for non-linear and multi-class data and applied to engineering applications [
63,
64].
The approximate function in the SVM algorithm is given in Equation (4) [
63].
where
denotes the higher dimensional feature space transformed from the input vector
x, and
ω and
b denote the weight vector and bias term, respectively. The risk function obtained by minimizing
b and
ω values is given in Equation (5) [
64].
where
term represents the regulation term of SVM, and
C represents the compensator parameter, which indicates the error rate in the optimization.
The most important difference between the Vapnik linear function and the classical regression functions is shown in the Novell loss function
in Equation (6) [
63]. Two variables (
ξ and
ξ*) with positive values are defined to avoid unexpected outliers. Lagrange multipliers (
a,
a*) are added to solve the optimization problems.
After calculating the Lagrange multipliers, Equation (4) can be written as Equation (7) [
63].
where
is called the kernel function and is illustrated in Equation (8) [
64].
After these regulations, the main function of SVM is [
64]
Here,
and
b are the parameters of the SVM,
K is the kernel function,
N is the number of training data,
x is the independent vector, and
is the vectors used in the training process. The selection of an appropriate function is important in terms of obtaining more accurate results from the data to be used. In this study, the linear SVM function given in Equation (10) was determined [
64].
3.3. Whale Optimization Algorithm
WOA is a swarm-based metaheuristic algorithm inspired by the hunting behaviour of humpback whales by Mirjalili and Lewis [
65]. Humpback whales, which feed on small fish near the surface, hunt these creatures with the bubble net method shown in
Figure 4. With the water bubble method, humpback whales create a narrowing circle and spiral bubbles while rising to the water’s surface, and they collect their prey together and reduce the target [
65].
WOA is used in classification, image processing, network, and other engineering problems [
66]. In WOA, each solution is represented by a whale (agent), while the prey represents the assumed best solution to the problem. The algorithm starts with the number of whales determined by the user, and the positions of these whales are updated according to the position of the best whale found so far, and the search for the best solution to the problem continues until the number of iterations determined by the user is reached (
Figure 5). The algorithm is mathematically modelled in three parts, namely, prey wrapping, bubble net attack, and prey search [
67].
In this study, WOA is applied to forecast electricity demand. The recommended model applies the errors as a WOA objective function to measure its solutions in the training section. Firstly, WOA initializes the position vector and score of the leader and the search whales’ positions. Secondly, optimization returns the search whales that go out of the search space boundaries and compute each search whale’s objective function. The leader is updated if the current objective function is not at the desired value. This situation continues until the maximum iteration is reached. Ultimately, the best solution is obtained [
68]. The number of whales, location coordinates, and launch parameters were randomly generated. To make predictions with WOA, the number of search agents was determined as 40, and the Maximum Number of Iterations was selected as 1000.
3.3.1. Encircling Prey
Once the search algorithm has identified the search agent with the most optimal solution discovered thus far, the positions of the remaining search agents are adjusted based on that of the best search agent. Thus, the prey that represents the best solution is surrounded. This behaviour is mathematically expressed in Equations (11) and (12) [
67].
Here,
represents the direct distance between the food and the whale,
represents the best humpback whale position so far,
t represents the number of iterations available,
represents the whale position, while
and
are the specific coefficients shown below in Equations (13) and (14) [
67].
Additionally, denotes a random vector with values in the range [0, 1] and denotes a vector that decreases from 2 to 0 over the iterations.
3.3.2. Bubble-Net Attacking Method
During a bubble net attack, agents can perform either the constricting motion or the spiral motion around the prey with equal probability, as shown in
Figure 5. While the narrowing of the circle around the prey is provided by Equation (10), the spiral movement is provided by Equation (15).
Here,
shows the difference between the search agent and the best agent obtained so far [
67].
3.3.3. Search for Prey
During the bubble net attack stage, each whale updates its position towards the best whale, for which a linearly decreasing parameter (
A) is considered. Instead of relying on the best-known point, the update of new locations for prey search agents is based on a search agent selected at random. Thanks to this move, the algorithm avoids the risk of getting stuck with the best local solutions and performs a global search. Equations (16) and (17) express the hunting movement mathematically [
67].
In WOA, local search is provided by encircling the prey, and holistic search is provided by the act of searching for prey. The type of search is decided according to the value of the vector .
3.4. Error Metrics
In this paper, several statistical criteria were used to assess the accuracy of the MNN-SVM-WOA model’s predictions. These criteria included commonly used error metrics such as RMSE, MAE, MSE, MAPE, and Pearson’s correlation coefficient (R). The MAE and RMSE were used to evaluate the difference between the predicted and true values without considering positive or negative errors or their mutual counteraction. The MSE indicated the discrepancy between the actual and estimated values, while the MAPE measured the precision of the forecasting methods, particularly when diverse data sets were used. The goal is to achieve low RMSE, MAE, and MAPE data. Additionally, the R value represented the correlation between the estimated and actual data [
69].
R
2 is a statistical parameter used to determine the extent to which changes in the independent variable can explain changes in the dependent variable, and its value ranges from 0 to 1. If the R
2 value is close to 1, it indicates that the regression line fits well, implying that the changes in the dependent parameter are mostly due to changes in the independent variable. Equations (18)–(21) provide the formulas for R
2, RMSE, MSE, and MAE [
10,
11,
70,
71,
72,
73,
74].
Note:
denote the estimated data, real value, sample size, mean predicted data, and mean actual data, respectively [
11].
5. Discussion
Forecasting electricity demand is crucial for the efficient planning of capacity and the establishment of electricity networks with minimal expenses. To achieve accurate predictions, policymakers must assess various alternative methods and determine which one will yield the most advantageous results.
This paper utilized MNN, SVM, and WOA models to estimate the electricity demand in Turkey by implementing three distinct scenarios. The architecture of the MNN model comprises an input layer, a hidden layer, and an output layer. The input parameters of the three methods are classified into two primary categories: demographic and economic. The output layer produces the forecasted electricity demand. The model performance is evaluated using various indices, including RMSE, MSE, MAE, and R2. The training process and evaluation algorithm are presented and analyzed, along with the limits of the 95% confidence intervals. Four independent variables, which are export, population, GDP, and import, are identified as the possible electricity demand predictors from 1980 to 2019. Equations for the mainland were derived using data spanning from 1980 to 2019. These equations were subsequently utilized to estimate Turkey’s future electricity demand under various scenarios.
Electricity consumption between 1980 and 2019 is obtained from Turkish Electricity Transmission Corporation. The population data is collected from the TSI. The World Bank Open Dataset was utilized to obtain values for imports, GDP, and export. These variables were then subjected to stepwise regression to state which of them best predicted the dependent variable.
To determine the significance of the correlation coefficient, statistical techniques were employed, and the results were compared. The SVM, WOA, and MNN methods’ correlation coefficient ranges were established using a 95% confidence interval. Based on the statistical analysis, it was determined that the MNN method is more reliable and consistent than the others, with better results at a 95% confidence level. Nevertheless, all the proposed method frameworks demonstrate promise as models that can assist engineers and policymakers in developing energy-related budgets more effectively. The MNN method’s adaptability renders it suitable for identifying optimal solutions regarding forecasting future trends in electricity demand. Moreover, the high regression values obtained from the models demonstrate that MNN is an effective tool for electricity demand prediction. Specifically, the R regression values for the testing, training, and validation of datasets in the prediction of electrical energy consumption using MNN were 0.99661, 0.99903, and 0.99697, respectively. The overall R regression value was calculated as 0.99448, which indicates that MNN is highly reliable when estimating electricity consumption. In addition, the R2 values for MNN, SVM, and WOA in the prediction of electricity consumption were shown as 0.9984, 0.9978, and 0.9966, respectively. These results indicate that MNN is highly reliable when estimating electricity consumption. The RMSE, MSE, and MAE values for the MNN method are 5.325 × 10−14, 28.35 × 10−28, 2.5 × 10−14, respectively. These results show that the electrical energy estimation performance of MNN is successful.
The official findings closely matched the predicted results of MNN, which is crucial for the advancement of productive and practical electrification systems policy planning. These findings demonstrate that the proposed model can be employed effectively and actively for Turkey’s long-term electricity demand forecasting. The obtained results can also serve as a guide for future electricity system network designs. In addition, statistical methods were utilized to determine the confidence intervals of the algorithms used to estimate electricity demand, and the findings were compared. The proposed model’s performance was assessed using various metrics such as RMSE, MSE, MAE, R2, the independent and dependent variables correlation matrix across the different methods, and the multiple regression equations correlation. The metrics for measuring errors provided a clear indication of how accurate and precise the estimation techniques were.
Typically, a single hidden layer is utilized for developing the MNN architecture; however, the determination of MNN’s architecture, including hidden layers and neurons, is frequently based on trial and error in most articles. The accuracy of the recognition process and training speed is influenced by the number of neurons in the hidden layer. In future studies, the number of neurons in the hidden layer, as well as the number of intermediate layers, can be modified to assess their performances in forecasting electricity demand. The R2 and R2 adj values indicate that the proposed regression model fits very well. The T-statistic for the historical electricity demand is over 2, indicating that it is statistically significant. The electricity demand and past data both have significant p-values, indicating that they are valuable additions to the model as they serve as significant regressors.
The correlation matrix depicts the relationship between input values (import, population, GDP, and export) and output values (electricity consumption). A strong linear correlation was observed between electricity consumption and export (0.991). In addition, imports and export show a strong linear relationship (0.9895). Import, export, GDP, and population affect the real data positively and significantly. Their positive effect continues with the different dependent variables used interchangeably. The
p-values in
Table 4 are less than 0.001, and therefore there is a very high level of statistically significant difference. The equations in the second and ninth rows have the lowest
p-values.
The Levenberg–Marquardt technique is an effective learning approach that outperformed other learning methods. Though Levenberg–Marquardt is the quickest training method, it is only employed for small- and medium-sized networks. While WOA is a sufficiently efficient algorithm, it does have limitations in terms of exploration ability, slow solution generation, and susceptibility to becoming trapped in local solutions. In addition, the issue of inadequate convergence accuracy and speed is present in the original WOA algorithm. There are some general shortcomings in the optimization process, including a tendency to get stuck in local optima and a low level of convergence accuracy. Additionally, when the data exceeds a certain number, the prediction performance of SVM decreases. In future studies, hybrid and machine learning approaches will be incorporated, and various optimization techniques will be developed to assess the accuracy of the forecasting models.