1. Introduction
During the past decades, energy markets have experienced considerable changes. The energy liberalization in many countries dissolved a centralized integrated market and transformed it into a deregulated competitive one. Market participants now include energy generators, distribution companies, traders, and large consumers who interact bilateral or through various exchange trading power spots, like European Power Exchange and Nodal Exchange and PJM in the United States [
1,
2].
The aim of an electricity market is to reduce the electricity cost through this liberalization framework. From this point of view, electricity could be considered as a “commodity” like other “trading” products in market. However, this specific “commodity” is different from other financial services because of its unique characteristics. As electrical energy cannot be stored, a continuous stable condition between power generation and demand is always required. In addition, this energy “commodity” is a typical day-ahead market that does not permit for continuous trading like other financial markets. This is due to the fact that system operators need in advance information for checking the power schedule is in line with any transmission constraints. In a day-ahead market situation, trading agents on day (
d − 1) are required to submit their bids/offers for electricity supply for each hour of the next day (i.e., day
d) before a specific market closing time [
3]. The goal for an energy market is to balance both the electricity demand and the supply and thus determine eventually the market clearing price (MCP). MCP is considered as the price for the entire system, however in case of transmission congestion issues, a different factor, the locational marginal prices (LMP), has to be defined. The prediction of LMP factor is considered as very significant for market participants, however because of its level of complexity such forecasting is assumed to be more difficult compared to MCP factor [
4]. An independent system operator, who acts as a transmission service provider, controls the overall transmission network.
Electricity price forecasting (EPF) is thus an important financial-related process for any decision-making system in power sector. This financial characteristic has initiated substantial research contributions in energy forecasting. A large portion of these contributions is associated with electricity load forecasting and particularly short-term load forecasting (STLF) [
5]. However, MCPs/LMPs are considered to have more volatile characteristics than hourly loads and thus EPF prediction is more complex than the STLF. MCP/LMP is dependent on a number of features, such as historical hourly price and load data, system operating conditions (which include any possible predicted power shortfalls and generation outrages), weather conditions, time indices (including hours, week days, holidays, and multiple seasonality), bidding strategies, and operating reserves/imports [
6]. Even though electricity price curves retain similar changes with load curves in a transaction day, they depend on more uncertain factors. As a result, prediction error for the MCP/LMP is generally greater than the hourly load under similar conditions [
7]. The difficulty in MCP/LMP modelling is also influenced by another important factor. Because of the recent changes in power market, there is a growing interest for the use of renewable energy sources (RES). At the same time, market is interested in those technologies that provide flexibility and dispatch-ability to the power grid, in other words technologies to be able to balance changes in power supply/demand in cases when RES are not offered. Coal and gas-fired power plants are considered as flexible technologies, thus an additional important factor is the difference between the market electricity price and its cost of production. For gas-fired cases, this parameter is called spark spread, while for coal-fired assets, dark spread. The importance of spark spread forward curve is high to energy industry planners, as it helps them to address generation profits [
8].
EPF models generally can be classified based on their horizon duration, mainly in three groups, namely, short-term price forecasting (STPF), medium-term price forecasting (MTPF), and finally long-term price forecasting (LTPF) schemes. STPF approach includes next hour and day-ahead price prediction schemes and is important for a quick decision-making process, so that producers and consumers can schedule their bidding strategies. MTPF includes prediction for next week, next month, up to one year. The performance of this specific scheme is affected by seasonal effects, such as rise in electricity price in summer because of higher load consumption and decline in winter. MTPF information can be used by suppliers to optimize their production cost by planning an efficient resource allocation for generation of electricity. Finally, LTPF horizon varies from couple of year to decades. Such models are used by policy-makers to plan pricing schemes and management of resources, while investors utilize them for analyzing recovery of investment in power plant construction, production, type of energy sources, and transmission.
The importance of EPF as well as its complexity level, especially for the STPF case has motivated research, and several data-driven approaches have been proposed by researchers. Following the restructuring process in energy sector, time-varying autoregressive (AR) models have been explored in analyzing price time-series in Japan Electric Power Exchange spot market. The estimation of AR’s coefficients was performed via Kalman Filter algorithm [
9]. A seasonal auto regressive integrated moving average (ARIMA) model has been also used for the prediction of day-ahead electricity prices in Elspot market in Sweden, which is considered as one of the main energy trading “day-ahead” market in the world [
10]. Additionally, other time-series models, utilizing dynamic regression methodologies have been proposed for electricity price prediction for PJM market [
11]. An interesting feature of this approach is the adoption of a clustering scheme as a way to minimize the resulting model complexity. While an ARIMA model is suitable for forecasting applications, its main weakness is that it is not able to handle efficiently high levels of nonlinearity and volatility that exist in electricity price time-series. A combination of generalized autoregressive conditional heteroskedastic (GARCH) models and ARMA has been successfully demonstrated for predicting the electricity price in ISO New England market [
12]. Alternatively, the combination of ARIMA models and signal processing techniques, such as wavelets, has been explored for the Singapore electricity market [
13]. Although some promising results have been achieved using these linear techniques, they cannot model adequately the non-linear characteristics of the electricity price. In cases where high-frequency changes of the electricity price occur, the application of these methods could be considered as problematic [
14]. In recent years, alternative methodologies based on computational intelligent techniques, including neural networks (NN) have been investigated by researchers for EPF. Multilayer perceptrons (MLP) and radial basis function (RBF) NNs have been successfully used in forecasting the electricity price volatility in PJM market, the United States [
15].
Electricity price signal characteristics, such as non-stationarity and time variance, have been addressed with the aid of appropriate feature selection techniques [
16]. Similarly, MLP-based models have been explored to predict next-week prices in California as well as Spanish electricity markets [
17]. These NN approaches outperformed the ARIMA as well as the naïve methods used in all cases. This reveals the superiority of NN-based systems over methods which have been traditionally used in energy forecasting systems. An Elman recurrent NN has been also utilized to EPF for the mainland Spain market, using samples captured during winter and summer weeks, achieving a superior to MLP-based NNs performance [
18]. The further development of NNs has led to the evolution of deep learning (DL)-based systems. Long-short term memory (LSTM) and convolutional neural network (CNN) based models have been applied and compared against ARIMA and single hidden-layer MLP models for the day-ahead price market in Belgium. Results revealed a clear advantage of DL models over traditional approaches; however their performance against MLP networks was rather comparable [
19].
As an alternative methodology to NNs, support vector machines (SVMs) and probabilistic-based techniques have grown in popularity in recent years. SVMs are not prone to local minima problems like MLPs, as they provide a nonlinear mapping of the original data into a higher dimension space. This desired characteristic has been addressed in research papers associated to the electricity load and price area [
20]. Generally, NNs and SVM methods focus on determining the predicted electricity price for a specific given horizon. However, in some cases, such information is not enough to provide an efficient risk analysis. Thus, instead of predicting the future electricity price, there is an interest in estimating the uncertainty involved in that specific price, knowing the degree of spreading of the forecasted value and their probabilities [
21]. Such uncertainty in price is influenced by the level of demand, which in turn depends on the intensity of business activities and weather. With the incorporation of renewable energies into electric grid, the effect of uncertainty has been growing significantly, especially with the wind power generation, where in periods of high generation, the electricity price is likely to be lower [
22]. A methodology for probabilistic forecasting, where a Bayesian stochastic volatility model with double exponential distribution of spikes and exogenous variables for the PJM market has been recently explored [
23].
One of the fundamental problems for efficient system identification using MLP, which is a well-known NN, is the lack of neurons’ multi-dimensional “characteristics,” as the multi-dimensional hypersphere of the input domain for each neuron node cannot be defined though the use of sigmoid activation function. On the other hand, the use of bell-shaped activation function, like the Gaussian function, includes such “multi-dimensional” characteristics, as its center parameter could hold
n “dimensions” equal to the number of input variables. This is also explained by the local vs. global learning styles involved in different learning-based architectures. Fuzzy logic systems alternatively can offer such input space association; nevertheless, no learning ability is taking part for rules definition. Hybrid intelligent systems provide some modelling advantages and an adaptive-network-based fuzzy inference system (ANFIS) neurofuzzy (NF) model has been implemented for electricity price forecasting [
24]. Such models proved to have superior performance against related simulation studies that utilize ARIMA and MLP schemes [
14,
18]. Based on the existing research studies in STPF, it seems that the investigation of hybrid neurofuzzy (NF) models needs further consideration.
Obviously, the area of EPF is too complex and a number of parameters should be considered for an efficient forecasting process. These include three areas: Pre-processing analysis of data signals, feature selection of input variables and finally, the design of the prediction model. Pre-processing analysis involves consideration of any trend/seasonality/outliers (spikes), stationarity issues, and decomposition mode techniques [
25]. Feature selection is a rather complicated task, as the number and the type of input variables need to be carefully chosen. This includes historical values of prices as well as a number of exogenous factors such as electricity load, day type identification codes (i.e., weekdays/holidays) temperature, fuel prices. The implementation of an efficient prediction model and its associated issues (type of model—MIMO, MISO, recurrent) complete the whole design of the forecasting framework. The main objective of this paper is aiming at supplementing research in the area of NF systems and their application in hourly ahead EPF. In this specific research, as a proof of concept, we focus mainly on the prediction model and more precisely on the development of novel clustering-based NF modelling schemes and their application in the hourly step ahead EPF. The equally important issues of pre-processing analysis with the incorporation of decomposition mode techniques and the inclusion of additional exogenous parameters, in addition to electricity load, will be considered in a future work.
In general, two main forecasting approaches have been explored for the hourly ahead EPF. In the former case, one MIMO model is used as a 24-hourly prices predictor, while in the latter case a recurrent NN configuration has been investigated. However, the MIMO approach incorporates, by nature, a network structure with relative high complexity as it has to “accommodate” a large number of input variables to model adequately all 24-hourly next-step-ahead prices [
26,
27]. With this type of complexity, such MIMO model may result under-fitting forecasts for specific hours, while in the same time-release over-fitting forecasts for different hours. This problem is explained by the fact that it is a complicated process and a difficult task to simultaneously model all the dynamics of 24-hourly prices via a single MIMO scheme. Alternatively, although a recurrent model could result in a less complex structure, in practice the feedback error accumulation could deteriorate its performance. In order to avoid such intrinsic problems in recurrent NNs, one option is to configure the input vector of a recurrent NN with a nonlinear autoregressive exogenous structure (NARX); however the overall complexity of the network will be increased dramatically [
28].
Alternatively, a modular structure with 24 MISO models, where each hour is modelled via a dedicated modelling learning-based unit, has been adopted in this paper for EPF [
29]. As each hourly-model is modelled independently, its complexity, including the number of input variables is much reduced. The proposed scheme is different from existing approaches and in this paper, as a pilot study, we have implemented and demonstrated two MISO models of real data obtained from New England, USA system operator associated only to 22:00 h and 04:00 h which correspond to the hours where electricity prices have the maximum and minimum values respectively. The proposed novel NF scheme utilized as a one-step-ahead prediction model is the asymmetric-based Gaussian Fuzzy Inference Neural Network (AGFINN) architecture. This specific architecture has been designed by taking into consideration the following sub-objectives:
A NF model should have a small as possible structure, i.e., rules.
A NF model could be generic in terms of architecture, acting either as MISO or MIMO.
A NF model may need to have a different nonlinear membership function.
Clustering analysis could be utilized for the definition of the number of fuzzy rules as well the estimation of the initial membership function parameters in the premise/fuzzification part. In the proposed AGFINN design, this task has been assigned to the Fuzzy C-means (FCM) method. One of the most successful, in terms of applicability, NF architectures is the Takagi-Sugeno-Kang (TSK), with ANFIS scheme as its main representative [
30]. However, ANFIS’s performance is deteriorated in cases of large number of fuzzy rules. In the proposed AGFINN model, the main problem of “curse of dimensionality” is considerably reduced, as the numbers of membership functions (MF) for each input variable and fuzzy rules are equal. Additionally, AGFINN uses an asymmetric MF rather the standard symmetric Gaussian MF usually employed in NF systems. It has been considered from literature, that variability and flexibility features are higher in asymmetric Gaussian functions compared to the standard one [
31]. Thus, the main innovation in this proposed architecture is the development of a clustering-based non-symmetric based NF system. Although the concept of non-symmetric MF exists already in literature, it is much more complicated and requires more parameters to be tuned compared to the approach proposed in this paper [
32]. The RLS learning training algorithm utilized in ANFIS model has been also adopted here, as AGFINN shares with ANFIS also a TSK defuzzification layer. In this research study, AGFINN has been implemented acting either as an MISO or MIMO system. In the MISO configuration, a Takagi-Sugeno based defuzzification scheme has been adopted, while for the MIMO case, a “center average” (CA) scheme has been utilized as defuzzifier. The proposed models are compared against classic ARIMA models as well as adaptive fuzzy logic systems (AFLS) [
33], ANFIS, and MLP models in order to evaluate their forecasting performance. The comparison against alternative NF models is considered essential as these specific methodologies have proved to provide robust performances for EPF based on literature review.
3. The Short-Term Electricity Price Forecasting
The performance of the proposed AGFINN models are examined on the hourly day-ahead electricity market of New England, USA (ISONE—Independent System Operator of New England), commonly used as a benchmark case study in related literature [
34]. ISO New England (ISONE) (
http://www.iso-ne.com) is a not-for-profit independent corporation responsible for managing the daily operation of the power grid in New England, USA.
In order to improve the system’s reliability and to mitigate prices volatility, ISONE manages day-ahead and forward capacity markets by scheduling future supply and demand needs. ISONE is the power system regulator for the six-state region of New England and serves 7.1 million retail electricity customers in a population of 14.7 million. In total, ISONE has control over 350 electricity generators (coal, oil, nuclear power, natural gas, and renewable energy) within the region and maintains energy ties with neighboring regional grids in New York, Quebec, and New Brunswick. Existing power units have approximately 30,500 MWs of installed electricity generating capacity that covers the non-constant power demand in the region with high electricity peak of 28,000 MWs in the summer period, and smaller peak of 22,800 MW in the winter.
In this research, ISONE data from the period 2006–2007 was used for the development of the proposed prediction models. Initially, data were divided into 24 hourly time-series and for each hourly time-series the first 600 data were used for training purposes, while the remaining 123 ones for testing. Based on the proposed forecasting framework, each one of these 24 MISO models is scheduled to provide a next-day forecast for a specific hour. In this paper, only prediction results associated with the maximum (22:00 h) as well as the minimum (04:00 h) electricity price are shown. Similarly, the same process can be expanded for each hour. The specific New England price time-series used in this research are shown in
Figure 4 and
Figure 5 respectively. The specific choice is justified by the fact that the maximum price has the highest average value among remaining hours, while its plot of the raw data reveals nonlinear characteristics with arbitrary amplitude. On the other hand, minimum price reveals a rather smoother response, however with the presence of frequent spikes.
In this research study, as the primary aim is to explore the efficiency of AGFINN models against other learning-based approaches, analysis was performed only on raw data provided by ISONE. It has to be added that the presence of nonlinear characteristics in electricity price curves appears generally in any benchmark datasets and it is independent of any specific time period [
35]. Obviously, pre-processing analysis (incl. seasonality, trend, stationarity issues and decomposition mode methods) as well as selection of appropriate input variables will be considered in a future work, in order to provide a complete assessment of the applicability of neurofuzzy systems in this area. An efficient forecasting methodology in electricity prices depends heavily on the specific input parameters used in the system identification process. In general, internal historical price values and external factors such as temperature and/or electricity load as well as day-type (i.e., weekends, holidays) are considered as potential input variables. This problem therefore involves the consideration of appropriate feature selection techniques for choosing the suitable input variables. In this research study, as an initial pilot study, the well-known Boruta feature selection method has been employed to explore the level of importance of adopted input variables [
36]. Based on the existing literature, electricity load has been assumed as an important exogenous factor [
37]. Hence, in this research, only next day’s predicted electricity load together with historical electricity prices have been considered as potential input variables. Therefore, in order to investigate further the performance of AGFINN models, three different case studies have been implemented in this research. As it is very interesting to determine whether there is any significant difference between the performances of AGFINN models in these studies, the Diebold-Mariano test [
38] has been used to explore performances obtained from AGFINN models.
4. Results and Discussion
In order to evaluate the performance of AGFINN NF models, all studies were carried out based on the same New England electricity price datasets. For the evaluation of the forecasting performance of the all developed models, various error measures have been employed as assessment criteria. These include root mean square error (RMSE), absolute percentage error (APE), mean absolute percent error (MAPE), mean absolute error (MAE), standard error of prediction (SEP), and the coefficient of determination (R
2) [
37]. Relative measures are another type of evaluation indices, with Theil
U1 coefficient as one of the most used measure [
37]. The Theil coefficient is scale invariant and its range is between one and zero. If
U1 equals zero then a perfect fit is assumed. The main idea of utilizing such relative measures is to assess the performance of a forecast relative to a benchmark forecast [
39]. Measures may result in very big values because of either outliers or poor modelling, which in turn make the comparison of different forecasts not feasible or even not consistent. Relative measures could remove the bias introduced by possible trends, outliers or even seasonal components, provided that the benchmark forecast handles these issues appropriately.
Initially, an approach utilizing ARIMA models has been attempted. ARIMA scheme, a well-known forecasting method, consists of three components, the autoregressive (AR), integral, and the moving average (MA) part. AR explores the use of the time-delayed inputs to predict the future value. Integral component is related to differencing required to transform the data stationary, while MA is similar to AR, as it is used to predict the error using historical time-delayed errors [
40]. The fact that ARIMA models are able to predict the target variable only based on its historical data, make them very powerful. Especially, for short run forecast of hourly or daily data, they can provide with high level of accuracy small ups and downs. On the other hand, in cases where the target variable increased or decreased dramatically and unexpectedly, ARIMA may provide much less accurate predictions.
ARIMA model’s parameters are associated to the number of past (time-delayed) input variables incorporated in the model (i.e., p), to the number of times that inputs are differenced (i.e., d) and finally to the size of the moving average window (i.e., q). In this study ARIMA models have been created using the XLSTAT software (v.2019.2, Addinsoft) for both 4 h and 22 h cases. After some trials using classical time series ARIMA, two models with two input variables have been developed in both case studies. More specifically, for the 4-h case, an ARIMA(3,1,0) has been implemented while an ARIMA(4,1,0) was constructed for the 22 h respectively. Testing results are shown in
Figure 6, while the values for RMSE, SEP, and MAPE indices for the 4-h case were 5.79, 12.96%, and 10.90 respectively, while the corresponding values for 22 h were 8.67, 7.17%, and 5.45. Based on the graphical illustrations in
Figure 6, both curves clearly reveal to have different characteristics, with the 22-h case to retain a much smoother output compared to the 4-h case. However, as the main aim of this paper is to propose new NF architectures, existing NF schemes such as AFLS [
33] and ANFIS [
30] were implemented and tested under the same conditions. In addition, a comparison with an MLP NN has been explored, as this specific network has been widely used in the past alongside the classic ARIMA models. Our approach toward the MLP network however is different, as our MLP models were implemented with a two-hidden layer structure instead of one layer. The standard gradient descent algorithm (GD) has been adopted as the MLP’s learning method. It is well-known that historical information of electricity prices and load demand are considered as important inputs for forecasting the electricity price. In this research, the proposed forecasting architecture has been based on an hourly horizon plan. In all presented models, only historical prices and forecasted demands have been chosen as appropriate input variables, while three case studies based on different input selection schemes have been considered for all used learning-based methods. The choice of this input configuration, as showing following case studies, has been influenced from previous related studies with NF and NN models respectively [
37,
41]. A series of trials has been carried out in order to define AGFINN’s optimal configuration in terms of internal structure, number of clusters, etc.
4.1. Case Study 1
In this specific scenario, the forecasted price depends not only on the historical prices (days and hours), but also on the predicted electricity load for that targeted hour/day. Hence, the proposed modeling configuration for a specific hour (
i) and day (
j) is
where in
,
i is related to the specific hour, while
j to the specific day.
This specific scenario has been considered based on the assumption that of the price (i, j − 1) position as the main variable is known, while additional variables were selected from its direct “neighborhood.” More specifically, prices on the same hour (i) at two previous days and prices on the same day (j − 1) at two previous hours have been considered.
In this case study, additionally it is considered that the forecasted demand Load(i,j) is also available via a dedicated “load” forecasting model. In a complete energy forecasting framework, an additional model for the prediction of load demand is also required. Such “load” model will provide the predicted Load(i,j) information required by the “price” model. Obviously, the predicted load input variable will introduce an additional error to the overall error generated by the “price” model. However, as it is necessary to associate input Load(i,j) with output Price(i,j), it is of great importance that a high accurate “load” model needs to be developed as well. AGFINN models have been utilized in prediction of the maximum (22 h) and minimum (04 h) price, using this specific input configuration. Best results were produced by AGFINN-RLS models.
After trying various internal structures, the optimal one utilized 20 and 15 rules for the cases of 22 h and 04 h respectively. The combination of GD and RLS algorithms as learning methods resulted in a fast training process, which was concluded in less than 500 epochs. Performances for these two specific hours, based on testing datasets, are shown in
Figure 7 and
Figure 8 for all simulated forecasting models, while
Table 1 and
Table 2 summarize their statistical performances.
A close inspection of these curves reveals that AGFINN-RLS model fitting with historical price data managed to forecast the result very satisfactory.
Although the “observed price” curves in both figures have different dynamic characteristics, it is significant to indicate the existence of some “volatile” price spikes appeared in both cases. From the statistical tables, the clear advantages of AGFINN-based models against other models implemented for this specific problem are clearly shown. Case Study 1 can be used as a benchmark for all implemented models for the next two case studies, where alternative input variable schemes will be presented. In fact, AGFINN-CA’s performance could be considered to be close to AFLS’s one, while the superior performance of TSK-based model is due to their specific defuzzification scheme. Although AGFINN models performed very satisfactory in both cases (22 h and 4 h), they need to be compared against other NF and neural network models, in order to extract more useful conclusions.
One such competitor is the adaptive fuzzy logic system (AFLS) architecture. AFLS shares like AGFINN the CA defuzzifier output as well as the AGFINN’s premise part, without however the pre-processing clustering stage. A classic Gaussian function has been used in its MFs. After trying different number of MFs/Rules and the GD method as learning scheme, eventually 25 and 20 fuzzy rules were chosen as the optimal ones for the cases of 22 h and 04 h respectively. Additionally, MLP and ANFIS models have been developed for this case study. Both structures have a great range of applicability, thus such comparison is considered as a benchmark. After a few trials, an MLP neural network with 20 and 8 nodes employed at the first and second hidden layer respectively while the adopted ANFIS NF scheme utilized 32 rules. The proposed AGFINN architecture has certain structural advantages over the ANFIS model, as the number of its MFs in fuzzification layer is equal to the numbers of rules. The increased number of MFs increases the localization of the input signal while simultaneously retains low the number of fuzzy rules. The main problem with ANFIS design is the excessive number of fuzzy rules which is a result of combination of input variables/number of MFs. AGFINN-RLS scheme shares the same hybrid learning algorithm as in the case of ANFIS.
4.2. Case Study 2
It has been investigated through research literature, that there is high level correlation of the current hourly electricity price with related prices at two and seven days before respectively [
41].
This issue implies the existence of a daily and weekly periodicity. Hence, the aim in this case study is to explore this hypothesis.
No external variables (i.e., load) have been considered as potential input variables. The proposed modeling configuration for a specific hour (
i) and day (
j) is
In this proposed case study, the number of input variables was increased to six, as weekly information of the electricity price for that specific hour was also considered. Performances for these two specific hours, based on testing datasets, are shown in
Figure 9 and
Figure 10 for all simulated forecasting models, while
Table 3 and
Table 4 summarize their statistical performances.
Best results were produced again by AGFINN-RLS models. From both forecasting graphs and statistical performances, an improvement in results accuracy, compared to case study 1, has been noted. The proposed hypothesis in this case study, based on the produced improved results, was verified, thus the characteristics of periodicity could be included for case study 3.
The optimal structure for AGFINN-RLS included 20 fuzzy rules for both cases. As it is illustrated from these statistical tables, AFLS models utilizing 20 and 25 rules for 4 h and 22 h respectively, achieved a satisfactory performance, although inferior to AGFINN-CA model. Both AFLS and AGFINN-CA shared the same type of defuzzification scheme. Both MLP and ANFIS forecasting results, in this case study, were problematic as they revealed a number of unexpected “spikes” and 4 h and 22 h respectively. The MLP retained the same network configuration as in case study 1, while in the case of ANFIS, a larger in size network with 64 fuzzy rules has been created, increasing thus the computational cost.
4.3. Case Study 3
The objective of this last case study was to expand the case study 2 by incorporating external input variables such as predicted electricity load from case study 1. It was considered as very important to evaluate the combined effect of exogenous input together with the weekly periodicity characteristics of the price. The proposed modeling configuration for a specific hour (
i) and day (
j) is
In this proposed case study, the number of input variables was increased to seven because of the inclusion of the forecasted value of electricity load.
Performances for these two specific hours, based on the testing datasets, are shown in
Figure 11 and
Figure 12 for all simulated forecasting models, while
Table 5 and
Table 6 summarize their statistical performances.
As it is illustrated from
Figure 11, AGFINN and AFLS models managed to capture the dynamics (i.e., trend) at 4:00 h with different levels of accuracy. For this experiment, AGFINN-RLS utilized 20 rules, while the optimal number of rules for AFLS was 25. On the other hand, forecasting via ANFIS model resulted in unexpected spikes. This issue is due to the complexity of the specific ANFIS model. With seven input variables and two MFs per input, a massive number of 128 rules required for training purposes.
ANFIS’s performance was significantly improved for the case of 22 h, as shown in
Figure 12. MLP’s performance was deteriorated however for this case. AGFINN models in both experiments outperformed “rival” models, as shown also from the related statistical tables. Although, case study 3 proved to be the most accurate compared to the previous cases, even in this case, there were variations in performance between the two AGFINN-based models.
All models utilize FCM clustering units, as well as asymmetric functions as MFs. The difference in defuzzification part is mainly responsible for such variations. It was verified that TSK-based models are more accurate/suitable for forecasting applications compared to CA-based systems. Although this specific case proved to provide superior results compared to the other cases, it is also important to assess the performance of all these learning-based methods against the ARIMA regression scheme.
Results revealed that for both electricity price datasets, ARIMA models performed quite satisfactorily and outperformed the performance of MLP neural networks. Such results are comparable with existing studies in literature, and can be explained by the relatively smooth (without any abrupt spikes) characteristics of both hourly datasets.
Overall, the electricity price forecasting problem is a difficult task that involves the implementation of many related subtasks, such as pre-processing analysis, efficient input variables selection and accurate “price” forecasting models either for one or multi steps-ahead predictions. In addition, the implementation of an accurate “load/demand” prediction model would benefit the forecasting performance of the “price” model. In this research study, the performance of AGFINN model against existing learning-based models, justified the concept of its development. However, there are a number of open questions related to the problem of accurate electricity price forecasting. Prediction models need to be incorporated in the whole framework of analysis, which means that issues of pre-processing and input variables selection need to be addressed. Although these issues will be considered in a future research, at this stage it is important to assess the performances of RLS-based AGFINN models as they have been applied in these three case studies. Practically, model 1 included information related to the predicted load, while model 2 included information of the previous weekly electricity price. Model 3 incorporated both of them.
Figure 13 illustrate the performances of RLS-based AGFINN models for these three case studies for the case of 22 h.
Table 1,
Table 3 and
Table 5 summarize their performances based on established indices, however two issues need to be discussed: How different are these three RLS based models and which of the chosen input variables are more important.
To test the statistical significance of the results shown in
Figure 13, the Diebold-Mariano (DM) test, which is the state-of-the-art method to evaluate significance was implemented using Excel. Furthermore, the DM test is used to show the difference of these three RLS-based AGFINN models through the calculation of the related
p-values for the DM test. The main objective is to explore whether this difference shown is significant (for predictive purposes) or simply because of the specific choice of data values in the sample [
42]. By comparing model 1 and model 2, the
p-value was equal to 0.36 > 0.05, thus no significant difference exists between the two forecasts. The comparison of model 2 and model 3 resulted in a
p-value equal to 0.0915, which was slightly above the 0.05 threshold. However, the comparison of model 1 with model 3 resulted in a
p-value equal to 0.012 which was less than the 0.05 threshold, indicating thus a significant difference. In fact, the difference between these two models was the incorporation of the past weekly price information in model 3, which is an indication that this variable could be considered as an important factor.
Therefore, in order to identify the level of importance of those seven input variables employed in the case study 3, the Boruta method has been utilized [
43,
44]. Boruta is an ensemble-based feature selection algorithm that utilizes random forest (RF) principles with additional mechanisms for finding better results. This technique follows an all-relevant feature selection process where it captures all features which are in some conditions related to the desired output. In contrast to Boruta, other “classic” feature selections algorithms follow a minimal optimal process where they depend on a small set of features which results in a minimal error on a selected predictive model. This specific feature selection technique has been built in an R-language environment (v.3.6.2), and results are illustrated through a graphical boxplot graph. Red and green boxplots represent Boruta output-scores of rejected and confirmed features respectively, while blue boxplots correspond to minimal, average, and maximum output-score of a shadow feature.
Dataset corresponding to prices at 04:00 h and 22:00 h have been processed through this mechanism, and results are shown in
Figure 14 and
Figure 15.
A common characteristic in both figures is that although all chosen input variables were considered as appropriate ones, the load factor is considered as the one with the highest level of importance. This is another justification of the close relationship between load and price. Obviously, the current electricity price is considered an important factor (i.e., 2nd factor in both boxplots) however the 7th past price variable has been clearly defined as the 3rd most important factor. This indicates that the hypothesis of the second case study was correct.