You are currently viewing a new version of our website. To view the old version click .
Forecasting
  • Article
  • Open Access

19 July 2023

A Hybrid Model for Multi-Day-Ahead Electricity Price Forecasting considering Price Spikes

,
,
and
1
Arcus Power, Calgary, AB T2P 3C5, Canada
2
Department of Electrical and Software Engineering, University of Calgary, Calgary, AB T2N 1N4, Canada
*
Author to whom correspondence should be addressed.
This article belongs to the Collection Energy Forecasting

Abstract

This paper proposes a new hybrid model to forecast electricity market prices up to four days ahead. The components of the proposed model are combined in two dimensions. First, on the “vertical” dimension, long short-term memory (LSTM) neural networks and extreme gradient boosting (XGBoost) models are stacked up to produce supplementary price forecasts. The final forecasts are then picked depending on how the predictions compare to a price spike threshold. On the “horizontal” dimension, five models are designed to extend the forecasting horizon to four days. This is an important requirement to make forecasts useful for market participants who trade energy and ancillary services multiple days ahead. The horizontally cascaded models take advantage of the availability of specific public data for each forecasting horizon. To enhance the forecasting capability of the model in dealing with price spikes, we deploy a previously unexplored input in the proposed methodology. That is, to use the recent variations in the output power of thermal units as an indicator of unplanned outages or shift in the supply stack. The proposed method is tested using data from Alberta’s electricity market, which is known for its volatility and price spikes. An economic application of the developed forecasting model is also carried out to demonstrate how several market players in the Alberta electricity market can benefit from the proposed multi-day ahead price forecasting model. The numerical results demonstrate that the proposed methodology is effective in enhancing forecasting accuracy and price spike detection.

1. Introduction

Spot prices in competitive electricity markets exhibit seasonality, volatility, and price spike occurrence [1,2,3]. The occurrence of price spikes is associated with different system events, for instance, the scheduled and forced outage of generation and transmission assets, transmission congestion, and extreme weather events leading to increased demand or shortage of supply [2,4,5]. Accurately forecasting price spikes holds paramount importance [6] due to their potential to inflict significant financial losses upon unsuspecting business owners. Moreover, traditional forecasting models encounter significant challenges in effectively accounting for and managing these volatile market conditions [7]. Even though price volatility can be observed in all power markets, real-time markets are more prone to the price risks associated with such unforeseen spikes [1]. Higher price volatility in real-time electricity markets adds to the difficulty of producing reasonably accurate electricity price forecasts [8], while forecasting errors for day-ahead markets are often reported to be single-digit, comparable studies often report significantly higher errors for real-time markets [9,10].
There are plenty of studies in the literature that have undertaken the challenge of forecasting electricity market prices. Between a very recent review [11] and one that was published in 2014 [4], the interested reader can find a comprehensive summary of applied methods, best practices, and tips and recommendations for future research in this domain. There are, however, a few observations that are worth noting in the context of the present work. First, the vast majority of available methods are applied to day-ahead market prices. This is understandable because most electricity markets have a day-ahead market component in which most of the energy is traded [3,12,13]. There are a few jurisdictions with only a real-time market mechanism, including Alberta and Ontario provinces in Canada, Australia, and New Zealand. Second, almost all available studies have limited their forecasting horizon to the next day, with a few exceptions, e.g., [14,15], that look beyond the next day. Longer forecast horizons are useful, and sometimes necessary, for optimizing operation plans and portfolio management for market participants. For example, in Alberta, participants in the ancillary services market rely on energy price forecasts generated before 1:00 p.m. on Fridays for the following Monday to optimize their portfolio [16]. Third, only a limited number of papers have taken price spikes into account when designing their models or when evaluating the performance of their proposed models. For example, ref. [5] used an auto-regressive conditional hazard model to predict the probability of one-step-ahead price spikes in the Australian electricity market. In [8], system demand forecasts along with weather information were used in a hybrid neural-network-based model to forecast next-day normal and spike prices for Ontario’s electricity market. In [17], other exogenous variables, such as reserve capacity, variable generation, and interconnection flows, were examined to predict the probability of extremely high or low prices for the next day in the Australian market.
In this paper, we propose a hybrid model to predict the real-time electricity market prices in Alberta for up to four days in advance. The model is designed particularly to handle the extended forecasting horizon and the high volatility of prices in this market caused by frequent price spikes. This is achieved through two main contributions. The first contribution is that the proposed hybrid model combines multiple models at two dimensions. More specifically, at a “vertical” dimension, we stack up two forecasting models, namely, long short-term memory (LSTM) [18] deep neural network and extreme gradient boosting, also called XGBoost [19]. LSTM networks have proven effective in time series forecasting with nonlinear long-term patterns [20]. However, neural networks are generally prone to over-fitting and are computationally taxing. Particularly, when deployed on cloud platforms, frequent training of deep learning models leads to high computation fees [21]. XGBoost, on the other hand, is a scalable machine learning method with comparably better training speed and resilience against over-fitting. In addition, at a “horizontal” dimension, we cascade five models, each with different inputs to predict prices along the 96 h forecasting horizon. Each model takes advantage of available features, depending on its forecasting horizon. Observe that this approach can be considered as an ensemble model. Ensemble forecasting, in general, enhances forecast accuracy by leveraging the strengths of multiple models [22]. However, we not only combine the predictions of the LSTM and XGBoost models, we also concatenate the outputs of multiple models to generate the final 96-hour-ahead forecast string. This approach distinguishes our method as a hybrid model rather than a conventional ensemble model. The second contribution is to identify and successfully deploy a new input that has not previously been explored in the context of enhancing price spike forecasting. In particular, we identify the real-time ramping of large baseload units within the system as effective inputs for predicting price spike events.
The rest of this paper is organized as follows: Section 2 presents a literature review on the analyzed market, the algorithms used in the proposed electricity price forecasting model, and the actual state of research in hybrid models for electricity price forecasting. The methodology of the proposed electricity price forecasting system and research contributions are described in Section 3. Section 4 discusses the results in terms of forecast accuracy and price spike detection. In Section 5, we conduct an economic application of the proposed electricity price forecasting system in Alberta’s ancillary services market. Finally, Section 6 summarizes the findings of the proposed research work.

3. Methodology

Let us denote the hourly market price at time, t, with p t . Furthermore, we refer to the feature space as X = { x j , j = 1 , , N } , where x j is the j t h feature. The proposed method is composed of five horizontally cascaded forecasting models, i.e., E i , { i = 1 , , 5 } . Each model uses a subset of X , referred to here as Φ i . Furthermore, each Φ i contains a set of observed and predicted inputs that are selected depending on their availability and usefulness. E i s are only different in their corresponding set of features and the associated forecasting horizons. More specifically, E 1 to E 5 are used to generate 1-hour-ahead, 2- to 3-hour-ahead, 4-hour-ahead, 5- to 24-hour-ahead, and 25- to 96-hour-ahead price forecasts, respectively. Let us refer to the final k i -step-ahead price forecasts generated by model E i using the feature set, Φ i , as { p ^ t + k i | Φ i } ; thus, k i can be specified as k 1 = 1 , k 2 { 2 , 3 } , k 3 = 4 , k 4 { 5 , , 24 } , and k 5 { 25 , , 96 } . This horizontal cascading of the five models is shown in Figure 3. Observe from Figure 3 that a set of 96 price forecasts are generated at each time step, i.e., hourly, at the forecasting origin, t.
Figure 3. The horizontal dimension of the forecasting system. The width of each model represents how many forecasts are generated at each forecasting horizon. Moreover, each E i uses a different input set of features, Φ i .
Each model, E i , is composed of two predictors, i.e., an LSTM and an XGBoost. The predictors are “vertically” stacked up and independently produce what we call preliminary price forecasts for each of the associated forecasting horizons. The final price forecasts, i.e., p ^ t + k i , are defined accordingly as follows:
p ^ t + k i = p ˜ t + k i ( X G B ) if p ˜ t + k i ( L S T M ) < T 1 p ˜ t + k i ( L S T M ) if T 1 p ˜ t + k i ( L S T M ) < T 2 p ˜ t + k i ( X G B ) if p ˜ t + k i ( L S T M ) T 2
In (12), we refer to the preliminary forecasts produced by the two predictors for the k step ahead as p ˜ t + k i ( L S T M ) and p ˜ t + k i ( X G B ) . The preliminary price forecasts generated by the LSTM models are compared against two price spike thresholds, T 1 and T 2 , where T 1 < T 2 . Essentially, the final forecasts are picked from either the LSTM model or the XGBoost models, depending on if they are normal, i.e., p ˜ t + k i ( L S T M ) < T 1 ; a spike, i.e., T 1 p ˜ t + k i ( L S T M ) < T 2 ; or a super-spike, i.e., p ˜ t + k i ( L S T M ) T 2 . The observation is that the LSTM model performed better in predicting the trends and the sequences of prices, as expected. However, the XGBoost models outperformed the LSTM models for both low and very high prices. In this paper, finding the exact values of the two thresholds is conducted by examining the models’ performance on the training data sets with some trial and error. However, determining the thresholds and even the weight of combining the two forecasts could be automated and optimized, which is left for future work. Figure 4 presents how the LSTM models and XGBoost models are integrated in this methodology and how they contribute to obtain the final price forecasts, p ^ t + k i , for each E i model along each forecasting horizon, k i (i.e., the horizontal dimension). Here, a different input set, ϕ i , is used (i.e., the vertical dimension) depending on k i .
Figure 4. Integration of XGBoost and LSTM models in the proposed methodology to obtain final price forecasts, p ^ t + k i , for each E i model that deploys a different input set of features, Φ i .
A key factor to enhance the performance of the proposed model is to identify as many publicly input features that are available for a given forecast horizon. Here, we describe the features that we identified for Alberta’s market. Electricity price time series show significant autocorrelation, and the literature is accordingly rich on how to choose the most informative lags using tools such as autocorrelation functions [48,56]. We define the subset feature, x 1 = p t l , to include the autocorrelated lagged values of the prices [56,57] up to 168 previous hours. The AESO [27] publishes hourly load forecasts [9,11,58], wind energy production forecasts [11], and the total generation availability forecasts [20] for each technology for up to seven days ahead; we refer to these features as x 2 , x 3 , and x 4 , respectively. On the other hand, the AESO publishes the forecast imports/exports of energy into and from Alberta for the next 48 h. This feature is referred to here by x 5 . Although they do not use forecasts of the imports/exports, some works have used historical values of this variable (for example, [37,59,60]).
We propose to employ a new feature for price forecasting in this paper. Thermal generators often play a key role in electricity markets where they make up a significant portion of the supply side. This is the case, for example, in Alberta where coal and gas units make up 75% [26] of the total installed capacity (end of 2021). A sudden change in the output of the thermal units could indicate a change in the market supply curve and could help predict sharp changes in the prices in the short term. Thus, we propose to use the slope of the output of the online thermal generation capacity as a new feature, and we refer to it as x 6 . The slope is defined as the change in the output power over the past 10 min period. Moreover, the AESO also publishes pool price forecasts [61] for the next three hours. We, hence, define this feature subset as x 7 . Additionally, the latest system marginal price (SMP) values are found to be a useful input [61] in price forecasting models. The modeler may choose to feed its model with one or more of the latest available SMP values before the model is run. This feature is referred to as x 8 here.
Based on the availability of the identified features, the following feature subsets, Φ i , are proposed:
Φ 1 = x 1 , x 2 , x 3 , x 4 , x 5 , x 6 , x 7 , x 8
Φ 2 = x 1 , x 2 , x 3 , x 4 , x 5 , x 6 , x 7
Φ 3 = { x 1 , x 2 , x 3 , x 4 , x 5 , x 6 }
Φ 4 = { x 1 , x 2 , x 3 , x 4 , x 5 }
Φ 5 = { x 1 , x 2 , x 3 , x 4 }
In (13a)–(13b), Φ i corresponds to a multiple input to each E i , and depending on the forecasting horizon, the output of E i is single (i.e., k = 1 , 4 -hour-ahead) or multiple (i.e., k = 2–3-, 5–24-, or 25–96-hour-ahead) point forecast(s). E 1 takes advantage of all the information that we have, including the latest SMPs, the slope of the thermal generation, and the AESO price forecasts. On the other hand, we discovered that including the latest SMP does not improve model accuracy; thus, x 8 is excluded from E 2 . AESO price forecasts are not available for a 4-hour-ahead forecasting horizon; thus, the model for this horizon does not include x 7 . However, it is often observed in Alberta that price spikes last longer than one hour; thus, the slope of thermal generators, i.e., x 6 is included in the input set for E 3 . When it comes to import/export forecasts, their accuracy often drops for the second day. Thus, their forecasts are only considered for the next 24 h. This is the main reason for creating model E 4 , to focus on 5 to 24 hours ahead, where x 5 is included in the input set of this model. The last model, i.e., E 5 , benefits from the least available information, simply because not all of the identified features are available or useful for a long horizon for up to 96 h.

4. Numerical Results

In this section, we present the numerical results of the proposed hybrid forecasting system. Furthermore, we show the convenience of using the slope of the online thermal generators’ output power as a feature, i.e., x 6 . Alberta’s electricity market dataset comprises three years of data, ranging from January 2017 to December 2019. The training set considers data from January 2017 to July 2018; likewise, a validation set considers data from August 2018 to December 2018. The rest of the data, i.e., January 2019 to December 2019, is used for testing. In general, we use two years of data for training and validation, and one year for testing [11]. Moreover, k i -step-ahead hourly price forecasts are generated following a forward rolling window. This is a common practice in electricity price forecasting [11,62]. However, in our case, the computational burden makes it challenging for the model to be recalibrated daily; thus, monthly recalibration results in a better choice.
The input data from where the LSTM [18] and XGBoost [19] models are estimated is scaled between [ 1 , 1 ] [20]. The inverse of this transformation is applied to the price forecasts at the error calculation stage [20]. Furthermore, hyperparameter tuning for each E i is conducted via grid search and evaluated on the validation set [11]. Likewise, due to its simple implementation, computational efficiency, and small memory requirements, the Adam algorithm [63] is selected to optimize the LSTM models. The corresponding hyperparameter search spaces for the LSTM and XGBoost models are presented in Table 2 and Table 3, respectively.
Table 2. Hyperparameter search space for the LSTM models.
Table 3. Hyperparameter search space for the XGBoost models.
The grid search algorithm systematically explores all possible combinations of hyperparameter values from Table 2 and Table 3. For each combination of hyperparameters, the model is trained on the training set, and its performance is evaluated using the mean squared error as the chosen metric on the validation set. The hyperparameter combination that yields the best result, based on the chosen evaluation metric, is selected. Finally, once the best hyperparameters are identified, the model is retrained using the training dataset, incorporating the optimized hyperparameter values.
The forecasting accuracy evaluation is conducted using two performance metrics, i.e., the root-mean-square error (RMSE) and the mean absolute error (MAE). Both can be expressed as follows:
MAE = 1 N h e = 1 N h p m e p ^ m e
RMSE = 1 N h e = 1 N h p m e p ^ m e 2
In (14a) and (14b), m { 1 , 2 , 3 , , 96 } , N h is the number of hours in the evaluation period, i.e., the test period. Likewise, p m e and p ^ m e are the actual and predicted pool prices, respectively. Finally, the fixed price thresholds [5,53] presented in (12) correspond to T 1 C A D 150 /MWh and T 2 C A D 400 /MWh.

4.1. Electricity Price Forecasting with the Proposed Model

The proposed multi-horizon price forecasting system generates forecasts using a different set of regressors, E i (i.e., the vertical dimension), throughout the forecasting horizons (i.e., the horizontal dimension) k 1 = 1 , k 2 { 2 , 3 } , k 3 = 4 , k 4 { 5 , , 24 } , and k 5 { 25 , , 96 } . Furthermore, each E i uses a different input set, Φ i , depending on the market data availability at the forecasting origin (see Section 3).
Figure 5 displays a comparison of the RMSE and MAE between the proposed forecasting system and a set of benchmark models. These benchmark models consist of a decision tree regressor (DTR), an LSTM, and an XGBoost. All models have the same inputs and training periods. Observe that the proposed hybrid forecasting system consistently outperforms the benchmark models along all the forecasting horizons, i.e., k 1 = 1 hour ahead, k 2 { 2 , 3 } hours ahead, k 3 = 4 hours ahead, k 4 { 5 , , 24 } hours ahead, and k 5 { 25 , , 96 } hours ahead. As expected, shorter forecasting horizons present the lowest error rates. Observe, for example, the RMSE and MAE for k 1 = 1 hour ahead, k 2 { 2 , 3 } hours ahead, and k 3 = 4 hours ahead are somewhere around CAD 38-65/MWh and CAD 10-24/MWh, respectively. The reason is because these forecasts contain the most recent market information (i.e., close to real time) from x 6 , x 7 , and x 8 (see Section 3).
Figure 5. Performance comparison between the proposed model (dashed line) and a set of benchmark models (dotted lines) for the testing period. (Top): root-mean-square error (RMSE). (Bottom): mean absolute error (MAE).
Beyond k 3 = 4 hours ahead, i.e., from k 4 { 5 , , 24 } hours ahead up to k 5 { 25 , , 96 } hours ahead, both the RMSE and MAE increase by approximately CAD 15/MWh and CAD 10/MWh, respectively. Observe that once they reach somewhere around CAD 80/MWh (RMSE) and CAD 35/MWh (MAE), both error metrics are kept almost constant. This demonstrates the capabilities of the proposed hybrid forecasting system to forecast long sequences of data without considerably decreasing its performance, i.e., significantly increasing the error.
Likewise, we demonstrate the usefulness of deploying different forecasting regressors following the proposed methodology. To do so, we compare the proposed hybrid forecasting system against a single LSTM forecaster, i.e., a single regressor that generates pool price predictions p ^ t + k { 1 96 } ( L S T M ) . Observe that these predictions are made for all the forecasting horizons, from k 1 = 1 hour ahead up to k 96 = 96 hours ahead. The single forecaster is trained, validated, and tested using all input variables. Figure 6 shows the 96-hour-ahead forecasts made by the proposed forecasting system and the single forecaster for an arbitrary test period. Observe that the proposed model could effectively predict more price spikes (i.e., p t > T 1 ) than the single forecaster model. For example, the group of spikes between August 14 and 15 in Figure 6 is better captured by the proposed multi-horizon forecasting system than from the single forecaster. Similarly, this can be observed on 16 August 2019.
Figure 6. Forecast comparison between the proposed model and a single forecaster for the 96-hour-ahead forecasting horizon between 8 August 2019 and 22 August 2019.
Based on the results shown in Figure 6, we estimate and compare the average RMSE and MAE along all k i s for the proposed forecasting model and a single forecaster. The average RMSE for the proposed forecasting model is CAD 66.41/MWh, compared to CAD 70.81/MWh for the single forecaster. Likewise, the average MAE for the proposed forecasting model is CAD 20.06/MWh, compared to CAD 21.78/MWh for the single forecaster. Overall, the proposed hybrid cascading forecasting model enhances the accuracy of the price spike predictions.

4.2. Impact of the Slope of Power Production of Thermal Generators on the Performance of the Developed Forecasting System

The slope of power production of thermal generators during the initial minutes of the first predicted hour is a novel feature (i.e., x 6 ) of the proposed forecasting system. For example, coal generators usually offer blocks of power at low prices; thus, a reduction in output power could be indicative of price spikes in Alberta’s electricity market. Furthermore, thermal generators (e.g., coal generators) have a steady ramp-up and ramp-down curve; therefore, their power production slope can help to determine if they are going to be available for the next two or three hours.
Figure 7 (left) shows an example case where three price spikes occurred in Alberta’s market on 8 August 2019. Each price spike is shown with a value of CAD 762.83/MWh, CAD 507.4/MWh, and CAD 801.68/MWh. Furthermore, their occurrence is within the on-peak hours window, i.e., between 2:00 p.m. and 5:00 p.m. The main cause of these price spikes is associated with a reduction in thermal generation availability. Observe in Figure 7 (left) how the thermal units Thermal Gen 1 and Thermal Gen 2 reduced their power production considerably after midday. Thermal Gen 1 goes completely offline at 2:15 p.m., and Thermal Gen 2 reduces its production around 2:00 p.m., 3:00 p.m., and, again, at 4:00 p.m. during that day.
Figure 7. The arbitrarily selected period of 8 August 2019. (Left): Pool price dynamic reaction to changes in thermal generation. (Right): One-hour-ahead forecast (i.e., p ^ t k + 1 ) with and without using x 6 as a feature.
For the same period shown in Figure 7 (left), the dispatched energy of the economic merit order table from the AESO [27] indicates that high-price power blocks of other thermal generation units set high spot prices during these hours. To demonstrate the contribution that the slope of power production of thermal generators (i.e., x 6 ) has on the model’s capabilities to better predict price spikes, Figure 7 (right) compares the one-hour-ahead forecasts (i.e., p ^ t + 1 ) produced by E 1 for the same period on 8 August 2019. As an example, we present the RMSE and MAE for this specific period. When using x 6 , the RMSE and MAE are CAD 140.63/MWh and CAD 46.74/MWh, respectively. Likewise, if x 6 is not used, the RMSE and MAE are CAD 197.79/MWh and CAD 75.09/MWh, respectively. This example shows the impact of x 6 on the proposed model’s performance. In other words, using x 6 enhances the model’s ability to forecast price spikes.

4.3. Price Spike Predictions Accuracy Assessment

As previously emphasized, the proposed forecasting model is designed to enhance the prediction of price spikes. Thus, we analyze its performance to detect price spikes for the one-hour-ahead and four-hour-ahead forecasting horizons. The selection of these forecasting horizons is because, in Alberta’s market, different market participants benefit from these forecasts (see Section 1). For example, large load consumers benefit from the one-hour-ahead forecasts to decide if they need to reduce their demand. Likewise, generators benefit from the four-hour-ahead price forecasts to decide if they need to modify their bidding strategies. To conduct the analysis, we use the spike prediction confidence (SPC) and spike prediction accuracy (SPA) metrics [60]:
SPC ( % ) = N c o r r N a s
SPA ( % ) = N c o r r N s p
In (15) and (16), N c o r r corresponds to the number of correctly predicted price spikes. Similarly, N a s is the total number of predicted price spikes, and N s p is the actual number of price spikes. Moreover, prices greater than or equal to CAD 150/MWh (see definition of T 1 in Section 4) are used to decide whether a price is considered a spike, normal, or otherwise. The SPA and SPC metrics are computed for each month [61] in the test period, i.e., the year 2019. In each case, the final spike detection performance of the models is assessed with the average over this period. The higher the values of both SPA and SPC, the better the performance of the model.
Table 4 shows the results for the SPC corresponding to the one-hour-ahead forecasts. Observe that the proposed model has the highest SPC compared to the benchmark models. In other words, the proposed model outperforms the benchmark models in not labeling normal prices as price spikes. Similarly, Table 5 shows the performance of the proposed model and the benchmark models on the SPA metric for the one-hour-ahead forecasts. Here, the proposed model has the highest SPA compared to the benchmark models. Consequently, the proposed model is better at detecting price spikes compared to the benchmark models.
Table 4. SPCs (%) for one-hour-ahead price forecasts for the different models.
Table 5. SPAs (%) for one-hour-ahead price forecasts for the different models.
Likewise, Table 6 and Table 7 show the four-hour-ahead SPC and SPA of the proposed model and the benchmark models. As expected, the performance of all the models in terms of the four-hour-ahead SPC and SPA is lower than the performance shown for the one-hour-ahead. The reason for this is that as we move forward to further forecasting horizons, the spike detection capability of the models tends to decrease. Moreover, we can observe from Table 6 and Table 7 that the proposed model has again the highest SPC and SPA when compared to the benchmark models. For the SPC, the proposed model is better than the benchmark models in not labeling normal prices as price spikes. Similarly, for the SPA, the proposed model is better at detecting price spikes. Observe that for both of the analyzed forecasting horizons, the proposed model demonstrates a superior balance between SPC and SPA when compared to the benchmarks.
Table 6. SPCs (%) for four-hour-ahead price forecasts for the different models.
Table 7. SPAs (%) for four-hour-ahead price forecasts for the different models.
Finally, observe that the differences between the proposed model and the XGBoost are relatively small in terms of the RMSE and MAE ( see Figure 5). However, when evaluating their performance based on the SPC and SPA metrics (Table 4, Table 5, Table 6 and Table 7), it is evident that the proposed model outperforms the XGBoost. Thus, when enhancing the capabilities of a forecasting model to predict price spikes, it is important to evaluate not only the accuracy of the point predictions but the spike detection ability of the model.

5. Evaluation of the Economic Value of the Developed Price Forecasting Model in Alberta

In this section, we evaluate the economic advantage of the proposed price forecasting model in the ancillary services market, in particular, to the spinning reserve offers of one generation facility. Moreover, an optimization problem is proposed to maximize the revenues from the contingency reserve volume offers based on the electricity price forecasts, p ^ t + k i , of each E i .
For the generation facility, the optimization considers the maximum power (MW) to distribute active and standby operating reserves across the daily on-peak and off-peak periods. Likewise, the prices of each offer are not changed in the optimization problem and are assumed to be low enough to be dispatched first on their respective merit order tables. In other words, the generation facility is assumed to be a price-taker, so its operation in the market does not affect the prices. Unlike the two optimized models, which both consider maximum power for the offers in the spinning reserves market, we consider the other three approaches, i.e., the first, second, and third approach. The first approach considers 10 MW and 5 MW of the volume offers for the active and standby spinning market, respectively. Similarly, the second (third) approach uses 12 MW (8 MW) and 3 MW (7 MW). The offers for the second and third approaches also correspond to the active and standby spinning market.
The total actual revenues from January to December 2019 are calculated using daily volume offers and the actual electricity prices from both the energy and ancillary services markets. The total possible revenues are obtained using the actual electricity price to solve the optimization problem, i.e., the perfect approach. Similarly, we solve, again, the optimization problem using price forecasts, p ^ t + k i , instead of the actual electricity price to calculate the revenues in this scenario. We call this approach the proposed model.
The difference between the perfect approach and the proposed model revenues represents the effect of the price forecast inaccuracies in the proposed forecasting system. Likewise, the total actual revenues are also compared against the other three approaches that do not consider any optimization techniques on their volume offers, i.e., the first, second, and third approaches.
Table 8 presents the total obtained revenues from each approach at the spinning reserves market. Observe that the proposed model provides the highest revenues when compared with the first, second, and third approaches. Its total revenues represent 89.67% of the maximum revenues that could have been obtained from the perfect approach. The difference in revenues between the developed model and the perfect approach is approximately CAD 450 , 000 per year. This difference represents the economic losses brought by the forecast inaccuracies, e.g., approximately equivalent to 10% per year of the maximum possible revenues. We recognize these inaccuracies are related to the ideas presented in Section 1 on the challenges of producing single-digit price forecast errors in real-time electricity markets.
Table 8. Total revenues from the spinning reserve volume offers between January and December 2019, i.e., the test period.
Observe also from Table 8 that the proposed model outperforms, on average, by more than 30 % or approximately CAD 1 , 197 , 000 per year compared to the revenues obtained from fixed volume offers, i.e., first, second, and third approaches. Table 8 also shows that for the case of the fixed volume offers, it is more profitable to increase the offers in the active spinning reserve market and decrease those in the standby spinning reserve market. Observe, for example, the difference in revenues of approximately more than CAD 500 , 000 per year between the second and third approaches.
Finally, Figure 8 shows the combined cumulative revenues for both types of spinning reserves (i.e., active and standby) for the analyzed period in 2019. It is possible to observe that the revenues between the perfect approach and the developed model significantly differ from the other three approaches approximately after February 2019. Likewise, observe the impact of the forecast error between the perfect approach and the developed model after March 2019. Such a tendency prevails and increases towards the end of the analyzed period.
Figure 8. Total cumulative revenues from the spinning reserve volume offers in the test period. These results correspond to the economic evaluation of the proposed forecasting system compared to other approaches.

6. Conclusions

A hybrid electricity price forecasting model that takes advantage of a two-dimension strategy was presented in this paper. At the vertical dimension, the different forecasting regressors generated price predictions based on a stack array of an XGBoost and an LSTM model. Likewise, at the horizontal dimension, the proposed model took advantage of the available market information at the forecasting origin to generate price forecasts from 1 h ahead up to 96 h ahead. In addition, the novel input feature based on the slope of the output power production of thermal generators demonstrated and improvement in the price spike prediction accuracy of the proposed forecasting model.
Extensive numerical results were conducted to evaluate the accuracy of the proposed point forecasting model, and also to evaluate its ability to detect the occurrence of price spikes in the Alberta electricity market for the year 2019. In terms of MAE and RMSE, the proposed forecasting model outperformed the other benchmark models along all the forecasting horizons. The MAE and RMSE errors for price predictions from one to four hours ahead range between CAD 10–24/MWh and CAD 38–65/MWh, respectively. From five to ninety-six hours ahead, the MAE and RMSE increased by approximately CAD 10/MWh and CAD 15/MWh, respectively. Both error metrics also kept almost constant during this forecasting horizon, which demonstrates the capabilities of the proposed hybrid forecasting system to forecast long sequences of data without considerably decreasing its performance. Furthermore, our experiments showed that the proposed forecasting model enhanced the prediction of price spikes at further forecasting horizons when compared to a single forecaster. Moreover, in terms of the price spike detection assessment, we showed that the proposed forecasting model outperformed, on average, the benchmark models for both the SPC and SPA metrics. This demonstrates the importance of conducting a specific spike detection evaluation of the models to assure their performance not only in predicting, but also in detecting the occurrence of price spikes at different forecasting horizons.
Finally, the proposed price forecasting model was effectively employed to allocate, in the most profitable way possible, the contingency reserve volume offers of a hypothetical generation facility in the ancillary services market in Alberta. The output of the optimization problem showed that the proposed forecasting model was able to provide higher revenues for the generation facility when compared to the other presented approaches. Moreover, we also demonstrated that the impact of the forecast error in our study case represents a loss in revenues of approximately 10% per year of what could have been if a perfect approach was used.
One limitation of this study is that the proposed price forecasting model does not directly consider the historical offers and bids submitted by market players. By not considering these crucial inputs, the model may overlook important market dynamics and potential strategic behaviors, which could impact the accuracy of its forecasts. Incorporating the offers and bids of market players into the model has the potential to enhance its performance by capturing the influence of participant strategies and optimizing the forecasting outcomes. However, it is important to note that such types of input data are not publicly available in all markets. Future research should explore the integration of these additional inputs to further refine and strengthen the predictive capabilities of the model. Several other limitations should also be noted:
  • The input of recent variations in the output power of thermal units as an indicator of unplanned outages or shifts in the supply stack may be more relevant to coal generators, as their ramp-up and ramp-down times are typically longer compared to gas generators. The proposed methodology was tested using data from Alberta’s electricity market, where more than five coal generators were still operational (the year 2019).
  • The literature presents some benchmark models for comparison for day-ahead market price forecasts [11]. However, it lacks a standard benchmark model for testing price forecasting methods for real-time markets focused on price spike detection. Thus, we have reported the results of comparing the proposed method with two baseline models that we have developed to evaluate the performance of the proposed method.
  • The price spike thresholds ( T 1 and T 2 ) will vary among different electricity markets and depend on the frequency and magnitude of price spikes. The optimal selection of these thresholds should be based on the training dataset specific to the electricity market under study. Automating and optimizing the process of determining these thresholds is recommended for future research.
  • The proposed methodology was only tested on the Alberta electricity market, which features the presence of more than five coal generators (the year 2019). However, it can be applied to other electricity markets where the presence of large generation units can have an impact on prices.
Furthermore, it is essential to acknowledge the significance of quantifying uncertainties associated with deterministic forecasts. As part of future research, developing a probabilistic forecasting model based on the proposed strategy would be valuable in addressing this aspect.

Author Contributions

Conceptualization, H.Z. and M.Q.; methodology, D.M.J.; software, D.M.J. and M.Z.L.; validation, D.M.J., M.Z.L. and H.Z.; formal analysis, D.M.J. and M.Z.L.; investigation, D.M.J.; resources, D.M.J.; data curation, D.M.J. and M.Z.L.; writing—original draft preparation, D.M.J.; writing—review and editing, D.M.J. and M.Z.L.; visualization, M.Z.L.; supervision, H.Z. and M.Q.; project administration, D.M.J. and M.Q.; funding acquisition, H.Z. and M.Q. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially supported by NSERC Discovery Grants.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to thank Arcus Power and NRGStream for granting access to their databases.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Mayer, K.; Trück, S. Electricity Markets around the World. J. Commod. Mark. 2018, 9, 77–100. [Google Scholar] [CrossRef]
  2. Maryniak, P.; Weron, R. What is the Probability of an Electricity Price Spike? Evidence from the UK Power Market. In Handbook of Energy Finance: Theories, Practices Furthermore, Simulations; Goutte, S., Nguyen, D., Eds.; World Scientific: Hackensack, NJ, USA, 2019; pp. 231–245. [Google Scholar]
  3. Ciarreta, A.; Martinez, B.; Nasirov, S. Forecasting electricity prices using bid data. Int. J. Forecast. 2022, 39, 1253–1271. [Google Scholar] [CrossRef]
  4. Weron, R. Electricity price forecasting: A review of the state-of-the-art with a look into the future. Int. J. Forecast. 2014, 30, 1030–1081. [Google Scholar] [CrossRef]
  5. Christensen, T.; Hurn, A.; Lindsay, K. Forecasting spikes in electricity prices. Int. J. Forecast. 2012, 28, 400–411. [Google Scholar] [CrossRef]
  6. Haben, S.; Caudron, J.; Verma, J. Probabilistic Day-Ahead Wholesale Price Forecast: A Case Study in Great Britain. Forecasting 2021, 3, 596–632. [Google Scholar] [CrossRef]
  7. Tschora, L.; Pierre, E.; Plantevit, M.; Robardet, C. Electricity price forecasting on the day-ahead market using machine learning. Appl. Energy 2022, 313, 118752. [Google Scholar] [CrossRef]
  8. Sandhu, H.; Fang, L.; Guan, L. Forecasting day-ahead price spikes for the Ontario electricity market. Electr. Power Syst. Res. 2016, 141, 450–459. [Google Scholar] [CrossRef]
  9. Rodriguez, C.; Anders, G. Energy Price Forecasting in the Ontario Competitive Power System Market. IEEE Trans. Power Syst. 2004, 19, 366–374. [Google Scholar] [CrossRef]
  10. Aggarwal, S.K.; Saini, L.M.; Kumar, A. Electricity Price Forecasting in Ontario Electricity Market Using Wavelet Transform in Artificial Neural Network Based Model. Int. J. Control Autom. Syst. 2008, 6, 639–650. [Google Scholar]
  11. Lago, J.; Marcjasz, G.; De Schutter, B.; Weron, R. Forecasting day-ahead electricity prices: A review of state-of-the-art algorithms, best practices and an open-access benchmark. Appl. Energy 2021, 293, 116983. [Google Scholar] [CrossRef]
  12. Gomez, T.; Herrero, I.; Rodilla, P.; Escobar, R.; Lanza, S.; de la Fuente, I.; Llorens, M.L.; Junco, P. European Union Electricity Markets: Current Practice and Future View. IEEE Power Energy Mag. 2019, 17, 20–31. [Google Scholar] [CrossRef]
  13. Aggarwal, S.K.; Saini, L.M.; Kumar, A. Electricity Price Forecasting in Deregulated Markets: A Review and Evaluation. Int. J. Electr. Power Energy Syst. 2009, 31, 13–22. [Google Scholar] [CrossRef]
  14. Sgarlato, R.; Ziel, F. The Role of Weather Predictions in Electricity Price Forecasting Beyond the Day-Ahead Horizon. IEEE Trans. Power Syst. 2022, 38, 2500–2511. [Google Scholar] [CrossRef]
  15. Lehna, M.; Scheller, F.; Herwartz, H. Forecasting Day-Ahead Electricity Prices: A Comparison of Time Series and Neural Network Models Taking External Regressors into Account. Energy Econ. 2022, 106, 105742. [Google Scholar] [CrossRef]
  16. Complete Set of ISO Rules. Available online: https://www.aeso.ca/rules-standards-and-tariff/iso-rules/complete-set-of-iso-rules/ (accessed on 7 July 2020).
  17. Liu, L.; Bai, F.; Su, C.; Ma, C.; Yan, R.; Li, H.; Sun, Q.; Wennersten, R. Forecasting the occurrence of extreme electricity prices using a multivariate logistic regression model. Energy 2022, 247, 123417. [Google Scholar] [CrossRef]
  18. Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef]
  19. Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference On Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
  20. Lago, J.; De Ridder, F.; De Schutter, B. Forecasting spot electricity prices: Deep learning approaches and empirical comparison of traditional algorithms. Appl. Energy 2018, 221, 386–405. [Google Scholar] [CrossRef]
  21. International Institute of Forecasters. Available online: https://forecasters.org/foresight/beyond-error-measures/ (accessed on 18 July 2022).
  22. Wu, H.; Levinson, D. The ensemble approach to forecasting: A review and synthesis. Transp. Res. Part Emerg. Technol. 2021, 132, 103357. [Google Scholar] [CrossRef]
  23. AESO. Electricity in Alberta. Available online: https://www.aeso.ca/aeso/electricity-in-alberta/ (accessed on 23 March 2020).
  24. AESO. Understanding the Market. Available online: https://www.aeso.ca/market/understanding-the-market/ (accessed on 24 March 2020).
  25. AESO. Guide to Understanding Alberta’s Electricity Market. Available online: https://www.aeso.ca/aeso/understanding-electricity-in-alberta/continuing-education/guide-to-understanding-albertas-electricity-market/ (accessed on 23 March 2020).
  26. AESO. 2021 Annual Market Statistics. Available online: https://www.aeso.ca/market/market-and-system-reporting/annual-market-statistic-reports/ (accessed on 28 September 2022).
  27. AESO. Available online: http://www.aeso.ca (accessed on 19 August 2022).
  28. IESO. Available online: http://www.ieso.ca (accessed on 28 July 2022).
  29. NYISO. Available online: http://www.nyiso.com (accessed on 28 July 2022).
  30. Weron, R. Modeling and Forecasting Electricity Loads and Prices: A Statistical Approach; John Wiley & Sons: Chichester, UK; Hoboken, NJ, USA, 2006; pp. 25–64. [Google Scholar]
  31. Zareipour, H.; Bhattacharya, K.; Cañizares, C.A. Electricity market price volatility: The case of Ontario. Energy Policy 2007, 35, 4739–4748. [Google Scholar] [CrossRef]
  32. Uniejewski, B.; Weron, R.; Ziel, F. Variance Stabilizing Transformations for Electricity Spot Price Forecasting. IEEE Trans. Power Syst. 2018, 33, 2219–2229. [Google Scholar] [CrossRef]
  33. Ugurlu, U.; Oksuz, I.; Tas, O. Electricity Price Forecasting Using Recurrent Neural Networks. Energies 2018, 11, 1255. [Google Scholar] [CrossRef]
  34. Jiang, L.; Hu, G. Day-Ahead Price Forecasting for Electricity Market using Long-Short Term Memory Recurrent Neural Network. In Proceedings of the 2018 15th International Conference on Control, Automation, Robotics and Vision (ICARCV), Singapore, 18–21 November 2018; pp. 949–954. [Google Scholar] [CrossRef]
  35. Bengio, Y.; Simard, P.; Frasconi, P. Learning long-term dependencies with gradient descent is difficult. IEEE Trans. Neural Netw. 1994, 5, 157–166. [Google Scholar] [CrossRef] [PubMed]
  36. Le, X.H.; Ho, H.V.; Lee, G.; Jung, S. Application of Long Short-Term Memory (LSTM) Neural Network for Flood Forecasting. Water 2019, 11, 1387. [Google Scholar] [CrossRef]
  37. Li, W.; Becker, D.M. Day-Ahead Electricity Price Prediction Applying Hybrid Models of LSTM-based Deep Learning Methods and Feature Selection Algorithms under Consideration of Market Coupling. Energy 2021, 237, 121543. [Google Scholar] [CrossRef]
  38. Chang, Z.; Zhang, Y.; Chen, W. Electricity Price Prediction Based on Hybrid Model of Adam Optimized LSTM Neural Network and Wavelet Transform. Energy 2019, 187, 115804. [Google Scholar] [CrossRef]
  39. Lu, X.; Dong, Z.Y.; Li, X. Electricity market price spike forecast with data mining techniques. Electr. Power Syst. Res. 2005, 73, 19–29. [Google Scholar] [CrossRef]
  40. Gaillard, P.; Goude, Y.; Nedellec, R. Additive Models and Robust Aggregation for GEFCom2014 Probabilistic Electric Load and Electricity Price Forecasting. Int. J. Forecast. 2016, 32, 1038–1050. [Google Scholar] [CrossRef]
  41. Hong, T.; Pinson, P.; Fan, S.; Zareipour, H.; Troccoli, A.; Hyndman, R.J. Probabilistic Energy Forecasting: Global Energy Forecasting Competition 2014 and Beyond. Int. J. Forecast. 2016, 32, 896–913. [Google Scholar] [CrossRef]
  42. Agrawal, R.K.; Muchahary, F.; Tripathi, M.M. Ensemble of Relevance Vector Machines and Boosted Trees for Electricity Price Forecasting. Appl. Energy 2019, 250, 540–548. [Google Scholar] [CrossRef]
  43. Xie, H.; Chen, S.; Lai, C.; Ma, G.; Huang, W. Forecasting the Clearing Price in the Day-Ahead Spot Market Using eXtreme Gradient Boosting. Electr. Eng. 2022, 104, 1607–1621. [Google Scholar] [CrossRef]
  44. Weron, R.; Bierbrauer, M.; Trück, S. Modeling electricity prices: Jump diffusion and regime switching. Phys. Stat. Mech. Its Appl. 2004, 336, 39–48. [Google Scholar] [CrossRef]
  45. Motamedi, A.; Geidel, C.; Zareipour, H.; Rosehart, W.D. Electricity price forecasting considering residual demand. In Proceedings of the 2012 3rd IEEE PES Innovative Smart Grid Technologies Europe (ISGT Europe), Berlin, Germany, 14–17 October 2012; pp. 1–8. [Google Scholar] [CrossRef]
  46. Zhang, J.; Tan, Z.; Wei, Y. An Adaptive Hybrid Model for Short Term Electricity Price Forecasting. Appl. Energy 2020, 258, 114087. [Google Scholar] [CrossRef]
  47. Filho, J.C.R.; Affonso, C.d.M.; De Oliveira, R.C. Energy Price Prediction Multi-Step Ahead Using Hybrid Model in the Brazilian Market. Electr. Power Syst. Res. 2014, 117, 115–122. [Google Scholar] [CrossRef]
  48. Zareipour, H.; Canizares, C.; Bhattacharya, K.; Thomson, J. Application of Public-Domain Market Information to Forecast Ontario’s Wholesale Electricity Prices. IEEE Trans. Power Syst. 2006, 21, 1707–1717. [Google Scholar] [CrossRef]
  49. Chen, X.; Dong, Z.Y.; Meng, K.; Xu, Y.; Wong, K.P.; Ngan, H.W. Electricity Price Forecasting With Extreme Learning Machine and Bootstrapping. IEEE Trans. Power Syst. 2012, 27, 2055–2062. [Google Scholar] [CrossRef]
  50. Xiong, H.; Mamon, R. A higher-order Markov chain-modulated model for electricity spot-price dynamics. newblock Appl. Energy 2019, 233–234, 495–515. [Google Scholar] [CrossRef]
  51. Janczura, J.; Trück, S.; Weron, R.; Wolff, R.C. Identifying Spikes and Seasonal Components in Electricity Spot Price Data: A Guide to Robust Modeling. Energy Econ. 2013, 38, 96–110. [Google Scholar] [CrossRef]
  52. Afanasyev, D.O.; Fedorova, E.A. On the Impact of Outlier Filtering on the Electricity Price Forecasting Accuracy. Appl. Energy 2019, 236, 196–210. [Google Scholar] [CrossRef]
  53. Christensen, T.; Hurn, S.; Lindsay, K. It Never Rains but It Pours: Modeling the Persistence of Spikes in Electricity Prices. Energy J. 2009, 30. [Google Scholar] [CrossRef]
  54. Manner, H.; Türk, D.; Eichler, M. Modeling and Forecasting Multivariate Electricity Price Spikes. Energy Econ. 2016, 60, 255–265. [Google Scholar] [CrossRef]
  55. Vu, D.H.; Muttaqi, K.M.; Agalgaonkar, A.P.; Bouzerdoum, A. A Multi-Feature Based Approach Incorporating Variable Thresholds for Detecting Price Spikes in the National Electricity Market of Australia. IEEE Access 2021, 9, 13960–13969. [Google Scholar] [CrossRef]
  56. Contreras, J.; Espinola, R.; Nogales, F.; Conejo, A.J. ARIMA models to predict next-day electricity prices. IEEE Trans. Power Syst. 2003, 18, 1014–1020. [Google Scholar] [CrossRef]
  57. Conejo, A.J.; Contreras, J.; Espínola, R.; Plazas, M.A. Forecasting Electricity Prices for a Day-Ahead Pool-Based Electric Energy Market. Int. J. Forecast. 2005, 21, 435–462. [Google Scholar] [CrossRef]
  58. Marcjasz, G.; Uniejewski, B.; Weron, R. Probabilistic Electricity Price Forecasting with NARX Networks: Combine Point or Probabilistic Forecasts? Int. J. Forecast. 2020, 36, 466–479. [Google Scholar] [CrossRef]
  59. Van Der Heijden, T.; Lago, J.; Palensky, P.; Abraham, E. Electricity Price Forecasting in European Day Ahead Markets: A Greedy Consideration of Market Integration. IEEE Access 2021, 9, 119954–119966. [Google Scholar] [CrossRef]
  60. Zhao, J.H.; Dong, Z.Y.; Li, X.; Wong, K.P. A Framework for Electricity Price Spike Analysis With Advanced Data Mining Methods. IEEE Trans. Power Syst. 2007, 22, 376–385. [Google Scholar] [CrossRef]
  61. Chitsaz, H.; Zamani-Dehkordi, P.; Zareipour, H.; Parikh, P.P. Electricity Price Forecasting for Operational Scheduling of Behind-the-Meter Storage Systems. IEEE Trans. Smart Grid 2018, 9, 6612–6622. [Google Scholar] [CrossRef]
  62. Marcjasz, G.; Uniejewski, B.; Weron, R. On the Importance of the Long-Term Seasonal Component in Day-Ahead Electricity Price Forecasting with NARX Neural Networks. Int. J. Forecast. 2019, 35, 1520–1532. [Google Scholar] [CrossRef]
  63. Kingma, D.P.; Ba, J. Adam: A Method for Stochastic Optimization. In Proceedings of the 3rd International Conference on Learning Representations, San Diego, CA, USA, 7–9 May 2015. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.