1. Introduction
Forecasting electricity prices is one of the key challenges in modern energy markets, particularly owing to the dynamic increase in the share of renewable energy sources (RESs) such as wind and solar energy. High price volatility, the occurrence of negative prices, and complex relationships between the economic, weather, and technological factors require advanced predictive models that combine high accuracy, the ability to model nonlinearities, and practical utility in risk management and operational planning. In previous studies, various approaches, from simple neural networks to advanced deep-learning models, were employed, revealing both significant progress and substantial gaps that limit the effectiveness of the existing methods.
Electricity price forecasting (EPF) was introduced in the 1990s when markets began to be deregulated, thereby increasing the price volatility and the need for accurate forecasts [
1]. Electricity requires constant balance in the power system [
2], and the best predictive models for balancing supply and demand had to be identified. Over the years, various models have been used for price forecasting, including agent-based simulation models, statistical models, and computational intelligence models [
1,
3]. The foundations of the computational intelligence (CI) models were explained in [
4], which discussed the principles of electricity price determination and forecasting. These principles primarily depended on the price formation, volatility, and exogenous variables. The CI model comprises a range of computational methods, primarily artificial intelligence and machine learning algorithms, which are employed in this article. A comprehensive review of point forecasting models for electricity, including an assessment of model performance in day-ahead, intra-day, and balancing markets, is presented in [
5].
Currently, in electricity markets, excluding forward contracts, energy can be contracted for delivery one day in advance [
6]. Essentially, accurate forecasting presents considerable savings for the energy consumers and indirectly contributes to the balance in the power system. EPF typically can be categorised by the forecasting horizons: short-term, medium-term, and long-term. Short-term forecasting typically includes up to a few days ahead, medium-term spans from a few days to several months, and long-term forecasting includes months, quarters, or years [
7].
Forecast evaluation typically employs the mean absolute error (MAE) and mean absolute percentage error (MAPE) [
8]. However, several other evaluation indicators are used in the price prediction articles. The EPF community has not standardised the evaluation methods [
7], which creates challenges in interpreting the results of various prediction methods.
In recent years, artificial intelligence (AI) has become embedded in nearly every aspect of daily life. In electricity price forecasting, AI is used to improve time-series analyses, among other applications [
9,
10,
11]. Typically, forecasting is applied to enhance the prediction accuracy, which is expressed through the sum of squared errors [
12]. One of the most frequently used methods is the artificial neural network (ANN). The ANN computational procedures are based on the learning mechanisms of the human brain. The performance of ANNs depends on the network structure [
13]. Multilayer perceptron (MLP) is the most commonly used neural network in price forecasting [
14]. Other algorithms applied in this field include support vector machines (SVMs) and Gaussian process regression. These are kernel-based machine learning methods used for data analysis [
15]. SVMs are widely used supervised-learning algorithms [
16]. Support vector regression is a generalisation of SVM used for regression problems [
17]. Additionally, some studies employed decision tree algorithms, particularly regression trees, for price forecasting problems [
18].
Several studies focus on short-term day-ahead forecasts (24 h) [
19], which support trading decisions on energy markets such as PJM, EPEX SPOT, or TGE [
20,
21,
22,
23]. For example, Miller and Bućko [
21] employed a simple three-layer MLP to forecast prices on the Polish TGE market, achieving a MAPE of 4.05%, although it was limited to historical prices, omitting exogenous variables such as weather or system load. Similarly, Hong and Wu [
24] employed a hybrid principal component analysis (PCA)–multilayer feedforward (MLF) model on the PJM market, where PCA reduced the data dimensionality, and MLF forecasted prices with an R
2 of up to 0.904 and an MAE of USD 13.66/MWh. Their approach included temporal variables and load but lacked weather variables, thereby reducing its effectiveness in markets with high RES penetration [
24]. Meanwhile, Dias et al. [
25] used an MLP with two hidden layers for medium-term forecasts (4 weeks) in the Brazilian market, thereby achieving a MAPE of 14.65%. Although this performance was satisfactory for long-term contexts, it does not satisfy the hourly precision requirements of the dynamic European markets. An innovative approach was presented in a previous study [
26], where hybrid MLP topologies (parallel and cascaded) with six to eight hidden layers trained using Levenberg–Marquardt, scaled conjugate gradient, and gradient descent with momentum algorithms, achieved a MAPE of 1.42% on the Australian market (AEMO). A longer horizon than 24 h is crucial for medium-term operational planning, RES balancing, and energy portfolio management. However, limited research has been conducted in the context of MLP models.
Although MLP networks are effective in specific conditions, they have limited capabilities in modelling complex nonlinearities and dependencies between variables such as the wind speed, cloud cover, or temperature, which are crucial in markets with high RES penetration [
7,
24,
25,
26]. The model presented in [
21] did not consider weather variables, thereby limiting its accuracy under variable market conditions. By contrast, the AEMO study [
26] included exogenous variables such as load, weather, and gas prices. However, the lack of statistical tests and restriction to a single market reduced the generalisability of the results. Another study [
27], which combined MLP with a genetic algorithm and machine committees on the Brazilian market, achieved high accuracy but required 9 days of training, making it impractical for real-time applications. Moreover, most MLP models, such as those proposed in [
21,
24,
25], provided only point forecasts, whereas probabilistic forecasts, which are crucial for risk management in the uncertainty of RESs, are rarely applied in these approaches. Notably, in [
8], the researchers demonstrated that MLP models are suitable for forecasting electricity prices on the Italian market owing to their high accuracy and lower risk of large errors, thereby achieving the best MAE of EUR 3.873/MWh for hourly forecasts. The results in [
6] demonstrate that increasing the number of hidden layers or nodes in an MLP network does not improve the accuracy and may even reduce it.
An additional challenge is the limited automation of feature selection in the MLP models, which introduces subjectivity and reduces the reproducibility of the results. Miller and Bućko [
21] manually selected the features, thereby increasing the risk of errors, whereas the model proposed by Hong and Wu [
24] partially automated the process using PCA. However, full automation was not achieved, as observed in the more advanced AEMO approach [
26]. Furthermore, most MLP-based studies [
21,
25,
26] tested models on a single market; thus, assessing the universality of the models is difficult. Furthermore, the absence of rigorous statistical tests, such as Diebold–Mariano or Giacomini–White, which were conducted by Hong and Wu [
24], reduces the reliability of the results when compared with more advanced models. Another approach, based on PCA for averaging ARX model forecasts on the EPEX SPOT market [
23], achieved a reduction of 3.84% in the MAE for day-ahead forecasts, thereby demonstrating the potential of automated data processing; however, its application was limited to short-term horizons. A study on the EPEX DE/AT market [
28], conducted using a deep neural network (DNN) with embedding layers for calendar variables, achieved an MAE of EUR 4.10/MWh, thereby incorporating wind and photovoltaic (PV) forecasts, highlighting the significance of RES variables in price modelling. These limitations demonstrate the requirement for new MLP models that combine simplicity with advanced techniques, thereby enabling accurate forecasts over longer time horizons.
Furthermore, the literature highlights the increasing importance of more advanced methods that can better handle nonlinearities and temporal dependencies. A hybrid Wavelet Transform + long short-term memory (LSTM) model, which was applied on the PJM market [
20], achieved a MAPE of 0.40% for short-term load forecasts using automatic feature selection via mutual information and interaction gain. However, its practical application was limited by the high computational requirements. Additionally, a convolutional neural network (CNN)–gated recurrent unit (GRU) model with an attention mechanism applied in the German market [
22] achieved a MAPE of 6.33%, thereby incorporating exogenous variables such as RES generation and load, although its computational complexity limited its real-time applications. An extreme learning machine model on the New York market [
29] achieved low RMSE (0.0697–0.1301); however, the lack of exogenous variables limited its universality. In another study [
30], distributed neural networks were employed for the probabilistic forecasting of electricity prices for the next day, along with a simple but effective aggregation method for these networks to increase the stability of forecasts. The autoregressive multivariate linear model with exogenous variables and LASSO for variable selection and regularization was introduced in [
31]. The potential of meteorological forecasts to improve the accuracy of price forecasts, resulting in a 10–20% improvement in RMSE, was presented.
The specificity of the energy markets, such as the Polish TGE market, requires the consideration of local conditions such as the high installed RES capacity (approximately 44% in 2024, according to PSE [
32]), variable weather conditions, and limited historical data. A study demonstrating the significance of weather variables such as the temperature and wind speed and NWP data in RES forecasting [
33] highlighted the requirement for standardised evaluation metrics. The Polish energy market is characterised by an increasing RES share, gas, high price volatility, and dependencies on external factors such as coal prices or EUA emission units; thus, it requires models capable of considering these specific conditions [
23,
28]. A study conducted on the German–Luxembourg market [
22] demonstrated that the RES variables such as wind generation significantly affect the forecast accuracy, indicating the effectiveness of a similar approach in the Polish market.
The increasing share of RESs in the energy mix, particularly in Europe, presents additional challenges such as price unpredictability due to the dependence of supply and demand on the weather conditions [
19]. MLP networks can effectively model such dynamics owing to their ability to realise nonlinear dependencies. Maciejowska [
34] clearly demonstrated that both the wind and solar energy reduced the electricity prices on the market. The models based on probabilistic quantile regression averaging methods [
35] presented low Pinball Loss values, indicating their effectiveness in risk management in markets with high RES penetration.
Expanding this analysis, electricity price forecasting requires the consideration of both weather variables and macroeconomic factors, such as gas, coal, or EUA prices, which affect the price dynamics. A study conducted on the AEMO market [
26] reported that gas prices were a key exogenous variable, demonstrating their importance in markets dependent on fossil fuels, though the lack of statistical tests limited the reliability of the results. Another study [
23] conducted based on PCA for averaging ARX forecasts demonstrated error reduction and could promote the development of models accounting for macroeconomic conditions. The European Union’s climate policy, including the emission trading system (ETS), contributes significantly to determining the electricity prices on the Polish market. The increasing EUA prices in recent years have increased the cost of fossil fuel-based energy production, thereby affecting electricity price dynamics. A study conducted on the EPEX DE/AT market [
28] demonstrated the significance of RES variables in the context of climate policy, which is particularly relevant for Poland, where the energy transition is accelerating.
Geopolitical factors such as instability in the gas supplies or fluctuations in the raw material prices further complicate electricity price forecasting in Poland. Dynamic changes in the energy mix, such as the planned increase in the RES share to 23% by 2030 (in line with the National Energy and Climate Plan), require models that can adapt to new market conditions [
33]. The energy transition, which was based on EU regulations such as the ‘Fit for 55’ package, introduces additional requirements for forecasting models, which must consider the changing emission costs, new RES technologies, and the development of energy storage systems. Energy storage technologies, such as lithium-ion batteries or flow systems, are beginning to contribute significantly to balancing RESs [
36,
37], necessitating their inclusion in the modelling of price dynamics [
22,
32]. Limited historical data on the Polish market complicates the modelling process, requiring approaches that efficiently utilise the available information, such as the proposed MLP model with PCA [
21,
24]. A study conducted on the Russian electricity market [
38] using DNN with LSTM layers demonstrated that in some cases, DNNs such as LSTM or CNN can achieve slightly better accuracy in forecasting the energy consumption and energy prices when compared with the MLP.
EU regulations, such as directives on the energy efficiency and renewable energy sources, further complicate price forecasting on the Polish market, which involves high dependence on coal despite the increasing RES share. A study conducted on the German–Luxembourg market [
22], based on CNN-GRU with an attention mechanism, demonstrated that RES variables are crucial for accurate forecasts, which is relevant for Poland, where wind and solar generation are becoming increasingly significant [
22]. Models based on simpler methods, such as MLP, must be enhanced with advanced data processing techniques to address these challenges [
21,
25,
26]. A study conducted on the PJM market [
20] using Wavelet Transform + LSTM demonstrated that automatic feature selection can improve the accuracy; however, its computational complexity limits its application in resource-constrained environments.
Previous studies [
36,
37] reported that energy storage systems present considerable potential to support the energy transition and decarbonisation of energy systems. Determining the optimal storage capacity is crucial for fully realising this potential, considering factors such as the electricity demand, renewable energy generation, and energy costs. The day-ahead market provides the price of electricity only 24 h beforehand. This forecast is insufficient for some energy storage users. Several applications equipped with energy storage facilities do not fully utilise their installed capacity and discharge under unfavourable conditions. Determining the expected energy prices for the next few days enables better usage of the storage capacity. An accurate 72 h forecast helps in providing the relevant data to optimise the operation of storage facilities. The simplicity and low computational requirements of models based on MLP and SVM make them suitable for use in complex energy storage systems.
In summary, the literature highlights the need to develop new, more integrated approaches to electricity price forecasting that combine the simplicity and interpretability of MLP and SVM models with modern feature selection methods, probabilistic analysis, and consideration of a wide range of exogenous variables. Only such models can satisfy the increasing demands of the energy market; support operators, traders, and decision makers in making informed decisions; and effectively contribute to the development of a sustainable energy system.
This study proposes an approach for forecasting the electricity prices 72 h in advance based on MLP and SVM networks enhanced with linear correlation analysis. A methodology for analysing, processing, and partitioning the training data is proposed for hourly electricity price forecasting. Various types of data are analysed: time-related, weather, and energy data from the Polish market. Two time periods are distinguished owing to the variability and repeatability of the electricity prices. Unique results are obtained based on two data periods and three sets of different data types, enabling a comprehensive comparison and evaluation of the capabilities of models based on the SVM and MLP networks.
The novel aspects of this study are as follows:
- -
A novel method is proposed for processing the cloud cover data using the average daily and annual profiles.
- -
Rules for dividing data into the training, validation, and testing subsets are established. Furthermore, principles for selecting the training parameters using a sliding window based on the problem of price forecasting with hourly resolution are determined.
- -
A multivariate comparison of the 72 h electricity price prediction using MLP and SVM networks is performed with analysis of the 3-day, monthly, and 2-year average errors for two periods with different price profiles and three sets of training data types.