1. Introduction
Adequate forecasting of the required volumes of electricity generation and sufficient power for integrated power systems (IPS) is crucial given the rapid development of industries and consumption volumes and the growing demands to reduce the harmful impact on the environment by reducing greenhouse gas emissions [
1,
2,
3,
4,
5,
6,
7,
8,
9,
10]. As the demand for electricity is also growing due to economic growth, urbanisation and the increasing electrification of industries, reliable long-term forecasting of the required parameters of integrated power systems is becoming important to ensure energy security and grid stability. Integrated power systems, which often connect several regional or national grids, require an integrated approach to accurately forecast demand, supply capabilities and potential problems.
The global trend of the rapid deployment of renewable energy sources and the growing focus on reducing carbon emissions are challenging traditional forecasting methods. The successful functioning of the IPS largely depends on balancing electricity availability and peak demand, making accurate long-term forecasting a cornerstone of strategic planning and infrastructure development in the energy sector.
Ensuring the reliable operation of integrated power systems in the context of growing energy consumption, technology development and the integration of renewable energy sources is currently a critical task. Accurate forecasting of required and sufficient power can be the basis for strategic planning, improving the efficiency of the power system and preventing energy shortages or surpluses.
Models based on regression analysis methods that take into account the dependence of power on socio-economic factors (gross domestic product (GDP), population, etc.) are presented in [
11,
12,
13,
14,
15,
16,
17,
18].
Paper [
11] presents a simple model based on regression analysis that includes population and GDP per capita for the long-term forecasting of electricity demand. The model, based on energy consumption data for 1990–2012 by sector, generates projected energy demand by sector until 2025.
To predict future demand for regional energy, Ref. [
12] analyses the applicable environment and forecasting conditions, performs energy forecasting for Liaoning Province from 2010 to 2019 and compares it with actual data and the elasticity coefficient. The comparison shows that the forecasting accuracy of the proposed model is higher than that of the traditional forecasting method using the elasticity coefficient, which proves the scientificity and effectiveness of the model for regional energy demand forecasting. It is argued that the method and data can provide a reliable database for the planning and development of the future power grid.
The main study in [
13] is the forecasting of energy demand using Multivariate Adaptive Regression Splines (MARSs) as a non-parametric regression technique. Energy demand was modelled for the period 1975–2019 based on a combination of factors, including GDP, population, etc. Five models were created and compared with real data collected by the Ministry of Energy and Natural Resources (MENR). The third model, MARSs, was recognised as the best, demonstrating the highest predictive accuracy in forecasting energy demand.
In [
14], three years of demand data and its categorical characteristics are divided into four seasons and fed into an efficient regression model called Machine Learning Categorical Boosting (ML CatBoost) to predict demand for the next year. The model uses a new gradient boosting algorithm that works efficiently with categorical features. Five different machine learning models were developed, analysed and tested to predict the hourly total electricity demand using the same data. The proposed model is compared with a long short-term memory neural network and five other ML models using performance evaluation matrices. The importance of accurate forecasting for the integration of clean energy into the grid is discussed.
Traditional demand forecasting models use linear regression, exponential smoothing, pick-up approach and other models for forecasting [
15]. These models can be viewed as time series. They have a weak ability to generalise and, ultimately, demonstrate low adaptability to random changes. The proposed support vector regression (SVR) model, which is derived from machine learning, is highly adaptive to nonlinear random data changes and can adaptively account for random disturbances. The model is trained on a vector composed of historical data. Numerical results show that the SVR model significantly improves forecasting accuracy compared to traditional models.
Study [
16] aims to apply a first-order model with one variable without using statistical assumptions to forecast energy demand. To improve the accuracy of forecasting, it is necessary to solve the problem arising from the collected samples, which are often based on uncertain estimates. One approach to handling these uncertain and imprecise observations is to use nonlinear interval regression analysis with neural networks to generate upper and lower bounds for individual samples. The study confirmed the high applicability of the proposed model for energy demand forecasting.
Article [
17] proposes a two-stage model based on the method of least absolute reduction and selection of variables to determine the factors of energy demand. A support vector regression model with composite data and second-order exponential smoothing (SVR-CDSES) was created to forecast demand for coal, oil, natural gas and primary electricity. The results of the empirical analysis show that China’s energy demand will grow at an annual rate of 2.68% over the next decade. Demand for natural gas and primary electricity will grow rapidly, reaching a maximum annual growth rate of 8.05%, indicating an increased focus on clean energy.
Study [
18] created multivariate linear regression models to estimate the expected annual heating energy demand in different building configurations and tested their accuracy. The models were modified in such a way that the complexity increased only to the point where the accuracy of the approximation remained sufficient. The result was a multivariate linear model that estimated the expected outcome for unknown descriptive variables with a relative error of 0% and a standard deviation of 1.6%.
Time series models, such as ARIMA (Auto Regressive Integrated Moving Average) and exponential smoothing [
19,
20,
21,
22], use historical data to predict future demand trends based on observed patterns. These models are particularly effective in accounting for seasonal fluctuations and short-term changes in demand.
Study [
19] considers the problem of modelling the load of electricity demand. The provided actual load data were deseasonalised and the offline autoregressive moving average model (ARMA) was applied to them using Akaike’s adjusted information criterion (AICC). The obtained results show that the proposed method, based on the theory of multimodel partitioning, successfully solves the task. The developed model can be useful in studies related to forecasting electricity consumption and electricity prices.
Study [
20] proposes a vector ETS (VETS) method as a suitable alternative to ARIMA for smoothing and forecasting MF time series. The accuracy of the method’s predictions was investigated using Monte Carlo simulations. The results show that the proposed method is suitable for short- and medium-term forecasting.
Study [
21] proposes a method for short-term forecasting of the load on the power grid based on an improved exponential smoothing model. Based on a smoothed sequence that corresponds to an exponential trend, a grey forecasting model with an optimised baseline value is created. The inverse exponential smoothing method is used to restore the predicted values. The model takes into account the impact of factors affecting the load on the power grid. The simulation results show that the proposed algorithm has a satisfactory forecasting effect and meets the requirements of short-term load forecasting.
In [
22], a multivariate forecasting method of nonlinear exponential smoothing with different weighting factors is investigated to realise short-term power output prediction. The simulations and experimental discussions show that the algorithm proposed in this paper can effectively implement power allocation compared to the traditional one.
New machine learning techniques, such as neural networks and ensemble models, offer improved accuracy by identifying complex patterns in large data sets. These models are becoming increasingly popular due to their adaptability and accuracy in dynamic CPS environments. Examples of the use of machine learning and neural networks for more accurate and adaptive electricity demand estimation are presented in [
23,
24,
25,
26,
27,
28,
29,
30,
31,
32].
Scenario-based forecasting takes into account different future conditions, such as policy changes or technological developments, to provide a range of supply and demand projections. This approach is particularly useful when quantitative data is limited. Approaches [
33,
34,
35] are based on scenario modelling to account for the uncertainty of external factors (economic changes, climate impacts).
Reviews of scientific articles devoted to the development and improvement of modelling systems for energy complexes are presented in [
36,
37,
38,
39,
40].
Even a brief review of the works devoted to the improvement of energy system modelling methods confirms the conclusion about the relevance of updating existing and developing new software and information tools for these purposes. Thus, papers [
41,
42,
43,
44,
45,
46,
47,
48,
49,
50,
51,
52,
53,
54] describe the most common modern software and information tools aimed at solving these problems.
The COHYBEM—Conceptual Hybrid Energy Model for different scales of energy potential [
41]—is dedicated to bridging the gap between the theoretical foundations of hybrid renewable energy systems and their practical implementation at different scales through the new Conceptual Hybrid Energy Model (COHYBEM). The main goal is to develop a multivariate model that will allow for a new complete and comprehensive technical and economic analysis of the performance of possible hybrid renewable energy systems at different scales. The task is to determine the impact of critical parameters by changing the key variables in the developed model and analysing their consequences. The study includes big data analyses and modelling and optimisation of hybrid energy solutions that combine wind, solar and hydro power with energy storage technology in the form of a pumped storage plant. The study also shows a Pareto curve with increasing installed power.
The CYME International T&D software [
42] (CYME 9.0 Rev. 4) is, from the authors’ point of view, an advanced and comprehensive world-class modelling package developed for transmission, distribution and industrial power systems. The new generation of CYME 9 software supports utilities for modernising the long-term network planning structure. It is built on the basis of such core components as time series analysis, distributed energy resource (DER) optimisation and no-wire alternatives (NWA). CYME 9.0 aims to integrate the power planning process with the distributed resource planning process. It emphasises the power and versatility of the user interface, combining simplicity with efficiency.
Power System Solutions DIgSILENT PowerFactory 2023 [
43] is a software for analysing power systems: generation, transmission, distribution and industry. It covers a full range of functionality from standard functions to highly complex and advanced applications, including wind energy, distributed generation, real-time modelling and performance monitoring for system testing.
EMMA [
44] is an optimisation model for the European integrated power system. It minimises the total system costs, taking into account dispatch and investments in generation, energy storage and interconnections with hourly granularity. The model includes fourteen generation technologies and two energy storage technologies. It takes into account a large number of technical constraints related to short-term system balancing, cogeneration, cyclical operation of thermal power plants and operational limitations of hydropower facilities. The demand for electricity must be met at every hour of the year, taking into account the cost of under-supply, which is EUR 1000/MWh. Border crossings are subject to net power constraints. However, the model does not include the task of discrete unit commitment and load flow calculations. EMMA is a linear programme with approximately 2 million non-zero elements and is routinely solved using CPLEX 22.1, a software package designed to solve linear and quadratic programming problems, including integer programming.
SOPS [
45] is a software and information complex of the General Energy Institute of the NAS of Ukraine. The main advantages of the complex are its capabilities:
- -
Modelling the power system as a complex hierarchical quasi-dynamic system;
- -
Modelling a multi-node integrated power system with integer variables;
- -
Minimising the weighted average cost of electricity generation and storage;
- -
Long-term technological upgrading of power system components.
The complex allows for optimal selection of power units and their operating modes, ensuring the generation and redistribution of energy in accordance with the consumption schedule. The authors emphasise the versatility of SOPS, which makes it possible to explore various models of power system optimisation in a short time. The user can develop, edit, save and debug optimisation models. The connection of input data, sets, parameters, constants and variables used in the model is organised in a convenient way. You can edit parameters and input data and then run the model. The modelling results can be displayed on the model worksheets or output as separate files. Another advantage of the software and information complex [
45] is the ability to conveniently compare many models, since each of the worksheets can contain its own model.
Enertile [
46] is an energy system optimisation model developed at the Fraunhofer Institute for Systems and Innovation Research (Fraunhofer ISI). The model focuses on the electricity sector. Enertile is primarily used for long-term scenario studies and is specifically designed to capture the challenges and opportunities associated with the growing share of renewable energy sources. The main advantage of the model is its high technical and temporal granularity, which allows for a detailed analysis of power systems, taking into account the dynamics of technology development and RES integration.
ENTIGRIS [
47] is an electricity market model designed to analyse energy systems from an economic perspective. It takes into account various factors, including market participant configurations and economic indicators, to provide insights into the development of energy systems.
TIMES [
48,
49] (acronym for The Integrated MARKAL-EFOM System) is an economic model generator for local, national, multi-regional or global energy systems that provides a technologically rich representation of the dynamics of energy processes over a multi-period time horizon. The model is typically used to analyse the entire energy sector but can also be used to study specific sectors such as electricity or district heating. TIMES is used to generate energy development scenarios taking into account economic, technological and resource constraints.
TIMES-Ukraine [
50] is a linear optimisation model of the energy system that belongs to the MARKAL/TIMES class of models. It provides a multi-technology (bottom-up) representation of the energy system to assess energy dynamics in the long term. The Ukrainian energy system is divided into seven sectors in the model. The structure of the TIMES-Ukraine model follows the methodological approach of the State Statistics Service of Ukraine, which is aligned with the Eurostat and IEA methodology for energy statistics. More than 1.6 thousand technologies are represented in the model.
GENESYS (GENeration Evaluation SYStem) [
51] is a model developed to assess the adequacy of energy supply in the Pacific Northwest region under uncertain future conditions. GENESYS is a constrained economic dispatch model that uses the Monte Carlo method to assess the effects of uncertainty. GENESYS is used for analysing the adequacy of energy supply, modelling the impact of uncertainties in generation and demand.
ARTEMiS [
52] is designed for real-time modelling of power systems and power electronics. ARTEMiS provides advanced solvers and algorithms to ensure reliable, accurate computations with fixed time steps, which is essential for real-time simulations. In addition, ARTEMiS uses an advanced decomposition method that does not introduce artificial delays, which is important when modelling large-scale power grids.
LIBEMOD [
53] is a multi-country energy equilibrium model based on a set of competitive markets for eight energy commodities: electricity, natural gas, oil, coking coal, lignite, steam coal, biofuels and biomass. All energy commodities are produced and consumed in each of the model countries: EU27, Iceland, Norway and Switzerland.
PowerGAMA [
54] optimises generation dispatch, i.e., the power produced by all generators in the power system, based on marginal costs for each time step over a certain period, e.g., one year. It takes into account the variable power available for solar, hydro and wind generators, as well as demand variability.
The problems of existing methods are as follows:
Most of the models are designed to assess and forecast the economic and technological parameters of the elements of the IPS and are aimed at solving the problems of minimising the current and total costs of operation. At the same time, the issues of the sustainability of the IPS and ensuring the necessary volumes of energy consumption are considered secondary factors affecting the minimisation of these costs and the payback period of capital investments.
Other models focus on the problems of step-by-step (daily, hourly, minute-by-minute, etc.) functioning of the energy system or its elements in the presence of external influences.
Thus, the problem of determining the generalised parameters of the IPS, which determine the necessary stability of the system, which is determined by the values of the projected volumes of consumption and, therefore, generation, fades into the background.
This paper proposes a methodology and a mathematical model for determining the necessary and sufficient powers in integrated power systems according to their forecast volumes of electricity production, which are aimed at solving specific problems that arise in the process of assessing the current state and quality of scenarios for the development of integrated energy systems.
The need to develop a comprehensive methodology and mathematical model is becoming an urgent task for the integrated power systems of Ukraine and the world.
Hypothesis 1. By calibrating the structural characteristics of historical load profiles with a scaling factor, it is possible to predict the required zonal powers with minimal error. This approach leverages persistent features observed in historical data to enable robust future forecasts.
5. Conclusions
Electricity is an essential, technologically advanced commodity. Activities in the electricity sector provide high added value. Power generation equipment is high-tech and expensive. The residual life of the equipment of the power system, which is comparable to the Ukrainian one, is estimated to be around USD 80–100 billion. In the context of the post-war restoration of the power system of Ukraine, the required investment will increase at least threefold. Under such conditions, the requirements for the accuracy of the models used to forecast the future structure of the power system’s generating powers will be significantly increased. While in peacetime the permissible error in determining zonal powers could be up to 5%, in the context of the power system restoration, such an error could result in USD 12–15 billion of additional investments. This would be extremely undesirable in the post-war period.
Therefore, over the course of this study, a methodology and a mathematical model were developed and synthesised to determine the necessary and sufficient zonal powers that ensure predetermined volumes of electricity production with an accuracy limited only by the accuracy of the initial information.
Two methods were used to achieve these results within the same model. In the first stage, the least squares method (averaging) was used to identify the reference ELG from the set of base ELGδi, . The detected ELGie has the lowest errors in power and generation modules compared to other base ELGδi. In this way, the errors of the initial information are filtered (minimised).
In the second stage of calculations using this model, the calibration method was used. This new method virtually identically transfers the indicators from the accuracy of the reference ELG
ie to the ELG
xmax of the daily maximum electricity generation at its annual production
Wx in year
x. It is the organic combination of these methods within one model that ensures the obtaining of zonal power indicators and, as a result, the generation of the daily ELG
xmax with an accuracy limited only by the errors of the initial information of the retrospective daily base ELG
δi,
. The application of the developed model to the real data of the Ukrainian power system (
Table 1 and
Table 2) showed that it forms decisions on zonal power
Pxl and generation
Wxl with high accuracy, namely, the error of zonal power by module does not exceed 1.02% and generation −0.39% (
Table 5 and
Table 6).
In the future, to solve the problem of optimising the structure of power system generating powers for a given period with a given electricity production, it is necessary to use this model to determine the necessary and sufficient zonal powers and then to optimise the structure of its generation for each zone using universal software systems.
While our model demonstrates very low forecasting errors, it relies on the assumption that historical load patterns will persist into the future. Changes due to increased renewable penetration, evolving economic conditions or other unforeseen factors may affect its accuracy. Future work should explore integrating external variables such as meteorological or policy factors and incorporate more dynamic, stochastic models to further enhance forecast robustness and scalability across different integrated power systems.