Usage of the Pareto Fronts as a Tool to Select Data in the Forecasting Process—A Short-Term Electric Energy Demand Forecasting Case

: Transmission, distribution, and micro-grid system operators are struggling with the in-creasing number of renewables and the changing nature of energy demand. This necessitates the use of prognostic methods based on ever shorter time series. This study depicted an attempt to develop an appropriate method by introducing a novel forecasting model based on the idea to use the Pareto fronts as a tool to select data in the forecasting process. The proposed model was implemented to forecast short-term electric energy demand in Poland using historical hourly demand values from Polish TSO. The study rather intended on implementing the range of different approaches—scenarios of Pareto fronts usage than on a complex evaluation of the obtained results. However, performance of proposed models was compared with a few benchmark forecasting models, including naïve approach, SARIMAX, kNN, and regression. For two scenarios, it has outperformed all other models by minimum 7.7%.


Introduction
After decades of a rather stable situation in the electric power engineering sector, almost every year brings new challenges. This sector faces dynamically growing intermittent sources of production of electric energy (wind, PV) [1], changes in energy storages [2], changes in electricity demand (e-mobility, standard of living, social influence) [3,4], changes in existing electricity production methods (pollution, climate) [5]. All of them cause the forecasting to be more complicated and more unreliable.
The obvious point is that accurate forecasting is an important part of any successful planning process. In the electric energy sector, transmission system operators (TSOs), distribution system operators (DSOs), commercial operators (COs), or commercial and technical operators (CTOs) perform numerous different forecasts to plan their activities in an optimal way.
The high accuracy forecasts enable, particularly in the short-term, the TSOs to provide more secure power system operation, i.e., to balance demand with production and minimalize its costs. The impact of forecasts' accuracy is multilevel: it influences not only the security, reliability of the power system, and cost of electrical energy production, but also procurement of both electrical energy and frequency reserves. Moreover, the accuracy influences the comfort of dispatchers' everyday work, which is due to operation on more reliable sets of data about the foreseen situations in the power system for the following, future time steps, used by them in the security management of power systems. In the long run, forecasts' accuracy will influence the possibility of green energy inclusion in energy mix and proper management of energy storage which can give benefits both for customer and energy provider [6]. tool due to the combination of its simplicity and accuracy. It has been provided in this article, as the kNN algorithm had been chosen as one of the benchmark models. In the kNN models k represents the model's parameter, i.e., the number of data (facts) selected from the training data set, which are the k closest ones in terms of the explanatory variables (e.g., calculated as the Euclidean distance) to the forecast one. The forecast value is calculated based on the selected k nearest training data values, e.g., as their arithmetic average (as in the Figure 1). Hence, in this approach, the number of similar data is the main factor determining the chosen data from the training data set (set items are used to make the forecast) [30]. The authors' aim was to challenge kNN's approach and verify a similar concept. The new model proposed in this article assumed that the process of choosing similar data, from the training data set, was based on their belonging to the Pareto fronts rather than on the predefined number of nearest data points.
The general idea of using Pareto fronts for forecasting is presented in Figure 2.
Construction of a set of facts from historical data Choosing of a time quantum for which the forecast is to be determined and generating a subset of the values of explanatory variables Selection, according to the assumptions, a subset of facts using Pareto fronts, taking into account the explanatory data of the forecasted quantum Calculation of forecast based on a subset of facts using the assumed method  The authors' aim was to challenge kNN's approach and verify a similar concept. The new model proposed in this article assumed that the process of choosing similar data, from the training data set, was based on their belonging to the Pareto fronts rather than on the predefined number of nearest data points.
The general idea of using Pareto fronts for forecasting is presented in Figure 2.
tool due to the combination of its simplicity and accuracy. It has been provided in this article, as the kNN algorithm had been chosen as one of the benchmark models. In the kNN models k represents the model's parameter, i.e., the number of data (facts) selected from the training data set, which are the k closest ones in terms of the explanatory varia bles (e.g., calculated as the Euclidean distance) to the forecast one. The forecast value is calculated based on the selected k nearest training data values, e.g., as their arithmetic average (as in the Figure 1). Hence, in this approach, the number of similar data is the main factor determining the chosen data from the training data set (set items are used to make the forecast) [30]. The authors' aim was to challenge kNN's approach and verify a similar concept. The new model proposed in this article assumed that the process of choosing similar data from the training data set, was based on their belonging to the Pareto fronts rather than on the predefined number of nearest data points.
The general idea of using Pareto fronts for forecasting is presented in Figure 2.
Construction of a set of facts from historical data Choosing of a time quantum for which the forecast is to be determined and generating a subset of the values of explanatory variables Selection, according to the assumptions, a subset of facts using Pareto fronts, taking into account the explanatory data of the forecasted quantum Calculation of forecast based on a subset of facts using the assumed method   The entire process of forecasting begins with the construction of a set of facts from historical data. A single fact (as it is in the case of neural networks) is understood as a set consisting of a subset of the values of the explanatory variables and the corresponding one value of the dependent variable. Facts are constructed from historical data. The method of including the explanatory data in the subset depends on the adopted data scenario on which the forecast is to be based.
Then the time quantum for which the forecast is to be determined should be chosen. For this quantum, the values of the explanatory variables should be determined (in the same way as a subset of the explanatory variables for individual fact).
The key step is to select, according to the assumed scenario, a subset of facts using the Pareto front. Central point location for determining of Pareto fronts is stated by coordinates-values of the explanatory variables of the predicted quantum.
The last step is to determine the appropriate forecast using the assumed method and using the subset of facts selected in the previous step. The methods/scenarios for selecting Pareto fronts and the forecast calculation are described in following sections.

Theory and Examples of Pareto Fronts
In optimization the Pareto front represents the set of nondominated solutions being chosen as optimal if no objective can be improved without sacrificing at least one other objective [31,32].
Nondominated solution is defined in [33] as "A point, x * ∈ X, is Pareto optimal if there does not exist another point, x ∈ X, such that F (x) ≤ F (x * ), and Fi (x) < Fi (x * ) for at least one function." In the authors' approach to forecasting and choosing nearest neighbors, Pareto front represents training data points that are nondominated in sense of distance (in many dimensions) to testing point. Each explanatory variable creates one space dimension.
In other words, in case of problem with two explanatory variables (two dimensions ∆X,∆Y) and when sought values for Pareto front are minimalized, as in the Figure 3, the points on the Pareto front (represented by the red dots in the Figure 3) are:

•
Those elements whose values are the closest to the zero value in dimension ∆X or ∆Y; • Those elements which with one another have one value closer to zero in one dimension than other elements and one value further than other elements in the second dimension.
Mentioned above two-dimensional elements have their values calculated: • For ∆X dimension as differences between testing data value representing value connected with first explanatory variable X and relevant corresponding training example' value of this variable; • For ∆Y dimension as differences between testing data value representing value connected with the second explanatory variable Y and relevant corresponding training example value of this variable.
As a result there is no unequivocal methodology to compare the points on Pareto front to one another in situations when parameters represented by the two dimensions (∆X,∆Y) are so different from each other (for example price and color). The testing data represents data for which the forecast value is calculated.  In the proposed model, as compared to kNN algorithm, there is no parameter directly defining the number of training data points belonging to the Pareto fronts. In consequence, the selection of the data for the forecast computation is, while still structured, randomized. This can be viewed as an advantage, as the users are not required to determine the optimal number of 'k' neighbors themselves, and a disadvantage considering the lack of control over the data selection process. However, the proposed model contains a new, different parameter defining the number of Pareto fronts, from which training data are used to set forecast values. An example of three Pareto fronts is presented in Figure 4.

Implementation Options
There are many methods of applying proposed Pareto fronts model to select data from training data set and to use them further in the forecasting process. Those taken into account by authors, implemented and verified by them are described and presented below.
Scenario options and their combination on which Pareto fronts as model has been verified as described in Section 3: Application and Results.
Option 1: The set of training data contains all available training data (there are no limitations added, all historic data from the set are treated as the training data). In the proposed model, as compared to kNN algorithm, there is no parameter directly defining the number of training data points belonging to the Pareto fronts. In consequence, the selection of the data for the forecast computation is, while still structured, randomized. This can be viewed as an advantage, as the users are not required to determine the optimal number of 'k' neighbors themselves, and a disadvantage considering the lack of control over the data selection process. However, the proposed model contains a new, different parameter defining the number of Pareto fronts, from which training data are used to set forecast values. An example of three Pareto fronts is presented in Figure 4.  In the proposed model, as compared to kNN algorithm, there is no parameter directly defining the number of training data points belonging to the Pareto fronts. In consequence, the selection of the data for the forecast computation is, while still structured, randomized. This can be viewed as an advantage, as the users are not required to determine the optimal number of 'k' neighbors themselves, and a disadvantage considering the lack of control over the data selection process. However, the proposed model contains a new, different parameter defining the number of Pareto fronts, from which training data are used to set forecast values. An example of three Pareto fronts is presented in Figure 4.

Implementation Options
There are many methods of applying proposed Pareto fronts model to select data from training data set and to use them further in the forecasting process. Those taken into account by authors, implemented and verified by them are described and presented below.
Scenario options and their combination on which Pareto fronts as model has been verified as described in Section 3: Application and Results.
Option 1: The set of training data contains all available training data (there are no limitations added, all historic data from the set are treated as the training data).

Implementation Options
There are many methods of applying proposed Pareto fronts model to select data from training data set and to use them further in the forecasting process. Those taken into account by authors, implemented and verified by them are described and presented below.
Scenario options and their combination on which Pareto fronts as model has been verified as described in Section 3: Application and Results. Option 1: The set of training data contains all available training data (there are no limitations added, all historic data from the set are treated as the training data).
Option 2: The set of training data is limited exclusively to data representing historical hours equal to the forecast hour.
Option 3: The forecast value is calculated as the arithmetic average of testing data belonging to Pareto front/s. Option 4: The forecast value is obtained as the result of the linear regression calculated on testing data belonging to Pareto front/s. Option 5: Explanatory variables are: (a) Historical hourly demand values for different historic hours (for example training data is characterized by two explanatory variables, i.e., on ∆X dimension by the demand in the previous hour (h-1) and on ∆Y dimension by the demand on the same hour but on the previous day (h-24)). (b) Days of the week (values are in range from −3 to 3, i.e., value equals 0 when the forecast hour comes from the same day of the week as the historic hour, for example, both are Wednesdays). (c) Hours (values are in range from −11 to 12, i.e., value equals 0 when the forecast hour is the same as the historic hour from historic day, for example, both are 10 a.m.). (d) Meteorological data (historical temperature demand values, for example, when training data is characterized by three explanatory variables and on ∆Z dimension is by the temperature in the previous hour (h-1)).
Option 6: There are differences in the size of set of training data. All presented above scenario options have been verified and are presented in Section 3: Application and Results with necessary examples and results provided.
Additionally, to compare achieved results from Pareto front model the following benchmark approach, algorithm, and models have been implemented: • Naïve approach (used as benchmark in [34,35], • SARIMAX model (different variations of ARIMA model i.e., ARIMA, SARIMA and SARIMAX used to forecast demand has been described in [36][37][38], • K nearest neighbors algorithm, • Additional 'no data selection' model.
Above-mentioned approach, algorithm, and models have been characterized by the following sets of parameters: (1) Naïve approach in three variants: Forecast value for the hour equals previous hour's value (h-1); Forecast value for the hour equals the value for the same hour but the previous day (h-24); (c) Forecast value for the hour equals the arithmetic average of values from points a and b (h-1, h-24).
(2) SARIMAX model with following values verified and giving the best results for the analyzed sets of data: (a) Trend parameters: • autoregression order (p) equals 2; • difference order (d) equals 1; • moving average order (q) equals 2.
(3) K nearest neighbors algorithm with the k parameter's value is from set (1, 7).  (4) 'No data selection' model in which forecast value is calculated equally to the corresponding Pareto front scenarios mentioned above, but the forecast calculation involves all available training data values without any kind of selection.
Scenario options for Pareto front model verification and benchmark approach, algorithm and models described above have been implemented and described both in twoand three-dimensional spaces (in Section 3: Application and Results; Sections 3.1 and 3.2, respectively). Each verification case contains set of results and detailed information about forecast period, range of available historic data, methodology of forecast value calculations.
For cases in two-dimensional spaces, Pareto fronts have been obtained in all quadrants individually.
The example of the first Pareto front visualization is presented in Figure 5 and of the first, the second, and the third Pareto fronts in Figure 6, respectively. Scenario options for Pareto front model verification and benchmark approach, algorithm and models described above have been implemented and described both in twoand three-dimensional spaces (in Section 3: Application and Results; sections 3.1 and 3.2, respectively). Each verification case contains set of results and detailed information about forecast period, range of available historic data, methodology of forecast value calculations.
For cases in two-dimensional spaces, Pareto fronts have been obtained in all quadrants individually.
The example of the first Pareto front visualization is presented in Figure 5 and of the first, the second, and the third Pareto fronts in Figure 6, respectively.    Scenario options for Pareto front model verification and benchmark approach, algorithm and models described above have been implemented and described both in twoand three-dimensional spaces (in Section 3: Application and Results; sections 3.1 and 3.2, respectively). Each verification case contains set of results and detailed information about forecast period, range of available historic data, methodology of forecast value calculations.
For cases in two-dimensional spaces, Pareto fronts have been obtained in all quadrants individually.
The example of the first Pareto front visualization is presented in Figure 5 and of the first, the second, and the third Pareto fronts in Figure 6, respectively.    The values of the elements in basic verification cases represents differences between the pairs of historical hourly demand values from h-1 and h-24 as explanatory variables (as described in this Section's scenario Option 5a). For example, when the demand for 6 p.m. on 1.06.2018 is forecast, the points in two-dimensional space are characterized by the two following values:

•
The first one as the parameter representing ∆X dimension equals the difference between the value of demand for one hour before the forecast one which is for 5 p. Based on information from this Section's scenario Options 3 and 4, forecast value might be calculated in different ways. Illustration of one of the approaches, which is when the forecast value is calculated based on linear regression modeled based on values located on the Pareto fronts, is presented in Figure 7. The presented orange rectangle represents the plane that most closely fits values (green dots) of the points on Pareto front (here the red dots) used to calculate forecast value. The green rhomb marks the point on the rectangle surface, when values of parameters ∆X and ∆Y equal zero, representing the forecast value.
The values of the elements in basic verification cases represents differences bet the pairs of historical hourly demand values from h-1 and h-24 as explanatory vari (as described in this Section's scenario Option 5a). For example, when the demand p.m. on 1.06.2018 is forecast, the points in two-dimensional space are characterized b two following values: • The first one as the parameter representing ∆X dimension equals the differenc tween the value of demand for one hour before the forecast one which is Based on information from this Section's scenario Options 3 and 4, forecast v might be calculated in different ways. Illustration of one of the approaches, which is w the forecast value is calculated based on linear regression modeled based on valu cated on the Pareto fronts, is presented in Figure 7. The presented orange rectangle r sents the plane that most closely fits values (green dots) of the points on Pareto front the red dots) used to calculate forecast value. The green rhomb marks the point o rectangle surface, when values of parameters ∆X and ∆Y equal zero, representing the cast value.

Data
The proposed model was implemented to forecast short-term electric energy dem in Poland using historical hourly demand values from Polish TSO. Polish TSO publ historical hourly demand values for Poland on its website https://www.pse.pl/obs dzialalnosci/krajowy-system-elektroenergetyczny/zapotrzebowanie-kse (accessed September 2018). The exact forecasting task was to predict next hour demand ha available data of hourly demand for previous hours and some additional explanatory iables-depending on case considered. For each case, the data was specially collecte all methods compared. The results depicted in this paper are only a small, more inte ing part of numerous experiments.
The set of weather data was provided to authors directly by employees of the I disciplinary Centre for Mathematical and Computational Modelling at the Univers Warsaw.

Data
The proposed model was implemented to forecast short-term electric energy demand in Poland using historical hourly demand values from Polish TSO. Polish TSO publishes historical hourly demand values for Poland on its website https://www.pse.pl/obszarydzialalnosci/krajowy-system-elektroenergetyczny/zapotrzebowanie-kse (accessed on 5 September 2018). The exact forecasting task was to predict next hour demand having available data of hourly demand for previous hours and some additional explanatory variables-depending on case considered. For each case, the data was specially collected for all methods compared. The results depicted in this paper are only a small, more interesting part of numerous experiments. The set of weather data was provided to authors directly by employees of the Interdisciplinary Centre for Mathematical and Computational Modelling at the University of Warsaw.

Results
In this Section, as mentioned in Section 1: Introduction, authors present the range of different approaches to Pareto fronts usage in forecasting rather than the complex evaluation of the obtained results. However, to present and compare different scenarios' results (verification cases) obtained from Pareto front model and benchmark approach, model, and algorithm within the same verification case and between different cases the mean absolute percentage error (MAPE) was calculated and used: where: A t -The actual hourly value F t -The forecast hourly value n-The total number of the forecast values The percentage of the error for every forecast horizon is averaged resulting in the MAPE value for the Pareto front and benchmark approach, model, and algorithm. The mean absolute percentage error is considered as standard measurement, for electric forecasting, in the process of verifying the accuracy of the models [36].

Results for the Two-Dimensional Spaces
Results for the first verification case are given in Tables 1 and 2. The first verification case assumptions (in line with Section 2 scenario Options 2, 3, and 5a):  Results for the second verification case are given in Tables 3 and 4  Results for the third verification case are given in Tables 5 and 6. The third verification case assumptions (in line with Section 2 scenario Options 2, 4, 5a, and no limitations in case of Option 6), with longer forecast period than in the previous cases:   Results for the first verification case are given in Tables 9 and 10  Results for the sixth verification case are given in Tables 11 and 12. The sixth verification case assumptions (in line with Section 2 scenario Options 1, 4, combination of Options 5a and 5c, and no limitations in case of Option 6):  Results for six verification cases described above are illustrated in Figures 8 and 9. In the case of proposed approach (Figure 8) best results were obtained for second and third cases. Other cases gave inferior forecast quality. For mentioned best cases it can be pointed out that MAPE is maximal for one Pareto front, having minimum for two Pareto fronts, and goes little up for three Pareto fronts. This is probably due to the fact that points from the third Pareto front are different from the forecasted point than those from first and second Pareto fronts. For kNN method (Figure 9) a similar phenomenon is visible. For the best three cases (first, third, and sixth), best results are obtained for two neighbors. Obviously, this can be called only a "similar phenomenon" because operation of these two methods is different.

h-1 h-24 h-1 h-24
(2,1,2) (1,1,0,24) k = 1 k = 2 k = 3 k = 4 k = 5 k = 6 k = 7 Results for six verification cases described above are illustrated in Figures 8 and 9. In the case of proposed approach (Figure 8) best results were obtained for second and third cases. Other cases gave inferior forecast quality. For mentioned best cases it can be pointed out that MAPE is maximal for one Pareto front, having minimum for two Pareto fronts, and goes little up for three Pareto fronts. This is probably due to the fact that points from the third Pareto front are different from the forecasted point than those from first and second Pareto fronts. For kNN method (Figure 9) a similar phenomenon is visible. For the best three cases (first, third, and sixth), best results are obtained for two neighbors. Obviously, this can be called only a "similar phenomenon" because operation of these two methods is different.   Table 2, Table 4, Table 8, Table 10,  and Table 12). Additionally, the impact of the data history length on the forecast accuracy has been under investigation with the results as in Table 13 (in line with Section 2 scenario Options  1  In case of calculation involving three explanatory variables (presented in Section 3.2) the historical hourly demand values from 2015, not from 2018, were used due to having more available adequate meteorological data for that year. Therefore, one extra simplified simulation was run to obtain results for Pareto front model based on two explanatory variables from 2015 (Table 14) required to have a benchmark for the three-dimensional one.
The benchmark simplified 2015 verification case assumptions (in line with Section 2scenario Options 1, 4, 5a and no limitations in case of Option 6.   (Table 1, Table 3, Table 7, Table 9, and Table 11).
Additionally, the impact of the data history length on the forecast accuracy has been under investigation with the results as in Table 13 (in line with Section 2 scenario Options  1  In case of calculation involving three explanatory variables (presented in Section 3.2) the historical hourly demand values from 2015, not from 2018, were used due to having more available adequate meteorological data for that year. Therefore, one extra simplified simulation was run to obtain results for Pareto front model based on two explanatory variables from 2015 (Table 14)

Results for the Three-Dimensional Spaces
Results for the three-dimensional verification case are given in Tables 15 and 16

Two-and Three-Dimensional Spaces' Quadrants and Octants Analysis
All Pareto fronts results presented in Sections 3.1 and 3.2 were calculated based on all nondominated solutions situated in all four quadrants, for two explanatory variables in Section 3.1 and all eight octants in Section 3.2 for three explanatory variables. The focus of further analysis was on investigating results in individual quadrants and octants and their potential impact on the final forecast value. The following two options were considered and verified in terms of searching possible enhancement of Pareto front model: (4) Pareto fronts model uses historical demand values from h-1 and h-24 (for four quadrants analysis) and additional temperature from h-1 as explanatory variables (for eight octants analysis); (5) Forecast value is calculated based on linear regression modeled based on values located on the Pareto fronts.
A meticulously performed analysis, for both options described above, exhibit instability and unpredictability in terms of the number of historic data points appearing in each quadrant and octant. This instability and unpredictability resulted in the occurrence of hours for which the number of the data points on Pareto fronts in one or more quadrants/octants were not sufficient to use linear regression to calculate the forecast value; in an extreme example, that number equaled zero. When the number of data points was not sufficient to use linear regression but was still different from zero, that issue could be solved by using an alternative method to calculate the forecast for those hours, for instance the mean average. However, there were hours in which the number of training examples in a quadrant/octant equaled zero. In consequence there was no method allowing the forecast value calculation to apply, which could solve this issue. The simulations showed that the issue with the insufficient number of selected training examples was not connected with any particular quadrant nor octant. The insufficient number of selected data points occurred occasionally in each quadrant/octant depending on the data set. In consequence, the first option of enhancement proposed in this subsection cannot be treated as a potential possibility to achieve that, even though the analyzed cases proved that there was potential in this approach due to hours in which forecast values, calculated based on results from one of the spaces instead of from all of them, are more accurate.
As far as the second option of enhancement was concerned, the abovementioned instability and unpredictability foreclosed constructing an enhancing universal adjustment to the proposed Pareto front model. Moreover, the same obstacles, as described for the first option, apply to that one. For the second option, verified forecast calculation as mean average value from forecast values calculated for each quadrant/octant (for four and eight spaces) and tried to find equation coefficients for the following equation (for quadrants): where: a-the first quadrant forecast value's coefficient b-the second quadrant forecast value' coefficient c-the third quadrant forecast value's coefficient d-the fourth quadrant forecast value' coefficient f 1 -forecast value obtained based on the Pareto front in the first quadrant f 2 -forecast value obtained based on the Pareto front in the second quadrant f 3 -forecast value obtained based on the Pareto front in the third quadrant f 4 -forecast value obtained based on the Pareto front in the fourth quadrant f 5 -final forecast value Both attempts to did not enhance the Pareto front model, therefore further work concerning them was not continued.

Discussion
The authors presented the new forecasting model based on the idea to use Pareto fronts as a tool to select data in the forecasting process with its various variants. The proposed model was implemented to forecast short-term electric energy demand in Poland.
The publication aimed to present the range of different approaches to Pareto fronts usage in forecasting, thus the descriptions of eight basic verification cases and two additional developed approaches were provided in the text. Naïve approach, SARIMAX model, and kNN algorithm, and 'no data selection' model were implemented as benchmarks to the proposed Pareto front model. As far as two-dimensional spaces, i.e., with two explanatory variables concerned, the first verification case focused on using the set of training data that was limited exclusively to data representing historical hours equal to the forecast hour. Simple forecast value calculation as the arithmetic average of values on Pareto fronts did not deliver accurate results, comparing not only to SARIMAX but also with kNN and naïve approach's results. However, the change in forecast value calculation, like in the second and third cases, from obtaining arithmetic average to linear regression modelling based on values located on the Pareto fronts increased results accuracy. This additional training data preselection resulted, in the second verification case, in the smallest mean absolute percentage error not only among all benchmark models' results for that specific case but also as compared to results obtained in all other scenarios. Therefore, the second case's assumptions are the most promising for further investigation. On the other hand, the difference between MAPE value for Pareto front with two fronts and 'no data selection' models equaled 0.031% (MAPE), but it also means improvement by 4.3% in forecast quality. In this case, training data selection provided by Pareto fronts improved the accuracy of the obtained results, but not significantly.
The third case results confirmed those obtained in the second verification case presenting the Pareto front method advantage again. The difference between MAPE value for Pareto front with two fronts and 'no data selection' models equaled 0.059% (MAPE), but it also means stronger improvement than in previous case-by 7.7% in forecast quality. However, that case showed also that the set of training data containing data representing longer history than in the second case, may negatively influence the accuracy, e.g., by 0.016% in MAPE for two fronts.
Further investigation will focus both on the Pareto front model development and identifying this method's advantages over 'no data selection'.
Further verification cases, i.e., from number four to six, showed that lack of the additional training data preselection and, using the difference between days of the week or hours as explanatory variables, decreased the Pareto fronts models' forecast accuracy.
For the case aimed to verify results for three-dimensional spaces, i.e., with three explanatory variables both historical demand and temperature values, adding the third dimension improved the proposed method's forecast accuracy by 0.09% in MAPE for the best one front calculations. This slight improvement causes that three-and moredimensional cases will be implemented and verified in further investigation.

Conclusions
In all analyzed cases so far, all Pareto fronts results were calculated based on all nondominated solutions situated in all four quadrants/eight octants.
Additionally, the authors proposed and examined the Pareto front model to obtain forecasts based on nondominated solutions situated in each quadrant/octant individually or on nondominated solutions situated in each of the quadrants/octants (four or eight) in order to calculate the final forecast demand value. Even though those approaches seemed intriguing, they both had a disadvantage of instability and unpredictability in terms of the number of historic data appearing in each quadrant and octant. The extreme examples when the number of data equaled zero were the reason why those ideas were not developed and are not planned to be in the future.
The new forecasting model based on the idea to use the Pareto fronts as a tool to select data in the forecasting process was presented and its various variants were analyzed. Twodimensional cases assuming additional training data preselection and forecast obtained from linear regression modelling based on values located on the Pareto fronts provided promising results' accuracy. For two investigated cases (second and third) proposed model gave best results among all benchmark methods. The forecast quality was better by 4.3% and 7.7%, respectively, comparing to next best method.
As stated, the authors find the presented idea to be very promising. In this article they have presented more interesting results than they have obtained so far. These results give rise to many subsequent questions which the authors intend to analyze. As the most interesting, the authors indicate the following issues: - Tests of the approach for other time series (e.g., wind energy forecast), -Development of other methods of final forecast calculation from Pareto fronts sets, -Tests of the approach in more than three-dimensional spaces, -Development of a new approach for more than three-dimensional spaces, -Development of hybrid and ensemble approaches. Funding: This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors. The research was carried out as part of the statutory activity.