Modeling the Photovoltaic Power Generation in Poland in the Light of PEP2040: An Application of Multiple Regression

Rybak, Aurelia; Rybak, Aleksandra; Kolev, Spas D.

doi:10.3390/en16227476

Open AccessArticle

Modeling the Photovoltaic Power Generation in Poland in the Light of PEP2040: An Application of Multiple Regression

by

Aurelia Rybak

^1,*,

Aleksandra Rybak

²

and

Spas D. Kolev

³

¹

Department of Electrical Engineering and Automation in Industry, Faculty of Mining, Safety Engineering and Industrial Automation, Silesian University of Technology, 44-100 Gliwice, Poland

²

Department of Physical Chemistry and Technology of Polymers, Faculty of Chemistry, Silesian University of Technology, 44-100 Gliwice, Poland

³

School of Chemistry, The University of Melbourne, Parkville, VIC 3010, Australia

^*

Author to whom correspondence should be addressed.

Energies 2023, 16(22), 7476; https://doi.org/10.3390/en16227476

Submission received: 7 October 2023 / Revised: 29 October 2023 / Accepted: 31 October 2023 / Published: 7 November 2023

(This article belongs to the Special Issue Demand-Side Management and the Sustainable Energy Transition)

Download

Browse Figures

Versions Notes

Abstract

:

This paper presents the results of research on the development of photovoltaic systems in Poland. The authors’ goal was to identify factors that can potentially shape the dynamics of solar energy development in Poland and that will affect the implementation of the PEP2040 goals. The authors also wanted to find a forecasting method that would enable the introduction of many explanatory variables—a set of identified factors—into the model. After an initial review of the literature, the ARMAX and MLR models were considered. Finally, taking into account MAPE errors, multiple regression was used for the analysis, the error of which was 0.87% (minimum 3% for the ARMAX model). The model was verified based on Doornik–Hansen, Breusch–Pagan, Dickey–Fuller tests, information criteria, and ex post errors. The model indicated that LCOE, CO₂ emissions, Cu consumption, primary energy consumption, patents, GDP, and installed capacity should be considered statistically significant. The model also allowed us to determine the nature of the variables. Additionally, the authors wrote the WEKR 2.0 program, which allowed to determine the necessary amount of critical raw materials needed to build the planned PV energy generating capacity. Solar energy in Poland currently covers about 5% of the country’s electricity demand. The pace of development of photovoltaic installations has exceeded current expectations and forecasts included in the Polish Energy Policy until 2040 (PEP2040). The built model showed that if the explanatory variables introduced into the model continue to be subject to the same trends shaping them, a dynamic increase in photovoltaic energy production should be expected by 2025. The model indicates that the PEP2040 goal of increasing the installed capacity to 16 GW by 2040 can be achieved already in 2025, where the PV production volume could reach 8921 GWh. Models were also made taking into account individual critical raw materials such as Cu, Si, Ge, and Ga. Each of them showed statistical significance, which means that access to critical raw materials in the future will have a significant impact on the further development of photovoltaic installations.

Keywords:

PV power generation; forecast; multiple regression; energy transition

Graphical Abstract

1. Introduction

Energy security is currently one of the most important factors that directly shapes other dimensions of state security, including, above all, the military security of EU countries, including Poland. In light of the events in Ukraine, it has become clear that excessive dependence on fossil raw materials, and above all reliance on raw materials from Russia, may lead to an energy crisis in a short time. This, in turn, may lead to an economic crisis, and a long-term economic crisis may directly disturb the military security of EU countries. For many years, the European Union has been taking steps to make member countries independent of fossil fuels by replacing them with renewable energy sources. Depending on the country, its history, and climatic conditions, the share of renewable energy sources in primary energy consumption in EU countries ranges from 6% for the Czech Republic to 28% for Finland [1]; in Poland, it is about 9%. There are many factors that can influence the further development of renewable energy sources in EU countries. They also differ depending on which renewable energy source is taken into account. Energy security will, therefore, be shaped by many factors; one of them being access to critical raw materials. In the case of PV, raw materials are mainly Cu, Ga, Ge and Si. However, when thinking about solar energy, the rare elements necessary to produce energy storage systems should also be mentioned. REE (rare earth metals), which are also key to the development of wind turbines, will be particularly important in this case. REEs are mined mainly in China, and the rapidly growing global demand for these metals may result in their sources being depleted. The solution may be alternative sources of obtaining REE, such as fly ash generated during coal combustion [2].

In the presented research, the authors aimed to identify factors that could potentially shape the dynamics of solar energy development in Poland, and which will affect the implementation of the Poland’s Energy Policy until 2040 (PEP2040) goals. The factors taken into account were assigned to several categories, namely, economic, ecological, energy, and raw materials factors. The factors adopted at the initial stage of the research were verified in terms of their actual impact on the explanatory variable. The dependent variable in the research was solar power generation. Explanatory variables that turned out to be statistically insignificant were rejected. This meant that such variables had no direct impact on the volume of electricity generation. After eliminating the indicated variables, a long-term forecast of electricity production was made. Using this methodology, the authors wanted to identify the factors that will have the greatest impact on the future development of photovoltaic power (PV) in Poland. Additionally, it was determined whether a given factor has a positive or negative impact on this development. Due to this, it was possible to identify those variables that should receive special attention when planning and implementing Poland’s energy policy in the coming years. The methodology of the presented research is as follows:

Development of a preliminary set of factors shaping PV power generation in Poland;
Obtaining statistical data for analysis;
Determining the demand for critical raw materials using the WEKR 2.0 program;
Verification of factors selected for analysis statistical significance by multiple regression model;
Forecast of PV power generation until 2025.

2. Literature Review

Poland’s energy policy until 2040 assumes a 75% increase in solar power capacity by 2040 [3]. Solar energy is a type of renewable energy that has been subjected to continuous dynamic development since the 1980s. Therefore, many examples of PV power forecasting can be found in the literature. Taking into account the time horizon of forecasts, they can be divided into very short-term, limited to second and hour forecasts [4], short-term, characteristic for daily forecasts [5], and medium-term, covering weeks or months. Long-term forecasts, in turn, cover years [6]. Additionally, the PV power forecasting methods used so far can be divided into direct forecasts where historical PV power is used. In turn, the indirect method involves forecasting PV power generation based on weather data and historical data on solar irradiation [7].

To predict PV power, methods such as artificial neural networks (ANN) were used [8]. The ANN method has also been subjected to numerous modifications to optimize it. Short-term neural networks (STM), long short-term memory (LSTM) [9], support vector machine (SVM) [10], or fuzzy logic (FL) [11] have also been applied.

Grey prediction model was used for long-term forecasts, the advantage of which is that it requires a small amount of historical data [12,13]. Due to the nature of the solar energy consumption time series, a modified Grey model was also applied, which had a positive effect on the accuracy of the prediction [14]. Markov chains [15] as well as ARIMA [16], SARIMA [17], Vector Autoregression VAR [18], and Support Vector Regression SVR [19] models were used for PV power prediction.

In the literature, examples of the use of linear regression to forecast all aspects related to the production of PV power can also be found [5], including multiple regression [20]. Yang and Meng used 12 independent weather variables from the European Centre for Medium-Range Weather Forecasts [21].

In turn, the authors of this publication used the indirect method and built their own set of explanatory data, taking into account economic, ecological, and technological factors, as well as energy and raw materials factors. The forecast proposed by the authors constitutes a long-term forecast until 2025. The prediction and model verification methods used are described in the Methods section.

3. Advantages and Disadvantages of the Model Used to Forecast PV Power

The main advantage of ANN models is the self-learning ability of the neural network. Thanks to this, it can improve the results of the forecasts it generates. Neural networks find solutions that are often not obvious to humans. Additionally, neural networks are able to adapt to changing explanatory variables. The main disadvantage of ANNs is that they work similar to black boxes, i.e., it is not entirely clear why they gave a specific result. An additional problem arises in the case of unique and complex tasks that will require a lot of time and resources to complete. Additionally, learning the neural network requires the provision of a large amount of data, which, for example, in the case analyzed in the article is not entirely possible due to the limited length of the PV power generation time series [22,23]. Despite numerous advantages, it was noticed that there are significant differences between the real data and the results obtained thanks to ANN. The MAPE error in this case ranges from 5–8% [24]. SVM methods usually provide more accurate results than independent methods, but their disadvantage is the need to perform a large number of calculations related to repeated network training. With large amounts of data, model estimation may take a long time and estimating the correct model requires some knowledge. Because a support vector classifier works by placing data points above and below the classification hyperplane, there is no probabilistic explanation for classification [25,26]. The MAPE errors of the SVM models ranged from 5% for annual forecasts to 16% for daily forecasts [27]. Because FL models belong to the group of expert systems, they lead to imprecise data. Qualitative analysis based on fuzzy data enables assessments and summaries to be made in natural language, in a form understandable to an average user. They are suitable for solving problems where high accuracy is not needed and there is no systematic method for designing these systems. The MAPE based on the fuzzy logic algorithm ranges from 13.87% to 20.22% for solar radiation forecasting [28]. The main advantages of grey models include the ease of calculations and short forecast preparation time. The main disadvantages include the failure to meet the conditions set for the residuals of the econometric model [29]. The MAPE error for the Grey model found in the literature ranged from 3–7% [30,31]. Markov models can be used when there are multiple causes of the phenomenon under study, as well as in the case of qualitative variables. Using Markov chains, short-, medium-, and long-term forecasts can be created [32]. The best forecasting results for the PV power were obtained using a combination of the Markov model and the generalized fuzzy model. In this case, the MAPE error was approximately 7% [33]. In turn, ARMAX class models are popular due to the automation of the time series decomposition process. At the same time, they provide great flexibility in selecting the right model. This allows to take into account variables that are important from the point of view of the analyzed process. Automation of the estimation process allows quick determination and comparison of many potential models [34]. ARMA was used for short- and medium-term forecasts at the microgrid level. It has been noticed that it works well for very short-term forecasting [35]. The MAPE error for the models made with this method was approximately 16% [36].

Inspection of statistical criteria showed that the lowest RMSE values were recorded when the MLR model included extraterrestrial radiation, the difference between the maximum and minimum daily temperature, and relative humidity as input variables [37]. One of the main advantages of multiple regression is that it can capture the complex and multifaceted nature of real-world phenomena. This gives a more accurate and detailed picture of the relationship between each specific factor and the outcome. The greatest advantage of linear regression models is linearity because it simplifies the estimation procedure and linear equations are easy to understand at the modular level. Another advantage is the ability to identify outliers, i.e., anomalies, and the ability to determine the relative impact of one or more predictor variables on the criterion value. The disadvantages of using a multiple regression model usually lie in the data used. First, a problem may arise when incomplete data are used. Attention should also be paid to the assumptions and conditions of multiple regression, such as linearity, normality, and homoscedasticity. These should be verified using statistic tests. If these assumptions are violated, the results may be inaccurate or misleading.

The authors decided to choose multiple regression for their research, which is suitable for long-term forecasts. The simple and transparent structure of the model makes the interpretation of the obtained results much easier than, for example, in the case of neural networks. The MLR model allowed operating on relatively short time series, which would be impossible in the case of many of the other methods mentioned. Additionally, the calculation speed in this case is much faster than in the case of models that undergo a learning process. The MLR model also meets the assumptions and requirements of the residuals of the econometric models. Appropriate statistical tests allow to verify the assumptions of normality, lack of autocorrelation, and homoscedasticity of residuals. The authors also assumed that the MLR model would be characterized by a high level of accuracy of the obtained forecasts. Table 1 presents a summary of the features of the most common models used to forecast PV power generation in the literature and features of the model due to which the model could not be used in the case of the PV power generation in Poland time series.

To summarize the above considerations, the aim of the research was to find a simple model, easy to use, and with a transparent structure, which would allow the identification of factors influencing the size of PV power generation in Poland in the long term (in years).

Before looking for a nonlinear model, it is best to first use a linear model such as AMRAX or MLR. The most commonly used MLR examines the relationship between power output and climatic factors [38]. In turn, the authors took into account not climatic factors, but ecological, economic, raw material, and energy factors. At the same time, it was important in the presented research to determine a model that would be characterized by high accuracy of the forecasts created. Based on their forecasting experience, the authors initially selected two models, i.e., the ARMAX and MLR models. Both of them meet the conditions set. The authors assumed that by applying all the requirements for time series and model residuals, they would be able to find a model with MAPE error values at least at the level obtained for more complicated tools, such as ANN or SVM.

4. Methods

Multiple linear regression MLR is a statistical method that uses many explanatory variables to predict the value of the explained variable. It enables the study of the linear relationship between the dependent variable and the independent variables that influence it.

Multiple regression allows to describe the relationship between one explained variable and many independent variables. Due to this, it is possible to examine which of them best describe the explained variable [39].

The multiple regression model can be described by the following equation:

z = β_{0} + β_{1} x_{1} + \dots + β_{i} x_{i} + ε

(1)

where:

z—dependent variable,

ε

—random variable,

β_{i}

—regression coefficient,

x_{i}

—explanatory variables.

The regression coefficients of the model β characterize the contribution of each explanatory variable in the forecasting process of the explained variable. The sign of the correlation coefficient determines the nature of the relationship between the individual variables. A positive sign means that the variable is a stimulant, i.e., an increase in the independent variable has a positive effect on the value of the explained variable. A negative sign, in turn, means that an increase in the explanatory variable has a negative impact on the explained variable (destimulant). A correlation coefficient of 0 would mean that there is no relationship between the variables.

The primary methodological limitation underlying MLR is that it can only be used to establish the existence of an association, but cannot determine the causes underlying that association. Another limitation is the number of explanatory variables that can be introduced into the model. It is recommended to include at least 10–20 times as many observations as the number of variables in the analysis. This may lead to the omission of important variables. MLR also only allows for the analysis of linear relationships between variables, which is also a serious limitation.

Forecasting based on a nonstationary time series may produce erroneous results. Since the time series used by the authors to build the MLR model may be characterized by a trend, the results obtained may be questionable and lead to the appearance of the phenomenon of spurious regression. Therefore, before searching for the optimal MLR model, it is necessary to verify the hypothesis that the time series is stationary. For this purpose, for example, the Dickey–Fuller test can be used.

The statistics of the Dickey–Fuller test (DF) for the existence of a unit root are represented by the formula:

D F = \frac{δ}{S (δ)}

(2)

The DF test allows to verify whether there is a unit root in a time series. The test requires verification of the following hypotheses:

H0: there is a unit root in the time series,

δ = 0,

H1: the time series is stationary,

δ < 0

.

If the ADF test confirms the existence of nonstationarity in the time series, it can be eliminated by differentiating the series or introducing a time variable into the model [40].

Building an econometric model burdened with autocorrelation may also lead to conclusions based on an incorrect model. To verify the occurrence of autocorrelation, the Durbin–Watson test was used [41]. This is one of the most frequently used tests, which is based on the assumption that if random disturbances contain autocorrelation, the residuals of the model will also be correlated. The test requires the null hypothesis H0—the residuals of the model are not correlated with each other—and the alternative hypothesis H1—there occurs autocorrelation in the residuals of the model.

D W = \sum_{i = 2}^{T} {(e_{i} - e_{i - 1})}^{2} / \sum_{i = 1}^{T} e_{i}^{2}

(3)

where:

D W

—Durbin–Watson test statistic,

e_{n}

—the residuals of the model,

T—length of the sequence of residues.

One of the assumptions of the regression model is also the normality of the residual term. Therefore, the normality of the distribution of model residuals was confirmed using the Doornik–Hansen test statistics DH [42]:

D H = {[Z (\sqrt{b_{1}}]}^{2} + {[z_{2}]}^{2}

(4)

where:

Z (\sqrt{b_{1}})

—transformed sample skewness,

z_{2}

—Wilson–Hilferty transformation.

The heteroscedasticity of a time series means that at least one random variable in the sequence differs from the others in variance or its variance is infinite [43]. The occurrence of heteroscedasticity may indicate an incorrect form of the model or omission of important variables.

The Breusch–Pagan test for heteroskedasticity is used when it is caused by more than one variable. The test requires two hypotheses: H0—there is homoscedasticity in the model—and hypothesis H1—there is heteroscedasticity in the model. The formula for the Breusch–Pagan test statistics is as follows:

L M = \frac{1}{2} E S S

(5)

where:

L M

—Breusch–Pagan statistic,

ESS—Explained Sum of Squares in auxiliary regression.

In addition, the models created were analyzed in terms of information criteria [44]. When comparing several similar models, the one with the lowest Akaike (AIC), Schwarz (BIC), and Hannan–Quinn (HQ) criteria values should be chosen:

A I C = - 2 l n L (\hat{θ}) + 2 K

(6)

B I C = - 2 l n L (\hat{θ}) + K l n (n)

(7)

H Q = - 2 l n L (\hat{θ}) + 2 K l n (l n n)

(8)

where:

n—number of observations,

L (\hat{θ})

—model credibility function corrected by the penalty function—the function of the number of K parameters of the model.

The model was also selected based on the value of ex post errors. Several of them were taken into account, namely:

Mean Absolute Error (MAE) [45]:

M A E = \frac{\sum_{i = 1}^{n} |e_{t}|}{n}

(9)

Root Mean Square Error (RMSE) [46,47]:

R M S E = \sqrt{\frac{\sum_{i = 1}^{n} e_{t}^{2}}{n}}

(10)

Mean Absolute Percentage Error (MAPE) [48,49]:

M A P E = \frac{\sum_{i = 1}^{n} |\frac{e_{t}}{y_{t}}|}{n}

(11)

where:

y_{t}

—value of the explained variable in period t,

e_{t}

—forecast error.

Theil’s inequality coefficient (U) [50]:

U = \frac{R M S E}{\sqrt{\frac{1}{n} \sum_{n = 1}^{n} {y_{t}}^{2}} + \sqrt{\frac{1}{n} \sum_{n = 1}^{n} {\hat{y_{t}}}^{2}}}

(12)

where:

y_{t}

—empirical value,

\hat{y_{t}}

—forecast of the explained variable in period t.

5. Results and Discussion

ARMAX and MLR models were initially built. However, the AMRAX models were ultimately rejected because, depending on the combination of explanatory variables and the adopted model parameters, the MAPE error ranged from 3–30%.

The research began by collecting statistical data on factors that could influence the volume of PV power generation in Poland. The analysis was aimed at identifying not the weather factors and technical parameters of the solar panels, which, as discussed in the Introduction, are the most frequently the subject of interest, but ecological, economic, raw material, and energy factors. The factors that were finally taken into account and the source of their acquisition are presented in Table 2.

The value of the explained variable was approximated by a mathematical model. This was possible because the explanatory variables in the past were characterized by regular changes that could be described using time functions. The direction of the trend and changes in the analyzed phenomena were assumed to be constant. It was also assumed that random fluctuations did not significantly affect the nature of the analyzed phenomenon. However, these assumptions were supplemented by using scenarios in which it was assumed that possible changes in the explanatory variables may ultimately affect the explained variable.

Forecasting using the MLR model was carried out in the following steps:

Specification—visual analysis of time series, determining the nature of regularities occurring in the time series of explanatory variables and the explained variable.
Estimation of parameters of the MLR function, when selecting explanatory variables for the MLR model, general guidelines for econometric models were used.

For the coefficient of variation, those variables whose coefficient of variation is greater than 0.1 were selected. For the variables finally entered into the model, the coefficient of variation was on average 0.8.

The explanatory variables should be strongly correlated with the explained variable—in the case of most variables, the correlation coefficient was from 0.99–0.7. Only in the case of primary energy consumption was this condition not met. However, the parameter turned out to be statistically significant. It is also known that these assumptions can be 100% met only for experimental data. Due to the importance of this variable in terms of the research being conducted, it was retained in the model.

Verification—examination of compliance of designated models with empirical data, verification of information criteria, ex post errors, examination of model residuals in terms of normality of distribution, autocorrelation, and homoscedasticity.
Prediction—a proven and accepted model was used to build a forecast in the selected horizon.

Since electricity generated using solar panels has been produced in Poland only since 2011, and the authors wanted to make a long-term forecast that required annual data, the analyzed time series is relatively short. Therefore, it was impossible to introduce more than eight explanatory variables into the models due to the insufficient number of freedom degrees. Therefore, the authors selected the categories variables for each of the analyzed that were considered the most representative. For economic factors, these were the LCOE (levelized cost of electricity) and real GDP (gross domestic product) per capita for ecological factors, the amount of CO₂ emission for energy factors, installed capacity, gross available energy, final energy consumption, and primary energy consumption. The raw materials category considered the share of natural raw materials necessary to produce MW installed in Poland during the year in the annual production/import of a given raw material in Poland. Raw materials taken into account are critical raw materials, such as copper (Cu), gallium (Ga), germanium (Ge), and silicon (Si).

To determine the demand for critical raw materials in accordance with the expected level of installed power, the WEKR 2.0 computer program written by research authors was used. The data necessary to perform the appropriate calculations were taken from Ref. [55]. Figure 1 presents the algorithm according to which the program calculates the value of explanatory variables from the category of raw materials. First, the program determines how many MW will be produced in Poland annually by 2025. The next step determines the number of kilograms of each critical raw material needed to generate the installed capacity planned for a given year.

When selecting the forecast model, the authors took into account the ARMAX and MLR models. MAPE error was chosen as the model comparison indicator so that the accuracy of the MLR model could be compared with models found in the literature and presented in this publication. It was noted that the constructed ARMAX models were characterized by a minimum MAPE error of 3%, while the MAPE of the MLR model was 0.87%. This is an excellent result also compared to models such as ANN, SVM, or FL, the development of which is much more complicated and requires more time and resources. Therefore, the MLR model was finally selected for analysis. Due to this, it was possible to take into account all explanatory variables when building a model of annual PV power production in Poland until 2025. The variable explained in the volume of built model was the PV power production. Due to the limited possibility of introducing all explanatory variables into the model at the same time, the forward stepwise method was used, which involves introducing individual factors into the model one by one, ending when adding another factor does not significantly improve the prediction. In the model, a combination of explanatory variables was retained, where all these variables were statistically significant. Since the ADF test showed that the analyzed time series is nonstationary with probability p = 0.99, an additional time variable t was introduced into the model; otherwise, the model being built could give erroneous results. Table 3 shows the explanatory variables that were finally selected to build the model. A statistically significant impact on the volume of PV power generation was characteristic for CO₂ emissions, LCOE, copper consumption, primary energy consumption, the number of reported patents related to PV technology, GDP, and installed capacity.

The parameter value column contains the value of the regression coefficient

β

for individual explanatory variables. The probability column contains the value of statistical significance p. Statistical significance was verified with the Student’s t-test. Two hypotheses were presented:

H0—the variable is not statistically significant,

H1—the variable is statistically significant.

If p is less than the significance level α, it is necessary to reject hypothesis H0 in favor of hypothesis H1. If it is greater than the significance level, the H0 hypothesis should be maintained. Only variables where p was lower than α = 0.01 (***) and α = 0.05 (**) were entered into the model.

The last column of Table 2 contains information on the nature of the explanatory variables. LCOE, CO₂ emissions, and copper consumption are destimulants. This means that an increase in these explanatory variables has a negative impact on the volume of PV power generation. In the case of the LCOE, it is obvious that a decrease in the cost of generating PV power stimulates further investment. The decrease in the volume of photovoltaic power production will, in turn, require filling the gap with the energy produced based on fossil fuels, which will translate into an increase in CO₂ emissions. The intense (exponential) increase in copper demand that has been taking place since 2018 may lead to rapid consumption of this raw material, which may be a factor limiting the further development of PV farms in the future. The remaining explanatory variables are stimulants, which means that their increase stimulates the increase in PV power generation.

The adopted model was verified in terms of the information criteria values, as well as ex post errors, which are presented in Table 4. The MAPE error is used to compare potential models. The model selected for further analysis is characterized by a very low MAPE error value below 1%. This means that the model can be considered highly reliable and accurate. The coefficient of determination (R2) is also 0.99, which means that 99% of the empirical data entered into the model were described by this model.

The model was used to determine the forecast and ultimately the residuals of the model. The residuals were analyzed for heteroskedasticity, normal distribution, and autocorrelation.

The Breusch–Pagan test for heteroskedasticity showed that there is no heteroskedasticity phenomenon in the model. The obtained p-value was greater than the assumed significance level alpha = 0.01.

Test statistics: LM = 8.237945,
with p value = P (Chi-square (8) > 8.237945) = 0.410579

Because the model is characterized by the homoscedasticity of the random component variance, it can be used for correct statistical inference.

The residuals of the model were also verified for normal distribution. For this purpose, the Doornik–Hansen test was used. The test confirmed the correctness of the null hypothesis, which means that the empirical distribution function has a normal distribution with a probability value of p = 0.38654 > α = 0.05.

Chi-square (2) = 1.901 with p-value = 0.38654

The residuals of a properly constructed econometric model should not show autocorrelation, that is, the dependence of current values of the random component on past values. To check whether the residuals of the adopted model constitute white noise, the Durbin–Watson test was used. It confirmed the validity of the H0 hypothesis on the lack of correlation of residuals. The probability value p in this case was p = 0.365725 > α = 0.05.

Because the model was successfully validated and met all the conditions for linear regression models, it was used to build a forecast of the PV power production volume until 2025. The empirical data, the forecast, and the confidence interval for the forecast are presented in Figure 2.

It should be noted that the visual analysis of time series of empirical and theoretical data also indicates the high accuracy of the multiple regression model. If the explanatory variables introduced into the model continue to be subject to the same trends shaping them, a dynamic increase in PV power generation should be expected until 2025. The built model indicates that the Poland Energy Policy until 2040 (PEP2040) goal of expanding the installed capacity to 16 GW by 2040 could be achieved as early as 2025. To determine this value, a conversion factor was used, obtained by determining the average value of photovoltaic power generation to the installed power in 2011–2020. The volume of PV power production in 2025 would be, according to the forecast, 8.921 GWh, which is more than four times the last known observation from 2020.

The year 2022 is the period of the most intensive development of photovoltaic technology in Poland. The reason for the unprecedented interest in this solution was primarily the war in Ukraine, rising electricity prices, and concerns about the lack of access to energy resources. Moreover, the state’s encouragement in connection with the implementation of PEP2040 and the National Reconstruction Plan in the form of tax reliefs and co-financing programs translated into an increase in the number of photovoltaic installations in Poland. In 2022, the number of prosumer installations increased by 40% compared to the previous year. The increase in interest in photovoltaic installations was also influenced by the decrease in energy generation, the need to reduce greenhouse gas emissions, the increasing demand for electricity with the simultaneous threat of lack of access to energy (energy raw materials), and the economic development of the country. According to the Energy Regulatory Office [56], 74% of the total installed capacity were private installations. However, April 2022 brought a change to the settlement system for prosumers based on net-billing. As a result of this change, interest in photovoltaic installations in Poland has decreased. In the first quarter of 2023, it decreased by approximately 30% [57]. Therefore, due to the upcoming changes, investors interested in PV installations expressed their willingness to connect to the energy system by 31 March 2022. The abolition of net-metering, i.e., transferring surplus energy to the grid and collecting it in times of increased demand in favor of net-billing, which assumes fees for energy consumed from the grid, has discouraged new potential prosumers. The model time variable t also has a negative value. This means that time has a braking effect on the amount of PV power production. This confirms that the development of solar energy in Poland was largely based on prosumers. PV power will be produced in the near future by momentum, but PV installations have a limited lifespan. If they are not modernized, the production volume of existing installations may decline.

The change in the billing method was probably intended to discourage prosumers from overcalling their installations to ensure free electricity supply during the fall and winter period, which, however, had a great impact on the already overloaded energy network. Therefore, because 2022 was an exception in the entire history of photovoltaic installations in Poland, it was omitted from the time series of empirical data so that this variable did not affect the results obtained due to the forecast.

The trends shaping the explanatory variables were determined on the basis of historical data. Since there is no certainty that these variables will continue to develop in the same way in the future, it was necessary to build a confidence interval for the forecast. This is the range within which the forecast value of PV generation can move with a probability of 90%, so there is a 5% probability of 5% error for this assumption.

The built model allowed for the determination of three scenarios for the size of PV power generation until 2025:

The most likely scenario—forecast;
Pessimistic scenario—determined by the lower range of the 95% confidence interval of the model;
The optimistic scenario—plotted by the upper range of the 95% confidence interval.

The confidence coefficient can be interpreted as the probability of determining the range within which the actual value of PV power generation may fluctuate. It allows to express the uncertainty related to the forecast [58]. The confidence interval is the interval in which the condition: P = 1 − α is met. Typically, this probability is assumed to be 0.95, 0.99, and 0.90. The confidence interval determined that PV power generation over the forecast horizon can be on average 11% higher than the forecast value and 14% lower than the forecast value.

Because the latest changes can change the demand for photovoltaic installations in Poland for a long time, an additional scenario was performed. This time, data on installed capacity reduced by 30% were introduced into the model. The forecast showed that by 2025, the volume of PV power generation on average would be 40% lower than the original forecast, as presented in Figure 3. In the demand scenario, the PEP2040 target for 2040 would be achieved in 2034, which would also be a good result.

Renewable energy sources are a basic solution in light of the need to carry out energy transformation. The costs of building a photovoltaic installation in Poland are decreasing every year, but they are still relatively high for the average citizen. In such a case, the cost of building the installation may consume the entire annual income, without taking into account the costs of energy storage. Therefore, financial support from the state and stable laws shaping the development of photovoltaic installations in Poland will certainly be necessary. Additionally, the energy system in Poland will require modernization, which is currently unable to absorb the energy produced by existing photovoltaic installations on sunny days. Scientists are constantly working on the development of photovoltaic technology [59,60], thanks to which they will have an increasingly longer lifespan, the LCOE cost will be reduced, and they will also acquire aesthetic values, which may convince additional investors.

6. Conclusions

Solar energy in Poland currently covers about 5% of the country’s electricity demand. The pace of development of photovoltaic installations exceeded previous expectations and forecasts included in PEP2040. Most of these were prosumer installations which, in the face of rising electricity prices, the threat of lack of access to energy supplies, and the amendment to the RES Act coming into force on 1 April 2023, accelerated the decision to implement their investments before the law changed. The future development of photovoltaic installations will depend on economic, ecological, energy, technological, and access to critical raw materials. Furthermore, the legal factor, i.e., promoting and supporting the development of solar energy by the state, certainly influences the level of investor interest in photovoltaic installations. Legal factors were omitted from the presented analysis due to the fact that they would require qualitative analysis and the analysis carried out for the purposes of this research was a quantitative analysis. In further research, the authors will want to take into account the legal factor and conduct a qualitative analysis of the factors that influence the development of photovoltaic installations in Poland.

An Important element of the analysis was the ability to indicate the nature of the explanatory variables introduced into the multiple regression model. Only variables whose significance was confirmed by a statistical test, and which were significant at the level of α = 0.05 and α = 0.01, were left in the model. Particular attention should be paid to those independent variables that have been identified as destimulants, because an increase in their value will signal an upcoming decline in PV power generation volume. Tracking the so-called weak signals (course of explanatory variables of time series) can provide advance information about changes in PV energy generation. The use of scenario planning is helpful in this respect, an example of which is also presented in this publication. The forecast, despite the omission of 2022 from the input data, showed that intensive increases in PV energy production in Poland can be expected by 2025. To make this possible, it is necessary to maintain the favorable trend of the explanatory variables; otherwise, the share of photovoltaic power in the total energy production in Poland may decrease. This would be unfavorable considering the need to change the country’s energy mix. Legislative changes may significantly slow the pace of development of renewable energy, as was the case in Poland with respect to wind energy. The change in the electricity billing system in April 2022 also resulted in a decline in interest in photovoltaics. Time will tell whether this method of settlement will be less beneficial for prosumers, but the change itself discouraged investors from building new installations. Photovoltaic farms built by energy companies have also begun to signal that new zoning regulations may inhibit planned investments. Ultimately, the Ministry of Development and Technology guidelines were modified and relaxed, but the fact of legislative instability can affect future investment decisions of both consumers and energy companies.

The authors verified the dependence of the development of installed PV power capacity on critical raw materials. Models were built taking into account individual critical raw materials such as Cu, Si, Ge, and Ga. Each of them showed statistical significance, which means that access to critical raw materials in the future will have a significant impact on the further development of photovoltaic installations. Currently, most of the panels used in Poland are produced in China. Since Poland is not the only country that is taking intensive steps to modify the energy mix, and thus, increase the share of solar energy in the energy generation structure, the access to raw materials necessary to produce photovoltaic technology may be limited in the future.

The energy security of EU countries, including Poland, will in the near future depend on the ability and efficiency of the country to carry out the energy transition. The European Green Deal, in line with PEP2040, assumes that this transformation will be based on renewable energy sources. These include primarily water, wind, and solar energy. In recent years, technologies that have been developing very dynamically in Poland include wind turbines and photovoltaic installations. The share of solar energy in total electricity production in Poland has increased from 0% to approximately 5% over the last 10 years. According to the forecast obtained by the authors, if the volume of electricity production does not change, this share will increase to 11% by 2030. This is in line with the Institute for Renewable Energy forecasts. According to both, in 2025 the installed capacity in Poland may amount to approximately 20 GW.

Together with energy obtained from other renewable sources and assuming that clean coal combustion will provide stabilizing support to the energy system during the transition period, Poland’s energy security should be maintained. Furthermore, valuable elements obtained from coal combustion byproducts, such as REE, can support the energy transition. REEs are essential to build wind turbines and batteries necessary to store renewable energy.

Author Contributions

Conceptualization, A.R. (Aurelia Rybak); methodology, A.R. (Aurelia Rybak) and A.R. (Aleksandra Rybak); software, A.R. (Aurelia Rybak); validation, A.R. (Aurelia Rybak); formal analysis, A.R. (Aurelia Rybak) and S.D.K.; investigation, A.R. (Aurelia Rybak) and A.R. (Aleksandra Rybak); resources, A.R. (Aleksandra Rybak); data curation, A.R. (Aurelia Rybak); writing—original draft preparation, A.R. (Aurelia Rybak) and A.R. (Aleksandra Rybak); writing—review and editing, A.R. (Aurelia Rybak), A.R. (Aleksandra Rybak), and S.D.K.; visualization, A.R. (Aurelia Rybak); supervision, A.R. (Aurelia Rybak); project administration, A.R. (Aurelia Rybak); funding acquisition, A.R. (Aleksandra Rybak). All authors have read and agreed to the published version of the manuscript.

Funding

The research leading to these results has received funding from the Norway Grants 2014–2021 via the National Centre for Research and Development. Grant number NOR/SGS/MOHMARER/0284/2020-00. Publication supported by the rector’s pro-quality grant, Silesian University of Technology, grant number 06/010/RGJ23/0057.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

BP Statistical Review of World Energy. Available online: https://www.bp.com/en/global/corporate/energy-economics/statistical-review-of-world-energy.html (accessed on 15 September 2023).
Rybak, A.; Rybak, A.; Kolev, S.D. AA synthetic measure of energy security taking into account the influence of rare earth metals. The case of Poland. Energy Rep. 2023, 10, 1474–1484. [Google Scholar] [CrossRef]
PEP 2040. The Energy Policy of Poland until 2040. Available online: https://www.gov.pl/web/klimat/polityka-energetyczna-polski (accessed on 1 August 2023).
Kushwaha, V.; Pindoriya, N.M. A SARIMA-RVFL hybrid model assisted by wavelet decomposition for very short-term solar PV power generation forecast. Renew. Energy 2019, 140, 124–139. [Google Scholar] [CrossRef]
Yang, M.; Meng, L. Short-term photovoltaic power dynamic weighted combination forecasting based on least squares method. IEEEJ Trans. Electr. Electron. Eng. 2019, 14, 1739–1746. [Google Scholar] [CrossRef]
Alanazi, M.; Alanazi, A.; Khodaei, A. Long-term solar generation forecasting. In Proceedings of the IEEE/PES Transmission and Distribution Conference and Exposition (T&D), Dallas, TX, USA, 3–5 May 2016. [Google Scholar]
Meng, M.; Song, C. Daily Photovoltaic Power Generation Forecasting Model Based on Random Forest Algorithm for North China in Winter. Sustainability 2020, 12, 2247. [Google Scholar] [CrossRef]
Zhang, J.; Chi, Y.; Xiao, L. Solar power generation forecast based on LSTM. In Proceedings of the 2018 IEEE 9th International Conference on Software Engineering and Service Science (ICSESS), Beijing, China, 23–25 November 2018; pp. 869–872. [Google Scholar]
Abdel-Nasser, M.; Mahmoud, K. Accurate photovoltaic power forecasting models using deep LSTM-RNN. Neural Comput. Appl. 2019, 31, 2727–2740. [Google Scholar] [CrossRef]
Zeng, J.; Qiao, W. Short-term solar power prediction using a support vector machine. Renew. Energy 2013, 52, 118–127. [Google Scholar] [CrossRef]
Garud, K.S.; Jayaraj, S.; Lee, M.Y. A review on modeling of solar photovoltaic systems using artificial neural networks, fuzzy logic, genetic algorithm and hybrid models. Int. J. Energy Res. 2021, 45, 6–35. [Google Scholar] [CrossRef]
Liu, S.; Yang, Y.; Forrest, J.Y.L. Grey Models for Decision-Making. In Grey Systems Analysis: Methods, Models and Applications; Springer Nature: Singapore, 2022; pp. 247–275. [Google Scholar]
Chang, C.J.; Li, D.C.; Dai, W.L.; Chen, C.C. Utilizing an adaptive grey model for short-term time series forecasting: A case study of wafer-level packaging. Math. Probl. Eng. 2013, 2013, 526806. [Google Scholar] [CrossRef]
Wang, Z.X.; Wang, Z.W.; Li, Q. Forecasting the industrial solar energy consumption using a novel seasonal GM (1, 1) model with dynamic seasonal adjustment factors. Energy 2020, 200, 117460. [Google Scholar] [CrossRef]
Gillespie, D.T. Markov Processes: An Introduction for Physical Scientists; Elsevier: Amsterdam, The Netherlands, 1991. [Google Scholar]
Blaga, R.; Sabadus, A.; Stefu, N.; Dughir, C.; Paulescu, M.; Badescu, V. A current perspective on the accuracy of incoming solar energy forecasting. Prog. Energy Combust. Sci. 2019, 70, 119–144. [Google Scholar] [CrossRef]
Na, Z.; Ma, D.; Ma, X. Short-term electric power demand forecastingusing a hybrid model of SARIMA and SVR. In Proceedings of the IOP Conference Series: Earth and Environmental Science, Changchun, China, 21–23 August 2020; Volume 619, p. 012035. [Google Scholar]
Tiwari, A.K. A structural VAR analysis of renewable energy consumption, real GDP and CO₂ emissions: Evidence from India. Econ. Bull. 2011, 31, 1793–1806. [Google Scholar]
De Leone, R.; Pietrini, M.; Giovannelli, A. Photovoltaic energy production forecast using support vector regression. Neural Comput. Appl. 2015, 26, 1955–1962. [Google Scholar] [CrossRef]
He, J.; Zelikovsky, A. MLR-tagging: Informative SNP selection for unphased genotypes based on multiple linear regression. Bioinformatics 2006, 22, 2558–2561. [Google Scholar] [CrossRef] [PubMed]
Abuella, M.; Chowdhury, B. Solar power probabilistic forecasting by using multiple linear regression analysis. In Proceedings of the Southeast Conference, Fort Lauderdale, FL, USA, 9–12 April 2015; pp. 1–5. [Google Scholar]
Tu, J.V. Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes. J. Clin. Epidemiol. 1996, 49, 1225–1231. [Google Scholar] [CrossRef] [PubMed]
Xu, A.; Chang, H.; Xu, Y.; Li, R.; Li, X.; Zhao, Y. Applying artificial neural networks (ANNs) to solve solid waste-related issues: A critical review. Waste Manag. 2021, 124, 385–402. [Google Scholar] [CrossRef]
Jumaat, S.A.; Crocker, F.; Abd Wahab, M.H.; Radzi, N.H.M.; Othman, M.F. Prediction of Photovoltaic (PV) output using artificial neutral network (ANN) based on ambient factors. J. Phys. Conf. Ser. 2018, 1049, 012088. [Google Scholar] [CrossRef]
Auria, L.; Moro, R.A. Support Vector Machines (SVM) as a Technique for Solvency Analysis. DIW Berlin, 1433-0210. 2008. Available online: https://www.econstor.eu/bitstream/10419/27334/1/576821438.PDF (accessed on 10 August 2023).
Ahmad, A.S.; Hassan, M.Y.; Abdullah, M.P.; Rahman, H.A.; Hussin, F.; Abdullah, H.; Saidur, R. A review on applications of ANN and SVM for building electrical energy consumption forecasting. Renew. Sustain. Energy Rev. 2014, 33, 102–109. [Google Scholar] [CrossRef]
Deo, R.C.; Wen, X.; Qi, F. A wavelet-coupled support vector machine model for forecasting global incident solar radiation using limited meteorological dataset. Appl. Energy 2016, 168, 568–593. [Google Scholar] [CrossRef]
Lazarevska, E.; Trpovski, J. A neuro-fuzzy model of the solar diffuse radiation with relevance vector machine. In Proceedings of the 11th International Conference on Electrical Power Quality and Utilisation, Lisbon, Portugal, 17–19 October 2011; pp. 1–6. [Google Scholar]
Donaj, Ł. Teoria szarych systemów a prognozowanie w naukach społecznych. Przyczynek do dyskusji. Przegląd Strateg. 2017, 7, 43–52. [Google Scholar] [CrossRef]
Ding, S.; Li, R.; Tao, Z. A novel adaptive discrete grey model with time-varying parameters for long-term photovoltaic power generation forecasting. Energy Convers. Manag. 2021, 227, 113644. [Google Scholar] [CrossRef]
Liu, L.; Wu, L. Forecasting the renewable energy consumption of the European countries by an adjacent non-homogeneous grey model. Appl. Math. Model. 2021, 89, 1932–1948. [Google Scholar] [CrossRef]
Carta, A.; Conversano, C. On the use of Markov models in pharmacoeconomics: Pros and cons and implications for policy makers. Front. Public Health 2020, 8, 569500. [Google Scholar] [CrossRef]
Bhardwaj, S.; Sharma, V.; Srivastava, S.; Sastry, O.S.; Bandyopadhyay, B.; Chandel, S.S.; Gupta, J.R.P. Estimation of solar radiation using a combination of Hidden Markov Model and generalized Fuzzy model. Sol. Energy 2013, 93, 43–54. [Google Scholar] [CrossRef]
Kotłowski, J. Metody wygładzania szeregów czasowych za pomocą modeli klasy ARIMA. Proc. Mater. Inst. Rozw. Gospod. SGH 2002, 73, 69–84. [Google Scholar]
Huang, R.; Huang, T.; Gadh, R.; Li, N. Solar generation prediction using the ARMA model in a laboratory-level micro-grid. In Proceedings of the 2012 IEEE Third International Conference on Smart Grid Communications (SmartGridComm), Tainan, Taiwan, 5–8 November 2012; pp. 528–533. [Google Scholar]
Chodakowska, E.; Nazarko, J.; Nazarko, Ł.; Rabayah, H.S.; Abendeh, R.M.; Alawneh, R. ARIMA Models in Solar Radiation Forecasting in Different Geographic Locations. Energies 2023, 16, 5029. [Google Scholar] [CrossRef]
Antonopoulos, V.Z.; Papamichail, D.M.; Aschonitis, V.G.; Antonopoulos, A.V. Solar radiation estimation methods using ANN and empirical models. Comput. Electron. Agric. 2019, 160, 160–167. [Google Scholar] [CrossRef]
Li, Y.; He, Y.; Su, Y.; Shu, L. Forecasting the daily power output of a grid-connected photovoltaic system based on multivariate adaptive regression splines. Appl. Energy 2016, 180, 392–401. [Google Scholar] [CrossRef]
Berger, D.E. Introduction to Multiple Regression; Claremont Graduate University: Claremont, CA, USA, 2003. [Google Scholar]
Mikołajczyk, K.; Wyrobek, J. Possibilities of Using the Vector Autoregression Method in Monetary Policy; Zeszyty Naukowe/Kraków University of Economics: Kraków, Poland, 2006; pp. 63–87. [Google Scholar]
Aslam, M. On testing autocorrelation in metrology data under indeterminacy. Mapan 2021, 36, 515–519. [Google Scholar] [CrossRef]
Domański, C.; Szczepocki, P. Comparison of selected tests for univariate normality based on measures of moments. Stat. Transit. New Ser. 2020, 21, 151–178. [Google Scholar] [CrossRef]
Dougherty, C. Introduction to Econometrics; Oxford University Press: Oxford, UK, 2011; pp. 280–299. [Google Scholar]
Piłatowska, M. Information criteria in the selection of an econometric model. Stud. Work. Univ. Econ. Krakow 2010, 10, 25–37. [Google Scholar]
Chai, T.; Draxler, R.R. Root mean square error (RMSE) or mean absolute error (MAE)?—Arguments against avoiding RMSE in the literature. Geosci. Model Dev. 2014, 7, 1247–1250. [Google Scholar] [CrossRef]
Almorox, J.; Benito, M.; Hontoria, C. Estimation of monthly Angström–Prescott equation coefficients from measured daily data in Toledo, Spain. Renew. Energy 2005, 30, 931–936. [Google Scholar] [CrossRef]
Safi, S.; Abdelouhab, Z. Prediction of global daily solar radiation using higher order statistics. Renew. Energy 2002, 27, 647–666. [Google Scholar] [CrossRef]
Bliemel, F. Theil’s Forecast Accuracy Coefficient: A Clarification. J. Mark. Res. 1973, 10, 444–446. [Google Scholar] [CrossRef]
Farnum, N.R.; Stanton, W. Quantitative Forecasting Methods; PWS-Kent Publishing Company: Boston, MA, USA, 1989. [Google Scholar]
Kufel, T. Econometrics. Solving Problems Using the Gretl Program; PWN: Warszawa, Poland, 2004. [Google Scholar]
Eurostat. Available online: https://ec.europa.eu/eurostat/data/database (accessed on 10 August 2023).
IRENA. Available online: https://www.irena.org/ (accessed on 11 August 2023).
GSM. Rocznik-GSM.pdf. Available online: min-pan.krakow.pl (accessed on 5 August 2023).
KGHM. Available online: https://kghm.com/pl/wstepne-wyniki-produkte-i-sprzedazowe-grupy-kghm-polska-miedz-sa-za-grudzien-2021-r (accessed on 1 September 2023).
Carrara, S.; Alves Dias, P.; Plazzotta, B.; Pavel, C. Raw Materials Demand for Wind and Solar PV Technologies in the Transition Towards a Decarbonized Energy System. 2020, Volume 10, p. 160859. Available online: https://core.ac.uk/download/pdf/322747915.pdf (accessed on 10 September 2023).
URE. Available online: https://www.ure.gov.pl/pl/oze/potencjal-krajowy-oze (accessed on 5 September 2023).
Barometr Zawodów. Available online: https://barometrzadow.pl/ (accessed on 10 August 2023).
Gilliland, D.; Vince, M. A note on confidence interval estimation and margin of error. J. Stat. Educ. 2010, 18, 9474. [Google Scholar] [CrossRef]
Ragb, O.; Mohamed, M.; Matbuly, M.S.; Civalek, O. Nonlinear Analysis of Organic Polymer Solar Cells Using Differential Quadrature Technique with Distinct and Unique Shape Function. CMES-Comput. Model. Eng. Sci. 2023, 137, 8992. [Google Scholar] [CrossRef]
Ragb, O.; Mohamed, M.; Matbuly, M.S.; Civalek, O. Sinc and discrete singular convolution for analysis of three-layer composite of perovskite solar cell. Int. J. Energy Res. 2022, 46, 4279–4300. [Google Scholar] [CrossRef]

Figure 1. WEKR 2.0 program algorithm for Cu.

Figure 2. Forecast of the volume of photovoltaic power production by 2025.

Figure 3. PV power production scenarios until 2025.

Table 1. Features of the selected forecast models.

Model	MAPE Error, %	Model Features
ANN	5–8	not enough observations in the time series
SVM	5–16	does not provide a probabilistic interpretation of the decision boundary
FL	13.87–20.22	MAPE >10% makes the method unacceptable in the analyzed case
Grey	3–7	does not meet the conditions set for the residuals of the econometric model
Markov chains	7	may be subjective—requires experience, which affects the accuracy of the model

Table 2. Factors taken into account during the presented research.

Index	Source
Installed power, MW	Eurostat [51]
Wind energy production, GWh	Eurostat
Primary energy consumption, GJ/capita	Eurostat
CO₂ emission, mil Mg	Eurostat
Real GDP per capita, EUR/capita	Eurostat
number of patents	IRENA [52]
LCOE for onshore technology, USD/kWh	IRENE
Cu consumption/consumption, %	IGSMiE PAN, KGHM [53,54]
Si consumption/consumption, %	IGSMiE PAN
Ge consumption/consumption, %	IGSMiE PAN
Ga consumption/consumption, %	IGSMiE PAN

Source: own elaboration.

Table 3. MLR model parameters.

Model Parameter	Parameter Value	Standard Error	p-Value	Character of the Variable
Time t	−267.66	26/61	0.01 **
LCOE	−5469.80	550.36	0.01 ***	destimulant
CO₂ emission	−6/11	1.35	0.045 *	destimulant
Cu	−60,244.20	11,966.70	0.04 **	destimulant
Primary energy consumption	36/05	6.29	0.029 **	stimulant
Patents	0.68	0.08	0.012 **	stimulant
GDP	0.05	0.01	0.033 **	stimulant
Installed capacity	0.72	0.05	0.0045 ***	stimulant

(***)—α = 0.01, (**)—α = 0.05, (*)—α = 0.1.

Table 4. MLR model ex post errors value.

Index	Index Value
Bayesian information criterion	46.6
Hannan–Quinn information criterion	41.5
Akaike information criterion	44.1
MAE	0.79
RMSE	1/11
MAPE, %	0.87
R²	0.99
Theil’s coefficient, %	0.01

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Rybak, A.; Rybak, A.; Kolev, S.D. Modeling the Photovoltaic Power Generation in Poland in the Light of PEP2040: An Application of Multiple Regression. Energies 2023, 16, 7476. https://doi.org/10.3390/en16227476

AMA Style

Rybak A, Rybak A, Kolev SD. Modeling the Photovoltaic Power Generation in Poland in the Light of PEP2040: An Application of Multiple Regression. Energies. 2023; 16(22):7476. https://doi.org/10.3390/en16227476

Chicago/Turabian Style

Rybak, Aurelia, Aleksandra Rybak, and Spas D. Kolev. 2023. "Modeling the Photovoltaic Power Generation in Poland in the Light of PEP2040: An Application of Multiple Regression" Energies 16, no. 22: 7476. https://doi.org/10.3390/en16227476

APA Style

Rybak, A., Rybak, A., & Kolev, S. D. (2023). Modeling the Photovoltaic Power Generation in Poland in the Light of PEP2040: An Application of Multiple Regression. Energies, 16(22), 7476. https://doi.org/10.3390/en16227476

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Modeling the Photovoltaic Power Generation in Poland in the Light of PEP2040: An Application of Multiple Regression

Abstract

1. Introduction

2. Literature Review

3. Advantages and Disadvantages of the Model Used to Forecast PV Power

4. Methods

5. Results and Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI