Energy Efficiency Forecast as an Inverse Stochastic Problem: A Cross-Entropy Econometrics Approach

Bwanakare, Second

doi:10.3390/en16237715

Open AccessArticle

Energy Efficiency Forecast as an Inverse Stochastic Problem: A Cross-Entropy Econometrics Approach^†

by

Second Bwanakare

^1,2

¹

Faculty of Social and Economic Sciences, Institute of Economics and Finance, Cardinal Stephan Wyszynski University in Warsaw, 01-815 Warsaw, Poland

²

Institute of Statistics in Rzeszów, 35-959 Rzeszów, Poland

^†

This paper is an extended version of our paper published in 2015 “Greenhouse Emission Forecast as an Inverse Stochastic Stochastic Problem: A Cross-Entropy Econometrics Approach”, Proceedings of the 7th Symposium FENS, Lublin, 14–17 May 2014, Acta physica Polonica A, Vol. 127, No. 3-A, 2015.

Energies 2023, 16(23), 7715; https://doi.org/10.3390/en16237715

Submission received: 30 September 2023 / Revised: 13 November 2023 / Accepted: 14 November 2023 / Published: 22 November 2023

(This article belongs to the Special Issue Sustainable and Low Carbon Development in the Energy Sector)

Download Versions Notes

Abstract

:

This paper forecasts the energy efficiency coefficients at the Polish province level (NUT-2), based on imperfect and contradictory knowledge. On the one hand, we have information on the aggregated national energy efficiency coefficients in the industrial, transport, household, and service sectors. On the other hand, we also have information on the energy intensity at the Polish province level. Since the two samples are of different natures and known with uncertainty, we are obviously dealing with an inverse stochastic problem whose solution requires particular statistical devices. The applied technique of non-extensive cross-entropy econometrics generalizes the Shannon-Kullback-Leibler approach based on the Gaussian assumptions. Its justification is explained throughout this paper from both methodological and empirical points of view. The model forecasts lead to the high-value energy efficiency estimates from quasi-unstructured sets of information. This constitutes the main contribution of this research. These outputs should provide energy policy units with valuable new devices for the optimization of the energy management processes on a disaggregated local level where, by contrast, different agents and households act decisively. On a global level, the proposed technique can be applied in different EU countries and elsewhere, in the context of experimental official statistics.

Keywords:

overall energy efficiency score; energetic intensity; non-extensive cross-entropy econometrics; stochastic inverse problems; regional innovation

1. Introduction

The oil shocks and market supply instabilities and uncertainties that have been occurring from 1973 until today have led to an increase in energy costs with negative consequences on the growth of different economies in different parts of the world. In addition, the 1997 Kyoto Protocol operationalized the United Nations Framework Convention on Climate Change by committing industrialized countries and economies in transition to limiting and reducing greenhouse gas (GHG) emissions in accordance with agreed-upon individual goals. It is therefore the occurrence of these two joint goals of the end of the last century that forced public decision-makers to initiate energy- and carbon-saving policies. Consequently, Europe defined the goals for 2030 [1], which are a 40% reduction in greenhouse gas emissions with respect to the 1990 values and an increment of energy saving of at least 32.5%.

Following IRENA [2], energy efficiency and renewable energy sources may provide more than 80% of the required emissions savings [3]. These saving-oriented objectives require a management system enabling the evaluation of progress with respect to these defined goals on different time and space scales. In fact, many authors are aware of the complexity of this problem. This is highlighted in recent publications either from a descriptive or prospective point of view (e.g., [4,5,6,7]) or from a methodological point of view (e.g., [8,9]). One of the issues is the lack of statistical data on a disaggregated scale, which contrasts with the pressing needs for information on a parallel level for such a strategic sector of the economy.

Therefore, this paper tries to address one of these issues. We will present and apply a recent statistical approach to recover statistical information in conditions when traditional mathematical or statistical techniques are unable to pass muster. This approach is related to the principle of maximum entropy, which is known to deal with ill-posed inverse problems such as the one to be solved in this article. The same approach has been applied in the recent past. The most noteworthy work is one that assesses the interregional distribution of greenhouse emissions in Poland by industry [10]. In this document, questions will arise when trying to recover the estimates from a dashboard of energy efficiency scores at a disaggregated national level where the statistical data does not exist. The case of Polish provinces will be methodologically illustrated for many similar countries, particularly those within the European Union. Many developed countries publish annually (e.g., [11]) sectoral energy efficiency scores aggregated at the national level. However, such aggregated information makes it very difficult to assess and plan the energy policies for which scope of operation happens to be at the sub-regional levels. To overcome this problem, statistical institutes instead calculate energy intensity scores, which are easier to estimate. The first question is whether the production of statistics on energy efficiency coefficients at a disaggregated level (in our case at NUT 2 level) deserves to be calculated alongside the energy intensity coefficients. The counter argument is that the energy intensity coefficients available should be sufficient to provide information on its efficiency. Nevertheless, the correlation between energy intensity and energy efficiency is generally far from perfect. For instance, a small service-based economy in a mild climate region will be characterized by a lower intensity than a large industry-based economy in a colder climate region, even though the latter uses energy more intensively and efficiently. In addition, other elements will also play a role in defining the efficiency levels and trends. Among these are regional economic structure (share of large energy-consuming industries), geographical characteristics (e.g., longer distances leading to higher demand for transport), and climatic and weather conditions (changing demand for heating or cooling) (e.g., [11]).

Based on what has just been presented above, let us now concretize the problem targeted by this paper as follows: we need to estimate the coefficients of energy efficiency for the sixteen province levels of Poland. The energy efficiency in question concerns the following four sectors: industry, transport, households, and services. The Odyssee-Mure project (e.g., [11]) publishes national statistics on the sectoral averages of these coefficients. Next, some institutes such as the Polish Institute of Statistics [12] publish the energy intensity coefficients at a disaggregated level, e.g., at the province level. The question posed in this article is, therefore, how to reconcile these two sources of information to forecast the sector ratios of energy efficiency at a province level. Mathematically, the problem as presented in a rectangular table (see Table 1) is ill-posed. On the one hand, we only have information on the averages of the energy intensity coefficients for each of the 16 provinces, while at the level of the four sectors, we have the averages of energy efficiency without, in both cases, any details at the individual interprovincial sectoral level. On the other hand, it follows that the total of the rows cannot correspond to the total of the columns. Finally, if we additionally take into consideration the fact that the systems generating these two sources of information are not known with precision, the problem to be solved turns out to be an ill-posed stochastic inverse problem for which the solution goes beyond traditional mathematical techniques.

The remaining part of this paper is organized as follows. Section 2 introduces the concepts and definitions related to optimal energy use and saving through recent literature on the subject. Section 3 present mathematical and statistical insights of the model in the context of the problem to be solved. In this context, the concept of inverse problem is defined, and this will be followed by a detailed presentation of non-extensive cross-entropy econometrics as the main technique applied by the model. Section 4 presents the model outputs and comments. At the end of this section, a sub-section is devoted to the main limitations of the proposed approach and future areas for further research. Section 5 draws conclusions and highlights potential outcomes related to the application of the presented method.

2. Energy Efficiency and Its Measurement

This section provides basic concepts to enable good understanding and interpretation of the computed outputs. There exists a vast literature on the energy efficiency definition and measurement (e.g., [13,14]). The reason may reside in the complexity of the energy efficiency technical evaluation [15]. The issues are the choice of the right aggregation level, the appropriate variables to construct a reference energy consumption trend, the energy units to be applied, and interaction between various effects. Uncertainty margins for the results lack in most presentations as well. Among a vast literature, we underscore some of these works that disserve a higher level of attention in the context of this paper. In particular, the authors [16] present two approaches to measuring energy efficiency. The bottom-up approach, to which the “Odex” index is linked, has been developed under the EU Odyssee-Mure programme. This approach will be developed in this article. The second approach is a top-down approach, which brings together the “Decomposition” methodologies as used by e.g., the Netherlands, Canada, and New Zealand. Likewise, the authors [17] propose the calculation of total factor energy efficiency (TFEE) using the concept of global production instead of domestic product, which excludes intermediate consumption. Another paper worthy to cite comes from the authors [18]. In that work, a literature review was carried out, and the authors found that the currently in use definitions of total-factor energy efficiency and total-factor carbon emissions efficiency are confusing and misleading.

Regarding institutional research, the Environmental and Energy Study Institute [19] uses a definition of energy efficiency, emphasizing using less energy to perform the same task, i.e., eliminating energy waste. The European Parliamentary Research Service (EPRS) definition [20] highlights the fact that energy efficiency should refer, in general terms, to the amount of output that can be produced with a given energy input. Most commonly, energy efficiency is measured as the amount of energy output for a given energy input. However, other kinds of output can also be used. The EU Energy Efficiency Directive uses a very broad definition: “energy efficiency” means the ratio of output of performance, service, goods, or energy to input of energy. Following the International Energy Agency [21], the energy efficiency ratio is a ratio between energy consumption (measured in energy units) and activity data (measured in physical units).

Finally, the UNDP, as an organ of the United Nations, has published a summary work that takes up different approaches to calculating energy efficiency. In fact, this institution, in the third chapter [22], begins by identifying the methodological challenges associated with defining and measuring energy efficiency. It then proposes a framework for understanding energy efficiency trends, integrating the current UNDP approach to energy efficiency developed by various international agencies and national institutions, and establishing a methodology to identify a starting point in relation to which future improvements in energy efficiency can be measured globally and at national levels.

It is worthwhile to pay attention to the related three concepts to be used in the next part of the paper. For an economy-wide measure, GDP is often compared to energy use, to give the energy intensity (measured, for example, in kilowatt-hours per euro). Next, energy savings are the reduction of energy use, without reference to output produced. Finally, as far as energy efficiency assessment is concerned, this can be done at different levels and according to different techniques. These levels may start from economy-wide and sectoral energy intensity to individual units of activity.

We now present some details regarding the ODEX energy efficiency index from which the data to be used in our model are extracted a priori. This index is published by the Odyssee-Mure project [23] to measure the energy efficiency progress of the main sectors (industry, transport, households, services) and for the whole economy (all final consumers). The ODEX composite indicator is calculated as a weighted average of sectoral indices.

For each sector, the index is calculated as a weighted average of sub-sectoral indices of energy efficiency progress. The sub-sectors stand for industrial branches, service sector branches, end-uses for households, or transport modes.

This project calculates the scores by including the following energy efficiency components:

the energy efficiency level,
the energy efficiency trends,
the energy efficiency policies, and
the overall energy efficiency.

These three first criteria are scored between 0 and 1 on the basis of a variety of indicators (extracted from the Odyssee Database) and of energy policies (extracted from the Mure Database). The overall energy efficiency score is obtained as an average of the three scores obtained for “energy efficiency level”, “energy efficiency progress”, and “energy efficiency policies” (i.e., one-third weighting). This work will use data representing the overall energy efficiency. Following [23], the energy efficiency scoring technique is based on the OECD Composite Indicator methodology. This method allows the countries or regions to be compared in a relevant range where minimum and maximum values indicators define the best and worst scores and countries or regions are ranked between these two extrema. The indicators are calculated and normalized so that they range between 0 and 1, following this formula:

Normalized score = [Indicator – Min indicator\((Max indicator − Min indicator) ∗ direction)] + 0.5 ∗ (1 − direction)

where

Indicator: The indicator value of the country/region.

Min indicator: The minimum indicator value across all countries/regions.

Max indicator: The maximum indicator value across all countries/regions.

Direction: The favored direction in the level of indicator; −1 if the decline is favored, 1 if the incline is favored.

In spite of a “one-third weighting”, the most influential score is the one of the energy efficiency level to which the two remaining scores should be related. Its scoring, according to the Odyssee-Mure practice, is done as follows:

Scoring is done separately for the four considered sectors (households, transport, industry, and services) and for all sectors together.
The score by sector is based on scores computed for statistically selected indicators of end uses in buildings or modes in transport. For the industry sector, an aggregate score is obtained from various industrial branch scores that account for the energy efficiency characteristics of each of them.
The score by sector is calculated as a weighted score of each indicator. The weights correspond to the average shares over the last 3 years of each end use or transport mode in the sector consumption.

Finally, for comparative reasons, sector score values are normalized into the interval 0–1 according to the next formula:

Normalized score = (Indicator − Min indicator)/(Max indicator − Min indicator)

where

Indicator: The indicator value of the sector.

Min indicator: The minimum indicator value across all sectors.

Max indicator: The maximum indicator value across all sectors.

To close this section, it is worthwhile to notice that the use of the above presented energy efficiency measurements will not apply in all cases. To illustrate this issue, one can mention difficulties of the energy efficiency estimation in the presence of the recent energy system known as the integrated energy system (IES). In the context of the energy crisis and environmental degradation, an integrated energy system (IES) based on the complementarity of multiple energy sources and the cascading use of energy is considered an effective way to mitigate these problems. Due to different forms of energy and different characteristics of IESs, the interrelationships between different forms of energy are complicated, which increases the difficulty of assessing the energy efficiency of IESs. A limited number of techniques exist. We can send the interested authors to the authors [15]. These authors have proposed a technique mixing energy use efficiency (EUE) and exergy efficiency (EXE) based on the first/second laws of thermodynamics [4].

3. Mathematical Problem Setting

(a): Inverse problem and the maximum entropy principal

In many real-world situations, theorists and empiricists observe at a given time two or more quantifiable multivariate stochastic systems and want to infer on an unknown cross-correlation between their random elements. To illustrate that, we implement a cross-entropy formalism to forecast an interprovincial sectoral energy efficiency score matrix based on imperfect and contradictory information from province or sector aggregates.

The basic model for (For example, by limiting itself to ordinary cases of signal or imaging, the basic equation (Equation (1)) can be extended to, for example, the impulse response of a measurement system.) dealing with poorly posed inverse problems is to solve the integral equation of the first type. We formulate this—in the context of a model that will be developed later—as follows:

G (ζ) = \int_{D} f (β) h (β, ζ) d β + b (ζ)

(1)

-: G is the amounts observed in rows or columns;
-: f is the unknown regional cross-sectoral energy efficiency coefficient matrix;
-: D defines the model Hilbert support space;
-: $h$ is the transformation kernel associating measures G and f; and
-: $b$ explains the random components.

The literature on various methodologies devoted to the recovery of ill-posed inverse problems [24] is expansive when dealing with empirical problems (e.g., [25,26,27]). In addition to the well-known Tikhonov regularization theory [28], the Gibbs–Shannon–Jaynes principle of maximal (minimum) entropy [29,30] and recent extensions [31,32], until recently, have remained the most commonly used techniques to solve this class of problems. The general rule that applies to both approaches is to link the linear or nonlinear problem of the least squares with the regularization rule (a priori or additional information) to get to a well-posed problem. Moreover, the Gibbs–Shannon–Jaynes principle of maximum (minimum) entropy formalism tries to search for global regularity—related to the second law of thermodynamics—while producing the smoothest reconstructions consistent with the data available in the Bayesian spirit. In this research, as is often the case in many empirical applications, the discrete form of equation 1 was implemented.

Focusing on social science research, a number of other techniques have been tried for this class of inverse problems. Examples include the pseudo-inverse Moore–Penrose problem approach or the bi-proportional RAS approach and its extension [33]. Although the latter technique requires an initial transaction matrix, it offers a worse solution when the model studied is stochastic. The authors [32] showed poorer performance of the Markov chain model compared to the generalized Gibbs–Shannon entropy for this class of inverse problems. Subsequently, the Bayesian approach showed its relative superiority, particularly that associated with the principle of maximal entropy. A neural class model can also be proposed. However, it is not based on a compact theory, its application takes time, and the results are not always guaranteed. Nevertheless, recent promising research studies have proposed new algorithms to solve multicriterial, dynamic inverse problems (e.g., [26,34,35]). The entropy model has been successfully applied to update and balance social accounting matrices [33]. However, on theoretical grounds, this assumes that entropy is a positive linear function of the number of possible states and therefore sets aside the possibility of interdependencies between states and their influence. In a recent paper, the authors [36] demonstrated the convergence of two standard regularization techniques to two special power law (PL)-related q Tsallis values. For q = 2, a Tikhonov regularization is obtained, and for q = 1, the classical formulation of the Boltzmann–Gibbs–Shannon entropy is obtained. The central point is that in addition to the well-known law of scale, PL exhibits a series of interesting characteristics related to its aggregation properties, in that it is preserved under minimum and maximum, addition, multiplication, and polynomial transformation ([37,38]).

Since we are dealing with a poorly conditioned inverse problem, we must meet all three conditions of regularity (existence, uniqueness, and stability of the solution) at the same time.

If, in general, the conditions of existence and uniqueness remain available due to regular a priori constraints, the stability of the optimal solution, by random or systematic errors, is much more difficult to reach. In short, the problem is, among an infinite number of distributions that meet all the restrictions imposed, to find the one that best replicates the data generation system (DGS). As for the formalism of maximum entropy, thanks to Jaynes’ contribution [5], the reasonable candidate should be the one that reduces uncertainty about the system the most. Similarly, according to Kullback-Leibler and information divergence metrics [29,39], the best candidate should be those posteriors who meet all the binding conditions and deviate the least from the priors.

(b): Non-extensive cross-entropy energy model and confidence interval area

Kullback-Leibler information divergence metrics are best known when they apply to the Gaussian attractor phenomena for parameter estimation, while dealing with an insufficient sample. In the case of PL phenomena—of which the Gaussian law is a particular case, the divergence metric applied is the q-Generalized Kullback-Leibler relative entropy ([40,41]). Formally, this metric constitutes a junction of both the power law and the Kullback-Leibler relative entropy formalisms. The first is known to analytically solve non-linear—and/or fractal—systems [42], and the second is known to find discriminative information from two or more hypotheses in the context of insufficient information. Accordingly, let us present below the main conditions following which emerges competing superiority of the q-generalized Kullback-Leibler relative entropy with respect to traditional techniques. Following (e.g., [32,43,44]), insufficient information means that we are trying to solve an ill-posed problem that is likely to arise in the following cases:

-: sample statistics are linear or collinear for various reasons;
-: non-stationary or non-cointegrating variables result from poor model specification;
-: data from the sampling plan are insufficient and/or incomplete due to technical or financial constraints—official statistics on small areas could illustrate this situation;
-: The Gaussian properties of random disturbances are questioned, among others, due to systematic errors resulting from the research process;
-: the model is not linear, and the last option is an approximate linearization; and
-: observations of aggregated data (in time or space) may hide a very complex system represented, for example, by a PL distribution, and there may be multifractal properties of the system.

Thus, it results from the above that the q-Generalized Kullback-Leibler relative entropy will be conceptually free from the traditional hypotheses, mainly those related to the least squares method. Among these, we can list the sphericity or collinearity attributes of the model.

Regarding the model specification, we estimate, using the proposed cross-entropy technique, the random parameters of the model from the cross-sectional data [45] of two distinct periods (years 2020 and 2021). In the context of that technique, authors [32] proposed the solutions to panel models based on various statistical hypotheses. The outputs of those models revealed higher precision in comparison with traditional techniques. Nevertheless, since we deal with a stochastic inverse problem, treating the problem as related to a panel structure could lead to important theoretical and computational problems related to the hypothesis of the non-extensivity of entropy.

The model related to that metric has been extensively presented in different publications (e.g., [44]) in the context of macroeconomic analysis. Since this work deals with energy management, it is worthwhile to reformulate the model in the energy management context to enable interpretation of the outputs.

We implement (the generalized Bregman Kullback-Leibler may be an alternative version of this model) the usual (the generalized Bregman Kullback-Leibler may be the alternative version of this model) discrete form of the q-Generalized Kullback-Leibler relative entropy (Equation (5)). With the cross-entropy new constraining data, the model updates the initial information (priors in Table 1) and provides new outputs (posteriors).

It is necessary to redefine the parameterization of the generalized linear model (Equation (2)), which plays the role of constraints. The inside table elements to be forecasted can be meaningfully presented by columns as the discrete Bayesian joint probabilities explaining each region’s average cross-sector weight or probabilities corresponding to individual energy efficiency ratio. The ratio total per column will sum up to unit. We recall that energy efficiency coefficients to be forecasted are in the form of normalized indicators, thus belonging within the space zero and one. In this case, the parameter processing space coincides with the probability space. Under these conditions, the accuracy of the estimated parameters is greater, since a priori there is no loss of information from these data [46]. In any case, let us briefly present the general procedure of reparameterization in the case of a general linear inverse model:

Y = X^{'} \cdot β + ε,

(2)

where the values of unknown parameters

β

are not necessarily bounded between

0

and

1

, indicating the need for reparameterization. This

ε

term is an unobservable random term for perturbations, plausibly with finite variance exhibiting observation errors from empirical measurement or random shocks that may be driven by PL. The variable

Y

consists of data observed—with errors—of an unknown data-generating system of energy efficiency coefficients by sector, and

X

may represent known average regional energy intensity coefficients—with uncertainty—through the relational parameter matrix

β

and the unobservable disturbance

ε

to be estimated through the observable error components

e

. Unlike classical econometric models, no binding assumptions are required—for example, regarding the distribution of random errors. In particular, as we deal with an ill-behaved inverse problem, the number of parameters to be estimated should be greater than the observed data points, and the quality of the informative data collected should be low. The process of the true system recovery requires the entropy objective function to include all of the interacting constraining consistency moments. Thus, referring to the properties of the relative entropy principle, each new piece of constraining information will reduce the entropy level of the system in accordance with the degree of data consistency with the system. For this multi-dimensional space inverse problem, among an unlimited number of model solution candidates, the best solution will result from identifying the one that—in terms of probability—best simulates the data-generating system. By taking each of

β_{k l}

(k = 1 \dots K, l = 1 . . . L)

as a discrete and random variable with a compact support (e.g., [32]) and

2 < M < \infty

possible outcomes, it can be estimated using

B_{k l}

, i.e.,

B_{k l} = \sum_{m = 1}^{M} p_{k l m} v_{k l m},

(3)

where

p_{k l m}

is the probability of the outcome

v_{k l m}

, and the probabilities must be non-negative and added to one. Similarly, by treating each element

e_{\cdot l}

(which affects the total uncertainty of the sector efficiency) as an

e

finite, discrete random variable with a compact support and

2 < J < \infty

possible results centred at zero, we can express

e_{\cdot l}

as

e_{\cdot l} = \sum_{j = 1 . . J} r_{\cdot l j} z_{\cdot l j},

(4)

As mentioned, it can be assumed that each previous entire row of errors has been evaluated,

ω_{k \cdot}

and a similar support space should be constructed as follows:

ω_{k \cdot} = {\sum_{s = 1}^{S} μ_{k \cdot s} υ}_{k \cdot s}

where

r_{\cdot l j}

and

μ_{k \cdot s}

are the outcome probabilities in the support spaces, respectively.

z_{k l j}

v_{l s}

j = 1 . . . J

s = 1 . . . S

. Therefore,

k = 1 . . . K

and

l = 1 . . . L

represent, respectively, the indexes of the number of rows and columns whose coefficient sums were estimated with errors. Moreover,

e

the term error

ω

is empirically set around the empirical standard error of the stated variables and a priori represents the Bayesian hypothesis. The choice of error limits, of course, depends on their own properties. In this study, their sets were determined by Chebyshev’s inequality [47], with the boundaries of the support space ranging from −3 to +3. Notice that in spite of this Gaussian property of the priors, posterior probabilities in the support space may represent a class of a non-Gaussian distribution, in particular a PL.

The element

v_{k l m}

constitutes a priori information provided by the researcher, while

p_{k l m}

is an unknown probability generating the true parameter

β_{k l}

, the value of which must be determined by solving a non-extensive cross-entropy econometrics problem. In matrix notation, let us rewrite

β = V \cdot P

with

p_{k l m} \geq 0

and

\sum_{k = 1}^{K} \sum_{j > 2 \dots J} p_{k l m} = 1

. Also, let

e = r \cdot z

with

r_{\cdot l j} \geq 0

and

\sum_{j > 2 \dots J} r_{\cdot l j} = 1

for K and L, the number of rows and columns, and

J

, the number of data points over the support space for the error terms inside the regional cross-sector matrix. The same conditions of normality can be easily formulated for any vector of column sums. Next, the cross-entropy econometric estimator of the Tsallis entropy can be presented as follows:

\begin{matrix} M i n H_{q} (p / / p^{0}, r / / r^{0}, μ / / μ^{0}) \\ \equiv α \sum p_{k l m} \frac{{[p_{k l m} / {p^{o}}_{k l m}]}^{q - 1} - 1}{q - 1} \\ + β \sum r_{\cdot l j} \frac{{[r_{\cdot l j} / {r^{o}}_{\cdot l j}]}^{q - 1} - 1}{q - 1} + \dots + δ \sum μ_{k \cdot s} \frac{{[μ_{k \cdot s} / {μ^{o}}_{k \cdot s}]}^{q - 1} - 1}{q - 1} \end{matrix}

(5)

Subject to

Y_{\cdot l} = c c j . l \sum_{k} ({[Y_{\cdot l} P_{k l}]}^{'} + e_{\cdot l}) = [c c j . l {\sum_{k}^{K} [(\sum_{m ≻ 2}^{M} {Y_{\cdot l}}^{'} v_{k l m} ({p_{k l m}}^{q})]}^{'} + \sum_{j = 1 . . J} {r^{q}}_{\cdot l j} z_{\cdot l j})]

(6)

H_{k \cdot} = C_{k \cdot} ({X_{k}}_{\cdot} + ω_{k \cdot}) = C_{k \cdot} (X_{k \cdot} + {\sum_{s = 1}^{S} {μ^{q}}_{k \cdot s} υ}_{k \cdot s})

(7)

\sum_{k . . K} H_{k \cdot} = \sum_{l . . L} Y_{\cdot l}

(8)

\sum_{k = 1}^{K} \sum_{j > 2 \dots J} p_{k l j} = 1

(9)

\sum_{j > 2 \dots J}^{J} r_{\cdot l j} = 1

(10)

\sum_{s > 2 \dots S}^{S} μ_{k \cdot s} = 1

where

Y_{\cdot l}

: indicates each sum per column

l

(values observed by energy sector

l

, including unknown errors);

H_{k \cdot}

: each row total (observed values per province

k

) corrected for

C_{k \cdot}

;

X_{k \cdot}

: total energy intensity indicators by region affected by unknown errors; and

p_{k l}

: probabilistic structure of energy efficiency ratios by sector and region.

C_{k \cdot}

is a positive scaling factor related to province climatic factor (average annual temperature by province) to match the totals of energy intensity and energy efficiency indicator levels,

{C C}_{\cdot j}

is the arbitrary additional scaling factor representing other random variables, but

C_{k \cdot}

to balance row and column sums “

\cdot

” refers to a variable bound to a row or column total, depending on the context.

Non-extensive statistics use a number of binding forms in which expectations can be set. The above model uses Curado-Tsallis (C-T) constraints [40,48], the general form of which is as follows:

⟨y_{q}⟩ = \sum_{i} {p_{i}}^{q} y_{i}

The parameter

q

, as already mentioned, represents the Tsallis parameter. Theoretically, the values of this parameter vary between 0 and 3 [49]. Once again, its value equal to 1 corresponds to the Gaussian attractor. In the real world, the values of this parameter should evolve between 1 and 5/3, thus including spontaneous or man-made stable fractal structures such as in most financial or economic markets ([50,51]) indices. As Table 1 suggests, the

C_{k \cdot}

remains a scaling factor related to province climatic factor contributing to match the totals of energy intensity and energy efficiency indicator levels that are known with uncertainty. Unlike the factor

C_{k \cdot}

, whose role has just been expressed above, the factor

{C C}_{\cdot j}

is the arbitrary additional scaling factor representing other random variables but

C_{k \cdot}

to complementarily contribute to balancing row and column sums. This is because the

C_{k \cdot}

, which explains that climatic differences within provinces could not play the balancing role alone since other factors may exist in explaining the difference between energy intensity and energy efficiency. Among these, as already said in the introductory section, are the regional economic structure (e.g., share of large energy-consuming industries) and the geographical characteristics of provinces (e.g., longer distances leading to higher demand for transport). Still, the sum per column (i.e.,per sector) of the posterior probabilities is constrained to unity, given

{C C}_{\cdot j}

. In the above model (Equations (5)–(10)), their values will play the role of the new Bayesian data discriminating in favor of the new inferential evidence. We must recall that when no new data are included in the cross-entropy model, the results correspond to those from formulating a maximum entropy principle, without additional conditions, except those of normality.

The above

H_{q} (p / / p^{0}, r / / r^{0}, μ / / μ^{0})

is nonlinear and measures entropy in the model. The relative entropies of the three independent terms (respectively the three posteriors

p

,

r

,

μ

and corresponding priors

p^{0}

r^{0}

μ^{0}

) are then added together using the weights. These are the real positives that amount to unity within the mentioned constraints. The first term known as the one of “parameter precision” takes into account the discrepancies between the estimated parameters

α, β, δ

and the prior parameters (usually defined in the support space). The second and third terms “ex-post predictions” include the empirical error term as the difference between the predicted and observed data values (see the last row and column in Table 1) in the model. The first component of the criterion function can relate to the structure of the table parameters, the second component to the errors in the row totals, and the last component to the errors in the column totals.

It should be noted that the estimates of the model and their variances should be influenced not only by the length of the support space but also by the spatial scale effect, i.e., the number of affected point values [32]. The greater the number of these points, the better the prior information, i.e., nonlinear starting points, about the system.

Next, the random errors

ω_{i}

(see Equation (7)) explain the errors in data collection and processing and are not necessarily related to the Gaussian distribution being itself a particular case of a PL. Traditionally, regarding Bayesian formulations and relative entropy, it should be noted that both models will lead to similar results if and only if the real expected errors associated with the data generation system are zero in the symmetric support space around zero (see, e.g., [44]). Similarly, the results of the Gibbs-Shannon cross-entropy and the Tsallis’ non-extensive cross-entropy will match when errors included in the model are not correlated, and the system distribution will then evolve towards the Gaussian attractor (e.g., [52,53,54]).

With respect to the confidence interval of the parameters, Equation (11) shows the non-additivity of the Tsallis entropy for two—probable—independent systems, one related to the probability distribution of the parameters and the other to the probability distribution of error perturbations:

S (\hat{P} r) = [S (\hat{p} + \hat{r})] = \{[S (\hat{p}) + S (\hat{r})] + (1 - q) \cdot S (\hat{p}) \cdot S (\hat{r})\}

(11)

where

S (\hat{p}) = - [1 - \sum_{k} \sum_{m} {(p_{k l m})}^{q}] / [K \cdot (M^{1 - q} - 1)]

S (\hat{r}) = - [(1 - \sum_{l} \sum_{j} {r^{q}}_{\cdot l j})] / [L \cdot (J^{1 - q} - 1)]

S (\hat{P} r)

is then the sum of the normalized entropy associated with the parameters of the

S (\hat{p})

model and the term of perturbation

S (\hat{r})

. Similarly, the last value is

S (\hat{r})

obtained for all observations

l

, with

J

the number of data points above the support of the estimated probabilities

r

related to the time length of errors.

The values of these normalized entropy indices range from zero to one. The values close to one indicate a weak information variable, while lower values indicate a parameter that is estimated to be more informative

{\hat{β}}_{k l}

in the model.

4. Outputs and Comment

We will illustrate the empirical basis of the theoretical model developed in the above sections by applying the Polish energy efficiency coefficients to the case. Table 1—whose row and column symbols correspond to Equation (6)—is presented to illustrate the extent to which a researcher can have limited information about quantity and quality before solving an inverse and ill-posed inverse problem such as the one in this article.

We have two aggregated energy indicators. each from the Polish energy demand sectors per province. The problem in hands stands as [29] a discrete problem in a multidimensional space, leading to (

k

− 1) × (

l

− 1) degree of freedom that would illustrate the case of a standard inverse problem. In this problem, as illustrated in Table 1 ([55,56]), we have 3 × 15 = 45 degrees of freedom related to the number of energy efficiency indicators per sector and province. Moreover, as alluded to before, the row total and the column total are different, probably as a result of the two separate data sources with different nature and scale.

As the above pages made us aware of, this kind of problem corresponds the best to the philosophy naturally contained in the principle of maximum entropy that we have implemented to forecast the energy efficiency ratios displayed in Table 2.

Let us now comment on the model outputs presenting the final solution about the cross “energy efficiency ratios“ (column 5 of Table 2) by sector and province. As presented in the theoretical model (Equations (5)–(10)), in addition to the classical normality conditions and moment consistency, the earlier explained random factors

{C C}_{\cdot j}

and

C_{k \cdot}

allowed the system to balance. The first factor is related to various variables differentiating energy intensity from energy efficiency except the climatical factor represented by

C_{k \cdot}

. As already said, these calculated ratios in the above table are not deterministic, since the priors are not known with certainty. Therefore, we treat them as random variables, from which the posteriors result from the optimization process. It is worthwhile to recall that the presented energy efficiency scores stand for the overall energy efficiency scores, i.e., a combination of the three already presented components: the energy efficiency level, the energy efficiency progress (i.e., energy efficiency trends), and the energy efficiency policies. The prior matrix of energy efficiency ratios, which is not presented in this paper, has been initiated on the basis of knowledge of the province energy intensity index and the sector energy efficiency ratio averages. Next, using proportions of the energy intensity index (the last column of Table 1) of different provinces, we computed the initial energy efficiency ratios by sector to sum up to the total of the energy efficiency ratios—known with error—available from the last row of the same Table 1. In probabilistic terms, we have assumed the energy efficiency ratio to be a uniform distribution across the provinces, and thus the last column of the energy intensity is a marginal probability. The assumption behind this procedure is that the province with a lower or a higher average energy intensity index will have a lower or a higher average energy efficiency ratio, respectively, irrespective of the considered sector. Doing so, we enabled the nonlinear mathematical system to start the search of the model global optimal solution from the best starting points leading to a quick convergence. The post-entropy posterior ratios in Table 2 are normalised, and the higher the value, the lower the energy efficiency level for a given sector and/or province. The obtained forecasts are empirically close to real world expectations of energy efficiency ranking within Polish provinces. We notice that in the industry sector, the provinces Mazowieckie (including Warsaw) and Wielkopolski (including Poznan) have the lowest energy efficiency ratios, respectively, around 0.320 and 0.333. The highest ratio is shown in the provinces Swietokrzyskie and Opole, with ratios around to 0.445 and 0.482, respectively. Globally, we notice that Mazowieckie displays the highest efficiency in all sectors, while Opole displays the lowest efficiency. We notice too that the industry sector globally remains the most efficient among all sectors.

Table 3 presents the level at which the model has discriminated the prior in favor of the post cross-entropy solution given the new model consistency moments and normality conditions. Consequently, we notice in that table the highest post cross-entropy discriminating values through all sectors in the case of Opole in comparison with the remaining sectors. It is worthy to recall that, as described in the precedent paragraphs, the prior values presented the same structure for different sectors. The cross-entropy formalism has discounted the non-valuable information to just retain the information fitting to corresponding sectors and provinces given the initial information on the energy intensity indicator and the climatic factor.

Following the formulation of Equation (11), the confidence interval of the model is a normalized value ranging between zero to one. In the present case, the comparable to the corrected classical coefficient of determination

R^{2}

is equal to 0.011, much closer to zero (lowest entropy) than one (highest entropy). Then, the proposed model system has discriminated optimally from the priors, given all the constraints encountering the energy efficiency system and defined by the equations. Nevertheless, it is important to mention that the cross-entropy information metric does not conceptually conform to the property of triangular inequality inherent in Euclidean distance.

Limitations of the Study and Prospective Research Area

Let us start with the strong side of the model and then discuss its limitations. Next, we will point out the specificity of the approach of the relative non-extensive entropy econometrics. We talk about it in the context of solving a random inverse problem consisting of the estimation of energy efficiency coefficients by sector and province in Poland. As explained in the article, this approach inherits its relative strength and originality from the combination of the following three attributes:

-: The power law distribution generalizes most of the known statistical laws and has proven to be designed for analytically resolving non-stationary functions. Its application leads to the analytical closed form outputs of the model.
-: The Kullback-Leibler relative entropy, being a combination between the properties of entropy and Bayesian formalism, stands for a strong information metric, particularly in the case of inverse problem modelling. Thanks to this formalism, several related hypotheses using the least squares method become obsolete.
-: While these two scientific sub-disciplines are based on solid hypotheses, joining traditional econometrics to them leads to the model proposed in this paper.

As a matter of fact, this approach will combine the advantages listed above: a minimum of hypotheses, the generalization of the normal law, and high precision of estimated parameters.

Now let us talk about the limitations. The major limitation of the approach is owing to computational complications that may arise while solving nonlinear systems. We used GAMS (General Algebraic Modelling System) software 43.4.1) to perform computations. Based on our own experience, the solution to this problem was to use one of the recent programs (connected to GAMS) designed to calculate the global optimum of nonlinear systems as well as to find the most informative initial points of the solution (the priors). In this case, we used “Knitro”. The point concerning the choice of the appropriate starting solution was explained when we presented the model.

In recent years, the non-extensive entropy approach has been extended to two parameters (in addition to q), and this opens up greater ease of modelling as well as faster convergence towards the global optimum from any point space and with a minimum of additional information about the system. This line of research could make this approach easier to apply to different models hitherto considered insoluble or soluble due to non-realistic or unverifiable hypotheses.

The next point of improvement of the model should be to analyze by simulation techniques the individual impact of the main factors impacting the energy efficiency coefficients. Once again, it is about climatic conditions, the regional economic structure, and the geographical characteristics of provinces. The model presented in this paper is limited to just estimating the sectoral coefficients by province. Therefore, certain factors have been grouped into the same variable “cc.j”.

Next, it would be worthwhile for the future to compare the relative entropy approach presented here with other powerful forecasting techniques for the same specific case. We could cite here, among others, the grey systems ([8,57]) based on solid theoretical foundations, the approach of neural networks successfully used in many fields of sciences, or the Fuzzy Time Series technique [58] plus its different versions, known for its relatively good precision in terms of predictions. The comparison of all these techniques requires a good understanding of the theoretical background of each of them and then the conceptually targeted area of their applications.

Finally, thanks to this approach, this problem could also be solved in the case of further disaggregation, for example, to the NUTS 3 level. Once again, this is of capital importance because it is at the more disaggregated levels that economic actors act.

5. Conclusions

The objective of this article was to carry out a forecast of energy efficiency ratios in conditions where statistical data are not only missing but also from contradictory sources. In empirical research, this case is rather a rule than an exception. In most situations, this will lead to a nonlinear ill-posed inverse problem for which traditional statistics fail to find an analytical solution unless commodity additional hypotheses are made, as is often the case in empirical econometrics.

In the present paper, we have analysed this problem at the level of the 16 Polish regions for four sectors of energy consumption. We started with sparse information concerning the annual sectoral energy efficiency ratio averages aggregated at the national level. We disposed as well of the annual averages of energy intensity aggregated at the provincial level. The problem, therefore, consisted of bringing together these two aggregates of different natures to produce reliable information on interprovincial sectoral energy efficiency coefficients.

The forecasts obtained seem to be in line with the expectations of those who know the realities behind the Polish energy sector.

This work more or less described in detail the generalized cross-entropy technique and illustrated its applicability in the field of the energy experimental statistics.

Depending on the countries studied, the energy efficiency factors presented in this paper could differently impact the deviation between the energy intensity and energy efficiency indicators. Next, a comparison of information on the energy efficiency across sectors and regions will be of significant benefit to target grey areas and redefine more appropriate energy policies fitting to realistic social parameters inside a disaggregated locality. In particular, the outputs of the model could allow public energy institutions to monitor the relative progress of each locality in their endeavour to optimize energy use given their own specifical conditions. Taking things on a province scale, the provinces like Opole, Swietokrzyskie or Lubuskie with the lowest efficiency in all sectors should try to allocate higher investment budgets to increasing parallelly to energy intensity and efficiency through new technology and province industry geographical reorganisation. This may take time but will pay off in the future. Finally, as it results from the above comments, an extensive forecast on the European or larger scale of these indicators through experimental official statistics could allow for better management of energy saving as targeted by the European or worldwide climate change mitigation institutions or projects.

Funding

This research is supported by the Cardinal Stephan Wyszynski University of Warsaw.

Data Availability Statement

The data is available and explained in this article; details of calculations may be available on request.

Conflicts of Interest

The author declares no conflict of interest.

References

Tsemekidi Tzeiranaki, S.; Bertoldi, P.; Paci, D.; Castellazzi, L.; Serrenho, T.; Economidou, M.; Zangheri, P. Energy Consumption and Energy Efficiency trends in the EU-28, 2000–2018; Office of the European Union: Luxembourg, 2020. [Google Scholar]
IRENA. Global Energy Transformation: A Roadmap to 2050 (2019 Edition); International Renewable Energy Agency: Abu Dhabi, United Arab Emirates, 2019; ISBN 978-92-9260-121-8. [Google Scholar]
IRENA. Reaching Zero with Renewables: Eliminating CO2 Emissions from Industry and Transport in Line with the 1.5 Degree, Climate Goal; International Renewable Energy Agency: Abu Dhabi, United Arab Emirates, 2020; ISBN 978-92-9260-269-7. [Google Scholar]
Jakimowicz, A. The Material Entropy and the Fourth Law of Thermodynamics in the Evaluation of Energy Technologies of the Future. Energies 2023, 16, 3861. [Google Scholar] [CrossRef]
Jakimowicz, A. The future of the energy sector and the global economy: Prosumer capitalism and what comes next. Energies 2022, 15, 9120. [Google Scholar] [CrossRef]
Foster, J. Energy, aesthetics and knowledge in complex economic systems. J. Econ. Behav. Organ. 2011, 80, 88–100. [Google Scholar] [CrossRef]
Batrancea, L.; Pop, M.C.; Rathnaswamy, M.M.; Batrancea, I.; Rus, M.I. An empirical investigation on the transition process toward a green economy. Sustainability 2021, 13, 13151. [Google Scholar] [CrossRef]
Wang, Q.; Li, S.; Li, R. Forecasting energy demand in China and India: Using single-linear, hybrid-linear, and non-linear time series forecast techniques. Energy 2018, 161, 821–831. [Google Scholar] [CrossRef]
Wang, Q.; Li, S.; Li, R.; Jiang, F. Underestimated impact of the COVID-19 on carbon emission reduction in developing countries–a novel assessment based on scenario analysis. Environ. Res. 2022, 204, 111990. [Google Scholar] [CrossRef]
Bwanakare, S. Greenhouse Emission Forecast as an Inverse Stochastic Stochastic Problem: A Cross-Entropy Econometrics Approach. Acta Phisica Pol. A 2015, 127, A-13–A-20. [Google Scholar] [CrossRef]
Odyssee-Mure. Indicator Tools. Available online: https://www.odyssee-mure.eu/ (accessed on 24 April 2023).
Polish Institute of Statistics. Regional Statistics. 2020–2021. Available online: https://bdl.stat.gov.pl/bdl/pomoc/stanzasilenia?active=0# (accessed on 16 November 2022).
Proskuryakova, L.; Kovalev, A. Measuring energy efficiency: Is energy intensity a good evidence base? Appl. Energy 2015, 138, 450–459. [Google Scholar] [CrossRef]
Office of Energy Efficiency & Renewable. Energy Efficiency vs. Energy Intensity Analysis; Office of Energy Efficiency & Renewable: Waszyngton, DC, USA, 2023.
Su, H.; Huang, Q.; Wang, Z. An energy efficiency index formation and analysis of integrated energy system based on exergy efficiency. Front. Energy Res. 2021, 9, 723647. [Google Scholar] [CrossRef]
Reuter, M.; Narula, K.; Patel, M.K.; Eichhammer, W. Linking energy efficiency indicators with policy evaluation—A combined top-down and bottom-up analysis of space heating consumption in residential buildings. Energy Build. 2021, 224, 110987. [Google Scholar] [CrossRef]
Li, S.; Diao, H.; Wang, L.; Li, C. Energy Efficiency Measurement: A VO TFEE Approach and Its Application. Sustainability 2021, 13, 1605. [Google Scholar] [CrossRef]
Li, S.; Wang, W.; Diao, H.; Wang, L. Measuring the efficiency of energy and carbon emissions: A review of definitions, models, and input-output variables. Energies 2022, 15, 962. [Google Scholar] [CrossRef]
EESI. Environmental and Energy Study Institute. Available online: https://www.eesi.org/ (accessed on 28 July 2023).
European Union. EPRS. EU. Available online: https://www.europarl.europa.eu/at-your-service/en/stay-informed/factsheets-on-the-european-union (accessed on 8 November 2023).
IEA. Energy Efficiency Indicators: Fundamentals on Statistics; IEA: Paris, France, 2014. [Google Scholar]
UNDP. Chapter 3: Energy Efficiency. In Global Tracking Framework; United Nations: San Francisco, CA, USA, 2017. [Google Scholar]
Odyssee-Mure. Methodology European Energy Efficiency Scoreboard; Odyssee-Mure: Luxemburg, 2020. [Google Scholar]
Kirsch, A. An Introduction to the Mathematical, Theory of Inverse Problems; Springer International Publishing: New York, NY, USA, 1996. [Google Scholar]
Zhang, Y.; Zhang, Y.; Shi, W.; Shang, R.; Cheng, R.; Wang, X. A new approach, based on the inverse problem and variation method, for solving building energy and environment problems: Preliminary study and illustrative examples. Build. Environ. 2015, 91, 204–218. [Google Scholar] [CrossRef]
Zio, E. Approaching the inverse problem of parameter estimation in groundwater models by means of artificial neural networks. Prog. Nucl. Energy 1997, 31, 303–315. [Google Scholar] [CrossRef]
Zhang, Y.; O’Neill, Z.; Dong, B.; Augenbroe, G. Comparisons of inverse modeling approaches for predicting building energy performance. Build. Environ. 2015, 86, 177–190. [Google Scholar] [CrossRef]
Tikhonov, A.N.; Arsenin, V.I. Solutions to Poorly Conditioned Problems; John Wiley & Sons: New York, NY, USA, 1977. [Google Scholar]
Jaynes, E.T. Probability Theory: The Logic Of Science; Washington University: Washington, DC, USA, 1944. [Google Scholar]
Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
Bwanakare, S. Econometrics of non-extended entropy: New statistical characteristics of constant elasticity of substitution-related models. Entropy 2014, 16, 2713–2728. [Google Scholar] [CrossRef]
Golan, A.; Juge, G.; Miller, D. Maximum Entropy Econometrics: Robust Estimation with Limited Data; Wiley: Chichester, UK, 1996. [Google Scholar]
Robinson, S.; Cattaneo, A.; El-Said, M. Updating and estimating a social accounting matrix using cross entropy methods. Econ. Syst. Res. 2001, 13, 47–64. [Google Scholar] [CrossRef]
Gee, S.B.; Tan, K.C.; Alippi, C. Solving multiobjective optimization problems in unknown dynamic environments: An inverse modeling approach. IEEE Trans. Cybern. 2017, 47, 4223–4234. [Google Scholar] [CrossRef]
Perera, R.; Fang, S.E.; Ruiz, A. Application of particle swarm optimization and genetic algorithms to multiobjective damage identification inverse problems with modelling errors. Meccanica 2010, 45, 723–734. [Google Scholar] [CrossRef]
Velho, F.; de Campos, H.; Elcio, H.; Shiguemori; Carvalho, F.M.; Ramos, J.C.A. Theory of Unified Regularization: The Principle of Non-Extensive Maximum Entropiy. Differential equation. Appl. Math. 2006, 25, 307–330. [Google Scholar]
Jessen, H.A.; Mikosch, T. Regularly varying functions. Publ. De L’institut Math. 2006, 80, 171–192. [Google Scholar] [CrossRef]
Gabaix, X. The Laws of Power in Economics and Finance; BER: Washington, DC, USA, 2008. [Google Scholar]
Kullback, S. Information Theory and Statistics; John Wiley and Sons: New York, NY, USA, 1959. [Google Scholar]
Plastino, A.R.; Venkatesan, R.C. Deformed Statistics K-L Divergence Minimization within a Scaled Bregman Framework. Phys. Lett. A 2011, 375, 4237–4243. [Google Scholar]
Bağci, G.B.; Arda, A.; Sever, R. On the problem of constraints in nonextensive formalism: A quantum mechanical treatment. Int. J. Mod. Phys. B 2006, 20, 2085–2092. [Google Scholar] [CrossRef]
Nielsen, F.; Nock, R. A closed-form expression for the Sharma–Mittal entropy of exponential families. J. Phys. A Math. Theor. 2012, 45, 32003. [Google Scholar] [CrossRef]
Jakimowicz, A. The role of entropy in the development of economics. Entropy 2020, 22, 452. [Google Scholar] [CrossRef]
Bwanakare, S. Non-Extensive Entropy Econometrics for Low Frequency Series: National Accounts-Based Inverse Problems; De Gruyter Open Poland: Warsaw, Poland, 2018. [Google Scholar]
Wooldridge, J.M. Econometric Analysis of Cross Section and Panel Data, 2nd ed.; MIT Press: Cambridge, MA, USA, 2010; ISBN 100262232588. [Google Scholar]
Shen, E.Z.; Perloff, J.M. Maximum entropy and Bayesian approaches to the ratio problem. J. Econom. 2001, 104, 289–313. [Google Scholar] [CrossRef]
Pukelsheim, F. The Three Sigma Rule; American Statistical Association: Alexandria, VA, USA, 1994; Volume 48, pp. 88–91. [Google Scholar]
Abe, S.; Bagci, G.B. Constraints and Relative Entropies in Nonextensive Statistical Mechanics. arXiv 2004, arXiv:cond-mat/0404253. [Google Scholar]
Gell-Mann, M.; Tsallis, C. Nonextensive Entropy, Interdisciplinary Applications; Oxford University Press: New York, NY, USA, 2004. [Google Scholar]
Rak, R.; Drożdż, S.; Kwapień, J. Nonextensive statistical features of the Polish stock market fluctuations. Phys. A Stat. Mech. Its Appl. 2007, 374, 315–324. [Google Scholar] [CrossRef]
Drăgulescu, A.; Yakovenko, V.M. Exponential and power-law probability distributions of wealth and income in the United Kingdom and the United States. Phys. A Stat. Mech. Its Appl. 2001, 299, 213–221. [Google Scholar] [CrossRef]
Tsallis, C. Introduction to Non-extensive Statistical Mechanics: Approaching a Complex World; Springer: Berlin, Germany, 2009. [Google Scholar]
Bwanakare, S. Solving a generalized constant elasticity of substitution function of production: A non ergodic maximum enropy principle. Acta Phys. Pol. 2013, 123, 502. [Google Scholar] [CrossRef]
Bwanakare, S. Non-extensive econometric model of entropy (NO): A case of labour demand in Podkarpackie Voivodeship. Acta Phys. Pol. 2010, 117, 647. [Google Scholar] [CrossRef]
EU. Eurostat. 2021. Available online: https://ec.europa.eu/eurostat/web/main/data/statistical-themes (accessed on 19 December 2022).
GUS. GUS. 2021. Available online: http://old.stat.gov.pl/gus/publikacje_a_z_PLK_HTML.htm (accessed on 30 June 2022).
Deng, J. Introduction to Grey System Theory. J. Grey Syst. 1989, 1, 1–24. [Google Scholar]
Singh, P.; Dhiman, G. A hybrid fuzzy time series forecasting model based on granular computing and bio-inspired optimization approaches. J. Comput. Sci. 2018, 27, 370–385. [Google Scholar] [CrossRef]

Table 1. Sector and province energy efficiency ratios (2021).

Polish Province	Industry	Transport	Households	Services	Average Energy Intensity Coefficients (GWh\mln zl Value Added)
	$Y_{\cdot 1} P_{11}$	$Y_{\cdot 2} P_{12}$	$Y_{\cdot l} P_{1 l}$	$Y_{\cdot L} P_{1 L}$	0.08
Mazowieckie	$Y_{\cdot 1} P_{21}$	..	$Y_{\cdot l} P_{2 l}$	$Y_{\cdot l} P_{2 L}$	0.05
Malopolskie	..	..	..	..	0.06
Silesian	..	..	..	..	0.09
Lublin	..	..	..	..	0.07
Podkarpackie	..	..	..	..	0.06
Podlaskie	..	..	..	..	0.06
Swietokrzyskie	..	..	..	..	0.09
Lubuskie	$Y_{\cdot 1} P_{k 1}$	..	$Y_{\cdot l} P_{k l}$	$Y_{\cdot L} P_{k L}$	0.07
Wielkopolska	..	..	..	..	0.05
Zachodnia pomors	..	..	..	..	0.07
Dolnoslaskie	..	..	..	..	0.07
Opole	..	..	..	..	0.11
Kujawska-pomorska	..	..	..	..	0.07
Pomorska	..	..	..	..	0.06
Warminsko-mazurskie	$Y_{\cdot 1} P_{K 1}$	..	$Y_{\cdot l} P_{K l}$	$Y_{\cdot L} P_{K L}$	0.06
Average energy efficiency ratio	0.372	0.612	0.621	0.689

Sources: Own based on Odyssey-Mure and Polish Institute of Statistics (GUS).

Table 2. Post-entropic 2021 energy efficiency ratio forecasts and efficiency progress in 2020/2021(%).

	Industry	Transport	Households	Services	Average Efficiency Ratio per Province	Efficiency Ratio Change_2021/2020
Lodz	0.4	0.667	0.677	0.754	0.625	−0.12
Slaskie	0.429	0.729	0.741	0.83	0.682	0.81
Mazowieckie	0.302	0.471	0.477	0.521	0.443	−3.16
Wielkopolska	0.333	0.526	0.533	0.583	0.494	7.49
Dolnoslaskie	0.369	0.602	0.61	0.676	0.564	−1.29
Opole	0.482	0.847	0.861	0.974	0.791	2.43
Malopolskie	0.336	0.536	0.543	0.596	0.503	−2.44
Swietokrzyskie	0.445	0.764	0.777	0.873	0.715	5.32
Zachodnia pomors	0.373	0.61	0.618	0.685	0.572	0.04
Kujawska-pomorska	0.37	0.604	0.613	0.678	0.566	−0.88
Pomorska	0.336	0.536	0.543	0.597	0.503	−2.44
Lublin	0.369	0.602	0.61	0.675	0.564	−1.29
Podkarpackie	0.336	0.536	0.543	0.597	0.503	−2.33
Lubuskie	0.386	0.636	0.645	0.716	0.596	4.07
Podlaskie	0.336	0.536	0.543	0.597	0.503	−2.39
Warminsko-mazurskie	0.339	0.541	0.548	0.602	0.507	−1.53
Sector average ratio	0.371	0.609	0.618	0.685
Progress_2021/2020(in %) ratio	0.067	0.369	0.385	0.548

Table 3. Model cross matrix increment (in %) between priors and posteriors (2021).

	Industry	Transport	Households	Services
Lodz	−4.901	−4.569	−4.566	−4.558
Slaskie	−9.481	−8.137	−8.098	−7.812
Mazowieckie	14.89	11.26	11.13	10.104
Wielkopolska	14.89	11.26	11.13	10.104
Dolnoslaskie	0.405	−0.418	−0.455	−0.76
Opole	−17.153	−14.151	−14.051	−13.297
Malopolskie	6.77	4.634	4.552	3.9
Swietokrzyskie	−9.481	−8.137	−8.098	−7.812
Zachodnia pomors	0.405	−0.418	−0.455	−0.76
Kujawska-pomorska	0.405	−0.418	−0.455	−0.76
Pomorska	6.77	4.634	4.552	3.9
Lublin	0.405	−0.418	−0.455	−0.76
Podkarpackie	6.77	4.634	4.552	3.9
Lubuskie	0.405	−0.418	−0.455	−0.76
Podlaskie	6.77	4.634	4.552	3.9
Warminsko-mazurskie	6.77	4.634	4.552	3.9
AEE ratio	−0.289	−0.908	−0.937	−1.178

Source: Own calculations.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bwanakare, S. Energy Efficiency Forecast as an Inverse Stochastic Problem: A Cross-Entropy Econometrics Approach. Energies 2023, 16, 7715. https://doi.org/10.3390/en16237715

AMA Style

Bwanakare S. Energy Efficiency Forecast as an Inverse Stochastic Problem: A Cross-Entropy Econometrics Approach. Energies. 2023; 16(23):7715. https://doi.org/10.3390/en16237715

Chicago/Turabian Style

Bwanakare, Second. 2023. "Energy Efficiency Forecast as an Inverse Stochastic Problem: A Cross-Entropy Econometrics Approach" Energies 16, no. 23: 7715. https://doi.org/10.3390/en16237715

APA Style

Bwanakare, S. (2023). Energy Efficiency Forecast as an Inverse Stochastic Problem: A Cross-Entropy Econometrics Approach. Energies, 16(23), 7715. https://doi.org/10.3390/en16237715

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Energy Efficiency Forecast as an Inverse Stochastic Problem: A Cross-Entropy Econometrics Approach^†

Abstract

1. Introduction

2. Energy Efficiency and Its Measurement

3. Mathematical Problem Setting

4. Outputs and Comment

Limitations of the Study and Prospective Research Area

5. Conclusions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Energy Efficiency Forecast as an Inverse Stochastic Problem: A Cross-Entropy Econometrics Approach †

Abstract

1. Introduction

2. Energy Efficiency and Its Measurement

3. Mathematical Problem Setting

4. Outputs and Comment

Limitations of the Study and Prospective Research Area

5. Conclusions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Energy Efficiency Forecast as an Inverse Stochastic Problem: A Cross-Entropy Econometrics Approach^†