1. Introduction
The by-production of undesirable, or so-called “bad”, outputs is an inherent attribute of many production processes. Electric power generation is a classical example of such a process, where the production of electricity (desirable output) is accompanied by the emission of pollutant gases (undesirable outputs). It is therefore imperative to account for undesirable outputs when estimating the productivity growth for such processes (e.g., see [
1,
2]).
The estimation of productivity (and, potentially, its components) naturally requires the estimation of the firm’s production process, the modeling of which in the presence of undesirable outputs is however not a clear-cut issue. A standard approach is to condition the conventional transformation (production) function on undesirable outputs (e.g., [
3,
4,
5]) which, effectively, treats these outputs as inputs. Such a treatment of undesirable outputs has since been heavily criticized due the implied strong disposability of undesirable outputs [
6] and the violation of the “material balance condition” [
7]. A popular alternative approach to tackling undesirable outputs is to specify a (single) directional output distance function [
6,
8] which accommodates both the expansion in desirable outputs and a simultaneous contraction in undesirable outputs. Feng and Serletis [
9] have recently proposed a primal Divisia productivity index based on such a directional output distance function.
Both the directional output distance function and the productivity index based on the latter allow the identification of a “composite” measure of inefficiency and productivity (respectively) only. Specifically, when modeling the production technology via a (single) directional distance function (e.g., [
6]), the inefficiency is defined over the entire vector of outputs, both desirable and undesirable. This produces a single measure of inefficiency which is a weighted combination of the technical and environmental inefficiencies, where the “weighting” is done on the basis of the prespecified directional vector. Similarly, the directional-output-distance-function-based productivity index identifies the “composite” productivity growth only. Thus, modeling undesirable outputs via the standard directional functions precludes researchers from disentangling the
technical inefficiency/productivity, conventionally oriented along desirable outputs, from the
environmental, or so-called “green”, inefficiency/productivity, oriented along undesirable outputs.
1 Both can be of great interest from a policy perspective.
In this paper, we follow a different path to modeling the production process with undesirable outputs in the spirit of Fernández
et al. [
11,
12, Forsund [
13] and Murty
et al. [
7]. Specifically, we model the firm’s production process as a system of separate simultaneous production technologies for desirable and undesirable outputs. In this setup, desirable outputs are produced by transforming inputs via the conventional transformation function satisfying all standard assumptions. Consistent with the material balance condition, the by-production of undesirable outputs is however treated as the so-called “residual generation technology”. The above setup explicitly recognizes that the generation of undesirable outputs is not the intended production but rather the
by-production process. By separating the generation of undesirable outputs from that of desirable outputs, we ensure that the former are not modeled as inputs as well as take into account their “costly disposability” (see [
7]).
The by-production system approach that we employ in this paper permits us to not only distinguish between technical efficiency and (undesirable-output-specific) environmental efficiencies but to also differentiate between traditional technical productivity and environmental (“green”) productivity. Specifically, we derive a Solow [
14] type primal (Divisia) environmental productivity index which, unlike a conventional desirable-output-oriented productivity index, is defined as the
contraction rate in undesirable outputs unexplained by the contraction in desirable outputs. This allows us to credit the
ceteris paribus reduction in undesirable outputs. Our environmental productivity index also provides a meaningful way to decompose productivity into environmental technological change and environmental efficiency change.
We apply our system approach as well as the environmental productivity index to study the efficiency and productivity trends among coal-fired electric power generating plants in the U.S. during the 1985–1995 period. The production of (desirable) electric power by these utilities is accompanied by the (undesirable) emission of SO2 and NOx gases.
We estimate the model subject to theoretical regularity conditions using (numerically) efficient Bayesian MCMC technique, where we also allow for unobserved plant-specific heterogeneity in addition to time-varying inefficiencies. We impose monotonicity and curvature regularity restrictions (at every data point) in order to ensure that our results are economically meaningful, as emphasized by Barnett
et al. [
15] and Barnett [
16]. Among many things, we find that electric utilities in our sample tend to suffer from higher levels of environmental inefficiency in the emission of SO
2 than in the emission of NO
x gases. We also document a significant divergence between the electric-power-oriented technical productivity and the emission-oriented environmental productivity. Specifically, we find that, while the pooled posterior mean estimate of (annual) productivity growth is negative for electric power generation (–0.13%), it is non-negligibly positive for the SO
2 and NO
x emissions:
2 2.25% and 3.31% per annum, respectively. The cumulative eleven-year growth is 23.26% for the SO
2-oriented environmental productivity, 37.98% for the NO
x-oriented environmental producitivity and a mere 5.33% for the electric-power-oriented technical productivity.
The rest of the paper proceeds as follows.
Section 2 describes the by-production system approach to modeling production technology in the presence of undesirable outputs as well as provides the derivation of the environmental productivity index. Data are discussed in
Section 3. We explain our econometric strategy in
Section 4.
Section 5 presents the results, and
Section 6 concludes.
2. The By-Production Model
Building on Fernández
et al. [
11,
12], the undesirable-output-generating production system (𝕋) with
J inputs
,
M desirable outputs
and
P undesirable outputs
can be formalized as the intersection of the primary technology used in the production of desirable outputs (
) and
P individual undesirable-output residual generation technologies (
),
i.e.,
3Consider the case of
inputs,
desirable and
undesirable outputs (as in our empirical application). Allowing for technical inefficiency in the production of a desirable output and environmental inefficiency in the by-production of undesirable outputs, we rewrite system 𝕋 in terms of the stochastic transformation function and two separate (environmental) residual generation functions for each undesirable output,
i.e.,
where
and
are technical and environmental efficiencies, respectively; and
are the white noise terms. The transformation function
is assumed to satisfy all standard properties such as continuity, positive (negative) monotonicity in
Y (
X), linear homogeneity in
Y and concavity in
X and
Y. Similarly, the residual generation function
is continuous, positively (negatively) monotonic in
(
Y), linearly homogeneous in
and convex in
Y and
.
Thus, the production system (2) permits the identification of
both the technical and environmental efficiencies:
θ and
. The latter is feasible due to the separability of the primary desirable-output generating production technology (2a) from the undesirable-output residual generating technologies (2b), which is motivated by the by-production approach satisfying the material balance condition. For instance, one would generally be unable to disentangle technical and environmental efficiencies (the way the above system approach allows us to) if following a popular alternative to the estimation of production processes in the presence of undesirable outputs based on the directional distance function [
8].
Specifically, when modeling the production technology via a (single) directional distance function (e.g., [
6]), the inefficiency is defined over the entire vector of outputs, both desirable and undesirable, using an
a priori specified directional vector. The latter precludes researchers from disentangling the technical inefficiency conventionally oriented along desirable outputs from the environmental inefficiency oriented along undesirable outputs. The directional distance function rather produces a “composite” measure of inefficiency which is a weighted combination of the two, where the “weighting” is done on the basis of the prespecified direction. Further, unlike a system in (2), the directional distance function yields an additive, not a proportional, measure of inefficiency.
Technical and Environmental Productivity
The production system 𝕋 that we consider in this paper permits us to not only distinguish between technical efficiency θ (conventionally defined over the desirable output) and undesirable-output-specific environmental efficiencies but to also differentiate between traditional technical productivity and environmental, or the so-called “green”, productivity.
Letting time
t enter the transformation and residual generation functions
and
explicitly and making use of their linear homogeneity properties, system (2) can be rewritten in the log form as
where, for convenience, we define
and
; and
and
are technical and environmental
inefficiencies, respectively.
Total differentiation of (3) with respect to
t yields
where we have made use of
since
are the
i.i.d. white noise. After some rearranging, from (4a) we get the following Solow [
14] type (Divisia)
technical productivity index:
along with the similarly defined
environmental productivity index from (4b):
The negative monotonicity of
and
in inputs and desirable outputs, respectively, imply that
and
.
4Unlike
which is conventionally defined as the
expansion rate in a desirable output
unexplained by the growth in inputs, the environmental productivity index
is defined as the
contraction rate in an undesirable output
unexplained by the contraction in desirable outputs.
5 This allows crediting the
ceteris paribus reduction in undesirable outputs.
Further, Equations (5) and (6) provide a meaningful way to decompose respective productivity indices into technical/technological change and efficiency change. The conventional technical productivity index equals the sum of the technical change , which measures the temporal shift in the production frontier, and technical efficiency change , which measures the movement toward (away from) the frontier. Similarly, the -oriented environmental productivity index is decomposed into similarly interpreted environmental technological change and environmental efficiency change .
Note the conceptual difference between the definition of a “technological progress” for desirable outputs and that for undesirable outputs. Namely, for a desirable output Y the technological progress corresponds to the case of , i.e., an outward shift in the production frontier over time, whereas for an undesirable output the technological progress corresponds to , i.e., an inward shift in the residual generating frontier over time. Thus, the residual generating frontier is defined as the minimum quantity of undesirable output generated when producing a given quantity of desirable outputs subject to the material balance condition.
We emphasize that the primary advantage of employing a system approach to model the production process with undesirable outputs, which we consider in this paper, is the opportunity to disentangle technical and environmental productivities. For instance, as in the case of inefficiency, one generally cannot do that when using the productivity index based on the directional distance function [
9].
4. Econometric Strategy
Under the assumption of the translog functional form of
and
, from system (3) we get the following system consisting of the production function for a desirable output
:
complemented by the (environmental) residual generation technologies for undesirable outputs
:
where a lower-case variable denotes the log of its upper-case counterpart, and
denotes the time dummy. For greater flexibility, we also allow for unobserved firm-specific heterogeneity which we model via “true” random effects
. The presence of these random effects (in addition to inefficiencies) captures additional technological heterogeneity among firms.
Since
appears on the right-hand side of equations for undesirable outputs
and
, it is imperative that all three equations in (7)–(8) be estimated as a system (of simultaneous equations) in order to control for the endogeneity of outputs. We estimate this production system subject to symmetry (
) as well as monotonicity and curvature restrictions. In this paper, we thus concur with Barnett [
15] and Barnett [
16] on the importance of maintaining the latter theoretical regularity conditions when modeling technology (especially, if allowing for inefficiency) in order to ensure that the results are economically meaningful.
Specifically, the monotonicity conditions are:
The curvature is imposed using restrictions on the eigenvalues of the Hessian matrices in levels (see [
18]). We employ the following stochastic specification for system (7)–(8):
where
denotes the (multivariate) half-normal distribution;
6 Σ and
are the covariance matrices;
where
is an identity matrix of dimension
κ and
; and
is a set of
unknown parameters. The location parameters of each inefficiency term
is given by
. Thus, for greater flexibility in modeling time effects, we allow inefficiency to be time-varying (
i.e., a function of the time dummies). The error components
are assumed to be orthogonal as well as independent of
. Further, the random effects
are assumed to be identically, independently distributed from the error components
as well as independent of
:
where
.
4.1. Priors
For the parameters in system (7)–(8), which we collectively denote by ϑ, we assume a non-informative prior that imposes the regularity restrictions so that , where denotes the set of acceptable parameters. For scale parameters , we assume , where and . For τ, we assume a proper but relatively non-informative prior of the form with . For Σ and , we assume proper but relatively non-informative priors in the Wishart family. The results are not sensitive to c, or unless c becomes approximately less than 0.1, in which case it approaches the domain of a dogmatic prior.
One may inquire if it would be possible to select objective priors such as in the case of Jeffreys’ prior. One way to proceed with objective priors would be along the lines of Berger and Mortera [
20] and Mulder
et al. [
21]. For instance, the use of a constrained posterior prior along the lines of Berger and Mortera [
20] is an option. The Jeffreys’ prior cannot be obtained analytically but can be computed using numerical or analytic derivatives. This computation is certainly heavy. Furthermore, the Jeffreys’ prior is not used as much in the present literature, and the emphasis is rather placed on the so-called intrinsic Bayes factor (see [
22]). We leave the issue for future research, but we do not expect much change since our results were not sensitive to important aspects of the prior.
4.2. Posterior Distribution
For convenience, we let
and
. The kernel posterior distribution of all parameters denoted by
(a superset of
ϑ), if conditioned on the latent inefficiencies and random effects, is given by
where
and
. Also:
where
and
denote regressors in (7) and (8), respectively (some of which are endogenous);
denotes vectors of parameters in the three equations of the system and
; and
denotes the entire available data. Further,
(via the Cholesky decomposition) and
denotes the
k-variate normal probability integral evaluated at some vector
.
The first term in the third line of (12) owes to the constraint
. Specifically, our stochastic assumptions about
imply the density
which requires the evaluation of a tri-variate normal integral that can be performed using standard numerical algorithms. Before proceeding with MCMC methods for inference, note that the posterior is given by
While the multivariate integration can be performed in the closed form with respect to inefficiencies u, the induced nonlinearity however precludes analytical integration with respect to random effects
α. We are not aware of any
efficient MCMC scheme that draws these random effects
as a block from the posterior, especially when
n is relatively large.
The posterior conditional distribution of latent inefficiencies is
where
. Draws from the above conditional distribution can be easily obtained. The same is true for the posterior conditional distribution of the random effects if we write
from (13) as
, where
is defined in an obvious way. We can then draw the random effects as a block for observation
i as follows
where
If it were not for the regularity constraints and the non-standard form of the posterior conditional distribution of τ (due do the term involving the multivariate normal integral), we could easily derive the posterior conditional distribution of parameters of interest ϑ.
Collecting data for all observations over
and
, we rewrite our model in an obvious matrix notation:
where we assume
We rewrite the system of equations (19) compactly as
where 𝕐 is an
vector of “data” appearing on the left-hand side of equalities in (19),
,
and
is a conformable vector of parameters.
System (21) takes the form of a multivariate regression model with
The GLS estimator of Θ is given by
with the corresponding covariance matrix:
We note that the above approximation however ignores that ϑ included in ϖ needs to satisfy (the regularity conditions).
Let us define a multivariate normal distribution centered at of which the covariance is for some constant . We denote this distribution by , where κ is the dimensionality of ϖ, i.e., the number of parameters in the extended system (19). We use the GLS quantities to form a proposal density for generating candidate parameter draws as we describe below.
Next, we describe how to realize draws from the conditional posterior distributions of
σ, Σ and
. Except for
,
σ and Σ can be drawn from standard statistical distributions. Specifically, for the elements of
σ we have:
where
,
and
. Here,
denotes all elements of the entire parameter vector
θ including all latent variables except the indicated subscripted parameter
.
Our priors are conditionally conjugate,
i.e.,
where
is a scalar prior parameter and
are prior matrices. In our empirical work we take
and
.
The posterior conditional of Σ is
where
for
.
The conditional posterior of
is
where
for
.
Clearly,
belongs to the Wishart family. The same would have been true for
if it were not for the second line of (28) which involves the Cholesky factor of this matrix. Therefore, we use the Wishart distribution to draw a candidate matrix and we retain the candidate with probability
where
denotes the candidate draw and
is the existing
sth draw (
).
4.3. Imposition of Restrictions
Imposing restrictions is not trivial in our application. Since the restrictions depend on the data, we adopt the following strategy. We draw from the proposal density described in the previous subsection subject to the constraints using a special form of rejection to improve the efficiency of “naive rejection” which would keep drawing parameters until all constraints are satisfied. Specifically, we first use acceptance at a limited number of points to facilitate acceptance and then we keep drawing from the proposal distribution until all regularity constraints hold at all data points.
We first impose the restrictions at the means of variables (normalized to zero) and then at points around the mean. We choose , and the restrictions hold without much trouble in the positive direction. In the negative direction, the restrictions are first tested for and then tested at the remaining points. This yields considerable improvement in the efficiency (i.e., timing) of acceptance rates from a density which we describe next.
Based on a current draw
such that
, a new candidate
is generated until it satisfies the regularity restrictions. The candidate is accepted and we set
with the Metropolis-Hastings probability
otherwise we repeat the current draw, that is
. We adjust the scaling constant
h so that the acceptance rate of the Metropolis-Hastings algorithm is between 20% and 30%. The Metropolis-Hastings algorithm also takes care of nonlinearity of the posterior in
τ.
We generate the covariance matrix Σ and scale parameters σ from their respective posterior conditional distributions which are all in standard form (inverted Wishart and inverted Gamma). The latter however is not the case for (i.e., this matrix cannot be drawn using an inverted Wishart). We therefore take an extra Metropolis-Hastings step to accommodate the presence of the Cholesky factor of (i.e., ) in the posterior inside the multivariate normal integral. Acceptance rates using a simple Metropolis-Hastings step were quite high (over 90%), and simple scaling has brought it down to the range of 20%–25%.
Our MCMC uses
preliminary or transient passes until we obtain convergence using Geweke’s [
23] relative numerical efficiency (RNE) diagnostic. Once convergence is achieved, we take another 100,000 passes. We do not use thinning. Instead, we report posterior standard deviations based on Newey-West HAC covariance matrices using 10 lags. For details, see
Table 1.
Table 1.
Computational Experience with the Data.
Table 1.
Computational Experience with the Data.
| (1) | (2) |
---|
median RNE | 0.113 | 0.615 |
median NSE | 0.0010 | 0.0012 |
draws to convergence | 150,000 | 70,000 |
median ACF at lag 50 | 0.977 | 0.312 |
4.4. Improving Performance of MCMC
We can explicitly integrate
u out of (12) to obtain a kernel posterior of the following form:
Further, we can also derive the closed-form conditional posterior of random effects .
We can achieve a significant improvement of MCMC performance by recognizing that the random effects
α can be explicitly integrated out of the posterior, when the parameters
ϑ are drawn. Specifically, similar to (19), we consider the following system
which we can rewrite in compact notation as
where
. Collecting all (time) observations for a given firm
i together, we obtain:
where
,
and
are defined in an obvious way. Clearly:
where
is a
matrix of which all elements are equal to one.
Therefore, we can redefine the GLS quantities that are used to obtain a good proposal distribution for
ϑ using the following as the covariance matrix
V,
i.e.,
Using the modified proposal density, we effectively
marginalize out the random firm-specific effects from the posterior and thus can draw latent inefficiencies
marginally on these effects hoping to reduce overall autocorrelation in MCMC due to the correlation between
and
.
7 This requires a trivial modification in the way we draw latent inefficiencies.
Since model (32) may be written as
after collecting all (time) observations for a given firm
i, in obvious notation, we have
where
and
are naturally redefined to account for a change of sign in the first equation of (37) in order to accommodate a uniform sign in front of inefficiencies
. Also recall that
and its stochastic properties have been defined before. Now we can draw
inefficiencies
as a block, after we couple system (38) with the following specification:
subject to
and
.
Since
we can draw latent inefficiencies, marginally on random effects, using the following multivariate truncated normal distributions,
i.e.,
the first two moments of which are
We have found that drawing blocks of latent inefficiencies marginally on random effects
(and conditionally on various covariance matrices and
) results in vast improvements in terms of computational efficiency.
Table 1 summarizes our computational experience with the data.
4.5. Random Effects
Based on our discussion above, we note that given the way the variance parameters enter the covariance matrix Ω, in principle, there is no problem in treating the random effects as jointly normally distributed, i.e., , independently over as well as independent from all other random variables and regressors in the model. All our derivations, including the conditional posterior distribution of , hold true. The only difference is that, in a more general setting (when random effects are allowed to be correlated across equations) one has to draw Ω from its conditional posterior distribution as a general positive definite matrix, whereas, when the random effects are a priori independent, the problem boils down to drawing variances only.
A general case of correlated random effects has an empirical implication of firm-specific effects in the production function being correlated with those in residual generation functions for undesirable outputs. The latter possibility is testable and is of interest on its own. Should Ω be found not to be diagonal, one should naturally focus on the sign of the correlation between the random effects.
Given a proper prior
on the different elements of Ω and the marginal posterior
, the Verdinelli and Wasserman [
25] approach to computing the Bayes factor in favor of diagonality is given by
which, in the general case, involves testing
zero restrictions, where
k is the dimension of Ω (in our case,
). By “
”, we mean the zero restrictions
, where
.
While the denominator of
is easy to compute, the numerator is computed in a standard fashion as
where
is the
sth (of
S) MCMC draw of all parameters
θ except those in Ω. Note that
does include the diagonal elements of Ω in this computation.
It remains to show how draws from the conditional posterior distribution may be realized. Our prior is conditionally conjugate and has the following form:
The conditional posterior distribution is given by
where
and
as before. We define the baseline prior using
and
with
. Clearly, Ω belongs to the Wishart family.
The Bayes factor
using the baseline prior is
with the corresponding range of
, which suggests that the diagonality of Ω can be definitely rejected. In order to compute the range of
, we generate 1000 alternative priors and implement the approximated MCMC using the sampling-iterative-resampling (SIR) algorithm which re-weights the original MCMC sample without recomputing MCMC samples for a new prior. For each one of the SIR re-weighted samples, we implement the Verdinelli and Wasserman [
25] approach, and the range is taken as the 95% confidence interval of the computed approximate Bayes factors in favor of diagonality. The 1000 alternative priors are generated by uniformly varying
in the interval
and
c in the interval
without restricting them to integer values.
Table
Table 2 reports the posterior means and standard deviations of the correlation coefficients between random effects
derived from Ω. We find that unobserved firm-specific effects are all positively correlated.
Table 2.
Correlation between Random Effects.
Table 2.
Correlation between Random Effects.
| | |
---|
| 0.831 (0.0011) | 0.630 (0.0130) |
| | 0.601 (0.0102) |
5. Results
Before proceeding to the discussion of technical and environmental inefficiencies as well as productivity and its components, we first focus on elasticities of the production process by electric utilities in our data sample.
Table 3 reports the summary of posterior estimates of these elasticities, including input elasticities of the primary production function (
i.e., electric power generation) as well as elasticities of SO
2 and NO
x emissions (undesirable outputs) with respect to the net generated electric power (desirable output).
8 In particular, the reported input elasticity estimates imply a posterior mean estimate of the returns to scale, defined as the sum of input elasticities, of 0.90, which suggests that, on average, electric utilities operated at decreasing returns to scale during our sample period.
The estimates of are of particular interest since they capture the cost of expanding the production of electric power in terms of the associated increase in the generation of the SO2 and NOx emissions. It is intuitive to interpret these estimates as “shadow prices” (in the elasticity form) of the power generation. The posterior mean estimates of the two shadow prices are 1.09 and 1.13. The latter implies that, on average, an increase in the net power generation by 1% requires a simultaneous increase in the SO2 and NOx emissions by at least 1.09% and 1.13%, respectively. Note that emissions may increase by even more if the firm is not on the residual generating frontier, i.e., environmentally inefficient.
We next proceed to the discussion of technical and environmental inefficiencies exhibited by the utilities in our sample.
Figure 1 plots kernel densities of the posterior estimates of the three types of inefficiency. In order to construct the figure, we use a Gaussian kernel with the cross-validated bandwidth parameters. We find apparent differences between the distributions of technical and environmental inefficiencies. Specifically, while technical inefficiency is relatively symmetrically distributed around its mean of 0.09, the distribution of the NO
x-oriented environmental inefficiency is noticeably skewed to the right and the distribution of the SO
2-oriented environmental inefficiency exhibits apparent bi-modality. There may be many reasons for such a stark difference between the levels of technical and environmental inefficiencies across utilities. One plausible explanation is that technical inefficiency may also be capturing declines in the desirable output due to unforeseen fluctuations in the demand for electric power. Since inputs often cannot be immediately adjusted/reallocated and electric power is not easily storable, electric plants may be forced to under-utilize their facilities and labor, which our model would detect and classify as technical underperformance (inefficiency) relative to the frontier. However, such a demand uncertainty would not apply to the by-production of undesirable SO
2 and NO
x gases given the exact physical relationship between the power generation and the associated emission of pollutant gases. The latter is also capable of at least partly explaining why environmental inefficiency (in the emission of both SO
2 and NO
x gases) appears to be relatively more stable over time unlike the electric-power-oriented technical inefficiency, which we discuss in more detail later in the paper.
Table 3.
Summary of Posterior Estimates.
Table 3.
Summary of Posterior Estimates.
| Mean | Median | St.Dev. | 95% Credible Interval |
---|
Elasticity |
Capital Elasticity | 0.2985 | 0.2984 | 0.0505 | (0.1992; 0.3959) |
Labor Elasticity | 0.4032 | 0.4043 | 0.0482 | (0.3076; 0.4935) |
Energy Elasticity | 0.2002 | 0.1998 | 0.0103 | (0.1801; 0.2205) |
RTS | 0.9018 | 0.9032 | 0.0726 | (0.7608; 1.0370) |
SO2 Shadow Price | 1.0873 | 1.0664 | 0.1437 | (0.8524; 1.4334) |
NOx Shadow Price | 1.1275 | 1.1163 | 0.1161 | (0.9366; 1.3988) |
Inefficiency |
Tech. Ineff. | 0.0905 | 0.0915 | 0.0257 | (0.0361; 0.1390) |
SO2 Env. Ineff. | 0.0870 | 0.0875 | 0.0351 | (0.0254; 0.1504) |
NOx Env. Ineff. | 0.0458 | 0.0438 | 0.0156 | (0.0186; 0.0798) |
Efficiency Change |
TEC | 0.0029 | 0.0022 | 0.0308 | (–0.0575; 0.0647) |
SO2 EEC | –0.0000 | –0.0004 | 0.0099 | (–0.0133; 0.0207) |
NOx EEC | 0.0000 | –0.0004 | 0.0038 | (–0.0050; 0.0107) |
Technological Change |
TTC | –0.0042 | –0.0027 | 0.0104 | (–0.0272; 0.0117) |
SO2 ETC | 0.0225 | 0.0224 | 0.0110 | (0.0024; 0.0446) |
NOx ETC | 0.0331 | 0.0332 | 0.0052 | (0.0229; 0.0431) |
Productivity Growth |
TPG | –0.0013 | 0.0009 | 0.0323 | (–0.0663; 0.0626) |
SO2 EPG | 0.0225 | 0.0220 | 0.0149 | (–0.0021; 0.0495) |
NOx EPG | 0.0331 | 0.0330 | 0.0064 | (0.0211; 0.0453) |
Figure 1.
Kernel Densities of Technical and Environmental Inefficiency Estimates.
Figure 1.
Kernel Densities of Technical and Environmental Inefficiency Estimates.
Further, we find that electric utilities tend to suffer from higher levels of inefficiency in the emission of SO
2 than in the emission of NO
x gases. These differences may be reflective of varying degree of strictness of environmental regulations (or the degree of their enforceability) for different pollutants across states. For instance, loose regulations for the SO
2 emissions could potentially explain why the SO
2-oriented environmental inefficiency is considerably higher on average and is more dispersedly distributed than the NO
x-oriented inefficiency. Lastly, we document little correlation between the two environmental inefficiencies as well as between the environmental inefficiencies and the technical inefficiency.
Table 4 reports such Spearman rank correlation coefficients for the posterior inefficiency estimates. For instance, there appears to be virtually no relationship between the electric-power-oriented technical inefficiency and the SO
2-oriented environmental inefficiency exhibited by utilities.
Table 4.
Rank Correlation Coefficients.
Table 4.
Rank Correlation Coefficients.
Inefficiency | Efficiency Change |
Tech. Ineff | 1.000 | | | TEC | 1.000 | | |
SO2 Env. Ineff | 0.080 | 1.000 | | SO2 EEC | –0.059 | 1.000 | |
NOx Env. Ineff | 0.112 | 0.227 | 1.000 | NOx EEC | 0.080 | 0.139 | 1.000 |
Technological Change | Productivity Growth |
TTC | 1.000 | | | TPG | 1.000 | | |
SO2 ETC | –0.001 | 1.000 | | SO2 EPG | –0.024 | 1.000 | |
NOx ETC | –0.020 | 0.045 | 1.000 | NOx EPG | –0.049 | 0.065 | 1.000 |
We next proceed to the posterior estimates of the productivity growth components—technological change and efficiency change—as defined in (5) and (6). From
Table 3, we see that the levels of both environmental efficiencies appear to be stable over time: posterior mean estimates of the SO
2 and the NO
x are virtually zero. The left panel of
Figure 2, which depicts box-and-whiskers plots of the distributions of technical and environmental efficiency change estimates for each year in our sample, confirms the relative stability of environmental efficiency levels (see the left panel of the figure). The electric-power-oriented (
i.e., desirable-output-oriented) technical efficiency is however less stable over the course of the years. The mean estimates of
are predominantly positive across utilities in 1986, 1988 and 1995, whereas a significant decrease in efficiency is documented for 1987 and 1994. However, the mean posterior estimate of
pooled over the entire sample is a mere 0.29% (also see
Table 3).
We document several regularities in the estimates of the technological change. First, the technical change
is primarily negative and is close to zero in the production of electric power (a desirable output). The posterior mean estimate of annual
is –0.42%. This finding is in sharp contrast with what we observe for the environmental technological change
. The posterior mean estimates of
for the SO
2 and NO
x emissions (undesirable outputs) are staggering 2.25% and 3.31% per annum, respectively. Second, we find that technological change is fairly stable across the years in all dimensions, be it the intended production of electric power or undesirable by-production of emission gases. The right panel of
Figure 2 confirms this observation: the distributions of
and the two
do not change much over the course of the years.
The discussed differences in the temporal dynamics between technical and environmental efficiency change and technological change result in dramatic differences in the measures of productivity across desirable and undesirable outputs. We find that, while the pooled posterior mean estimate of (annual) productivity growth is negative for electric power generation (–0.13%), it is substantially positive for the SO
2 and NO
x emissions: annual 2.25% and 3.31%, respectively. In other words, keeping input quantities constant, the net electric power generation, on average, fell by 0.13% per year during our sample period. Utilities however did a significantly better job in terms of cutting the emission of SO
2 and NO
x gases for any fixed quantity of the net electric power generated: on average, emissions fell by respective 2.25% and 3.31% per year
ceteris paribus. This disconnect between technical and environmental productivities of electric plants in our sample is also confirmed by virtually zero rank correlation coefficients between
and
for the SO
2 and NO
x emissions (see
Table 4).
Figure 3 vividly illustrates the differences between technical and environmental productivities. It plots the productivity indices that are normalized to 100 in the year 1985 and are constructed using the (respective)-output-weighted average annual productivity growth rates (over all utilities in the sample). The figure shows that the industry enjoyed stable positive rates of
during the 1985–1995 period, whereas
was more sporadic and included periods of both positive and negative growth. The cumulative eleven-year growth is 23.26% for the SO
2-oriented
, 37.98% for the NO
x-oriented
and a mere 5.33% for the electric-power-oriented
.
Figure 2.
Technical and Environmental Efficiency Change and Technological Change.
Figure 2.
Technical and Environmental Efficiency Change and Technological Change.
Figure 3.
Technical and Environmental PG Indices.
Figure 3.
Technical and Environmental PG Indices.
6. Conclusions
The prevalent approaches to modeling the firm’s production process in the presence of undesirable outputs either treat these outputs as inputs, which questionably implies their strong disposability as well as violates the “material balance condition”, or employ the directional distance function that, despite a sound theoretical foundation, allows the identification of a “composite” measure of inefficiency only. Similarly, the directional-output-distance-function-based productivity index identifies only the composite productivity which is a weighted combination of the technical and environmental productivities. This precludes researchers from disentangling the technical inefficiency/productivity, conventionally defined over desirable outputs, from the environmental (“green”) inefficiency/productivity, defined over undesirable outputs.
In this paper, we follow a different path to modeling the production process with undesirable outputs in the spirit of Fernández
et al. [
11,
12, Forsund [
13] and Murty
et al. [
7]. Specifically, we model the productive operations of the firm as a system of simultaneous production technologies for desirable and undesirable outputs. In this setup, desirable outputs are produced by transforming inputs via the conventional transformation function satisfying all standard assumptions. Consistent with the material balance condition, the by-production of undesirable outputs is however treated as the so-called “residual generation technology”. The above setup explicitly recognizes that the generation of undesirable outputs is not the intended production but rather the
by-production process. By separating the generation of undesirable outputs from that of desirable outputs, we ensure that the former are not modeled as inputs as well as take into account their “costly disposability”.
The by-production system approach that we employ in this paper permits us to not only distinguish between technical efficiency and (undesirable-output-specific) environmental efficiencies but to also differentiate between traditional technical productivity and environmental productivity. Specifically, we derive a Solow [
14] type primal (Divisia) environmental productivity index which, unlike a conventional desirable-output-oriented productivity index, is defined as the
contraction rate in undesirable outputs unexplained by the contraction in desirable outputs. This allows us to credit the
ceteris paribus reduction in undesirable outputs. Our environmental productivity index also provides a meaningful way to decompose productivity into environmental technological change and environmental efficiency change.