Next Article in Journal
A New Approach to Model Verification, Falsification and Selection
Previous Article in Journal
Strategic Interaction Model with Censored Strategies
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Bayesian Approach to Disentangling Technical and Environmental Productivity

by
Emir Malikov
1,*,
Subal C. Kumbhakar
2,3 and
Efthymios G. Tsionas
4
1
Department of Economics, St. Lawrence University, Canton, NY 13617, USA
2
Department of Economics, State University of New York at Binghamton, Binghamton, NY 13902, USA
3
Norwegian Agricultural Economics Research Institute, NO-0030 Oslo, Norway
4
Department of Economics, Lancaster University Management School, Lancaster LA1 4YX, UK
*
Author to whom correspondence should be addressed.
Econometrics 2015, 3(2), 443-465; https://doi.org/10.3390/econometrics3020443
Submission received: 16 April 2015 / Accepted: 8 June 2015 / Published: 16 June 2015

Abstract

:
This paper models the firm’s production process as a system of simultaneous technologies for desirable and undesirable outputs. Desirable outputs are produced by transforming inputs via the conventional transformation function, whereas (consistent with the material balance condition) undesirable outputs are by-produced via the so-called “residual generation technology”. By separating the production of undesirable outputs from that of desirable outputs, not only do we ensure that undesirable outputs are not modeled as inputs and thus satisfy costly disposability, but we are also able to differentiate between the traditional (desirable-output-oriented) technical productivity and the undesirable-output-oriented environmental, or so-called “green”, productivity. To measure the latter, we derive a Solow-type Divisia environmental productivity index which, unlike conventional productivity indices, allows crediting the ceteris paribus reduction in undesirable outputs. Our index also provides a meaningful way to decompose environmental productivity into environmental technological and efficiency changes.
JEL classifications:
C11; C30; C43; D24

1. Introduction

The by-production of undesirable, or so-called “bad”, outputs is an inherent attribute of many production processes. Electric power generation is a classical example of such a process, where the production of electricity (desirable output) is accompanied by the emission of pollutant gases (undesirable outputs). It is therefore imperative to account for undesirable outputs when estimating the productivity growth for such processes (e.g., see [1,2]).
The estimation of productivity (and, potentially, its components) naturally requires the estimation of the firm’s production process, the modeling of which in the presence of undesirable outputs is however not a clear-cut issue. A standard approach is to condition the conventional transformation (production) function on undesirable outputs (e.g., [3,4,5]) which, effectively, treats these outputs as inputs. Such a treatment of undesirable outputs has since been heavily criticized due the implied strong disposability of undesirable outputs [6] and the violation of the “material balance condition” [7]. A popular alternative approach to tackling undesirable outputs is to specify a (single) directional output distance function [6,8] which accommodates both the expansion in desirable outputs and a simultaneous contraction in undesirable outputs. Feng and Serletis [9] have recently proposed a primal Divisia productivity index based on such a directional output distance function.
Both the directional output distance function and the productivity index based on the latter allow the identification of a “composite” measure of inefficiency and productivity (respectively) only. Specifically, when modeling the production technology via a (single) directional distance function (e.g., [6]), the inefficiency is defined over the entire vector of outputs, both desirable and undesirable. This produces a single measure of inefficiency which is a weighted combination of the technical and environmental inefficiencies, where the “weighting” is done on the basis of the prespecified directional vector. Similarly, the directional-output-distance-function-based productivity index identifies the “composite” productivity growth only. Thus, modeling undesirable outputs via the standard directional functions precludes researchers from disentangling the technical inefficiency/productivity, conventionally oriented along desirable outputs, from the environmental, or so-called “green”, inefficiency/productivity, oriented along undesirable outputs.1 Both can be of great interest from a policy perspective.
In this paper, we follow a different path to modeling the production process with undesirable outputs in the spirit of Fernández et al. [11,12, Forsund [13] and Murty et al. [7]. Specifically, we model the firm’s production process as a system of separate simultaneous production technologies for desirable and undesirable outputs. In this setup, desirable outputs are produced by transforming inputs via the conventional transformation function satisfying all standard assumptions. Consistent with the material balance condition, the by-production of undesirable outputs is however treated as the so-called “residual generation technology”. The above setup explicitly recognizes that the generation of undesirable outputs is not the intended production but rather the by-production process. By separating the generation of undesirable outputs from that of desirable outputs, we ensure that the former are not modeled as inputs as well as take into account their “costly disposability” (see [7]).
The by-production system approach that we employ in this paper permits us to not only distinguish between technical efficiency and (undesirable-output-specific) environmental efficiencies but to also differentiate between traditional technical productivity and environmental (“green”) productivity. Specifically, we derive a Solow [14] type primal (Divisia) environmental productivity index which, unlike a conventional desirable-output-oriented productivity index, is defined as the contraction rate in undesirable outputs unexplained by the contraction in desirable outputs. This allows us to credit the ceteris paribus reduction in undesirable outputs. Our environmental productivity index also provides a meaningful way to decompose productivity into environmental technological change and environmental efficiency change.
We apply our system approach as well as the environmental productivity index to study the efficiency and productivity trends among coal-fired electric power generating plants in the U.S. during the 1985–1995 period. The production of (desirable) electric power by these utilities is accompanied by the (undesirable) emission of SO2 and NOx gases.
We estimate the model subject to theoretical regularity conditions using (numerically) efficient Bayesian MCMC technique, where we also allow for unobserved plant-specific heterogeneity in addition to time-varying inefficiencies. We impose monotonicity and curvature regularity restrictions (at every data point) in order to ensure that our results are economically meaningful, as emphasized by Barnett et al. [15] and Barnett [16]. Among many things, we find that electric utilities in our sample tend to suffer from higher levels of environmental inefficiency in the emission of SO2 than in the emission of NOx gases. We also document a significant divergence between the electric-power-oriented technical productivity and the emission-oriented environmental productivity. Specifically, we find that, while the pooled posterior mean estimate of (annual) productivity growth is negative for electric power generation (–0.13%), it is non-negligibly positive for the SO2 and NOx emissions:2 2.25% and 3.31% per annum, respectively. The cumulative eleven-year growth is 23.26% for the SO2-oriented environmental productivity, 37.98% for the NOx-oriented environmental producitivity and a mere 5.33% for the electric-power-oriented technical productivity.
The rest of the paper proceeds as follows. Section 2 describes the by-production system approach to modeling production technology in the presence of undesirable outputs as well as provides the derivation of the environmental productivity index. Data are discussed in Section 3. We explain our econometric strategy in Section 4. Section 5 presents the results, and Section 6 concludes.

2. The By-Production Model

Building on Fernández et al. [11,12], the undesirable-output-generating production system (𝕋) with J inputs X R + J , M desirable outputs Y R + M and P undesirable outputs B R + P can be formalized as the intersection of the primary technology used in the production of desirable outputs ( T 0 ) and P individual undesirable-output residual generation technologies ( T p , p = 1 , , P ), i.e.,3
  T = T 0 T 1 T p T P , where T 0 = def { ( X , Y ) : X can produce Y } T p = def { ( B p , Y ) : Y generates B p } p = 1 , , P .
Consider the case of J = 3 inputs, M = 1 desirable and P = 2 undesirable outputs (as in our empirical application). Allowing for technical inefficiency in the production of a desirable output and environmental inefficiency in the by-production of undesirable outputs, we rewrite system 𝕋 in terms of the stochastic transformation function and two separate (environmental) residual generation functions for each undesirable output, i.e.,
F ( X , θ 1 Y ) = exp { v 0 }
H p Y , λ p B p = exp { v p } p = 1 , 2 ,
where θ 1 and λ p 1 are technical and environmental efficiencies, respectively; and ( v 0 , v p ) are the white noise terms. The transformation function F ( · ) is assumed to satisfy all standard properties such as continuity, positive (negative) monotonicity in Y (X), linear homogeneity in Y and concavity in X and Y. Similarly, the residual generation function H p ( · ) is continuous, positively (negatively) monotonic in B p (Y), linearly homogeneous in B p and convex in Y and B p .
Thus, the production system (2) permits the identification of both the technical and environmental efficiencies: θ and λ p ( p = 1 , , P ) . The latter is feasible due to the separability of the primary desirable-output generating production technology (2a) from the undesirable-output residual generating technologies (2b), which is motivated by the by-production approach satisfying the material balance condition. For instance, one would generally be unable to disentangle technical and environmental efficiencies (the way the above system approach allows us to) if following a popular alternative to the estimation of production processes in the presence of undesirable outputs based on the directional distance function [8].
Specifically, when modeling the production technology via a (single) directional distance function (e.g., [6]), the inefficiency is defined over the entire vector of outputs, both desirable and undesirable, using an a priori specified directional vector. The latter precludes researchers from disentangling the technical inefficiency conventionally oriented along desirable outputs from the environmental inefficiency oriented along undesirable outputs. The directional distance function rather produces a “composite” measure of inefficiency which is a weighted combination of the two, where the “weighting” is done on the basis of the prespecified direction. Further, unlike a system in (2), the directional distance function yields an additive, not a proportional, measure of inefficiency.

Technical and Environmental Productivity

The production system 𝕋 that we consider in this paper permits us to not only distinguish between technical efficiency θ (conventionally defined over the desirable output) and undesirable-output-specific environmental efficiencies { λ p ; p = 1 , , P } but to also differentiate between traditional technical productivity and environmental, or the so-called “green”, productivity.
Letting time t enter the transformation and residual generation functions F ( · ) and H p ( · ) explicitly and making use of their linear homogeneity properties, system (2) can be rewritten in the log form as
ln Y t = ln f ( X t , t ) u 0 , t + v 0 , t
ln B p , t = ln h p ( Y t , t ) + u p , t + v p , t p = 1 , 2 ,
where, for convenience, we define f ( · ) = def [ F ( · , 1 ) ] 1 and h p ( · ) = def [ H p ( · , 1 ) ] 1 ; and u 0 , t = def ln θ t 0 and u p , t = def ln λ p , t 0 ( p = 1 , 2 ) are technical and environmental inefficiencies, respectively.
Total differentiation of (3) with respect to t yields
d ln Y t d t = j = 1 J ln f ( X t , t ) ln X j , t ln X j , t t + ln f ( X t , t ) t u 0 , t t
d ln B p , t d t = ln h p ( Y t , t ) ln Y t ln Y t t + ln h p ( Y , t ) t + u p , t t p = 1 , 2 ,
where we have made use of v 0 , t / t = v p , t / t = 0 since ( v 0 , v p ) are the i.i.d. white noise. After some rearranging, from (4a) we get the following Solow [14] type (Divisia) technical productivity index:
T P G = def d ln Y t d t j = 1 J ln f ( X t , t ) ln X j , t ln X j , t t = ln f ( X t , t ) t T T C u 0 , t t T E C ,
along with the similarly defined environmental productivity index from (4b):
E P G p = def d ln B p , t d t ln h p ( Y t , t ) ln Y t ln Y t t = ln h p ( Y , t ) t E T C p u p , t t E E C p p = 1 , 2 .
The negative monotonicity of F ( · ) and H p ( · ) in inputs and desirable outputs, respectively, imply that ln f ( X t , t ) / ln X j , t 0 and ln h p ( Y t , t ) / ln Y t 0 .4
Unlike T P G which is conventionally defined as the expansion rate in a desirable output unexplained by the growth in inputs, the environmental productivity index E P G is defined as the contraction rate in an undesirable output unexplained by the contraction in desirable outputs.5 This allows crediting the ceteris paribus reduction in undesirable outputs.
Further, Equations (5) and (6) provide a meaningful way to decompose respective productivity indices into technical/technological change and efficiency change. The conventional technical productivity index T P G equals the sum of the technical change T T C = ln f ( X t , t ) / t , which measures the temporal shift in the production frontier, and technical efficiency change T E C = u 0 , t / t , which measures the movement toward (away from) the frontier. Similarly, the B p -oriented environmental productivity index E P G p is decomposed into similarly interpreted environmental technological change E T C p = ln h p ( Y t , t ) / t and environmental efficiency change E E C p = u p , t / t .
Note the conceptual difference between the definition of a “technological progress” for desirable outputs and that for undesirable outputs. Namely, for a desirable output Y the technological progress corresponds to the case of T T C > 0 , i.e., an outward shift in the production frontier over time, whereas for an undesirable output B p the technological progress corresponds to E T C p < 0 , i.e., an inward shift in the residual generating frontier over time. Thus, the residual generating frontier H p ( · ) ( p = 1 , , P ) is defined as the minimum quantity of undesirable output generated when producing a given quantity of desirable outputs subject to the material balance condition.
We emphasize that the primary advantage of employing a system approach to model the production process with undesirable outputs, which we consider in this paper, is the opportunity to disentangle technical and environmental productivities. For instance, as in the case of inefficiency, one generally cannot do that when using the productivity index based on the directional distance function [9].

3. Data

The data we use come from Pasurka [17] and Murty et al. [7]. A balanced panel consists of 92 coal-fired electric power generating plants operating in the U.S. over the period from 1985 to 1995. We focus on coal-fired plants only in order to minimize heterogeneity among units. More specifically, we focus on utilities of which at least 95% of total fuel consumption (measured in Btu) come from coal. We also exclude utilities whose consumption of fuels other than coal, oil and natural gas exceeds 10 4 % of total fuel consumption.
The specification of outputs and inputs is as follows. The desirable output is the net electric power generation Y, measured in kWh. The two undesirable outputs are (i) the SO2 (sulfur dioxide) gas emissions B 1 and (ii) the NOx (nitrogen oxides) gas emissions B 2 , both measured in short-tons. The three inputs to the production are (i) the real stock of physical capital X 1 , constructed from historical cost of plant data and deflated to constant dollars using the Handy-Whitman Index; (ii) labor X 2 , measured in the number of employees; and (iii) energy X 3 , i.e., the heat content of coal, oil and natural gas consumption, measured in Btu.
The data on the cost of plants and equipment (used in the construction of the capital stock) and the number of employees come from the U.S. Federal Energy Regulatory Commission Form 1 survey. The data on fuel consumption, net power generation and pollutant gas emissions come from the U.S. Department of Energy Form EIA-767 survey. For more details on the data, see Pasurka [17].

4. Econometric Strategy

Under the assumption of the translog functional form of ln f ( · ) and ln h b ( · ) , from system (3) we get the following system consisting of the production function for a desirable output Y i t :
y i t = α 0 , i + j = 1 J α j x j , i t + 1 2 h = 1 J j = 1 J α h j x h , i t x j , i t + t = 1 T β t D i t + t = 1 T j = 1 J β t j D i t x j , i t + v 0 , i t u 0 , i t , i = 1 , , n ; t = 1 , , T ,
complemented by the (environmental) residual generation technologies for undesirable outputs ( B 1 , i t , B 2 , i t ) :
b 1 , i t = γ 0 , i + γ 1 y i t + 1 2 γ 11 y i t 2 + t = 1 T φ t D i t + t = 1 T φ t 1 D i t y i t + v 1 , i t + u 1 , i t
b 2 , i t = δ 0 , i + δ 1 y i t + 1 2 δ 11 y i t 2 + t = 1 T ψ t D i t + t = 1 T ψ t 1 D i t y i t + v 2 , i t + u 2 , i t ,
where a lower-case variable denotes the log of its upper-case counterpart, and D i t denotes the time dummy. For greater flexibility, we also allow for unobserved firm-specific heterogeneity which we model via “true” random effects { ( α 0 , i , γ 0 , i , δ 0 , i ) ; i = 1 , , n } . The presence of these random effects (in addition to inefficiencies) captures additional technological heterogeneity among firms.
Since y i t appears on the right-hand side of equations for undesirable outputs b 1 , i t and b 2 , i t , it is imperative that all three equations in (7)–(8) be estimated as a system (of simultaneous equations) in order to control for the endogeneity of outputs. We estimate this production system subject to symmetry ( α h j = α j h ) as well as monotonicity and curvature restrictions. In this paper, we thus concur with Barnett [15] and Barnett [16] on the importance of maintaining the latter theoretical regularity conditions when modeling technology (especially, if allowing for inefficiency) in order to ensure that the results are economically meaningful.
Specifically, the monotonicity conditions are:
y i t x j , i t = α j + h = 1 J α h j x h , i t + t = 1 T β t j D i t 0 j = 1 , , J b 1 , i t y i t = γ 1 + γ 11 y i t + t = 1 T φ t 1 D i t 0 b 2 , i t y i t = δ 1 + δ 11 y i t + t = 1 T ψ t 1 D i t 0 .
The curvature is imposed using restrictions on the eigenvalues of the Hessian matrices in levels (see [18]). We employ the following stochastic specification for system (7)–(8):
v i t = [ v 0 , i t , v 1 , i t , v 2 , i t ] i.i.d. 𝒩 ( 0 , Σ ) u i t = [ u 0 , i t , u 1 , i t , u 2 , i t ] 𝒩 + ( Z i t τ , Σ u ) ,
where 𝒩 + denotes the (multivariate) half-normal distribution;6 Σ and Σ u are the covariance matrices; Z i t = I 3 D where I κ is an identity matrix of dimension κ and D = [ D i 1 , , D i T ] ; and τ = vec τ k t ; k = 0 , 1 , 2 ; t = 1 , , T is a set of 3 T unknown parameters. The location parameters of each inefficiency term u k , i t ( k = 0 , 1 , 2 ) is given by t = 1 T τ k t D i t . Thus, for greater flexibility in modeling time effects, we allow inefficiency to be time-varying (i.e., a function of the time dummies). The error components ( v i t , u i t ) are assumed to be orthogonal as well as independent of x j , i t ( j = 1 , , J ) . Further, the random effects ( α 0 , i , γ 0 , i , δ 0 , i ) are assumed to be identically, independently distributed from the error components ( v i t , u i t ) as well as independent of x j , i t ( j = 1 , , J ) :
[ α 0 , i , γ 0 , i , δ 0 , i ] i.i.d. 𝒩 0 , Ω ,
where Ω = diag { σ α 2 , σ γ 2 , σ δ 2 } .

4.1. Priors

For the parameters in system (7)–(8), which we collectively denote by ϑ, we assume a non-informative prior that imposes the regularity restrictions so that p ( ϑ ) I ( ϑ ) , where denotes the set of acceptable parameters. For scale parameters σ α 2 , σ γ 2 , σ δ 2 , we assume p ( σ k ) σ ( N + 1 ) exp Q / ( 2 σ k 2 ) k { α , γ , δ } , where N = 1 and Q = 10 4 . For τ, we assume a proper but relatively non-informative prior of the form τ 𝒩 ( 0 , c I 3 T ) with c = 10 4 . For Σ and Σ u , we assume proper but relatively non-informative priors in the Wishart family. The results are not sensitive to c, N or Q unless c becomes approximately less than 0.1, in which case it approaches the domain of a dogmatic prior.
One may inquire if it would be possible to select objective priors such as in the case of Jeffreys’ prior. One way to proceed with objective priors would be along the lines of Berger and Mortera [20] and Mulder et al. [21]. For instance, the use of a constrained posterior prior along the lines of Berger and Mortera [20] is an option. The Jeffreys’ prior cannot be obtained analytically but can be computed using numerical or analytic derivatives. This computation is certainly heavy. Furthermore, the Jeffreys’ prior is not used as much in the present literature, and the emphasis is rather placed on the so-called intrinsic Bayes factor (see [22]). We leave the issue for future research, but we do not expect much change since our results were not sensitive to important aspects of the prior.

4.2. Posterior Distribution

For convenience, we let σ 2 = def σ α 2 , σ γ 2 , σ δ 2 and α i = def ( α 0 , i , γ 0 , i , δ 0 , i ) . The kernel posterior distribution of all parameters denoted by θ R d (a superset of ϑ), if conditioned on the latent inefficiencies and random effects, is given by
p θ | Ξ , α , u | Σ | n T / 2 exp 1 2 i = 1 n t = 1 T ( r i t u i t ) Σ 1 ( r i t u i t ) × exp 1 2 i = 1 n t = 1 T ( u i t Z i t τ ) Σ u 1 ( u i t Z i t τ ) × i = 1 n t = 1 T Φ 3 1 C u ( Z i t τ ) × p ( ϑ , τ , Σ , Σ u , σ ) ,
where α = { α i ; i = 1 , , n } and u = { u i t ; i = 1 , , n ; t = 1 , , T ) . Also:
r i t = y i t + α 0 , i + Ξ 0 , i t ϑ 0 b 1 , i t γ 0 , i Ξ 1 , i t ϑ 1 b 2 , i t δ 0 , i Ξ 1 , i t ϑ 2 ,
where Ξ 0 , i t and Ξ 1 , i t denote regressors in (7) and (8), respectively (some of which are endogenous); { ϑ k ; k = 0 , 1 , 2 } denotes vectors of parameters in the three equations of the system and ϑ = ϑ 0 , ϑ 1 , ϑ 2 ; and Ξ = { Ξ 0 , i t , Ξ 1 , i t , y i t , b 1 , i t , b 2 , i t ; i = 1 , , n ; t = 1 , , T } denotes the entire available data. Further, Σ u 1 = C u C u (via the Cholesky decomposition) and Φ k ( w ) denotes the k-variate normal probability integral evaluated at some vector w R k .
The first term in the third line of (12) owes to the constraint u i t 0 . Specifically, our stochastic assumptions about u i t imply the density
p ( u i t | Z i t , τ , Σ u ) = ( 2 π ) 3 / 2 | Σ u | 1 / 2 Φ 3 1 C u ( Z i t τ ) × exp 1 2 ( u i t Z i t τ ) Σ u 1 ( u i t Z i t τ ) ,
which requires the evaluation of a tri-variate normal integral that can be performed using standard numerical algorithms. Before proceeding with MCMC methods for inference, note that the posterior is given by
p θ | Ξ R n R + n T p θ | Ξ , α , u d u d α .
While the multivariate integration can be performed in the closed form with respect to inefficiencies u, the induced nonlinearity however precludes analytical integration with respect to random effects α. We are not aware of any efficient MCMC scheme that draws these random effects as a block from the posterior, especially when n is relatively large.
The posterior conditional distribution of latent inefficiencies is
u i t | · 𝒩 + V ( Σ 1 r i t + Σ u 1 Z i t τ ) , V ,
where V = Σ 1 + Σ u 1 1 . Draws from the above conditional distribution can be easily obtained. The same is true for the posterior conditional distribution of the random effects if we write r i t from (13) as r i t α i R i t , where R i t is defined in an obvious way. We can then draw the random effects as a block for observation i as follows
α i | · 𝒩 α ¯ i , V α ,
where
α ¯ i = T Σ + Ω 1 1 × T Σ 1 R ¯ i , R ¯ i = T 1 t = 1 T R i t V α = T Σ + Ω 1 1 .
If it were not for the regularity constraints and the non-standard form of the posterior conditional distribution of τ (due do the term involving the multivariate normal integral), we could easily derive the posterior conditional distribution of parameters of interest ϑ.
Collecting data for all observations over i = 1 , , n and t = 1 , , T , we rewrite our model in an obvious matrix notation:
y + u 0 = Ξ 0 ϑ 0 + v 0 b 1 u 1 = Ξ 1 ϑ 1 + v 1 b 2 u 2 = Ξ 1 ϑ 2 + v 2 u 0 = Z τ 0 + ζ 0 u 1 = Z τ 1 + ζ 1 u 2 = Z τ 2 + ζ 2 ,
where we assume
ζ = ζ 0 , ζ 1 , ζ 2 𝒩 ( 0 , Σ u ) .
We rewrite the system of equations (19) compactly as
Y = X ϖ + E ,
where 𝕐 is an n T × ( 2 × 3 ) vector of “data” appearing on the left-hand side of equalities in (19), X = diag { Ξ 0 , Ξ 1 , Ξ 2 , Z , Z , Z } , E = [ v 0 , v 1 , v 2 , ζ 0 , ζ 1 , ζ 2 ] and ϖ = [ ϑ , τ ] is a conformable vector of parameters.
System (21) takes the form of a multivariate regression model with
cov { E } = V = Φ I n T , where Φ = Σ 0 0 Σ u .
The GLS estimator of Θ is given by
ϖ ^ = X V 1 X 1 X V 1 Y ,
with the corresponding covariance matrix:
cov { ϖ ^ } = X V 1 X 1 .
We note that the above approximation however ignores that ϑ included in ϖ needs to satisfy ϑ (the regularity conditions).
Let us define a multivariate normal distribution centered at ϖ ^ of which the covariance is h × cov { ϖ ^ } for some constant h > 0 . We denote this distribution by f 𝒩 , κ ϖ ; ϖ ^ , h × cov { ϖ ^ } , where κ is the dimensionality of ϖ, i.e., the number of parameters in the extended system (19). We use the GLS quantities to form a proposal density for generating candidate parameter draws as we describe below.
Next, we describe how to realize draws from the conditional posterior distributions of σ, Σ and Σ u . Except for Σ u , σ and Σ can be drawn from standard statistical distributions. Specifically, for the elements of σ we have:
Q + Q α σ α 2 | θ α , Ξ χ n + N 2 Q + Q γ σ γ 2 | θ γ , Ξ χ n + N 2 Q + Q δ σ δ 2 | θ δ , Ξ χ n + N 2 ,
where Q α = i = 1 n α 0 , i 2 , Q γ = i = 1 n γ 0 , i 2 and Q δ = i = 1 n δ 0 , i 2 . Here, θ k denotes all elements of the entire parameter vector θ including all latent variables except the indicated subscripted parameter k { α , γ , δ } .
Our priors are conditionally conjugate, i.e.,
p Σ | Σ 1 | N ¯ + n T ( 3 + 1 ) / 2 exp 1 2 A ¯ Σ 1 p Σ u | Σ u 1 | N ¯ ( 3 + 1 ) / 2 exp 1 2 A ¯ u Σ u 1 ,
where N ¯ is a scalar prior parameter and A ¯ , A ¯ u are prior matrices. In our empirical work we take N ¯ = 10 and A ¯ = A ¯ u = 10 3 × I 3 .
The posterior conditional of Σ is
p Σ 1 | θ Σ , Ξ | Σ 1 | N ¯ + n T ( 3 + 1 ) / 2 exp 1 2 tr { A ¯ + A } Σ 1 ,
where A = Y k X k ϑ k Y k X k ϑ k for k , k = 0 , 1 , 2 .
The conditional posterior of Σ u is
p Σ u 1 | θ Σ u , Ξ | Σ u 1 | N ¯ + n T ( 3 + 1 ) / 2 exp 1 2 tr { A ¯ u + A u } Σ u 1 × i = 1 n t = 1 T Φ 3 1 C u ( Z i t τ ) ,
where A u = u m D τ m u m D τ m for k , k = 0 , 1 , 2 .
Clearly, Σ 1 belongs to the Wishart family. The same would have been true for Σ u if it were not for the second line of (28) which involves the Cholesky factor of this matrix. Therefore, we use the Wishart distribution to draw a candidate matrix and we retain the candidate with probability
min 1 , i = 1 n t = 1 T Φ 3 1 C u ( c ) ( Z i t τ ) i = 1 n t = 1 T Φ 3 1 C u ( s ) ( Z i t τ ) ,
where C u ( c ) denotes the candidate draw and C u ( s ) is the existing sth draw ( s = 1 , . . . , S ).

4.3. Imposition of Restrictions

Imposing restrictions is not trivial in our application. Since the restrictions depend on the data, we adopt the following strategy. We draw from the proposal density described in the previous subsection subject to the constraints ϑ using a special form of rejection to improve the efficiency of “naive rejection” which would keep drawing parameters until all constraints are satisfied. Specifically, we first use acceptance at a limited number of points to facilitate acceptance and then we keep drawing from the proposal distribution until all regularity constraints hold at all data points.
We first impose the restrictions at the means of variables (normalized to zero) and then at points ± r around the mean. We choose r = { 0 . 5 , 1 , 2 , 3 } , and the restrictions hold without much trouble in the positive direction. In the negative direction, the restrictions are first tested for r = 0 . 1 , 0 . 2 , . . . , 2 and then tested at the remaining points. This yields considerable improvement in the efficiency (i.e., timing) of acceptance rates from a density which we describe next.
Based on a current draw ϖ ( s ) such that ϑ ( s ) , a new candidate ϖ ( c ) 𝒩 ( ϖ ^ , h × cov { ϖ ^ } ) × I ( ϑ ( c ) ) is generated until it satisfies the regularity restrictions. The candidate is accepted and we set ϖ ( s + 1 ) = ϖ ( c ) with the Metropolis-Hastings probability
min 1 , p ( ϖ ( c ) | α , u , Ξ ) / f 𝒩 , κ ϖ ( c ) ; ϖ ^ , h × cov { ϖ ^ } p ( ϖ ( s ) | α , u , Ξ ) / f 𝒩 , κ ϖ ( s ) ; ϖ ^ , h × cov { ϖ ^ } ,
otherwise we repeat the current draw, that is ϖ ( s + 1 ) = ϖ ( s ) ; s = 1 , , S . We adjust the scaling constant h so that the acceptance rate of the Metropolis-Hastings algorithm is between 20% and 30%. The Metropolis-Hastings algorithm also takes care of nonlinearity of the posterior in τ.
We generate the covariance matrix Σ and scale parameters σ from their respective posterior conditional distributions which are all in standard form (inverted Wishart and inverted Gamma). The latter however is not the case for Σ u (i.e., this matrix cannot be drawn using an inverted Wishart). We therefore take an extra Metropolis-Hastings step to accommodate the presence of the Cholesky factor of Σ u (i.e., C u ) in the posterior inside the multivariate normal integral. Acceptance rates using a simple Metropolis-Hastings step were quite high (over 90%), and simple scaling has brought it down to the range of 20%–25%.
Our MCMC uses S preliminary or transient passes until we obtain convergence using Geweke’s [23] relative numerical efficiency (RNE) diagnostic. Once convergence is achieved, we take another 100,000 passes. We do not use thinning. Instead, we report posterior standard deviations based on Newey-West HAC covariance matrices using 10 lags. For details, see Table 1.
Table 1. Computational Experience with the Data.
Table 1. Computational Experience with the Data.
(1)(2)
median RNE0.1130.615
median NSE0.00100.0012
draws to convergence150,00070,000
median ACF at lag 500.9770.312
NOTES: (1) denotes MCMC using full MCMC, drawing random effects and inefficiencies through regular Gibbs sampling. In (2), MCMC is applied by marginalizing the random effects to draw inefficiencies. RNE—relative numerical efficiency; NSE—numerical standard error; ACF—autocorrelation function.

4.4. Improving Performance of MCMC

We can explicitly integrate u out of (12) to obtain a kernel posterior of the following form:
p θ | Ξ , α R + n T p θ | Ξ , α , u d u .
Further, we can also derive the closed-form conditional posterior of random effects p ( α | Ξ , θ α ) .
We can achieve a significant improvement of MCMC performance by recognizing that the random effects α can be explicitly integrated out of the posterior, when the parameters ϑ are drawn. Specifically, similar to (19), we consider the following system
y i t + u 0 , i t = α 0 , i + Ξ 0 , i t ϑ 0 + v 0 , i t b 1 , i t u 1 , i t = γ 0 , i + Ξ 1 , i t ϑ 1 + v 1 , i t b 2 , i t u 2 , i t = δ 0 , i + Ξ 1 , i t ϑ 2 + v 2 , i t ,
which we can rewrite in compact notation as
Y i t = α i + X i t ϑ + v i t ,
where X i t = diag { Ξ 0 , i t , Ξ 1 , i t , Ξ 2 , i t } . Collecting all (time) observations for a given firm i together, we obtain:
Y i = X i ϑ + V i ,
where Y i , X i and V i are defined in an obvious way. Clearly:
V i 𝒩 0 , Ω J T + Σ I T i = 1 , , n ,
where J T is a T × T matrix of which all elements are equal to one.
Therefore, we can redefine the GLS quantities that are used to obtain a good proposal distribution for ϑ using the following as the covariance matrix V, i.e.,
V = Ω J T + Σ I T I n .
Using the modified proposal density, we effectively marginalize out the random firm-specific effects from the posterior and thus can draw latent inefficiencies u i t marginally on these effects hoping to reduce overall autocorrelation in MCMC due to the correlation between α i and u i t .7 This requires a trivial modification in the way we draw latent inefficiencies.
Since model (32) may be written as
y i t = α 0 , i Ξ 0 , i t ϑ 0 v 0 , i t + u 0 , i t b 1 , i t = γ 0 , i + Ξ 1 , i t ϑ 1 + v 1 , i t + u 1 , i t b 2 , i t = δ 0 , i + Ξ 1 , i t ϑ 2 + v 2 , i t + u 2 , i t ,
after collecting all (time) observations for a given firm i, in obvious notation, we have
Ψ i = X i ϑ + V i + U i ,
where X i and V i are naturally redefined to account for a change of sign in the first equation of (37) in order to accommodate a uniform sign in front of inefficiencies U i . Also recall that V i and its stochastic properties have been defined before. Now we can draw 3 T × 1 inefficiencies U i as a block, after we couple system (38) with the following specification:
U i = I T Z i τ + ζ i
subject to U i 0 and ζ i 𝒩 ( 0 , Σ u I T ) ; i = 1 , , n .
Since
cov { U i , ζ i } = Ω J T + Σ I T 0 0 Σ u I T = def H 0 0 M ,
we can draw latent inefficiencies, marginally on random effects, using the following multivariate truncated normal distributions, i.e.,
U i 𝒩 + U ˜ i , W ,
the first two moments of which are
U ˜ i = H 1 + M 1 1 H 1 Ψ i X i ϑ + M 1 I T Z i τ W = H 1 + M 1 1 .
We have found that drawing blocks of latent inefficiencies marginally on random effects α i (and conditionally on various covariance matrices and ( ϑ , τ ) ) results in vast improvements in terms of computational efficiency. Table 1 summarizes our computational experience with the data.

4.5. Random Effects

Based on our discussion above, we note that given the way the variance parameters σ 2 enter the covariance matrix Ω, in principle, there is no problem in treating the random effects α i as jointly normally distributed, i.e., α i 𝒩 0 , Ω , independently over i = 1 , . . . , n as well as independent from all other random variables and regressors in the model. All our derivations, including the conditional posterior distribution of α i , hold true. The only difference is that, in a more general setting (when random effects are allowed to be correlated across equations) one has to draw Ω from its conditional posterior distribution as a general positive definite matrix, whereas, when the random effects are a priori independent, the problem boils down to drawing variances σ 2 only.
A general case of correlated random effects has an empirical implication of firm-specific effects in the production function being correlated with those in residual generation functions for undesirable outputs. The latter possibility is testable and is of interest on its own. Should Ω be found not to be diagonal, one should naturally focus on the sign of the correlation between the random effects.
Given a proper prior p Ω on the different elements of Ω and the marginal posterior p Ω | Ξ , the Verdinelli and Wasserman [25] approach to computing the Bayes factor in favor of diagonality is given by
B F diag = p Ω = diagonal | Ξ p Ω = diagonal ,
which, in the general case, involves testing k k 1 / 2 zero restrictions, where k is the dimension of Ω (in our case, k = 3 ). By “ Ω = diagonal ”, we mean the zero restrictions Ω i j = 0 , i > j , i , j = 1 , . . . , 3 , where Ω Ω i j .
While the denominator of B F diag is easy to compute, the numerator is computed in a standard fashion as
p Ω = diagonal | Ξ = S 1 s = 1 S p Ω = diagonal , θ Ω ( s ) | Ξ
where θ Ω ( s ) is the sth (of S) MCMC draw of all parameters θ except those in Ω. Note that θ Ω ( s ) does include the diagonal elements of Ω in this computation.
It remains to show how draws from the conditional posterior distribution may be realized. Our prior is conditionally conjugate and has the following form:
p Ω 1 Ω 1 N ¯ o ( 3 + 1 ) / 2 exp 1 2 tr { A o Ω 1 } .
The conditional posterior distribution is given by
p Ω 1 | θ Ω , Ξ Ω 1 N ¯ o + n T ( 3 + 1 ) / 2 exp 1 2 tr { A o + A Ω } Ω 1 ,
where A Ω = i = 1 n R i R i and R i = def α i r i t as before. We define the baseline prior using N ¯ o = 10 and A o = c × I 3 with c = 10 3 . Clearly, Ω belongs to the Wishart family.
The Bayes factor B F diag using the baseline prior is 2 . 402 × 10 3 with the corresponding range of ( 1 . 015 × 10 5 ; 0 . 0893 ) , which suggests that the diagonality of Ω can be definitely rejected. In order to compute the range of B F diag , we generate 1000 alternative priors and implement the approximated MCMC using the sampling-iterative-resampling (SIR) algorithm which re-weights the original MCMC sample without recomputing MCMC samples for a new prior. For each one of the SIR re-weighted samples, we implement the Verdinelli and Wasserman [25] approach, and the range is taken as the 95% confidence interval of the computed approximate Bayes factors in favor of diagonality. The 1000 alternative priors are generated by uniformly varying N ¯ o in the interval [ 1 ; 100 ] and c in the interval [ 10 7 ; 10 ] without restricting them to integer values.
Table Table 2 reports the posterior means and standard deviations of the correlation coefficients between random effects α i derived from Ω. We find that unobserved firm-specific effects are all positively correlated.
Table 2. Correlation between Random Effects.
Table 2. Correlation between Random Effects.
γ 0 , i δ 0 , i
α 0 , i 0.831 (0.0011)0.630 (0.0130)
γ 0 , i 0.601 (0.0102)
NOTES: The random effects ( α 0 , i , γ 0 , i , δ 0 , i ) are for ( y , b 1 , b 2 ) , respectively. Standard deviations are in parenthesis and are computed using a Newey-West HAC correction with 10 lags.

5. Results

Before proceeding to the discussion of technical and environmental inefficiencies as well as productivity and its components, we first focus on elasticities of the production process by electric utilities in our data sample. Table 3 reports the summary of posterior estimates of these elasticities, including input elasticities of the primary production function (i.e., electric power generation) as well as elasticities of SO2 and NOx emissions (undesirable outputs) with respect to the net generated electric power (desirable output).8 In particular, the reported input elasticity estimates imply a posterior mean estimate of the returns to scale, defined as the sum of input elasticities, of 0.90, which suggests that, on average, electric utilities operated at decreasing returns to scale during our sample period.
The estimates of b p , i t / y i t ( p = 1 , 2 ) are of particular interest since they capture the cost of expanding the production of electric power in terms of the associated increase in the generation of the SO2 and NOx emissions. It is intuitive to interpret these estimates as “shadow prices” (in the elasticity form) of the power generation. The posterior mean estimates of the two shadow prices are 1.09 and 1.13. The latter implies that, on average, an increase in the net power generation by 1% requires a simultaneous increase in the SO2 and NOx emissions by at least 1.09% and 1.13%, respectively. Note that emissions may increase by even more if the firm is not on the residual generating frontier, i.e., environmentally inefficient.
We next proceed to the discussion of technical and environmental inefficiencies exhibited by the utilities in our sample. Figure 1 plots kernel densities of the posterior estimates of the three types of inefficiency. In order to construct the figure, we use a Gaussian kernel with the cross-validated bandwidth parameters. We find apparent differences between the distributions of technical and environmental inefficiencies. Specifically, while technical inefficiency is relatively symmetrically distributed around its mean of 0.09, the distribution of the NOx-oriented environmental inefficiency is noticeably skewed to the right and the distribution of the SO2-oriented environmental inefficiency exhibits apparent bi-modality. There may be many reasons for such a stark difference between the levels of technical and environmental inefficiencies across utilities. One plausible explanation is that technical inefficiency may also be capturing declines in the desirable output due to unforeseen fluctuations in the demand for electric power. Since inputs often cannot be immediately adjusted/reallocated and electric power is not easily storable, electric plants may be forced to under-utilize their facilities and labor, which our model would detect and classify as technical underperformance (inefficiency) relative to the frontier. However, such a demand uncertainty would not apply to the by-production of undesirable SO2 and NOx gases given the exact physical relationship between the power generation and the associated emission of pollutant gases. The latter is also capable of at least partly explaining why environmental inefficiency (in the emission of both SO2 and NOx gases) appears to be relatively more stable over time unlike the electric-power-oriented technical inefficiency, which we discuss in more detail later in the paper.
Table 3. Summary of Posterior Estimates.
Table 3. Summary of Posterior Estimates.
MeanMedianSt.Dev.95% Credible Interval
Elasticity
Capital Elasticity0.29850.29840.0505(0.1992; 0.3959)
Labor Elasticity0.40320.40430.0482(0.3076; 0.4935)
Energy Elasticity0.20020.19980.0103(0.1801; 0.2205)
RTS0.90180.90320.0726(0.7608; 1.0370)
SO2 Shadow Price1.08731.06640.1437(0.8524; 1.4334)
NOx Shadow Price1.12751.11630.1161(0.9366; 1.3988)
Inefficiency
Tech. Ineff.0.09050.09150.0257(0.0361; 0.1390)
SO2 Env. Ineff.0.08700.08750.0351(0.0254; 0.1504)
NOx Env. Ineff.0.04580.04380.0156(0.0186; 0.0798)
Efficiency Change
TEC0.00290.00220.0308(–0.0575; 0.0647)
SO2 EEC–0.0000–0.00040.0099(–0.0133; 0.0207)
NOx EEC0.0000–0.00040.0038(–0.0050; 0.0107)
Technological Change
TTC–0.0042–0.00270.0104(–0.0272; 0.0117)
SO2 ETC0.02250.02240.0110(0.0024; 0.0446)
NOx ETC0.03310.03320.0052(0.0229; 0.0431)
Productivity Growth
TPG–0.00130.00090.0323(–0.0663; 0.0626)
SO2 EPG0.02250.02200.0149(–0.0021; 0.0495)
NOx EPG0.03310.03300.0064(0.0211; 0.0453)
Figure 1. Kernel Densities of Technical and Environmental Inefficiency Estimates.
Figure 1. Kernel Densities of Technical and Environmental Inefficiency Estimates.
Econometrics 03 00443 g001
Further, we find that electric utilities tend to suffer from higher levels of inefficiency in the emission of SO2 than in the emission of NOx gases. These differences may be reflective of varying degree of strictness of environmental regulations (or the degree of their enforceability) for different pollutants across states. For instance, loose regulations for the SO2 emissions could potentially explain why the SO2-oriented environmental inefficiency is considerably higher on average and is more dispersedly distributed than the NOx-oriented inefficiency. Lastly, we document little correlation between the two environmental inefficiencies as well as between the environmental inefficiencies and the technical inefficiency. Table 4 reports such Spearman rank correlation coefficients for the posterior inefficiency estimates. For instance, there appears to be virtually no relationship between the electric-power-oriented technical inefficiency and the SO2-oriented environmental inefficiency exhibited by utilities.
Table 4. Rank Correlation Coefficients.
Table 4. Rank Correlation Coefficients.
InefficiencyEfficiency Change
Tech. Ineff1.000 TEC1.000
SO2 Env. Ineff0.0801.000 SO2 EEC–0.0591.000
NOx Env. Ineff0.1120.2271.000NOx EEC0.0800.1391.000
Technological ChangeProductivity Growth
TTC1.000 TPG1.000
SO2 ETC–0.0011.000 SO2 EPG–0.0241.000
NOx ETC–0.0200.0451.000NOx EPG–0.0490.0651.000
We next proceed to the posterior estimates of the productivity growth components—technological change and efficiency change—as defined in (5) and (6). From Table 3, we see that the levels of both environmental efficiencies appear to be stable over time: posterior mean estimates of the SO2 E E C and the NOx E E C are virtually zero. The left panel of Figure 2, which depicts box-and-whiskers plots of the distributions of technical and environmental efficiency change estimates for each year in our sample, confirms the relative stability of environmental efficiency levels (see the left panel of the figure). The electric-power-oriented (i.e., desirable-output-oriented) technical efficiency is however less stable over the course of the years. The mean estimates of T E C are predominantly positive across utilities in 1986, 1988 and 1995, whereas a significant decrease in efficiency is documented for 1987 and 1994. However, the mean posterior estimate of T E C pooled over the entire sample is a mere 0.29% (also see Table 3).
We document several regularities in the estimates of the technological change. First, the technical change T T C is primarily negative and is close to zero in the production of electric power (a desirable output). The posterior mean estimate of annual T T C is –0.42%. This finding is in sharp contrast with what we observe for the environmental technological change E T C . The posterior mean estimates of E T C for the SO2 and NOx emissions (undesirable outputs) are staggering 2.25% and 3.31% per annum, respectively. Second, we find that technological change is fairly stable across the years in all dimensions, be it the intended production of electric power or undesirable by-production of emission gases. The right panel of Figure 2 confirms this observation: the distributions of T T C and the two E T C do not change much over the course of the years.
The discussed differences in the temporal dynamics between technical and environmental efficiency change and technological change result in dramatic differences in the measures of productivity across desirable and undesirable outputs. We find that, while the pooled posterior mean estimate of (annual) productivity growth is negative for electric power generation (–0.13%), it is substantially positive for the SO2 and NOx emissions: annual 2.25% and 3.31%, respectively. In other words, keeping input quantities constant, the net electric power generation, on average, fell by 0.13% per year during our sample period. Utilities however did a significantly better job in terms of cutting the emission of SO2 and NOx gases for any fixed quantity of the net electric power generated: on average, emissions fell by respective 2.25% and 3.31% per year ceteris paribus. This disconnect between technical and environmental productivities of electric plants in our sample is also confirmed by virtually zero rank correlation coefficients between T P G and E P G for the SO2 and NOx emissions (see Table 4).
Figure 3 vividly illustrates the differences between technical and environmental productivities. It plots the productivity indices that are normalized to 100 in the year 1985 and are constructed using the (respective)-output-weighted average annual productivity growth rates (over all utilities in the sample). The figure shows that the industry enjoyed stable positive rates of E P G during the 1985–1995 period, whereas T P G was more sporadic and included periods of both positive and negative growth. The cumulative eleven-year growth is 23.26% for the SO2-oriented E P G , 37.98% for the NOx-oriented E P G and a mere 5.33% for the electric-power-oriented T P G .
Figure 2. Technical and Environmental Efficiency Change and Technological Change.
Figure 2. Technical and Environmental Efficiency Change and Technological Change.
Econometrics 03 00443 g002
Figure 3. Technical and Environmental PG Indices.
Figure 3. Technical and Environmental PG Indices.
Econometrics 03 00443 g003

6. Conclusions

The prevalent approaches to modeling the firm’s production process in the presence of undesirable outputs either treat these outputs as inputs, which questionably implies their strong disposability as well as violates the “material balance condition”, or employ the directional distance function that, despite a sound theoretical foundation, allows the identification of a “composite” measure of inefficiency only. Similarly, the directional-output-distance-function-based productivity index identifies only the composite productivity which is a weighted combination of the technical and environmental productivities. This precludes researchers from disentangling the technical inefficiency/productivity, conventionally defined over desirable outputs, from the environmental (“green”) inefficiency/productivity, defined over undesirable outputs.
In this paper, we follow a different path to modeling the production process with undesirable outputs in the spirit of Fernández et al. [11,12, Forsund [13] and Murty et al. [7]. Specifically, we model the productive operations of the firm as a system of simultaneous production technologies for desirable and undesirable outputs. In this setup, desirable outputs are produced by transforming inputs via the conventional transformation function satisfying all standard assumptions. Consistent with the material balance condition, the by-production of undesirable outputs is however treated as the so-called “residual generation technology”. The above setup explicitly recognizes that the generation of undesirable outputs is not the intended production but rather the by-production process. By separating the generation of undesirable outputs from that of desirable outputs, we ensure that the former are not modeled as inputs as well as take into account their “costly disposability”.
The by-production system approach that we employ in this paper permits us to not only distinguish between technical efficiency and (undesirable-output-specific) environmental efficiencies but to also differentiate between traditional technical productivity and environmental productivity. Specifically, we derive a Solow [14] type primal (Divisia) environmental productivity index which, unlike a conventional desirable-output-oriented productivity index, is defined as the contraction rate in undesirable outputs unexplained by the contraction in desirable outputs. This allows us to credit the ceteris paribus reduction in undesirable outputs. Our environmental productivity index also provides a meaningful way to decompose productivity into environmental technological change and environmental efficiency change.

Acknowledgments

We would like to thank two anonymous referees for many insightful comments and suggestions that helped improve this article. Any remaining errors are our own.

Author Contributions

All authors contributed equally to the project.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. S.E. Atkinson, and J.H. Dorfman. “Bayesian measurement of productivity and efficiency in the presence of undesirable outputs: Crediting electric utilities for reducing air polution.” J. Econ. 126 (2005): 445–468. [Google Scholar] [CrossRef]
  2. S.E. Atkinson, and E.G. Tsionas. “Directional distance functions: Optimal endogenous directions.” J. Econ., 2015, in press. [Google Scholar] [CrossRef]
  3. S. Reinhard, C.A.K. Lovell, and G. Thijssen. “Econometric estimation of technical and environmental efficiency: An application to Dutch dairy farms.” Am. J. Agric. Econ. 81 (1999): 44–60. [Google Scholar] [CrossRef]
  4. S. Reinhard, C.A.K. Lovell, and G. Thijssen. “Environmental efficiency with multiple environmentally detrimental variables; estimated with SFA and DEA.” Eur. J. Oper. Res. 121 (2000): 287–303. [Google Scholar] [CrossRef]
  5. A. Hailu, and T.S. Veeman. “Non-parametric productivity analysis with undesirable outputs: An application to the Canadian pulp and paper industry.” Am. J. Agric. Econ. 83 (2001): 605–616. [Google Scholar] [CrossRef]
  6. R. Färe, S. Grosskopf, D.W. Noh, and W. Weber. “Characteristics of a polluting technology: Theory and practice.” J. Econ. 126 (2005): 469–492. [Google Scholar] [CrossRef]
  7. S. Murty, R.R. Russell, and S.B. Levkoff. “On modeling pollution-generating technologies.” J. Environ. Econ. Manag. 64 (2012): 117–135. [Google Scholar] [CrossRef]
  8. Y. Chung, R. Färe, and S. Grosskopf. “Productivity and undesirable outputs: A directional distance function approach.” J. Environ. Manag. 51 (1997): 229–240. [Google Scholar] [CrossRef]
  9. G. Feng, and A. Serletis. “Undesirable outputs and a primal Divisia productivity index based on the directional output distance function.” J. Econ. 183 (2014): 135–146. [Google Scholar] [CrossRef]
  10. R. Färe, and S. Grosskopf. “Directional distance functions and slacks-based measures of efficiency.” Eur. J. Oper. Res. 200 (2010): 320–322. [Google Scholar] [CrossRef]
  11. C. Fernández, G. Koop, and M.F.J. Steel. “Multiple-output production with undesirable outputs: An application to Nitrogen Surplus in Agriculture.” J. Am. Stat. Assoc. 97 (2002): 432–442. [Google Scholar] [CrossRef] [Green Version]
  12. C. Fernández, G. Koop, and M.F.J. Steel. “Alternative efficiency measures for multiple-output production.” J. Econ. 126 (2005): 411–444. [Google Scholar] [CrossRef] [Green Version]
  13. F. Forsund. “Good modelling of bad outputs: Pollution and multiple-output production.” Int. Rev. Environ. Resour. Econ. 3 (2009): 1–38. [Google Scholar] [CrossRef]
  14. R.M. Solow. “Technical change and the aggregate production function.” Rev. Econ. Stat. 39 (1957): 312–320. [Google Scholar] [CrossRef]
  15. W.A. Barnett, J. Geweke, and M. Wolfe. “Seminonparametric Bayesian estimation of the Asymptotically Ideal Production Model.” J. Econ. 49 (1991): 5–50. [Google Scholar] [CrossRef]
  16. W.A. Barnett. “Tastes and technology: Curvature is not sufficient for regularity.” J. Econ. 108 (2002): 199–202. [Google Scholar] [CrossRef]
  17. C.A. Pasurka. “Decomposing electric power plan emissions within a joint production framework.” Energy Econ. 28 (2006): 26–43. [Google Scholar] [CrossRef]
  18. C.J. O’Donnell, and T.J. Coelli. “A Bayesian approach to imposing curvature on distance functions.” J. Econ. 126 (2005): 493–523. [Google Scholar] [CrossRef]
  19. G. Koop, and M. Steel. “Bayesian analysis of stochastic frontier models.” In A Companion to Theoretical Econometrics. Edited by B. Baltagi. Malden, MA, USA: Blackwell Publishing Ltd., 2001. [Google Scholar]
  20. J.O. Berger, and J. Mortera. “Default Bayes factors for nonnested hypothesis testing.” J. Am. Stat. Assoc. 94 (1999): 542–554. [Google Scholar] [CrossRef]
  21. J. Mulder, H. Hoijtink, and I. Klugkist. “Equality and inequality constrained multivariate linear models: Objective model selection using constrained posterior priors.” J. Stat. Plan. Inference 140 (2010): 887–906. [Google Scholar] [CrossRef]
  22. J.O. Berger, and L.R. Pericchi. “The intrinsic Bayes factor for model selection and prediction.” J. Am. Stat. Assoc. 91 (1996): 109–122. [Google Scholar] [CrossRef]
  23. J. Geweke. “Evaluating the accuracy of sampling-based approaches to calculating posterior moments.” In Bayesian Statistics 4. Edited by J.M. Bernardo, J.O. Berger, A.P. Dawid and A.F.M. Smith. Oxford, UK: Clarendon Press, 1992. [Google Scholar]
  24. E.G. Tsionas, and S.C. Kumbhakar. “Firm heterogeneity, persistent and transient technical inefficiency: A generalized true random-effects model.” J. Appl. Econ. 29 (2014): 110–132. [Google Scholar] [CrossRef] [Green Version]
  25. I. Verdinelli, and L. Wasserman. “Computing Bayes factors by using a generalization of the Savage-Dickey density ratio.” J. Am. Stat. Assoc. 90 (1995): 614–618. [Google Scholar] [CrossRef]
  • 1Färe and Grosskopf [10] have recently proposed the slacks-based directional distance function which allows inefficiency to be input- and output-specific. The estimation of such slacks-based inefficiencies however is feasible under the deterministic treatment of the production technology only. In this paper, we focus on the econometric estimation of stochastic production technologies that accommodate random disturbances.
  • 2Implying a ceteris paribus contraction in these emissions.
  • 3We differ from Fernández et al. [11,12] by formulating separate residual generation technologies for each undesirable output. The latter allows us to gauge B p -specific “green” productivity.
  • 4Recall that f ( · ) = [ F ( · , 1 ) ] 1 and h p ( · ) = [ H p ( · , 1 ) ] 1 . Hence, negative monotonicity of F ( · ) and H p ( · ) imply positive monotonicity of f ( · ) and h p ( · ) .
  • 5Recall that the quantity of undesirable outputs does down as desirable outputs decrease due to the complementarity of the two types of outputs.
  • 6For a similar stochastic formulation, e.g., see Koop and Steel [19].
  • 7For alternative ways to reduce this correlation in multiple random-effect models, see Tsionas and Kumbhkar [24].
  • 8We also reestimate our model with no theoretical regularity constraints imposed. Consistent with one’s expectations, the unconstrained metrics generally have larger credible intervals. However, since the unconstrained estimates violate regularity conditions dictated by economic theory and thus have no meaningful economic interpretation, we do not report them here.

Share and Cite

MDPI and ACS Style

Malikov, E.; Kumbhakar, S.C.; Tsionas, E.G. Bayesian Approach to Disentangling Technical and Environmental Productivity. Econometrics 2015, 3, 443-465. https://doi.org/10.3390/econometrics3020443

AMA Style

Malikov E, Kumbhakar SC, Tsionas EG. Bayesian Approach to Disentangling Technical and Environmental Productivity. Econometrics. 2015; 3(2):443-465. https://doi.org/10.3390/econometrics3020443

Chicago/Turabian Style

Malikov, Emir, Subal C. Kumbhakar, and Efthymios G. Tsionas. 2015. "Bayesian Approach to Disentangling Technical and Environmental Productivity" Econometrics 3, no. 2: 443-465. https://doi.org/10.3390/econometrics3020443

APA Style

Malikov, E., Kumbhakar, S. C., & Tsionas, E. G. (2015). Bayesian Approach to Disentangling Technical and Environmental Productivity. Econometrics, 3(2), 443-465. https://doi.org/10.3390/econometrics3020443

Article Metrics

Back to TopTop