Abstract
Climate change and global warming are significantly affected by carbon emissions that arise from the burning of fossil fuels, specifically coal, oil, and gas. Accurate prices are essential for the purposes of measuring, capturing, storing, and trading in carbon emissions at regional, national, and international levels, especially as carbon emissions can be taxed appropriately when the price is known and widely accepted. This paper uses a novel Capital (K), Labor (L), Energy (E) and Materials (M) (or KLEM) production function approach to calculate the latent carbon emission prices, where carbon emission is the output and capital (K), labor (L), energy (E) (or electricity), and materials (M) are the inputs for the production process. The variables K, L, and M are essentially fixed on a daily or monthly basis, whereas E can be changed more frequently, such as daily or monthly, so that changes in carbon emissions depend on changes in E. If prices are assumed to depend on the average cost pricing, the prices of carbon emissions and energy may be approximated by an energy production model with a constant factor of proportionality, so that carbon emission prices are a function of energy prices. Using this novel modeling approach, this paper estimates the carbon emission prices for Japan using seasonally adjusted and unadjusted monthly data on the volumes of carbon emissions and energy, as well as energy prices, from December 2008 to April 2018. The econometric models show that, as sources of electricity, the logarithms of coal and oil, though not Liquefied Natural Gas (LNG,) are statistically significant in explaining the logarithm of carbon emissions, with oil being more significant than coal. The models generally displayed a high power in predicting the latent prices of carbon emissions. The usefulness of the empirical findings suggest that the methodology can also be applied for other countries where carbon emission prices are latent.
Keywords:
latent carbon emission prices; fossil fuels; energy; KLEM production function; average cost pricing; monthly seasonally adjusted data; unadjusted data JEL:
C22; C32; C58; G12; Q35; Q48; Q54
1. Introduction
Climate change and global warming are two of the most important environmental issues presently facing the international community. Global warming is typically defined as the observed century-scale rise in the average temperature of the Earth’s climate system and its related effects, while climate change refers to change in the statistical distribution of weather patterns, specifically average weather conditions, over an extended period, such as a century. Neither climate change nor global warming refer to changing weather conditions on a daily basis (see [1] Allen and McAleer (2018)).
Both climate change and global warming are significantly affected by carbon emissions that arise from the burning of fossil fuels, specifically coal, oil, and gas. One of the major environmental challenges of the climate policy of the European Union Emissions Trading Scheme (or System) is the reduction and mitigation of greenhouse gas emissions by capturing and storing carbon emissions.
Although the clear leader in the battle to reduce and mitigate the effects of carbon emissions is the European Union, China has also started to mitigate the effects of greenhouse gases through the development of domestic regional carbon emission trading markets, with the goal of extending it to the national level in the near future. The creation of such markets that trade in carbon emissions is intended to establish prices that can be used to buy and sell carbon emissions, and to reduce the harmful effects of such emissions on the social and physical environment.
Accurate prices are essential for the purposes of measuring, capturing, storing, and trading in carbon emissions at the regional, national, and international levels, especially as carbon emissions can be taxed appropriately when the price is known and widely accepted. In the absence of an active trading market for carbon emissions, the prices are typically decided through arbitrary administrative deliberations, and hence are not efficient or even necessarily accurate. In this sense, carbon emission prices are latent.
The purpose of this paper is to price the latent carbon prices through a novel econometric approach that can be used to calculate realized latent prices of carbon emissions as a function of capital (K), labor (L), energy (E) (that is, electricity), and materials (M) inputs. The novelty is in extending the KLEM production function to the multivariate setting, modeling latent carbon emission prices, and providing a novel method of estimating monthly latent carbon emission prices that are based on the observed monthly prices of alternative forms of energy using innovative econometric time series models.
The inputs K, L, and M are typically fixed at a monthly observation interval, and are not readily available on any well-known website, whereas E can be changed daily, so that changes in carbon emissions will depend on changes in E. If prices are assumed to depend on the average cost pricing, the prices of carbon emissions and energy may be approximated by an energy production model with a constant factor of proportionality, so that carbon emission prices are a function of energy prices.
Given the relationship between the prices of carbon emissions and energy, econometric models of the latent carbon emission prices are specified, estimated, evaluated, and used for forecasting carbon emissions prices for Japan, based on monthly data from December 2008 to April 2018. The econometric models will show that, as sources of electricity, the logarithms of coal and oil, though not LNG, as well as a deterministic time trend and a tax change dummy variable, are statistically significant in explaining the logarithm of carbon emissions, with oil being more significant than coal.
The plan of the remainder of the paper is as follows. A review of the literature on pricing carbon emissions for the European Union, and at the regional and national levels for China, is given in Section 2. Section 3 discusses the KLEM production function that obtains prices for the output of carbon emissions as a function of the inputs of capital, labor, energy (or electricity), and materials. The formal models of latent carbon emission prices are discussed in Section 4, estimation of the models is presented in Section 5, the monthly data and diagnostic checks are analyzed in Section 6, the empirical estimates and analysis are evaluated in Section 7, and some concluding remarks are given in Section 8.
Proof of the derivation of the correct standard errors for valid statistical inference for realized latent carbon emissions and the ranking of important underlying factors, a discussion on the theoretical structure for latent endogenous and exogenous variables, and extensions to more complicated decision strategies, are given in the Appendix A.
2. Literature Review
Much of the research literature on pricing carbon emissions in recent years has concentrated on Europe and China. The European Union Emissions Trading Scheme established the world’s first greenhouse gas emissions trading scheme in 2005, and remains the largest and most influential organization mitigating the effects of global warming and climate change.
In the sparse literature for Europe, most of which has been based on the European Union Emissions Trading Scheme (EUETS), [2] Daskalakis and Mrkellos (2009) developed a framework for future pricing and hedging, and examined data from EEX, Nord Pool, and Powernext, to analyze whether electricity risk premia were affected by the prices of emission allowances. [3] Daskalakis et al. (2009) provided evidence from modeling the prices of CO2 emissions allowances and derivatives using European Trading Scheme data.
More recently, [4] Bushnell et al. (2013) examined profiting from regulation, based on European carbon market data. [5] Oestreich and Tsiakas (2015) analyzed carbon emissions and stock returns using EUETS data. Additionally, [6] Martin et al. (2016) examined the impacts of the EUETS on regulated firms for ten years from inception.
China’s market for national carbon emissions was established at a later stage than in Europe, and has been progressing steadily. The development of such markets since their inception in China in 2013 has not necessarily been based on competitive market principles. The National Government in Beijing has made it clear that the regional markets are a first step in the development of a national carbon emission pricing scheme. Just as the market for carbon emissions trading is presently at a formative stage in China, the academic theoretical and empirical research on the emerging regional and national carbon emissions pricing markets is also at a development stage.
A review of the admittedly sparse literature on the prices of carbon emissions in Europe and China, as well as a comprehensive analysis regarding the availability of price data at the domestic regional level, is analyzed in [7,8] Chang et al. (2018, 2019). Following earlier work, the authors discuss a number of developments in China since 2005. [9] Chen (2005) evaluated the costs associated with mitigating carbon emissions. [10] Gregg et al. (2008) analyzed the carbon emissions patterns associated with fossil fuel consumption and cement production. Moreover, [11] Li and Colombier (2009) evaluated carbon emissions and energy efficiency. Using a multivariate modeling approach, [12] Chang (2010) developed a multivariate causality test of various combinations of carbon dioxide emissions, energy consumption, and economic growth.
In more recent years, there have been analyses of interactions between China and one or more other regions. For example, [13] Nam et al. (2014) compared the synergy between China and the USA associated with pollution and carbon emissions controls. For China alone, [14] Zhang et al. (2014) considered the effects of emissions trading on progress and future prospects, [15] Zhang et al. (2014) used the Shapley value to analyze carbon emissions quotas across regions, [16] Liu et al. (2015) reviewed trading in carbon emissions and future challenges, [17] Tang et al. (2015) evaluated the trading scheme for carbon emissions using a multi-agent-based model, and [18] Zhang (2015) reformulated a strategy for low-carbon green growth. [19] Xiong et al. (2017) provided a comparison of China’s carbon trading allowance scheme and related schemes in the EU and California.
There seems to be even less research on pricing carbon emissions at an international level. [20] Zhang and Sun (2016) applied a Full BEKK multivariate conditional volatility model to analyze the time-varying covariances and the associated dynamic spillovers between the respective markets for Euro carbon and fossil fuel (for caveats regarding the Full BEKK model, particularly regarding the lack of mathematical regularity conditions, invertibility, the absence of a likelihood function, and hence the lack of valid asymptotic statistical properties, see [21] Chang and McAleer (2019)) [22]. Chang et al. (2017) and [21] Chang and McAleer (2019) analyzed the novel issues of volatility spillovers and Granger causality on the basis of daily data for the futures prices of EU carbon emissions, spot prices of US carbon emissions, and spot and future prices of oil and coal.
Interesting recent research has been undertaken on analyses of price and volatility from the perspective of derivative markets and alternative volatility models, such as the price impact, trading volume and volatilities associated with mission permits and the announcement of realized emissions in [23] Hitzemann, Uhrig-Homburg, and Ehrhart (2015); a Bayesian approach to the stochastic volatility of future prices of emission allowances in [24] Kim, Park, and Ryu (2017); and the empirical performance of reduced form models for emission permit prices in [25] Hitzemann and Uhrig-Homburg (2019).
As stated above, there seems to have been no published research on pricing carbon emissions in Japan. The existing published research seems to have primarily focused on Europe and, very recently, China. For these reasons, this paper makes a novel contribution to the pricing of carbon emissions for Japan. The usefulness of the empirical findings suggests that the methodology can also be applied for other countries where carbon emission prices are latent.
3. KLEMS Production Function for Carbon Emissions and Energy
Inputs are required to produce outputs, and the relationship is captured in a production function. A relatively broad specification is the KLEMS production function ([26] OECD, 2001), which uses capital (), labor (), energy (), materials (), and services (S) as the inputs for the production process. The function captures the most widely-used inputs for producing the output. Typically, the most widely observable input is energy, which is based on the use of electricity, whereas the services associated with a specific output are typically difficult to observe.
As a special case of KLEMS, consider the KLEM production function (see, for example, [27] Lecca et al., 2011) that is given as
where CE denotes the production of the output of carbon emissions; capital (), labor (), energy or electricity (), and materials () are the inputs; is a constant; for simplicity, the partial production parameters are assumed to satisfy constant returns to scale, that is, (as in the Cobb-Douglas production function); and the random error term is assumed to be
In practice, the output and input may be observed at daily or monthly data frequencies, whereas the inputs , , and are generally fixed at much lower data frequencies, such as one year. Consequently, when the data are observed at monthly frequencies, as in this paper, and may be assumed to be proportional, with the random factor of proportionality given by
It is worth noting that is not restricted in its range, such as (0, 1). As has a unit of measurement, can take any value, depending on the units of measure of the variables, such as
or
where is a scalar. If it is assumed that output prices are set according to an average cost pricing rule (see [28] Chang et al. (2018)), the prices of and may be approximated by a model with a constant factor of proportionality. Let this relationship be given as
where is the price of , and is the price of energy (such as electricity), such that k in Equation (4) is the constant factor of proportionality. In practice, k may be known in advance or may be determined empirically using appropriate modeling techniques. [28] Chang et al. (2018) considered estimating k through the use of Equations (3b) and (4). The estimation of through the use of econometric models, and acknowledging the presence of measurement errors, will be developed in the next section.
4. Modeling Latent Carbon Emission Prices
The definitional relationship for a random variable, Y, can be seen in Equation (5):
where Y is the measured value of the random variable, which is subject to measurement error, and Y* is the true value of the random variable, with no measurement error, for which the following four relationships are possible:
Y = Y* + measurement error,
- (i)
- ;
- (ii)
- ;
- (iii)
- ;
- (iv)
- .
Here, and denote the (possibly) observed prices of carbon emissions and their underlying true prices, respectively, and and denote the observed and underlying true prices of energy, respectively.
In cases (i) and (ii), the price of carbon emissions is not available, together with the true underlying prices of carbon emissions and energy, respectively. case (iii) is not of particular interest as it relates to the observed and true underlying prices of energy, respectively. In case (iv), the price of energy is available, but the true underlying price of carbon emissions is not. Therefore, this paper is concerned with case (iv).
The remainder of the paper uses a novel econometric model to calculate the latent prices of carbon emissions, as will be discussed below. The approach has been used in the literature to calculate latent variables, and this paper is the first to use the methodological approach to calculate latent carbon emission prices.
For the sake of argument, in case (iv), if the price of energy is stationary, can be regressed on a set of observable variables using Ordinary Least Squares (OLS), as given in Equation (6):
to yield estimated fitted values, as given in Equation (7):
where denotes the OLS estimates of the unknown parameters (i = 0, 1, 2, 3), and denotes the estimated fitted values of energy prices from Equation (6), which are equivalent to the estimated fitted values of carbon emissions, (), adjusted by the factor (1/k) (see Equation (4)), where k may be known in advance or may be determined empirically.
If energy prices are nonstationary rather than stationary, the price model for energy in Equation (6) can be estimated using a cointegrating relationship or a vector autoregressive distributed lag model to accommodate the nonstationarity. Alternatively, prices can be transformed into log differences in prices, which are equivalent to the rate of growth in prices, and then estimated by OLS (in empirical finance, log differences in prices are referred to as rates of return), which would be based on stationary processes.
5. Estimating Latent Carbon Emission Prices
This section describes a novel method of estimating monthly latent carbon emissions prices for Japan that are based on the observed monthly prices of alternative forms of energy and the models presented in the previous section.
Monthly prices (p) and volumes (quantities, q) of the following energy variables are typically available in a variety of energy databases, where almost ethanol refers to bio-ethanol:
- (1)
- Volume of carbon emissions;
- (2)
- Prices and volume of electricity;
- (3)
- Prices and volume of oil;
- (4)
- Prices and volume of crude coal;
- (5)
- Prices and volume of natural gas;
- (6)
- Prices and volume of nuclear energy;
- (7)
- Prices and volume of solar energy;
- (8)
- Prices and volume of wind energy;
- (9)
- Prices and volume of hydro energy;
- (10)
- Prices and volume of wave energy;
- (11)
- Prices and volume of bio-mass;
- (12)
- Prices and volume of ethanol;
- (13)
- Prices and volume of bio-ethanol.
In what follows, the models presented in the previous section will be estimated using the available price and volume data to obtain estimated fitted latent prices of carbon emissions. It is assumed that there is a known expected relationship between carbon emissions and the generation of electricity using the data in cases (2)–(12) above. This is based on applied engineering practice and structural form modeling, as presented in the previous section, as follows:
or equivalently as
where k is assumed to be a known factor of proportionality, and u is assumed to be a random measurement error, possibly independently and identically distributed, with a mean of 0 and constant variance.
Equation (8) relates the unknown price and known quantity of carbon emissions to the known price and known quantity of energy (or electricity), which confirms that the resulting price equation directly refers to carbon emissions. Equation (9) is the logarithmic equivalent of Equation (8).
Let (expenditure), with , without a loss of generality, so that Equation (8) becomes
It follows that
as , such that the estimated fitted value of Equation (10) is given as
A regression of on gives
for which the estimated fitted value of is given as
where are the OLS estimates of , respectively.
Recalling Equation (11), it follows that
which gives an “optimal” estimate of .
As and , the prices of carbon emissions can be “optimally” calculated as
Equation (16) gives the estimated fitted value, , as the “optimal” estimate of carbon emission prices as it uses valid statistical techniques that are efficient and optimal.
This novel method would seem to be the first to calculate carbon emissions prices in theory, as well as in practice, using monthly data for Japan.
Further to the above discussion, if data were available for , the price of energy (or electricity), as well as prices on some renewable (, say) and nonrenewable (, say) energy inputs, we could compare whether jointly was more or less significant than in explaining .
The Japanese Government imposes energy taxes on industry, in particular, as well as on households. The use of government data is crucial for consumers and businesses that use energy, as they are expected to pay carbon taxes as users of energy. This is not a financial product, but an application of optimal carbon tax policies.
6. Monthly Data and Diagnostic Checks
The empirical analysis uses a monthly sample period from December 2008 to April 2018 for Japan, where carbon emission prices have not yet been calculated by any government agency or research institute. The definitions of the variables are given in Table 1 as the volume of seasonally unadjusted and seasonally adjusted (using X12) values of coal, LNG, petroleum, electricity, carbon emissions, the spot electricity price, the total transaction volume of electricity, and the price of electricity that represents the price of energy. The sources of the data are the Trade Statistics of Japan, Ministry of Finance, Government of Japan; the Japan Metrological Agency; and the Japan Electricity Power eXchange.
Table 1.
Definition of variables.
Prior to econometric estimation, the seasonally unadjusted and adjusted volumes of the logarithms of electricity, coal, oil, and LNG were tested for unit roots, for both levels and first differences of the variables, using the non-parametric Phillips-Perron (PP) test of stationarity to ensure that asymptotic theory was valid for the purposes of drawing statistical inferences (see [29] Phillips and Perron (1979)). The test of the null hypothesis that a time series is integrated by an order of 1 is robust to unspecified autocorrelation and heteroscedasticity in the errors of the auxiliary test equation.
The PP tests of the logarithms of the levels and first differences of the seasonally unadjusted and adjusted volumes of electricity, coal, oil, and LNG in Table 2 were conducted using a constant term with a deterministic trend in order to provide alternative simulated critical values of the non-standard test. OLS was used to obtain the estimator of the residual spectrum at a frequency of zero. The PP test detects unit roots of the logarithmic levels of the variables at all reasonable levels of significance, but rejects the null hypothesis of unit roots in logarithmic first differences, so that the log differences of the data are stationary.
Table 2.
Phillips-Perron unit root tests.
The correlations in the logarithmic levels of the seasonally unadjusted and adjusted data are shown in Table 3(a), with the corresponding correlations in the logarithmic first differences of the seasonally unadjusted and adjusted data shown in Table 3(b). The highest correlations for the logarithmic levels of the seasonally unadjusted and adjusted data are between oil and LNG, with surprisingly close values of 0.73 and 0.737, respectively. The highest correlations for the logarithmic first differences of the seasonally unadjusted and adjusted data are quite different, with the highest correlation for the former being between oil and LNG at 0.445, and at a much lower 0.267 for coal and oil, as well as for coal and LNG.
Table 3.
(a) Correlation: Levels. (b) Correlation: First differences.
The [30] Johansen (1991) Trace and Maximum Eigenvalue tests of the number of cointegrating vectors for the seasonally unadjusted and adjusted values of coal, LNG, oil, and electricity are presented in Table 4. The null hypothesis of no cointegration (that is, R = 0) can be rejected at the 5% level of significance using both tests for the combination of logarithms of the seasonally unadjusted and adjusted volumes of coal, LNG, oil, and electricity.
Table 4.
Johansen Trace and Maximum Eigenvalue tests of cointegrating vectors.
7. Empirical Estimates and Analysis
The results of the PP unit root tests and Johansen tests of the number of cointegrating vectors in the previous section led to fully-modified OLS estimation ([31] Phillips and Hansen, 1990) of the cointegrating regressions, presented in Table 5.
Table 5.
Cointegrating regression (fully-modified OLS) for carbon emissions. Case 1: Seasonally unadjusted data. Case 2: Seasonally adjusted data.
Two cases are considered, as follows:
- Case 1: Seasonally unadjusted data, with a deterministic trend and dummy variable;
- Case 2: Seasonally adjusted data, with a deterministic trend and dummy variable.
The dummy variable, which was introduced to accommodate the influence of increases in electricity prices on May 2017 arising from tax changes in Japan, equals 0 before September 2017, and 1 after September 2017.
For both seasonally unadjusted and seasonally adjusted data, that is, Cases 1 and 2, the logarithms of coal and oil, as well as the deterministic time trend and the tax change dummy variable, are statistically significant in explaining the logarithm of carbon emissions, with oil being more significant than coal. The logarithm of LNG is not statistically significant at any conventional level of significance in explaining the logarithm of carbon emissions in either Case 1 or Case 2. The adjusted R2 values are reasonably high, at 0.943 and 0.958 for Cases 1 and 2, respectively, which suggests that the models are quite successful in explaining the logarithm of carbon emissions using both seasonally unadjusted and seasonally adjusted data.
The forecasts of for seasonally unadjusted and seasonally adjusted data are shown in Figure 1a,b, respectively, where two standard errors in each figure provide the 95% confidence intervals. In each case, there were noticeable falls in the forecasts of the logarithm of electricity from mid-2015 through to early-2016, and substantial rises toward the end of 2017.

Figure 1.
(a) Forecast of ( standard errors). (b). Forecast of ( standard errors).
The predicted latent carbon emission prices for seasonally unadjusted and adjusted data are depicted in Figure 2a,b respectively.

Figure 2.
(a) Predicted carbon emission prices. . . (b) Predicted carbon emission prices (SA). . .
The predicted values are based on the formulae for the logarithmic values presented in Equation (16), with the predicted carbon emissions prices having been obtained using the exponential function for the seasonally unadjusted data, as follows:
and for the seasonally adjusted data, as follows:
The patterns of the predicted carbon emissions prices in Figure 2a,b emulate the patterns described in Figure 1a,b with significant upward movements in both the seasonally unadjusted and seasonally adjusted data after mid-2017.
The correlation between and of 0.997 is given in Table 6, while the correlation between the seasonally unadjusted and seasonally adjusted predicted carbon emissions prices of 0.992 is given in Table 7. It is clear that the forecasts of carbon emission prices are very similar, regardless of whether seasonally unadjusted or seasonally adjusted data are used.
Table 6.
Correlation between and .
Table 7.
Correlation between predicted carbon emissions price and predicted carbon emissions price (SA).
8. Concluding Remarks
It is widely accepted in the international scientific and global communities that climate change and global warming are significantly affected by carbon emissions that arise from the burning of fossil fuels, specifically coal, oil, and gas. Although prices of fossil fuels are readily available, the prices of carbon emissions arising from competitive markets for the product are not yet commercially available.
As carbon emissions can be taxed appropriately when accurate commercial market prices are known and accepted at an international level, carbon emissions can be taxed appropriately for the purposes of measuring, capturing, storing, and trading in carbon emissions at regional, national, and international levels.
This paper used a novel KLEM production function approach to calculate the latent carbon emission prices, where carbon emissions are regarded as the output and capital (K), labor (L), energy (E) (or electricity), and materials (M), are the inputs for the production process. As the inputs K, L, and M are essentially fixed on a monthly basis, whereas E can be changed daily, it follows that changes in carbon emissions are functions of changes in E.
On the condition that prices depend on the average cost pricing, the prices of carbon emissions and energy may be approximated by an energy production model with a constant factor of proportionality, so that carbon emission prices will be a function of energy prices. Using this novel modeling approach, this paper estimated carbon emission prices for Japan using seasonally adjusted and unadjusted monthly data on the volumes of carbon emissions and energy, as well as energy prices, from December 2008 to April 2018.
The novel methods presented in the paper are straightforward to understand and implement, based on simple methods of estimation. The usefulness in establishing carbon emission prices goes to the heart of tackling global warming and climate change.
The econometric models showed that, as sources of electricity, the logarithms of coal and oil, though not LNG, are statistically significant in explaining the logarithm of carbon emissions, with oil being more significant than coal. The models generally displayed a high power in predicting the latent prices of carbon emissions. In particular, the forecasts of carbon emission prices were very similar, regardless of whether seasonally unadjusted or seasonally adjusted data were used. The usefulness of the empirical findings suggests that the methodology can also be applied for other countries where carbon emission prices are latent.
The theoretical approach and empirical application developed in the paper should be useful for purposes of public policy debate and decision making. Accurate predictions of the latent prices of carbon emissions, and the imposition of appropriate environmental taxes, are essential in order to mitigate the effect of carbon emissions on global warming and climate change.
Author Contributions
Conceptualization, C.-L.C. and M.M.; methodology, M.M.; software, C.-L.C.; validation, C.-L.C. and M.M.; formal analysis, C.-L.C. and M.M.; investigation, C.-L.C. and MM; resources, C.-L.C.; data curation, C.-L.C.; writing—original draft preparation, C.-L.C. and M.M.; writing—review and editing, M.M.; project administration, C.-L.C. and M.M.; funding acquisition, C.-L.C. and M.M.
Funding
This research received no external funding.
Acknowledgments
The authors wish to thank Shigeyuki Hamori and five reviewers for very helpful comments and suggestions, and Shigeyuki Hamori for kindly providing the data for the empirical analysis. For financial support, the first author wishes to acknowledge the Ministry of Science and Technology (MOST), Taiwan, and the second author is grateful to the Australian Research Council and the Ministry of Science and Technology (MOST), Taiwan.
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A
Appendix A.1. Derivation of Correct Standard Errors for Realized Latent Carbon Emissions and Ranking of Important Underlying Factors
The novelty of the paper includes the development of innovative methods of analyzing latent exogenous and endogenous variables. There has been little, if any, research undertaken on latent variables, such as unobservable endogenous or exogenous variables, including rankings of the importance of various factors in explaining the latent prices of carbon emissions.
An analysis of latent variables is such that the choice of important and influential explanatory informational variables is typically arbitrary, that is, not based on any optimizing criterion. The final rankings may be based on averaging aggregated using Pythagorean arithmetic, geometric, or harmonic means. Consequently, any rankings based on arbitrary explanatory informational variables are themselves arbitrary.
The purpose of this paper is to determine and forecast latent endogenous variables, in particular, the latent prices for carbon emissions, and rankings using optimization methods, so that the rankings will also be optimal. The extensive list of references applies technical methods to show how the estimation and calculation of correct standard errors can be performed.
A wide range of alternative rankings include, but are not limited to the following:
- (1)
- Academic rankings of individuals, departments, faculties/schools/colleges, institutions, states, countries, and regions, which are of both academic and practical interest from individual students, parents, and institutions, to public and private policy advisors (for robust methods of ranking academic journals see, for example, [32,33,34,35,36,37] Chang and McAleer (2013, 2014a, 2014b, 2014c, 2015, 2016); [38,39,40,41,42] Chang, McAleer, and Oxley (2011a, 2011b, 2011c, 2011d, 2013); and [43] Chang, Maasoumi and McAleer (2016));
- (2)
- International rankings of universities, namely “Academic Ranking of World Universities (ARWU)” by Shanghai Ranking Consultancy (originally compiled and issued by Shanghai Jiao Tong University), “World University Rankings” by Times Higher Education (THE), and “QS World University Rankings” by Quacquarelli Symonds (QS), are based on arbitrary measures.
Appendix A.2. Theoretical Structure for Latent Endogenous and Exogenous Variables
The fundamental ideas and theoretical structure underlying a critical analysis of latent exogenous and latent endogenous variables can be presented in the following basic model, with extensions to models involving more complicated decision strategies to be presented subsequently.
Suppose the true linear relationship between a dependent variable, or an endogenous variable that is determined within the model, and a set of explanatory variables, that is, exogenous variables that are determined outside the model, can be represented as
where t = 1, 2, 3, … n observations; is the latent endogenous variable; is a kx1 vector of latent exogenous variables; the unknown parameter vector is kx1; and is an uncorrelated and homoskedastic (or possibly an independently and identically distributed) random error term. The parameter in Equation (A1) cannot be estimated as and are not observable.
A standard approach to estimating , calculating the in-sample estimates (that is, fitted values) of , and forecasting out-of-sample values of , is to use proxy variables for both and . However, the use of a proxy for will almost certainly lead to problems in estimation and sub-optimal statistical properties of the associated estimators.
The use of proxy variables is called the problem of measurement error, which is inevitable whenever latent variables are considered:
in which the proxy for in Equation (A2) is decomposed into two components, namely the latent component, which is the true measure of , and a measurement error that is the difference between and . Similarly, the proxy for in Equation (A3) is decomposed into two components, namely the latent component, which is the true measure of , and a measurement error that is the difference between and .
Such measurement error problems frequently arise in the social sciences, especially in economics and finance, where may represent latent expectational, desired, perceived, equilibrium, or long-run variables. Rankings across a wide range of academic disciplines are also typically based on using a proxy for that depends on arbitrary rather than optimal measures, so that the rankings are arbitrary.
There are two approaches to trying to accommodate the problem of measurement error, namely a primitive approach that includes the measurement errors, and an alternative approach that quarantines and improves upon the incorporation of measurement errors.
Appendix A.2.1. Primitive Approach
Substituting and in (A1) leads to
where is the composite error arising from Equations (A1)–(A3).
In Equation (A4), can be estimated by OLS, as both and are measurable, to obtain the in-sample estimated or fitted values, represented by “hat” (or circumflex) of , as follows:
where is the fitted value of both and , which follows from (A2), and is the OLS estimate of . Equation (A5) can be used to calculate the in-sample fitted values of or forecast out-of-sample values of .
It is well-known that the OLS estimate of is biased, inconsistent, and inefficient as in (A4) is correlated with in the composite error. However, this does not mean that out-of-sample forecasts based on will be inaccurate in comparison with forecasts obtained from using an alternative consistent estimator of . This is also well-known, but does not seem to have been applied to the issue of computing and forecasting latent endogenous variables, such as carbon emission prices and important ranking factors.
Appendix A.2.2. Generated Regressors and Realized Latents
The second approach is more sophisticated mathematically and statistically, and leads to consistent and efficient estimators of by reducing the impact of informational ignorance in the presence of latent exogenous and endogenous variables. Returning to Equation (A1), the substitution of for gives
in which it is reasonably assumed that is not correlated with or the random errors in and , respectively. The parameter cannot be estimated in Equation (A6) as is a latent exogenous variable, even though is observed.
Rather than simply replacing with a proxy variable such as , in economics, it is frequently assumed that the latent depends on a vector of observable variables , as follows:
where is gx1, and the unknown matrix of parameters C is gxk.
What is quite remarkable about Equation (A7) is that a latent exogenous variable, , which is unmeasurable for all n observations on k variables, is now given as a function of n measurable observations on g variables in and g unknown in C.
The assumption based on (A7) effectively reduces the ignorance and lack of informational content from n latent observations in to g unknown parameters in C.
Using the definition of in Equation (A3) and the assumption for in Equation (A7) gives
As both and are measurable, the OLS estimation of C in Equation (A8) leads to
which gives an estimate in Equation (A10) of the latent variable , , based on estimating in Equation (A8) and from Equation (A9):
Equation (A8) is the foundation of the “rational expectations” literature in economics as in Equation (A10) is used as a “rational” (or, more accurately, a model-consistent) estimate of the latent exogenous variable, . is called a “generated (or realized) regressor” as it can be used to replace the latent to estimate and . Returning to Equation (A6), the substitution of for gives
The substitution of from Equation (7), from Equation (9), and from Equation (10), in Equation (11), leads to
in which is an observable generated regressor.
As is uncorrelated with each of the components in the combined error in Equation (A12), namely , , and , it follows that the OLS estimation of by regressing on is unbiased.
Calculation of the correct standard errors for statistical inference to be valid is dependent on using the correct error structure in Equation (A12), as given in Equation (A13):
which is based on the composite error term in Equation (A4), rather than on alone.
Using the incorrect standard errors from the OLS estimation of Equation (A12) based on rather than on the composite error in Equation (A13) will lead to standard errors that are biased downward as the variance of
is positively definite. Therefore, the calculated t-ratios will be biased upward, thereby incorrectly leading to findings of spurious statistical significance.
An alternative approach to calculating the correct standard errors based on the OLS estimation of Equation (A12) when = 0, that is, when only the exogenous regressors are latent, is to incorporate the [44] Newey and West (1987) Heteroskedasticity and AutoCorrelation (HAC) consistent covariance matrix estimator. [45] Smith and McAleer (1994) have shown that the HAC covariance matrix estimator is reasonably accurate for purposes of drawing inferences in finite samples in the presence of generated regressors. The finite sample properties of the OLS estimation of Equation (A12) when = 0 does not hold, that is, when the endogenous variable is latent, remains a topic for future research.
Moreover, using the necessary and sufficient Rao–Zyskind condition (also sometimes referred to as Kruskal’s Theorem) for OLS to be efficient (see [46] McAleer, 1992), the OLS estimator of in Equation (A12) can be shown to be efficient, and hence to enable valid statistical inference (for further details see, for example, [47] Pagan (1984), [48,49,50,51] McAleer and McKenzie (1991, 1992, 1994, 1997), and [52] Fiebig et al. (1992)).
Using the OLS estimate of from Equation (A12) will provide in-sample fitted values of and out-of-sample forecasts of , and hence of . The fitted values of are , which are “realized latents” as they are the fitted or realized in-sample values of the latent variable in Equation (A1), and the realized out-of-sample forecasts of .
The rankings literature provides a novel example of how it is possible to estimate true latent rankings, , using realized latent rankings, , that are obtained from generated (or realized) regressors and observed rankings, , which are subject to measurement error. The realized latent rankings can also be used to obtain out-of-sample forecasts.
Appendix A.3. Extensions to More Complicated Decision Strategies
The presentation of the basic model concentrated on developing a theoretical approach and structure to generate univariate “realized latent rankings” for unobservable latent endogenous variables, and establishing the theoretical properties of the new measures, as well as of the associated parametric estimators.
Generated regressors and realized latents can be applied to obtain measurable and optimal rankings of any latent endogenous variables, with interesting and challenging extensions to a wide range of models arising from more complicated and sophisticated decision strategies.
The development of innovative methods of analyzing latent endogenous variables can be extended to the following topical and practical issues:
- (i)
- Extending “realized latent rankings” to multivariate unobserved latent endogenous variables, and establishing the theoretical properties of the new measures, as well as of the associated parametric estimators, using extensions of the technical developments in the basic model. These could include structural models, recursive models, and probabilistic models;
- (ii)
- OLS is the simplest technique that can be employed to obtain optimal weights through estimation, but it is possible to use Logistic regression to obtain optimal weights and the inherent associated probabilities;
- (iii)
- In the context of Cognitive Computing, it is widely argued that computers, computing facilities, machine hardware, mathematical algorithms, and computer software should be perceived as aids to learn dynamically, to reach managerial decisions, and to achieve strategic aims;
- (iv)
- As advanced machines can be programmed to learn through feedback, an important implication is that the outputs obtained from inputs and processing systems are not the same if learning is allowed because the outputs can differ. Consequently, outputs from such processing systems should be modeled and analyzed in a probabilistic context which, in turn, helps to make managerial decisions.
The cognitive approach is precisely related to realized latent rankings models, in which the probabilistic outputs, namely realized rankings, can change dynamically according to the generated (or realized) regressors and dynamic learning. Therefore, the probabilities would assist in strategic management decision making.
Conditional on the sets of features (such as the latent endogenous variable, and measurable or latent exogenous variables), the model can provide different realized latencies, such as latent carbon emission prices, which can assist management in making optimal decisions. It is assumed in latent probabilistic models that the exogenous regressors are measurable, such as
where y* in Equation (A15) is latent, but we observe a binary variable y that takes the value of 1 or 0, according to whether or not y* crosses a threshold, that is, y = 1 if y* > 0, and 0 otherwise.
The probabilistic model is given by
where F in Equation (A16) is typically a normal or logistic cumulative distribution function (CDF). Then, y* is given in Equation (A17) by
The model and associated analysis change drastically if the basic latent regression model is changed to incorporate X*, as in Equation (A18), rather than X in Equation (A15):
as an appropriate model would be required for X* to obtain generate regressors for X* and realized latencies for y*.
Such an extension of logistic probabilistic models to incorporate latent exogenous variables to estimate and forecast latent endogenous variables is a novel extension of the classical latent probabilistic model, and can be analyzed as follows:
- (v)
- Defining and measuring a wide range of latent variables, such as unknown carbon emission prices and academic quality; ranking individuals, departments, faculties, and institutions based on the new measure; and establishing the theoretical properties of the new measures, as well as the associated parametric estimators;
- (vi)
- Ranking individuals, departments, faculties and institutions based on non-academic measures, and establishing the theoretical properties of the new measures, as well as of the associated parametric estimators;
- (vii)
- Applying the approach based on “realized latent rankings” commercially to any decision making strategies in business, using structural models; multiple decision making based on recursive models, that is, sequential decision making; and strategic decision making using probabilistic models, among others.
For example, if and are latent endogenous and exogenous variables, respectively, in stage 1 of a decision making process, the model may capture sequential decision making in Equations (A19)–(A21), as follows:
where in Equation (A20) is obtained at the first stage from Equation (A19), in Equation (A21) is obtained at the second stage from Equation (A20), and so on.
It is clear that sequential models will be necessary to obtain “sequential generated regressors”, and subsequent “sequential realized latencies”, which would present challenging and novel theoretical and practical developments. The approach given above could also be extended to probabilistic models, which would be an entirely new and innovative development.
Each of the points stated above is novel from theoretical and practical perspectives, namely applications of generated regressors and realized latents to obtain measurable and optimal rankings of any latent endogenous variables, and extensions to a wide range of more complicated models and decision strategies. The techniques of the basic model presented above can be extended, and are portable to different problems and disciplines, such as latent carbon emission prices, rankings of important factors, and associated management decisions.
References
- Allen, D.E.; McAleer, M. Fake news and indifference to scientific fact: President Trump’s confused tweets on global warming, climate change and weather. Scientometrics 2018, 117, 625–629. [Google Scholar] [CrossRef]
- Daskalakis, G.; Markellos, R. Are electricity risk premia affected by emission allowance prices? Evidence from the EEX, Nord Pool and Powernext. Energy Policy 2009, 37, 2594–2604. [Google Scholar] [CrossRef]
- Daskalakis, G.; Psychoyios, D.; Markellos, R.N. Modeling CO2 emission allowance prices and derivatives: Evidence from the European Trading Scheme. J. Bank. Financ. 2009, 33, 1230–1241. [Google Scholar] [CrossRef]
- Bushnell, J.B.; Chong, H.; Mansur, E.T. Profiting from regulation: Evidence from the European Carbon Market. Am. Econ. J. Econ. Policy 2013, 5, 78–106. [Google Scholar] [CrossRef]
- Oestreich, A.M.; Tsiakas, I. Carbon emissions and stock returns: Evidence from the EU Emissions Trading Scheme. J. Bank. Financ. 2015, 58, 294–308. [Google Scholar] [CrossRef]
- Martin, R.; Muuls, M.; Wagner, U.J. The impact of the European Union Emissions Trading Scheme on regulated firms: What Is the evidence after ten years? Rev. Environ. Econ. Policy 2016, 10, 129–148. [Google Scholar] [CrossRef]
- Chang, C.-L.; Mai, T.-K.; McAleer, M. Pricing carbon emissions in China. Ann. Financ. Econ. 2018, 13, 1–37. [Google Scholar]
- Chang, C.-L.; Mai, T.-K.; McAleer, M. Establishing national carbon emission prices for China. Renew. Sustain. Energy Rev. 2019, 106, 1–16. [Google Scholar] [CrossRef]
- Chen, W. The costs of mitigating carbon emissions in China: Findings from China MARKAL-MACRO modeling. Energy Policy 2005, 33, 885–896. [Google Scholar] [CrossRef]
- Gregg, J.S.; Andres, R.J.; Marland, G. China: Emissions pattern of the world leader in CO2 emissions from fossil fuel consumption and cement production. Geophys. Res. Lett. 2008, 35. [Google Scholar] [CrossRef]
- Li, J.; Colombier, M. Managing carbon emissions in China through building energy efficiency. J. Environ. Manag. 2009, 90, 2436–2447. [Google Scholar] [CrossRef] [PubMed]
- Chang, C.-C. A multivariate causality test of carbon dioxide emissions, Energy consumption and economic growth in China. Appl. Energy 2010, 87, 3533–3537. [Google Scholar] [CrossRef]
- Nam, K.-M.; Waugh, C.J.; Paltsev, S.; Reilly, J.M.; Karplus, V.J. Synergy between pollution and carbon emissions control: Comparing China and the United States. Energy Econ. 2014, 46, 186–201. [Google Scholar] [CrossRef]
- Zhang, D.; Karplus, V.; Cassisa, C.; Zhang, X. Emissions trading in China: Progress and prospects. Energy Policy 2014, 75, 9–16. [Google Scholar] [CrossRef]
- Zhang, Y.-J.; Wang, A.D.; Da, Y.-B. Regional allocation of carbon emission quotas in China: Evidence from the Shapley value method. Energy Policy 2014, 74, 454–464. [Google Scholar] [CrossRef]
- Liu, L.; Chen, C.; Zhao, Y.; Zhao, E. China’s carbon-emissions trading: Overview, challenges and future. Renew. Sustain. Energy Rev. 2015, 49, 254–266. [Google Scholar] [CrossRef]
- Tang, L.; Wu, J.; Yu, L.; Bao, Q. Carbon emissions trading scheme exploration in China: A multi-agent-based model. Energy Policy 2015, 81, 152–169. [Google Scholar] [CrossRef]
- Zhang, Y. Reformulating the low-carbon green growth strategy in China. Clim. Policy 2015, 15, 40–59. [Google Scholar] [CrossRef]
- Xiong, L.; Shen, B.; Qi, S.; Price, L.; Ye, B. The allowance mechanism of China’s carbon trading pilots: A comparative analysis with schemes in EU and California. Appl. Energy 2017, 185, 1849–1859. [Google Scholar] [CrossRef]
- Zhang, Y.J.; Sun, Y.F. The dynamic volatility spillover between European carbon trading market and fossil energy market. J. Clean. Prod. 2016, 112, 2654–2663. [Google Scholar] [CrossRef]
- Chang, C.-L.; McAleer, M. The fiction of full BEKK: Pricing fossil fuels and carbon emissions. Financ. Res. Lett. 2019, 28, 11–19. [Google Scholar] [CrossRef]
- Chang, C.-L.; McAleer, M.; Zuo, G.D. Volatility spillovers and causality of carbon emissions, oil and coal spot and futures for the EU and USA. Sustainability 2017, 9, 1789. [Google Scholar] [CrossRef]
- Hitzemann, S.; Uhrig-Homburg, M.; Ehrhart, K. Emission permits and the announcement of realized emissions: Price impact, trading volume and volatilities. Energy Econ. 2015, 51, 560–569. [Google Scholar] [CrossRef]
- Kim, J.; Park, Y.; Ryu, D. Stochastic volatility of the futures prices of emission allowances: A Bayesian approach. Physica A 2017, 465, 714–724. [Google Scholar] [CrossRef]
- Hitzemann, S.; Uhrig-Homburg, M. Empirical performance of reduced-form models for emission permit prices. Rev. Deriv. Res. 2019, 22, 389–418. [Google Scholar] [CrossRef]
- OECD. OECD Productivity Manual: A Guide to the Measurement of Industry-Level and Aggregate Productivity Growth; Statistics Directorate, Directorate for Science, Technology and Industry: Paris, France, 2001. [Google Scholar]
- Lecca, P.; Swales, K.; Turner, K. An investigation of issues relating to where energy Should enter the production function. Econ. Model. 2011, 28, 2832–2841. [Google Scholar] [CrossRef]
- Chang, C.-L.; Hamori, S.; McAleer, M. Pricing carbon emissions based on energy production, unpublished paper, Department of Applied Economics and Department of Finance, National Chung Hsing University, Taiwan; Graduate School of Economics, Kobe University, Japan; Department of Finance, Asia University, Taiwan, 2018.
- Phillips, P.C.B.; Perron, P. Testing for a unit root in time series regression. Biometrika 1988, 75, 335–346. [Google Scholar] [CrossRef]
- Johansen, S. Estimation and hypothesis testing of cointegration vectors in Gaussian vector autoregressive models. Econometrica 1991, 59, 1551–1580. [Google Scholar] [CrossRef]
- Phillips, P.; Hansen, B. Statistical inference in instrumental variables regression with I(1) processes. Rev. Econ. Stud. 1990, 57, 99–125. [Google Scholar] [CrossRef]
- Chang, C.-L.; McAleer, M. Ranking leading econometric journals using citations data from ISI and RePEc. Econometrics 2013, 1, 217–235. [Google Scholar] [CrossRef]
- Chang, C.-L.; McAleer, M. How should journal quality be ranked? An application to agricultural, energy, environmental and resource economics. J. Rev. Glob. Econ. 2014, 3, 33–47. [Google Scholar]
- Chang, C.-L.; McAleer, M. Ranking economics and econometrics ISI journals by quality weighted citations. Rev. Econ. 2014, 65, 35–52. [Google Scholar] [CrossRef][Green Version]
- Chang, C.-L.; McAleer, M. Just how good are the top three journals in finance? An assessment based on quantity and quality citations. Ann. Financ. Econ. 2014, 9, 1450005. [Google Scholar] [CrossRef]
- Chang, C.-L.; McAleer, M. Bibliometric rankings of journals based on the Thomson Reuters citations database. J. Rev. Glob. Econ. 2015, 4, 120–125. [Google Scholar]
- Chang, C.-L.; McAleer, M. Quality weighted citations versus total citations in the sciences and social sciences, with an application to finance and accounting. Manag. Financ. 2016, 42, 324–337. [Google Scholar] [CrossRef]
- Chang, C.-L.; McAleer, M.; Oxley, L. What makes a great journal great in economics? The singer not the song. J. Econ. Surv. 2011, 25, 326–361. [Google Scholar] [CrossRef]
- Chang, C.-L.; McAleer, M.; Oxley, L. What makes a great journal great in the sciences? Which came first, the chicken or the egg? Scientometrics 2011, 87, 17–40. [Google Scholar] [CrossRef]
- Chang, C.-L.; McAleer, M.; Oxley, L. Great expectatrics: Great papers, great journals, great econometrics. Econom. Rev. 2011, 30, 583–619. [Google Scholar] [CrossRef]
- Chang, C.-L.; McAleer, M.; Oxley, L. How are journal impact, prestige and article influence related? An application to neuroscience. J. Appl. Stat. 2011, 38, 2563–2573. [Google Scholar] [CrossRef]
- Chang, C.-L.; McAleer, M.; Oxley, L. Coercive journal self citations, impact factor, journal influence and article influence. Math. Comput. Simul. 2013, 93, 190–197. [Google Scholar] [CrossRef]
- Chang, C.-L.; Maasoumi, E.; McAleer, M. Robust ranking of journal quality: An application to economics. Econom. Rev. 2016, 35, 50–97. [Google Scholar] [CrossRef]
- Newey, W.; West, K. A simple, positive semi-definite, heteroskedasticity and autocorrelation consistent covariance matrix. Econometrica 1987, 55, 703–708. [Google Scholar] [CrossRef]
- Smith, J.; McAleer, M. Newey-West covariance matrix estimates for models with generated regressors. Appl. Econ. 1994, 26, 635–640. [Google Scholar] [CrossRef]
- McAleer, M. The Rao-Zyskind condition, Kruskal’s theorem and ordinary least squares. Econ. Rec. 1992, 68, 65–72. [Google Scholar] [CrossRef]
- Pagan, A. Econometric issues in the analysis of regressions with generated regressors. Int. Econ. Rev. 1984, 25, 221–247. [Google Scholar] [CrossRef]
- McAleer, M.; McKenzie, C.R. When are two-step estimators efficient? Econom. Rev. 1991, 10, 235–252. [Google Scholar] [CrossRef]
- McAleer, M.; McKenzie, C.R. Recursive estimation and generated regressors. Econ. Lett. 1992, 39, 1–5. [Google Scholar] [CrossRef]
- McAleer, M.; McKenzie, C.R. On the effects of misspecification errors in models with generated regressors. Oxf. Bull. Econ. Stat. 1994, 56, 441–455. [Google Scholar]
- McAleer, M.; McKenzie, C.R. On efficient estimation and correct inference in models with generated regressors: A general approach. Jpn. Econ. Rev. 1997, 48, 368–389. [Google Scholar]
- Fiebig, D.G.; McAleer, M.; Bartels, R. Properties of ordinary least squares estimators in regression models with non-spherical disturbances. J. Econom. 1992, 54, 321–334. [Google Scholar] [CrossRef]
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).