Next Article in Journal
Renovation of Public Lighting Systems in Cultural Landscapes: Lighting and Energy Performance and Their Impact on Nightscapes
Previous Article in Journal
Quasi-Z-Source Inverter-Based Photovoltaic Power System Modeling for Grid Stability Studies
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Analysis of Factors Influencing Energy Efficiency Based on Spatial Quantile Autoregression: Evidence from the Panel Data in China

1
School of Mathematics and Physics, North China Electric Power University, Beijing 102206, China
2
College of Applied Sciences, Beijing University of Technology, Beijing 100124, China
*
Author to whom correspondence should be addressed.
Energies 2021, 14(2), 504; https://doi.org/10.3390/en14020504
Submission received: 16 December 2020 / Revised: 12 January 2021 / Accepted: 12 January 2021 / Published: 19 January 2021
(This article belongs to the Section C: Energy Economics and Policy)

Abstract

:
This research mainly studies the factors influencing the efficiency of energy utilization. Firstly, by calculating M o r a n s I and local indicators of spatial association (LISA) of energy efficiency of regions in mainland China, we found that energy efficiency shows obvious spatial autocorrelation and spatial clustering phenomena. Secondly, we established the spatial quantile autoregression (SQAR) model, in which the energy efficiency is the response variable with seven influence factors. The seven factors include industrial structure, resource endowment, level of economic development etc. Based on the provincial panel data (1998–2016) of mainland China (data source: China Statistical Yearbook, Statistical Yearbook of provinces), the findings indicate that level of economic development and industrial structure have a significant role in promoting energy efficient. Resource endowment, government intervention and energy efficiency show a negative correlation. However, the negative effect of government intervention is weakened with the increase of energy efficiency. Lastly, we compare the results of SQAR with that of ordinary spatial autoregression (SAR). The empirical result shows that the SQAR model is superior to SAR model in influencing factors analysis of energy efficiency.

1. Introduction

With the rapid development of the global economy, energy demand and consumption are increasing extensively, which has also brought some problems, such as environmental pollution, global warming, excessive exploitation of resources etc. For sustainable economic development and social progress, it is very important to improve energy efficiency and formulate relevant policies. There are some studies focused on how to put the energy system onto a more sustainable path from a macro perspective. See for example [1,2,3,4,5] and references therein.
The reasonable calculation method of energy utilization efficiency and quantitative analysis of influence factors are necessary and important for policymakers. Tobler, an American geographer in [6] proposed that everything is related to everything else, but near things are more related to each other, which is also known as Tobler’s First Law of Geography. The energy efficiency of one region also can be influenced by neighboring regions. Some studies showed the spatial auto-correlation and clustering phenomenon of energy efficiency. For example, Shen and Liu [7] used the data envelopment analysis (DEA) model to measure the energy efficiency values of 30 provinces in China from 1992 to 2007. The result shows that there are obvious spatial aggregation phenomena in provincial energy efficiency.
Concerning research on influence factors of energy efficiency, there are some literatures, for example [7,8,9,10,11,12,13,14,15] etc. Shen and Liu [7] studied influence factors by using SAR and threshold regression models. Liu and Mao [10] studied the factors influencing energy efficiency by using a spatial lagged model (SLM, also called SAR in literature) and spatial error model (SEM) based on the data of 29 provinces and cities in China from 2000 to 2012. They did not consider the test of homoscedasticity of explanatory variables, which is required before building SLM and SEM. Makridou et al. (2016) [8] analyzed the energy efficiency of five energy-intensive industries in 23 European Union countries by using a two-level cross-classified model. Li and Shi (2014) in [9] proposed an improved super-SBM (slacks-based measure) model to analyze the energy efficiency of Chinese industrial sectors. Jiang et el. [12] analyzed energy efficiency of provinces in China by using the ordinary spatial Dubin error model for panel data. Li and Lin (2018) [13] studied the direct and indirect effects of different types of technological progress on China’s energy efficiency.
To the best of our knowledge, existing references about influence factors of energy efficiency used ordinary regression without considering different quantiles. However, the ordinary regression model characterizes the relationship between the expectation of the explained variable and explanatory variables, which is not robust to outliers. To this end, the quantile regression model is a good way to deal with outliers, which can describe more detail of data. Since Koenker and Bassett [16] proposed the quantile regression method in 1978, the quantile idea has been widely used and developed (see e.g., [17,18]).
The purpose of this work is to analyze the factors influencing the efficiency of energy utilization based on the spatial quantile autogression model, so as to provide some advice on how to improve energy efficiency for decision-makers. As far as we know, the SQAR model has not been employed in existing references related to this topic. Why do we use quantiles? Quantile regression is robust to outliers. By considering different quantiles, we can get different coefficients of factors, which can reveal the relationship between factors and energy efficiency more comprehensively. Based on the work [19], we combine spatial autoregression with quantiles to study factors influencing the efficiency of energy. Firstly we analyze the M o r a n s I ([20]), local indicators of spatial association (LISA) of provincial energy efficiencies of China. The data set for empirical analysis comes from China Statistical Yearbook, Statistical Yearbook of provinces and http://data.stats.gov.cn, http://www.tjcn.org (1998–2016). The M o r a n s I and LISA of energy efficiency show that energy efficiency possesses obvious spatial autocorrelation and spatial aggregation. Secondly, thanks to the spatial dependence of energy efficiency, we use the spatial quantile autoregressive model of panel data to analyze the influence of seven factors on energy efficiency at different quantile levels. The conclusion shows that the industrial structure, economic development level and technical level have a positive effect on the provincial energy efficiency in China. Resource endowment shows a negative correlation to energy efficiency. According to the empirical analysis based on the real data, the SQAR model is superior to the ordinary spatial autoregressive model in analyzing influence factors of energy efficiency.
The rest of the paper is organized as follows: Section 2 is about the methods and models including the measurement of efficiency of energy utilization, M o r a n s I , LISA and SQAR. In Section 3, we apply the models to the provincial data of China during the years of 1998–2016 to analyze the efficiency of energy utilization.

2. Methods and Models

2.1. Spatial Auto-Correlation Test Based on M o r a n s I Index

In this subsection, we introduce the measurement of energy efficiency, M o r a n s I and LISA which are employed to test the spatial autocorrelation of energy efficiency.
For the measurement of energy efficiency, we use the data envelopment analysis (DEA) by taking the so-called low-carbon GDP [19] as the output variable. For convenience, we briefly describe the method of data envelopment analysis here. Assume there are n regions or called decision making units (DMU) in some references. Each region has m input variables and k output variables. For a year in the j-th region, x j = ( x 1 j , x 2 j , , x m j ) and y j = ( y 1 j , , y k j ) ( j = 1 , 2 , , n ) represent input and output vectors respectively, whose corresponding weight vectors are denoted by μ = ( μ 1 , , μ m ) and ν = ( ν 1 , , ν k ) respectively. The CRS-DEA model [21] of the j-th region can be expressed as follows:
max μ , ν ( ν y j / μ x j )
s . t ν y i / μ x i 1 , , i = 1 , 2 , , n , μ 0 , ν 0 .
The dual problem of model (1) can be expressed as:
m i n θ j
s . t i = 1 n x s i λ i θ j x s j , s = 1 , , m i = 1 n y s i λ i y s j , s = 1 , , k λ i 0 , i = 1 , 2 , , n ,
where θ j ( 0 < θ j 1 ) is a scalar, the efficiency value of the j-th region. It embodies the region’s ability to effectively allocate resources and sustainable development capability. If θ j = 1 , it means that the j-th region is located at the production frontier, that is, the j-th region that obtains the maximum output under certain input is effective. λ is an n × 1 vector of constants. It serves to form a convex combination of observed inputs and outputs.
The global M o r a n s I describes the global spatial autocorrelation of all data, which is defined as follows [20]:
I = n i = 1 n j = 1 n w i j ( x i x ¯ ) ( x j x ¯ ) i = 1 n j = 1 n w i j j = 1 n ( x j x ¯ ) 2 = i = 1 n j = 1 n w i j ( x i x ¯ ) ( x j x ¯ ) S 2 i = 1 n j = 1 n w i j ,
where S 2 = 1 n i = 1 n ( x i x ¯ ) 2 , n the number of areas (spatial units), x i the energy efficiency value of the i-th region, w i j the element of the spatial weight matrix, which represents the adjacent relationship between the i-th region and the j-th region. It is defined as follows:
w i j = 1 I f a r e a i i s a d j a c e n t t o a r e a j 0 I f a r e a i i s n o t a d j a c e n t t o a r e a j .
The design of the spatial weight matrix in this paper follows the rook adjacency method. That is, if there is a common boundary between the two regions, they are considered to be adjacent. Using this method, a spatial weight matrix can be constructed.
The range of M o r a n s I is [ 1 , 1 ] . A positive M o r a n s I means that there is a positive spatial correlation between the economic behaviors of various regions. The negative M o r a n s I indicates that there is a spatial negative correlation between economic behaviors in various regions. If M o r a n s I = 0 , there is no spatial correlation, and the economic behavior of each region is independent of each other. The closer the absolute value of M o r a n s I is to 1, the greater the spatial correlation between regions.
By standardizing statistic M o r a n s I , a significance test can be performed. Let
Z ( I ) = I E ( I ) ( V a r ( I ) ) .
Under the null hypothesis H 0 : M o r a n s I = 0 , that is, the null hypothesis of no spatial autocorrelation, we have:
E ( I ) = 1 n 1 ,
V a r ( I ) = 1 w 0 ( n 2 1 ) ( n 2 w 1 m w 2 + 3 w 0 3 ) ( E ( I ) ) 2 ,
where w 0 = i = 1 n j = 1 n w i j ; w 1 = i = 1 n j = 1 n ( w i j + w j i ) 2 ; w 2 = i = 1 n ( w i + w i ) 2 ; w i = j = 1 n w i j , w i = j = 1 n w j i . When the null hypothesis holds, Z ( I ) follows a normal distribution.
Global spatial autocorrelation analysis yields only one statistic to summarize the whole study area. One statistic is not enough to describe the details of individual regions. One way is to study the local spatial autocorrelation of local regions. The local indicators of spatial association (LISA) to evaluate the clustering in those individual units by calculating Local M o r a n s I for each spatial unit and evaluating the statistical significance for the i-th region:
I i = j = 1 n w i j ( x j x ¯ ) ( x i x ¯ ) S 2 .

2.2. Spatial Quantile Autoregressive Model

Based on Su and Yang’s research [22] on the spatial quantile autoregressive model under cross-section data, we extend it to the panel data case. Combining the spatial autoregressive model with quantile regression method, we need to deal with endogenous issues. We use the instrumental variable method proposed by Chernozhukov and Hansen [23,24] to solve the variable endogenous problem, and give the process of parameter estimation of the spatial quantile autoregressive model.
In 1978, Koenker and Bassett [16] proposed the idea of quantile regression and extended the mean regression model to the quantile one.
Let ( Ω , F , P ) be a probability space, Y a random variable defined in Ω . For any real number y, the distribution function of the random variable Y is defined by F ( y ) = P ( Y y ) . For any 0 τ 1 , Q ( τ ) ( Y ) : = inf { y : F ( y ) τ } is called the τ -th quantile of Y. Q ( τ ) ( Y ) is abbreviated as Q ( τ ) . Q ( τ ) can be seen as a function of τ , called the quantile function of Y. Particularly, Q ( 0.5 ) is the median of Y.
The multivariate quantile regression model is defined as follows:
Y = X β ( τ ) + U ( τ ) ,
where Y = ( y 1 , , y n ) with dependent components. X = ( x i j ) n × m = ( X 1 , , X n ) is a non-random explanatory matrix, where the transpose ’′‘ follows the transpose rules of the block matrix. For each i = 1 , , n , X i = ( x i 1 , , x i m ) . β ( τ ) = ( β 1 ( τ ) , , β m ( τ ) ) is the parameter vector to be estimated. U ( τ ) = ( u 1 ( τ ) , u 2 ( τ ) , , u n ( τ ) ) is the error term. Assume each error component u i ( τ ) = y i X i β i ( τ ) satisfying the condition
P ( u i ( τ ) 0 | X i ) = τ , i = 1 , 2 , , n ,
the formulas (9) mean that the conditional τ -quantile of each error term is 0.
In 1973, Cliff and Ord [25] proposed a spatial autoregressive model. For the cross-sectional data, the spatial autoregressive model is expressed as:
y i = λ j = 1 n w i j y j + X i β + u i , i = 1 , 2 , , n ,
where the subscript i is the cross section unit. y i is the explained variable of the i-th cross section unit, and X i = ( x i 1 , x i 2 , , x i m ) is the explanatory vector. w i j is the weight, reflecting the relation between the i-th region and the j-th region. Denote Y = ( y 1 , , y n ) , W = ( w i j ) n × n , X = ( x i j ) n × m , β = ( β 1 , , β m ) . U = ( u 1 , , u n ) is the random error vector of independent and identically distributed with zero mean. The model (10) can be written in matrix form as follows:
Y = λ W Y + X β + U .
By applying the first-order moment constraint E ( u i | X i ) = 0 ( i = 1 , 2 , , n ) to Equation (10), then the conditional mean E ( y i λ d i | X ) = x i β , where d i = i = j n w i j y j . Similarly, applying quantile constraint P ( u i ( τ ) 0 | X ) = τ to Equation (10), the following Equation (12) can be obtained:
P ( y i λ ( τ ) d i X i β ( τ ) 0 | X i ) = τ .
Combining formula in (10) and (12), we have the spatial quantile autoregressive model for the cross-sectional data as follows:
y i = λ ( τ ) d i + X i β ( τ ) + u i ( τ ) P ( y i λ ( τ ) d i X i β ( τ ) 0 | X i ) = τ , i = 1 , 2 , , n .
We extend model (13) to the spatial quantile autoregressive model for panel data:
y i t = λ ( τ ) d i t + X i t β ( τ ) + u i t ( τ ) P ( y i t λ ( τ ) d i t X i t β ( τ ) 0 | X i t ) = τ i = 1 , 2 , , n , t = 1 , 2 , , T ( 1 ) ,
where X i t is an m-dimensional column vector.
Note: t can also be continuous time. Here we only consider the discrete case. We assume the time is divided into T periods. For each i and t, ( y i t , X i t ) R × R m ( R is the set of all real numbers, R m is the m-dimensional Euclidean space). d i t = j = 1 n w i j y j t , λ ( τ ) is the coefficient of the spatial lagged factor. β ( τ ) is an m × 1 vector of regression coefficients, u i t R is the error term with zero quantile, X i t = ( x i 1 t , x i 2 t , , x i m t ) .
Endogenous variables are important in economic modeling, which are determined by their relationship with other variables within the model and show whether a variable causes a particular effect. The spatial lagged factor d i t = j = 1 n w i j y j t in the spatial autoregressive model can be regarded as an endogenous variable. The spatial econometric model combined with quantile regression needs to deal with endogenous issues. Otherwise, it will affect the accuracy of parameter estimation. Therefore, in the next subsection, we will discuss the instrumental variable method to deal with endogenous problems and parameter estimation.

2.3. Instrument Variable Method for Endogenous Problem and Parameter Estimation

Instrumental variables are variables that do not belong to the original model and are related to endogenous variables (Pearl, 2000 [26]). Concerning instrumental variable(IV) method, we refer to the literatures of Chernozhukov and Hansen [23,24], Matthew and Carlos (2009) [27].
Let ( Z i ) T × m = ( Z i 1 , Z i 2 , , Z i T ) be a matrix of instrument variables, where Z i t = ( Z i 1 t , Z i 2 t , , Z i m t ) R m . Z i t is related to X i t but unrelated to u i t and satisfies
P ( y i t λ ( τ ) d i t + X i t β ( τ ) | X i t , Z i t ) = τ ,
Let W ˜ = I T W , where I T is the T × T identity matrix and ‘⊗’ represents the Kronecker product.
W ˜ n T × n T = I T W = 1 1 1 T × T w 11 w 12 w 1 n w 21 w 22 w 2 n w n 1 w n 2 w n n = 1 × W 1 × W 1 × W = W W W
Take Z i t = j = 1 n w ˜ i j t X j t , where X j t = ( X j 1 t , X j 2 t , , X j m t ) , w ˜ i j t = w i j for t = 1 , , T , which implies the spatial weights are invariable in different periods. Then Z i t is strictly exogenous and related to endogenous variables. The IV method will be simpler if y i t , d i t , X i , Z i are independently and identically distributed. Then (15) is equivalent to find ( λ ( τ ) , β ( τ ) ) such that zero is a solution to the ordinary quantile regression:
0 a r g m i n g G E [ ρ τ ( y i t λ ( τ ) d i t X i t β ( τ ) g ( X i t , Z i t ) ) ] ,
where ρ τ ( ϕ ) = [ τ χ ( ϕ 0 ) ] ϕ . χ ( · ) is the indicator function. G is a class of measurable functions of ( X i t , Z i t ) . Let D = I T ( I n λ ( τ ) W ) , there is G = ( I T W ) D 1 . Here we assume G is defined as follows:
G = { g ( X i t , Z i t ) = γ Z i t , γ Γ } ,
where γ is a column vector and Γ R m . In order to solve the problem (17), we construct the weighted quantile regressive objective function as follows:
Q τ ( λ , β , γ ) = 1 n T i = 1 n t = 1 T ρ τ ( y i t λ d i t X i t β γ Z i t ) .
By the results in [23], the principle to solve problem (17) is to find λ ^ , β ^ such that the instrument variable coefficient γ ^ is driven as close to zero as possible. Here we use the two-stage least squares on the objective function (18).
Assumptions:
(1)
P ( u i t 0 ) = τ , i = 1 , 2 , , n , t = 1 , 2 , , T .
(2)
For any t, s u p n 1 m a x 1 i n E | u i t | u ¯ .
(3)
Given a value of u i t , the conditional distribution function F ( · | u ¯ i t ) has a bounded continuous conditional probability density function f ( · | u ¯ i t ) . Where u ¯ i t = k i n t = 1 T g i k t u k t , g i k t is the element of G.
(4)
X is non-random. Its absolute value is uniformly bounded and contains the intercept term.
(5)
The instrument variable Z is non-random with full column rank.
Let ξ i t = ( X i t , Z i t ) . The algorithm of solving parameter estimation of the SQAR model are summarized as follows:
Step 1: for a given τ , to perform a ordinary quantile regression on y i t λ d i t based on ξ i t as follows:
( β ^ ( τ ) ( λ ) , γ ^ ( τ ) ( λ ) ) = a r g m i n ( β , γ ) Q τ ( λ , β , γ ) .
Step 2: minimize the value of γ ^ ( τ ) ( λ ) , and then get the estimator of λ ( τ ) :
λ ^ ( τ ) = a r g m i n λ γ ^ ( τ ) ( λ ) A ^ ( γ ^ ( τ ) ( λ ) ) ,
where A ^ = A + o p ( 1 ) , o p ( 1 ) = s u p λ | n S ( τ ) ( λ ) | , S ( τ ) ( λ ) = I n λ ( τ ) W , A is a symmetric positive definite matrix. In fact, in order to find the estimator of λ ( τ ) , the coefficient γ ^ ( τ ) ( λ ) of the instrument variable should be driven as close to zero as possible. For simplicity, referring to the study of Chernozhukov and Hansen [23,24], here we take A ^ to be an identity matrix.
Step 3: performing weighted quantile regression on y i t λ ^ ( τ ) d i t based on ξ i t we can get an estimator of β ( τ ) :
β ^ ( τ ) = β ^ ( τ ) ( λ ^ ( τ ) ) .

3. Empirical Analysis and Results

In this section, we apply the above models to analyze the spatial autocorrelation and influence factors of energy efficiency of China during the years 1998–2016. Here we only consider 30 provinces and cities of China (Tibet, Hongkong, Macao and Taiwan were not studied in this paper since the incomplete data). All data come from China statistical Yearbook and local statistical Yearbooks (http://www.tjcn.org, http://data.stats.gov.cn). In the web page http://data.stats.gov.cn, one can find the navigation links to the annual reports of provinces. We downloaded and checked the data carefully. We assure the accuracy of input data in our models.

3.1. Spatial Autocorrelation Test Based on Global and Local M o r a n s I

In [19], Lu and Zhang calculated energy efficiency according to (2) based on the low-carbon GDP output, where n = 30 , m = 3 and k = 1 . Based on the results in [19], here we obtain yearly M o r a n s I and LISA during the study period 1998–2016. The spatial weight matrix is designed following the rook adjacency rule. Hainan province’s geographical location in China is special, an island across the sea from the mainland. There will be no neighbors under the rook adjacency method. However, since the reform and opening up, Hainan province has become increasingly close to the outside world. So we consider Hainan province to be adjacent to Guangdong province.
In statistics, there are p-value and Z-value or Z-score. The p-value is the probability that you have falsely rejected the null hypothesis. The Z-value is the measure of standard deviation. Very high or a very low (negative) Z-value, associated with very small p-values, are found in the tails of the normal distribution. It can be seen from Table 1 and Figure 1 that values of M o r a n s I of China’s provincial energy efficiency are positive. All M o r a n s I are statistically significant at the given significance level 1 % . Therefore, it rejects the null hypothesis that there is no spatial autocorrelation in China’s provincial energy efficiency.
For each year, we obtained the scatter plots and clustering graphs of LISA. Here we only list the yearly results of 2008 and 2016 for limitation of pages.
It can be seen from Table 1 and Figure 1, the values of M o r a n s I are positive and fluctuate around 0.366 from 1998 to 2016, with little change year-on-year, which implies a positive correlation. On the whole, the energy efficiency M o r a n s I has a slight upward trend, and the spatial autocorrelation of energy efficiency is gradually increasing although the future trend is unclear.
The scatter plots of M o r a n s I (Figure 2 and Figure 3) and clustering graphs (Figure 4 and Figure 5) of LISA indicate that the spatial agglomeration of energy efficiency in China’s provinces tends to be stable. We can see that the energy efficiency of Chinese provinces is not randomly distributed. The energy efficiencies of regions with similar properties tend to gather together (High value of energy efficiency is adjacent to the high value of energy efficiency, low value of energy efficiency is adjacent to a low value of energy efficiency). The map is divided into four quadrants: The first quadrant (High-High) represents a space unit with a high energy efficiency value. Its neighboring provinces and cities are also high-efficiency units; The second quadrant (Low–High) represents the region with a low energy efficiency value. The energy efficiency values of its neighboring regions are at high level; the third quadrant (Low–Low) represents a low-energy efficiency space unit, and its neighboring provinces and cities are also low-efficiency units; The fourth quadrant (High–Low) represents a region with high energy efficiency value, but its neighboring region efficiency value is low.
Figure 1, Figure 2, Figure 3, Figure 4 and Figure 5 show that regions located in the first quadrant (High-High) are mainly the provinces and cities with higher economic development in the eastern coastal areas, such as Guangdong, Fujian, Zhejiang and Shanghai etc. The economic development of these provinces and cities is rapid. They have a positive influence on each other’s development. Provinces and cities in the third quadrant (Low–Low) are mainly distributed in northwest China such as Qinghai, Ningxia, Xinjiang, Gansu etc., where economic development is relatively backward with lower energy efficiency. The energy efficiency value of surrounding provinces and cities is also relatively low. Most provinces and cities in central China are located in the second quadrant (Low–High). In the fourth quadrant (High–Low) the energy efficiency values of various provinces and cities are interleaved, with little dependence on each other. Briefly, China’s provincial energy efficiency has obvious spatial autocorrelation. The energy efficiency value of a province is largely affected by the energy efficiency values of its neighboring provinces.

3.2. Influence Factors of Energy Efficiency

Base on results above, there are obvious spatial autocorrelation and spatial clustering phenomena in China’s provincial energy efficiency. The energy efficiency value of each province and city is affected by the neighboring regions’, which makes it reasonable to describe the data by a spatial autoregressive model. We take resource endowment (End), level of economic development (Eco), degree of government intervention (Gov), industrial structure (Ind), degree of opening to the outside world (Open), technical level (Tech) and energy prices (Price) as the main factors that affect energy efficiency. We use the spatial quantile autoregressive model (14) to analyze the influence of seven factors on energy efficiency values at the quantiles of τ = 0.1 , 0.25 , 0.5 , 0.75 , 0.9 . Before build models, at first we preprocess the data set. For linear regression analysis, hypothesis testing on the data is necessary. Here we need to consider the multicollinearity of all explanatory variables. We also consider the heteroscedasticity of data. From Figure 6, we found that all explanatory variables have no multicollinaerity. The hypothesis test result in Figure 7 shows that the data are of heteroscedasticity, which implies that ordinary linear regression is invalid while quantile regression is a better tool to deal with the data with heteroscedasticity.
As a comparison, we also give the estimators based on ordinary spatial autoregression (10). The values of total-factor energy efficiency(TFEE) for application analysis come from the study of Lu and Zhang [19]. The variable description and data processing methods are given in Table 2:
We establish the SQAR model as follows (23):
T F E E i t = λ ( τ ) d i t + β 0 ( τ ) + β 1 ( τ ) E n d i t + β 2 ( τ ) E c o i t + β 3 ( τ ) G o v i t + β 4 ( τ ) I n d i t + β 5 ( τ ) O p e n i t + β 6 ( τ ) T e c h i t + β 7 ( τ ) P r i c e i t + u i t ( τ ) ,
where the subscripts i ( = 1 , , 30 ) and t ( = 1 , , 19 ) denote regions and years respectively. β l ( τ ) ( l = 1 , 2 , , 7 ) are the parameters to be estimated. u i t is the random error term. λ ( τ ) is a coefficient of the spatial lagged factor d i t = j = 1 n w i j y j t . We treat d i t = j = 1 n w i j y j t as an endogenous variable and select Z i t that is defined in Section 2.3 as an instrumental variable. The parameters are estimated by using SQAR model at the following quantiles: τ = 0.1 , 0.25 , 0.5 , 0.75 , 0.9 .
We use the software of Stata 12.0 to estimate parameters of the model SQAR based on the instrumental variables method given in Section 2.3. The formulas (20)–(22) are the algorithm about the parameter estimation of the model (23). For the sake of convenience to repeat the solving process based on our data, we give the operation process as follows:
  • Computing the spatial effect W Y , where W is the spatial weight matrix. The spatial weight matrix is designed following the rook adjacency rule. Y is the response variable (TFEE).
  • Computing the instrumental variable W X , where W is the same weight matrix as the above and X is the matrix of explanatory variables.
  • Open Stata version 12.0 and input: ivqreg EE eco gov ind tech end price(WY=WX),q( τ )
    For a given quantile τ , we then can get the result of SQAR at the quantile.
In this work, we consider quantile τ = 0.1 , 0.25 , 0.5 , 0.75 , 0.9 . The results are shown in Table 3, Table 4, Table 5, Table 6 and Table 7.
From Table 3, Table 4, Table 5, Table 6 and Table 7, it can be seen that the spatial autoregressive coefficient λ is significantly negative at τ 0.75 and significantly positive for τ = 0.9 with the significant level 1%. This shows that at lower value of energy efficiency, the neighboring provinces and cities of a region have a negative influence on it, resulting in a low-low clustering case. Regions with higher energy efficiency tend to have a positive influence on the surrounding provinces and cities, which contributes to a high-high clustering distribution pattern. At the same time, this is also consistent with the conclusion of M o r a n s I > 0 . While in the SAR model (Table 8), the spatial effect coefficient λ = 0.3262 is significantly positive, and much bigger than that at quantile 0.9 of SQAR model. Obviously, the estimate of spatial effect in SAR is rougher and somewhat unreasonable. The heteroscedasticity of data may explain why this happens.
Considering the influence factors, we obtain the following results:
  • The coefficient of variable E n d is negative at all quantiles. Except for at the quantile 0.5, the coefficient of variable E n d is statistically significant at other quantiles. It shows that there is a negative correlation between the level of resource endowment and energy efficiency, that is, the situation of energy utilization efficiency in regions with more sufficient resource reserves is worse. The same result also can be obtained in the spatial autoregressive (SAR) model (See Table 8) but the p-value is 0.426, which means the negative influence of resource endowment level on energy efficiency is not statistically significant according to the result of SAR model. This shows that the regression results of the SAR model are not ideal.
  • The estimates of the parameter of the variable E c o are positive for all quantiles and the estimate is smaller at the upper quantile. This indicates that the level of economic development obviously promotes the improvement of energy efficiency, and with the increase of energy efficiency, the positive role of promotion is steadily weakening.
  • The estimates ( β 3 ^ ) of the parameter of variable G o v are significantly negative in SQAR model at all quantiles and in SAR model. The absolute values of estimates are smaller at the upper quantile. For example: τ = 0.1 , β 3 ^ = 1.6151 ; τ = 0.25 , β 3 ^ = 1.2024 ; τ = 0.5 , β 3 ^ = 1.0103 . This shows that the degree of government intervention has a negative correlation with energy efficiency. However, this negative influence gradually weakens as energy efficiency increases.
  • The influence of industrial structure on energy efficiency is quite similar to the influence of economic development level on energy efficiency. The influence of industrial structure on energy efficiency is greater.
  • When τ 0.75 , the degree of opening to the outside world has a positive effect on energy efficiency. while τ = 0.9 , the degree of opening to the outside has a negative effect on energy efficiency. The results of the SAR model show that there is a positive correlation between the degree of opening to the outside world and energy efficiency. Compared with the SQAR model, the conclusion shown by the SAR model is somewhat limited. It can not describe the full information. This also exposes the disadvantages of ordinary mean regression.
  • At lower quantile of energy efficiency ( τ 0.5 ) , there is a negative correlation between the level of technological development and energy efficiency, and the negative correlation at quantile 0.5 is most significant. At a higher quantile of energy efficiency, the effect of the technical level is positive and statistically significant at the 10% level. However, the SAR model shows that the technical level has a negative influence on energy efficiency. Obviously, the SAR model hides the tail information of the data. So the result of the SAR model can not fully describe the true distribution of data.
  • When τ 0.25 , the estimates of the parameter of P r i c e are significantly negative, indicating that at a lower level of energy efficiency, the higher the energy price the lower the energy efficiency value. When τ 0.5 , energy prices have a positive effect on energy efficiency.

3.3. Conclusions

Through the empirical analysis to the energy efficiency of Chinese 30 provinces and cities at different quantiles, we draw conclusion as follows.
The level of economic development and industrial structure have a significant role in promoting energy efficiency. There is a negative correlation between resource endowment and energy efficiency. The degree of opening to the outside world has no significant influence on energy efficiency. Only at a high level of energy efficiency can the technical level play a certain role in promoting. At a lower energy efficiency level, energy price has a significant negative influence on energy efficiency. By comparing with an ordinary spatial autoregressive model, this paper shows the advantages of spatial quantile autoregressive modes in the analysis process. SQAR model is more resistant to outliers and can describe the statistical distribution of variables in more detail.
Based on the above analysis, We offer a few suggestions that will help to promote energy efficiency.
  • Improve the economic level of each province and optimize the industrial structure. The government at all levels should take measures to allocate resources reasonably. China’s resources are concentrated in the central and western regions. These regions have unique resource conditions but the economic level is generally low. The government’s fiscal expenditure cannot play an active role. Therefore, the government needs to properly adjust the intensity of government intervention and allocate resources to promote the improvement of energy efficiency.
  • Provinces and cities with lower energy efficiency should put economic development and optimization of industrial structure in the first place. Regions with higher energy efficiency may continue to accelerate technological development. Under the lower efficiency level, both the technical level and the energy price have a negative influence on energy efficiency. At this time, rushing to raise the technical level and lowering the energy price may not help. So only by solving the most basic problem—economic development—can the development of other aspects be driven. For provinces and cities with higher energy efficiency, they have strong economic strength and a reasonable industrial structure. So it is better to develop and innovate science and technology constantly, making science and technology a powerful driving force for improving energy efficiency.

4. Discussion

The practical world is complex. In order to model the real world by using quantitative methods such as statistics tools, we have to make some assumptions in advance. For example, for ordinary linear regression, normal distribution and homoscedasticity are required. While, if the data does not follow a normal distribution and has heteroscedasticity, quantile regression is a good option. Compared with the existing references related to the analysis of factors influencing energy efficiency, the novelty of our model (23) is that it considers not only the spatial correlation of energy efficiency but also the heteroscedasticity of explanatory variables. Given a quantile τ and the input data of explanatory variables X in model (23), the output is the conditional quantile of efficiency under given X. It can characterize the distribution of energy efficiency. However, for ordinary regression, the output is the mean of energy efficiency, which is a rougher estimate. For example, given τ = 0 . 25 , input the data of Hubei province in 2016, we get the output 0.683 of model (23). That means when the values of explanatory variables change, the quantile at 0.25 of energy efficiency of Hubei is 0.683. Considering the energy of efficiency in the period of 1998–2016 of Hubei province, we get the 0.25 quantile is 0.687. The fitting effect is not bad from this single case. However, it does not mean our model is complete. The result of our model can not perfectly explain the practical world since it is based on some assumptions, although it is expected helpful to improve the energy efficiency for decision-makers for maintaining the sustainability of social development. It still needs to be improved. Here we only considered the spatial lagged effect and ignored the time-lagged effect. In our future work, based on the quantile method, we will consider both spatial lagged effect and time-lagged effect, i.e., consider the dynamic panel data.

Author Contributions

This research was designed and performed by all authors. The data were collected by Q.L., L.G. and X.W. contributed to statistical models and analysis, estimation of parameters. Q.L. contributed to writing—original draft. J.Z. contributed to writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the Natural Science Foundation of Beijing Municipality (No.1192015).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data used in this study are available in China Statistical Yearbook, Statistical Yearbook of provinces and http://data.stats.gov.cn, http://www.tjcn.org.

Acknowledgments

The authors are grateful to the editor and referees for valuable comments which led to improvements in this work.

Conflicts of Interest

The authors declare no conflict of interest.

Sample Availability

Samples are available from the authors.

Abbreviations

The following abbreviations are used in this manuscript:
LISALocal indicators of spatial association
SAROrdinary spatial autogression
SQARSpatial quantile autoregression
DEAData envelopment analysis

References

  1. Ayres, R.U.; Turton, H.; Casten, T. Energy efficiency, sustainability and economic growth. Energy 2007, 32, 634–648. [Google Scholar]
  2. Mavi, N.K.; Mavi, R.K. Energy and environmental efficiency of OECD countries in the context of the circular economy: Common weight analysis for malmquist productivity. J. Environ. Manag. 2019, 247, 651–661. [Google Scholar]
  3. Miketa, A. Analysis of energy intensity developments in manufacturing sectors in industrialized and developing countries. Energy Policy 2001, 29, 769–775. [Google Scholar]
  4. Yu, X.; Moreno-Cruz, J.; Crittenden, J.C. Regional energy rebound effect:The impact of economy-wide and sector level energy efficiency improvement in Georgia, USA. Energy Policy 2015, 87, 250–259. [Google Scholar]
  5. Zheng, J.; Mi, Z.; Coffman, D.M.; Milcheva, S.; Shan, Y.; Guan, D.; Wang, S. Regional development and carbon emissions in China. Energy Econ. 2019, 81, 25–36. [Google Scholar]
  6. Tobler, W.R. A computer movie simulating urban growth in the detroit region. Econ. Geogr. 1970, 46, 234–240. [Google Scholar]
  7. Shen, N.; Liu, F. Spatial spillover, threshold characteristics and the economic growth effect of energy efficiency. China Popul. Resour. Environ. 2012, 5, 155–159. [Google Scholar]
  8. Makridou, G.; Andriosopoulos, K.; Doumpos, M.; Zopounidis, C. Measuring the efficiency of energy-intensive industries across European countries. Energy Policy 2016, 88, 573–583. [Google Scholar]
  9. Li, H.; Shi, J.F. Energy efficiency analysis on Chinese industrial sectors: An improved Super-SBM model with undesirable outputs. J. Clean. Prod. 2014, 65, 97–107. [Google Scholar]
  10. Liu, J.; Mao, J. Analysis of China’s provincial energy efficiency distribution and its influencing factors based on spatial measurement. J. Hunan Univ. Financ. Econ. 2014, 30, 133–140. [Google Scholar]
  11. Gao, Z.; Wang, Y. Regional division of energy productivity in China and analysis of influencing factors. J. Quant. Tech. Econ. 2006, 9, 46–57. [Google Scholar]
  12. Jiang, L.; Folmer, H.; Ji, M.; Tang, J. Energy efficiency in the Chinese provinces: A fixed effects stochastic frontier spatial Durbin error panel analysis. Ann. Reg. Sci. 2017, 58, 301–319. [Google Scholar]
  13. Li, K.; Lin, B. How to promote energy efficiency through technological progress in China? Energy 2018, 143, 812–821. [Google Scholar]
  14. Qiu, L.; Shen, Y.; Ren, W.; Yan, T. Regional differentiation and influencing factors analysis of energy efficiency in China. J. Nat. Resour. 2008, 23, 920–928. [Google Scholar]
  15. Zou, Y.; Lu, Y. Analysis of regional characteristics of energy efficiency in China based on spatial autoregressive model. Stat. Res. 2005, 10, 67–71. [Google Scholar]
  16. Koenker, R.; Bassett, G. Regression quantiles. Econometrica 1978, 46, 33–50. [Google Scholar]
  17. Koenker, R. Quantile regression for longitudinal data. J. Multivar. Anal. 2004, 91, 74–89. [Google Scholar]
  18. McMillen, D.P. Quantile Regression for Spatial Data; Springer: Berlin/Heidelberg, Germany, 2012. [Google Scholar]
  19. Lu, Q.; Zhang, J. Analysis of total-factor energy efficiency in China under low carbon constraint. In Proceedings of the 5th International Conference on Big Data and Information Analytics, Kunming, China, 8–10 July 2019. [Google Scholar]
  20. Moran, P.A.P. Notes on continuous stochastic phenomena. Biometrika 1950, 37, 17–23. [Google Scholar]
  21. Charnes, A.; Cooper, W.W.; Rhodes, E. Measuring the efficiency of decision making units. Eur. J. Oper. Res. 1978, 2, 429–444. [Google Scholar]
  22. Su, L.; Yang, Z. Research Collection School of Economics. Instrumental Variable Quantile Estimation of Spatial Autoregressive Models. Working paper. May 2011, pp. 1–35. Available online: https://ink.library.smu.edu.sg/soe_research/1074 (accessed on 12 January 2021).
  23. Chernozhukov, V.; Hansen, C. Instrumental quantile regression inference for structural and treatment effect models. J. Econom. 2006, 132, 491–525. [Google Scholar]
  24. Chernozhukov, V.; Hansen, C. Instrumental variable quantile regression: A robust inference approach. J. Econom. 2008, 142, 379–398. [Google Scholar]
  25. Paul Elhorst, J. Spatial Econometrics: From Cross-Sectional Data to Spatial Panels; Springer: Berlin/Heidelberg, Germany, 2014. [Google Scholar]
  26. Pearl, J. Causality: Models, Reasoning, and Inference; Cambridge University Press: Cambridge, UK, 2000. [Google Scholar]
  27. Matthew, H.; Carlos, L. A quantile regression approach for estimating panel data models using instrument variables. Econ. Lett. 2009, 104, 133–135. [Google Scholar]
Figure 1. Trend of M o r a n s I .
Figure 1. Trend of M o r a n s I .
Energies 14 00504 g001
Figure 2. Scatter plot of M o r a n s I of provincial energy efficiency, 2008.
Figure 2. Scatter plot of M o r a n s I of provincial energy efficiency, 2008.
Energies 14 00504 g002
Figure 3. Scatter plot of M o r a n s I of provincial energy efficiency, 2016.
Figure 3. Scatter plot of M o r a n s I of provincial energy efficiency, 2016.
Energies 14 00504 g003
Figure 4. Local indicators of spatial association (LISA) of provincial energy efficiency, 2008.
Figure 4. Local indicators of spatial association (LISA) of provincial energy efficiency, 2008.
Energies 14 00504 g004
Figure 5. LISA of provincial energy efficiency, 2016.
Figure 5. LISA of provincial energy efficiency, 2016.
Energies 14 00504 g005
Figure 6. Multicollinearity test.
Figure 6. Multicollinearity test.
Energies 14 00504 g006
Figure 7. Heteroscedasticity test.
Figure 7. Heteroscedasticity test.
Energies 14 00504 g007
Table 1. M o r a n s I value of China of provincial energy efficiency.
Table 1. M o r a n s I value of China of provincial energy efficiency.
YearsMoran’s Ip-ValueZ-Value
19980.3196830.005682.8493
19990.3107560.006112.7602
20000.2919090.008472.6051
20010.3042830.006802.7217
20020.3290980.004052.9311
20030.3913380.000863.8959
20040.3879400.000993.8566
20050.3705800.001333.6974
20060.3774010.001163.7608
20070.3707540.001323.7029
20080.3825700.001073.8162
20090.3467090.002193.4881
20100.3663660.001523.6782
20110.4008830.000724.0090
20120.3941390.000883.9527
20130.3709470.001423.7354
20140.3698000.001363.7229
20150.3568680.001623.5980
20160.3492470.001863.5197
Table 2. Variable description of the spatial quantile autoregression (SQAR) model.
Table 2. Variable description of the spatial quantile autoregression (SQAR) model.
Variable TypeVariableData Processing Method
Explained VariableTFEECalculated by DEA Model
E n d Energy production/Energy consumption
E c o Tertiary industry output/GDP × 100 %
G o v Fiscal expenditure/GDP × 100 %
Explanatory variables I n d Industrial output/GDP × 100 %
O p e n Export trade volume/GDP × 100 %
T e c h Foreign direct investment/GDP
P r i c e Price index of raw material, fuel and power(Base period is 1980)
Table 3. Estimation results of the SQAR model, τ = 0.1 .
Table 3. Estimation results of the SQAR model, τ = 0.1 .
ParameterEstimatep-ValueZ-ValueStandard Error 95 % Confidence Interval
λ −0.07920.000−55.570.0014[−0.0820,−0.0764]
β 0 0.22630.0062.720.0831[0.0634,0.3892]
β 1 −0.03230.000−7.020.0046[−0.0413,−0.0233]
β 2 0.96290.0006.630.1452[0.6783,1,2474]
β 3 −1.61510.000−10.560.1529[−1.9148,−1.3154]
β 4 1.11770.0007.670.1457[0.8322,1.4032]
β 5 0.15150.1851.330.1142[−0.0724,0.3754]
β 6 −0.00970.792−0.260.0367[−0.0816,0.0622]
β 7 −0.07880.016−2.420.0326[−0.1427,−0.0149]
Table 4. Estimation results of the SQAR model, τ = 0.25 .
Table 4. Estimation results of the SQAR model, τ = 0.25 .
ParameterEstimatep-ValueZ-ValueStandard Error 95 % Confidence Interval
λ −0.07840.000−82.990.0009[−0.0729,−0.0695]
β 0 0.48810.0008.870.0550[0.3803,0.5959]
β 1 −0.01090.000−3.590.0030[−0.0169,−0.0050]
β 2 0.69470.0007.230.0960[0.5065,0.8830]
β 3 −1.20240.000−11.890.1012[−1.4006,−1.0041]
β 4 1.01610.00010.540.0964[0.8272,1.2049]
β 5 0.10750.1551.420.0756[−0.0406,0.2556]
β 6 −0.03490.150−1.440.0243[−0.0825,0.0126]
β 7 −0.07220.001−3.350.0216[−0.1145,−0.0300]
Table 5. Estimation results of the SQAR model, τ = 0.5 .
Table 5. Estimation results of the SQAR model, τ = 0.5 .
ParameterEstimatep-ValueZ-ValueStandard Error 95 % Confidence Interval
λ −0.07120.000−55.570.0014[−0.0820,−0.0764]
β 0 0.76200.00015.210.0501[0.6639,0.8602]
β 1 −0.00240.393−0.850.0028[−0.0078,0.0031]
β 2 0.39920.0004.560.0875[0.2278,0.5707]
β 3 −1.01030.000v10.970.0921[−1.1908,−0.8297]
β 4 0.62820.0007.160.0878[0.4563,0.8002]
β 5 0.12720.0641.850.0688[−0.0077,0.2621]
β 6 −0.06110.006−2.760.0211[−0.1044,−0.0178]
β 7 0.00320.8690.170.0186[−0.0352,0.0417]
Table 6. Estimation results of the SQAR model, τ = 0.75 .
Table 6. Estimation results of the SQAR model, τ = 0.75 .
ParameterEstimatep-ValueZ-ValueStandard Error 95 % Confidence Interval
λ −0.04440.000−50.340.0009[−0.0462,−0.0427]
β 0 0.83370.00016.190.0515[0.7327,0.9346]
β 1 −0.00530.064−1.860.0028[−0.0109,0.0003]
β 2 0.36420.0004.050.0899[0.1879,0.5404]
β 3 −0.84540.000−8.920.0947[−1.0310,−0.6597]
β 4 0.39640.0004.390.0902[0.2196,0.5733]
β 5 0.04880.4900.690.0708[−0.0899,0.1875]
β 6 −0.02530.266−1.110.0227[−0.0698,0.0193]
β 7 0.00680.7380.330.0202[−0.0328,0.0463]
Table 7. Estimation results of the SQAR model, τ = 0.9 .
Table 7. Estimation results of the SQAR model, τ = 0.9 .
ParameterEstimatep-ValueZ-ValueStandard Error 95 % Confidence Interval
λ 0.00330.0023.080.0011[0.0012,0.0055]
β 0 0.84550.00013.390.0632[0.7217,0.9693]
β 1 −0.00960.006−2.740.0035[−0.0164,−0.0027]
β 2 0.23150.0362.100.1103[0.0153,0.4477]
β 3 −0.63570.000−5.470.1162[−0.8634,−0.4080]
β 4 0.23610.0332.130.1107[0.0192,0.4530]
β 5 −0.64210.459−0.740.0868[−0.2343,0.1059]
β 6 0.03340.0681.830.0279[−0.0212,0.0880]
β 7 0.00890.7200.360.0248[−0.0396,0.0574]
Table 8. Estimation results of the ordinary spatial autoregression (SAR) model.
Table 8. Estimation results of the ordinary spatial autoregression (SAR) model.
ParameterEstimatep-ValueZ-ValueStandard Error 95 % Confidence Interval
λ 0.32620.0003.540.0921[0.1457,0.5067]
β 0 0.36270.0003.830.0946[0.1773,0.5481]
β 1 −0.00130.426−0.800.0016[−0.0044,0.0018]
β 2 0.14230.0891.700.0836[−0.0216,0.3061]
β 3 −0.27220.014−2.460.1105[−0.4888,−0.0556]
β 4 0.25420.0601.880.1353[−0.0111,0.5194]
β 5 0.12100.2501.150.1051[−0.0850,0.3271]
β 6 −0.00810.500−0.670.0121[−0.0317,0.0155]
β 7 −0.00250.877−0.140.0177[−0.0371,0.0321]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Zhang, J.; Lu, Q.; Guan, L.; Wang, X. Analysis of Factors Influencing Energy Efficiency Based on Spatial Quantile Autoregression: Evidence from the Panel Data in China. Energies 2021, 14, 504. https://doi.org/10.3390/en14020504

AMA Style

Zhang J, Lu Q, Guan L, Wang X. Analysis of Factors Influencing Energy Efficiency Based on Spatial Quantile Autoregression: Evidence from the Panel Data in China. Energies. 2021; 14(2):504. https://doi.org/10.3390/en14020504

Chicago/Turabian Style

Zhang, Jinping, Qiuru Lu, Li Guan, and Xiaoying Wang. 2021. "Analysis of Factors Influencing Energy Efficiency Based on Spatial Quantile Autoregression: Evidence from the Panel Data in China" Energies 14, no. 2: 504. https://doi.org/10.3390/en14020504

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop