Next Article in Journal
The Safety of Transnational Imported Second-Hand Cars: A Case Study on Vehicle-to-Vehicle Crashes in Romania
Next Article in Special Issue
Organizational Sustainability Practices: A Study of the Firms Listed by the Corporate Sustainability Index
Previous Article in Journal
Dynamic Evaluation of the Energy Efficiency of Environments in Brazilian University Classrooms Using DEA
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Prediction of CO2 Emission in China’s Power Generation Industry with Gauss Optimized Cuckoo Search Algorithm and Wavelet Neural Network Based on STIRPAT model with Ridge Regression

School of Economics and Management, North China Electric Power University, Beijing 102206, China
*
Author to whom correspondence should be addressed.
Sustainability 2017, 9(12), 2377; https://doi.org/10.3390/su9122377
Submission received: 30 November 2017 / Revised: 14 December 2017 / Accepted: 18 December 2017 / Published: 20 December 2017
(This article belongs to the Special Issue Environmentally Sustainable Competitive Strategies)

Abstract

:
Power generation industry is the key industry of carbon dioxide (CO2) emission in China. Assessing its future CO2 emissions is of great significance to the formulation and implementation of energy saving and emission reduction policies. Based on the Stochastic Impacts by Regression on Population, Affluence and Technology model (STIRPAT), the influencing factors analysis model of CO2 emission of power generation industry is established. The ridge regression (RR) method is used to estimate the historical data. In addition, a wavelet neural network (WNN) prediction model based on Cuckoo Search algorithm optimized by Gauss (GCS) is put forward to predict the factors in the STIRPAT model. Then, the predicted values are substituted into the regression model, and the CO2 emission estimation values of the power generation industry in China are obtained. It’s concluded that population, per capita Gross Domestic Product (GDP), standard coal consumption and thermal power specific gravity are the key factors affecting the CO2 emission from the power generation industry. Besides, the GCS-WNN prediction model has higher prediction accuracy, comparing with other models. Moreover, with the development of science and technology in the future, the CO2 emission growth in the power generation industry will gradually slow down according to the prediction results.

1. Introduction

According to the International Energy Agency, even though the global economy continued to grow in 2016, carbon dioxide (CO2) emissions remained stable with no significant increase since 2015. This is closely related to the development of energy-saving technologies and the continuous development of clean energy. In 2016, the proportion of thermal power generation in China dropped to 74.4%. Besides, the proportion of low carbon power generation, such as hydropower, wind power, has gradually increased. According to the “13th Five-Year Plan” of power industry released by National Energy Administration of China, it is estimated that by 2020, clean energy power generation will further occupy space for thermal power generation and thus reduce CO2 emissions [1]. Electrical energy, as a kind of clean energy, do not produce any CO2 during use and transportation. But in the production and transmission and distribution facilities construction process, there will be CO2 emissions, and mainly concentrated in the production process [2].
In order to improve the accuracy of CO2 emission prediction in power generation industry and help to draw up more scientific carbon emission policy, it is necessary to establish a more precise CO2 emission prediction model for power generation industry. Some scholars predict the CO2 emission from the power generation side from the perspective of the whole life cycle [3]. They consider the CO2 emissions, not only from the power plants in the production process, but also from the construction of the power plants to the scrapped ones. In addition, the Stochastic Impacts by Regression on Population, Affluence and Technology (STIRPAT) model is applied to the analysis of the impact of CO2 emissions [4,5,6,7]. The STIRPAT model attributes the change in CO2 emissions to the combined effect of three factors: population size, economic level and level of technological development [8,9,10]. After the CO2 emission, population, per capita Gross Domestic Product (GDP), the corresponding level of technology data were regressed and fitted, the regression model of CO2 emission factors in this region can be obtained. This model can scientifically and quantitatively reflect the impact of each major factor on CO2 emissions [11]. However, in regression estimation of the model, the common least-squares regression is prone to multiple collinear defects. This leads to poor regression results and difficulties in achieving expectations. The use of ridge regression can reduce the effect of multicollinearity to a certain extent [12]. Many scholars have conducted research on the choice of prediction model. Yue Y et al. apply the Back Propagation (BP) neural network model to detect and monitor land degradation [13]. Kumar U et al. apply Grey-Markov model, Grey-Model with rolling mechanism, and singular spectrum analysis (SSA) to forecast the consumption of conventional energy in India [14]. The GM-ARIMA model, the GM (1, 1) model, and the ARIMA model are used to predict the energy demand of Shandong province [15]. A model to predict the strength of RSC is established using a least squares-support vector machine (LS-SVM) through grid search algorithm and Support Vector Machine (SVM) model [16]. However, the original algorithm of the above model cannot meet the precision demand of modern prediction. More and more optimization algorithms are also applied to the parameter optimization of prediction models [17,18,19,20]. The Wavelet Neural Network (WNN) model is proposed for the prediction of micro-grids [21]. On the basis of adding Cuckoo Search (CS) algorithm to optimize, it can further improve the prediction accuracy [22].
To sum up, the research scope of the existing research is relatively narrow. Additionally, the above models have not been well integrated into problem-solving. What is more, the existing CO2 emission prediction model cannot take many factors into account, and the precision is not high enough. This paper combines STIRPAT model with intelligent algorithm innovatively. At the meantime, influence factors of CO2 emission from power generation industry are analyzed based on STIRPAT model. On the basis of the established model, GCS-WNN model is used to predict the influence factors. And the future CO2 emission expectation on the power generation industry is got by plugging the forecasted result into the STIRPAT model.

2. Materials and Methods

2.1. STIRPAT Model

The IPAT model was first proposed by Professor Elich at Stanford University in the United States in the 1970s [23]. This model studies the factors that affect the environment. They believe that population size (P), per capita wealth (A) and technology level (T) are the most important factors affecting the environment (I), and the formula I = PAT is proposed [24].
Of course, there are many factors that affect the environment, but other factors except the above-mentioned are not the most important and direct. Those factors can indirectly affect the environment through direct factors such as population size, per capita wealth and technological level [25].
However, due to the limitation of the IPAT model, the variables in the model vary linearly with the proportions. In 1994, York et al. proposed the Stochastic Impacts by Regression on Population, Affluence and Technology (STIRPAT) model based on the IPAT [26,27]. The STIRPAT model preserves the multiplication structure of the IPAT model, taking population, wealth and technology as the key factors affecting the environment. The STIRPAT model equation is as follows:
I = a P b A c T d e
where I represents the impact on the environment, P means population size, A means wealth, T represents technical factors, and e is model error [28]. Before using the model, the logarithm of the two sides of the equal sign is obtained:
ln I = a + b ln P + c ln A + d ln T + e

2.2. Ridge Regression

Ridge regression (RR) is a biased estimation regression method that is used exclusively for collinearity data analysis [29]. In essence, it is an improved least square estimation method. RR is a more realistic and reliable regression method to get the regression coefficient by giving up unbiased characteristics, losing some information and reducing accuracy [30]. The fitting of RR to ill conditioned data is better than that of least squares [31]. For the multivariate linear regression model Y = X β + ε , the least-squares estimator of parameter β is:
β ^ = ( X X ) 1 X Y
However, when the multicollinearity exists, the least square parameter estimation is singular. Some characteristic roots are close to 0, and there is a large deviation between the estimated values and the observed ones. Thus, the traditional least squares estimation is not reliable enough. And RR adjusts the estimate to:
β ^ ( k ) = ( X X + k I ) 1 X Y
where I represents the unit matrix, k is the ridge parameter. When k = 0, the ridge estimate β ^ ( 0 ) is the least-squares estimation. When k varies between [ 0 , ) , β ^ ( k ) is a function of the ridge parameter k. Multiple function curves are plotted in a graph as a ridge plot. Finally, the k value which makes the β ^ ( k ) value stable is chosen as the ridge parameter.

2.3. Wavelet Neural Network Prediction Model

Wavelet Neural Network (WNN) is a new type of feed-forward network, which combines the wavelet transform with neural networks and uses the nonlinear wavelet base function as the transfer function in the hidden layer, which is able to realize signals forward-propagation and error back-propagation simultaneously [32,33]. The wavelet function is obtained by the translation and expansion of its mother wavelet, and the signal can be decomposed into a series of wavelet functions [34]. After shifting the base wavelet function ψ ( t ) by τ , multiply it with the target signal at different scales, namely:
f x ( a , τ ) = 1 a x (t) ψ ( t τ a ) d t , a > 0
where τ represents the parallel movement of the target, a refers to adjusting the distance from the target. The graph of WNN structure is illustrated in Figure 1.
Where x1, x2, …, xn represent the input value of WNN, y1, y2, …, ym represent the output value of WNN, weight values are wij and wjk. The output of the hidden layer is:
h ( j ) = h j ( i = 1 k w i j x i b j a j ) , j = 1 , 2 , , l
The output of the hidden layer node j is h ( j ) , wij is the weight of the input and the hidden layer, the translation factor of the wavelet basis function h j is expressed by b j , and the scaling factor is expressed by a j . The output layer is calculated as:
y ( k ) = j = 1 l w j k h ( j ) , k = 1 , 2 , , m
where wjk is the weight of the input layer and the hidden layer.
WNN is also able to keep the neural network close to the expected data output by gradient modifying parameters of the base wavelet function and the neural network weights [35]. Assume that the network error is e, the expected output is y n ( k ) , and the actual prediction value is y ( k ) , namely,
e = k = 1 m y n ( k ) y ( k )
Then the following equations can be obtained.
Δ w i + 1 = μ e w i
Δ b i + 1 = μ e b i
Δ a i + 1 = μ e a i
where μ represents the learning rate, and the revised parameters can be obtained by plugging μ into the following formulas.
w i + 1 = w i + Δ w i + 1
b i + 1 = b i + Δ b i + 1
a i + 1 = a i + Δ a i + 1

2.4. Gauss Optimized Cuckoo Search Algorithm

2.4.1. Cuckoo Search Algorithm

The Cuckoo Search (CS) algorithm can search the optimization much faster and more accurately by simulating the cuckoo searching for the random walking process of the egg laying hosts [36]. According to Yang et al.’s research, the Cuckoo Search algorithm has the following three rules [37]:
(1)
The number of eggs produced by a cuckoo per time is 1.
(2)
The host bird’s nest where high-quality eggs are located is the optimal solution and will be retained for the next generation.
(3)
The number of host nests is certain, and the probability that cuckoo eggs are found by nest owners is P a [ 0 , 1 ] .
During the search, cuckoo’s flight search path follows the Lévy distribution [38,39], namely:
x i ( t + 1 ) = x i ( t ) + α L ( λ ) , i = 1 , 2 , , n
In the formula, x i ( t + 1 ) and x i ( t ) are the bird’s nest positions of the (t + 1)-th and the t-th generation, n is the number of cuckoo, is the dot multiplication and L ( λ ) is the Lévy flight path. The relation between searching path and time is as follows:
L ( λ ) u = t λ ( 1 < λ 3 )

2.4.2. Gauss Optimization

In the traditional CS algorithm, the probability of finding cuckoo eggs and the step size α of position updating are fixed values, which leads to the problems of the weak global search ability, slow convergence speed and low precision of the algorithm [40]. Therefore, an improved cuckoo algorithm is proposed in this paper to update the values of P a and α dynamically. According to references, where the updating process is as follows:
P a ( t ) = P a max t ( P a max P a min ) N
α ( t ) = α max e ln ( α max α min ) × t N
In the formula, t and N are the number of current iterations and the total number of iterations, P a max and P a min are the maximum and minimum values of the detection probability, α max and α min are the maximum and minimum step coefficients, respectively.
However, due to the lack of search vitality and slow speed of CS algorithm, the optimization ability of CS algorithm can be improved effectively by adding Gauss perturbation. Assuming that the optimal location of the nest x i ( i ) , ( i = 1 , 2 , , n ), is obtained after the calculation of t times CS iterations. In order to prevent the next iteration of x i ( i ) and maintain the Gaussian disturbance, the next phase of x i ( i ) is searched. Supposed that the matrix p t = [ x 1 ( t ) , x 2 ( t ) , , x n ( t ) ] r is made up of the better position x i ( i ) of the bird’s nest, x i ( i ) is a d-dimensional vector, and the dimension of p t is d × n . Matrix p t combined with Gaussian perturbation is the basic step of GCS algorithm [41], namely:
p t = p t + a ε
where ε is a random matrix with the same order of p t , which follows N(0, 1) distribution and a is constant. In the search for a better nest position vitality at the same time, the position of the bird’s nest can be overextended easily because of the large random range of ε .
Therefore, the selection of suitable a is particularly important. After obtaining a reasonable set of p t and comparing it with each nest in p t , only a better nest position is reserved to obtain a better set of nest positions p t .

2.5. The Prediction Model of RR-STIRPAT-GCS-WNN

The emission of CO2 in power generation industry is mainly affected by population, per capita GDP, standard coal consumption, proportion of thermal power and other factors. The forecasting steps are as follow:
(1) Data collecting and processing
Collect the historical data of population, per capita GDP, standard coal consumption, proportion of thermal power from 1996–2016.
(2) Establish regression model
Use the logarithm of historical data to establish regression model between the influencing factors and the CO2 emission in power generation industry based on STIRPAT by ridge regression method.
(3) Prediction model training
Initialize parameters of prediction model. The input layer node is set as 2, while the hidden layer node is 2 × 2 + 1 = 5, and the output layer node is 1. The weight value, translation factor and expansion factor of the wavelet base are randomly generated. In this study, the learning rate is set as 0.1, while the number of iterations is 500, and the prediction accuracy is 0.99.
Then train the prediction model by plugging the historical data into the model and search the best parameter by using the Gauss Optimized Cuckoo Search Algorithm. Firstly, set the Nnest (number of birds’ nest) as 30, while Pa (probability of bird’s eggs by bird’s nest owner) is 0.2, and N (number of iterations) is 200. After that, randomly generate Nnest bird nest location W = (W1, W2, ..., Nnest) T. And each bird nest Wi has s parameters (s = the number of weights between input layer and hidden layer + the number of weights between hidden layer and output layer + the number of translation factors + the number of expansion factors). The predicted values of each bird’s nest were calculated, and the nest which has the smallest error in the 30 nests is found, marked as Wbest. Then Wbest retain to the next generation.
(4) Prediction of CO2 Emission
Use the RR-STIRPAT-GCS-WNN model to forecast the CO2 Emission in Power generation industry based on the historical data of population, per capita GDP, standard coal consumption and proportion of thermal power. The flow chart of this model is shown in the Figure 2.

3. Results

3.1. Analysis on Influencing Factors of CO2 Emission in Power Generation Industry

Based on the scholars’ research on the influencing factors of CO2 emission in power generation industry and the basic framework of STIRPAT model, 4 factors, such as population scale (million people), per capita GDP (yuan), standard coal consumption (g/kWh), and thermal power specific gravity (%), are considered as the key factors affecting the CO2 emission from the power generation industry. The above factors are introduced into the STIRPAT model:
ln I = a + b ln P + c ln A + d ln T 1 + e ln T 2 + f
where I represents CO2 annual emissions, P means population size, A is per capita GDP, T1 is power supply standard coal consumption, T2 is proportion of thermal power, b, c, d, e are various coefficients. According to the concept of elasticity, b, c, d and e represent the elastic coefficients of each index when both sides of the equation take partial derivatives at the same time. The raw data is shown in Table 1.
According to the data in the table, after taking the logarithm of the indicators, SPSS software is used to achieve the least squares regression model based on STIRPAT. The regression results are shown in Table 2, Table 3 and Table 4.
According to the regression results, the regression coefficient of the model is 0.992, the adjustment coefficient is 0.991, and the F value is 555.098. It shows that the equation passed the significance test and the fitting effect is good. However, the variance expansion factor VIF is greater than 10 except for ln T2, indicating the possibility of multicollinearity. Therefore, the collinearity tests were performed, and the results are shown in Table 5.
According to the result of collinearity diagnosis, 4 of the 5 eigenvalues are close to 0, the maximum condition number is 5382.577. It is shown that the equation has multicollinearity characteristics. In order to solve the problem of multicollinearity without reducing the number of variables, the ridge regression method is applied to solve the multiple collinearity considering the few variables of the model. The ridge trace is shown in Figure 3.
It can be seen from the above figure that when k = 0.4, the ridge parameter is basically stable, at this time R2 is still relatively large. Therefore, k = 0.4 is determined as ridge parameter for regression. Ridge regression results are shown in Table 6, Table 7 and Table 8.
As can be seen from the above table, R2 = 0.9607 and F = 103.9188 at k = 0.4, the equation passes the significance test and no longer exists multicollinearity. It shows that the model established by ridge regression is reasonable, and the regression equation is as follows:
ln I = 3.9242 ln P + 0.212 ln A 1.6506 ln T 1 0.1682 ln T 2 26.3976

3.2. Prediction of CO2 Emission in Power Generation Industry

3.2.1. Prediction of Influencing Factors Based on GCS-WNN

Based on the historical data of population, per capita GDP, standard coal consumption and proportion of thermal power from 1995 to 2016, historical data were trained and predicted by using GCS-WNN model, WNN model and GCS-SVM model respectively. The prediction results are shown in Figure 4.
By comparing the model predictions in Figure 4, it can be seen that the prediction accuracy of the GCS-WNN model is the highest among all the indicators, which is basically consistent with the original data. Therefore, the GCS-WNN model can be used to predict the future population, per capita GDP, standard coal consumption, proportion of thermal power and other indicators.
The prediction results of CO2 emission were obtained by plugging the predicted values of each influencing factor into the established RR-STIRPAT model. The prediction results are shown in the Figure 5.
According to the Figure 5 and Figure 6, the prediction accuracy of GCS-WNN model is the highest, which has best fitting effect with the original data. It shows that the RR-STIRPAT-GCS-WNN model is accurate and scientific in the prediction of CO2 emission.

3.2.2. Prediction of CO2 Emission Based on RR-STIRPAT Model

On the basis of historical data, GCS-WNN is used to predict 2018–2022 years population, per capita GDP, standard coal consumption, proportion of thermal power and other indicators. The results are shown in Table 9.
The prediction results of CO2 emissions in power generation industry in 2018–2022 years could be obtained by plugging the results of each factor into the RR-STIRPAT model. The prediction results are shown in Table 10 and Figure 7.
It can be seen from the Table 10 and Figure 7 that during the period of 2018–2022, the CO2 emissions from the power generation industry will still keep rising. However, the growth rate of CO2 emission will continue to decline due to the slowdown of population growth, the drop of standard coal consumption and the decrease of the proportion of thermal power in the future.

4. Discussion and Conclusions

Greenhouse gas emissions, represented by CO2, have led to a rise in global temperatures. Power generation industry is one of the key industries of CO2 emissions. To achieve the purpose of controlling CO2 emissions, it is necessary to scientifically and quantitatively predict the CO2 emissions of power generation industry.
(1)
The CO2 emission of power generation industry is affected by many factors. It is concluded that the key factors that directly affect CO2 emissions are population, per capita GDP, standard coal consumption and proportion of thermal power.
(2)
Based on the STIRPAT model, the CO2 emission factors analysis model of power generation industry is established. Besides, the collinearity test of the model shows that if the ordinary least squares algorithm is applied, the multicollinearity will be serious. However, the ridge regression method can solve this problem to a certain extent.
(3)
The WNN model is used to predict the influencing factors. In order to improve the convergence speed and prediction accuracy of the model, Gauss optimized cuckoo search algorithm is added into the model parameter optimization. Compared with other models, it is found that the optimized model has higher prediction accuracy.
(4)
The GCS-WNN model is used to predict the population, per capita GDP, standard coal consumption and the proportion of thermal power in the past 2018–2022 years. The predicted results of each factor are plugged into the RR-STIRPAT model, and finally the CO2 emission prediction value of 2018–2022 years power generation industry is obtained. It is predicted that the CO2 emission from the power generation industry will increase gradually in the next five years. However, with the slowdown of population growth and the development of power generation technology, CO2 emissions in power generation industry will grow slower and slower.
The advantages of the prediction model in this paper includes that the prediction precision is much higher and more scientific factors are taken into account. But limited to the availability of data, the factors considered in this study are not all-round, which also restricts the application of intelligent algorithms. In future research, we will continue to explore more accurate prediction methods and propose more scientific prediction models.

Supplementary Files

Supplementary File 1

Acknowledgments

This research was supported by the National Natural Science Fund (71471059).

Author Contributions

Weibo Zhao designed the research, established the forecasting model, completed the data arrangement and calculation, created the tables and figures, and finished the writing of the paper; Dongxiao Niu contributed valuable opinions during the manuscript writing. All authors read and approved the final manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. The Electric Power Development Planning “in 13th Five-Year” (2016–2020 (Full Text). Available online: http://news.bjx.com.cn/html/20161222/798873.shtml (accessed on 22 December 2016).
  2. Liu, L.; Zong, H.; Zhao, E.; Chen, C.; Wang, J. Can China realize its carbon emission reduction goal in 2020: From the perspective of thermal power development. Appl. Energy 2014, 124, 199–212. [Google Scholar] [CrossRef]
  3. Seo, Y.; Ide, K.; Kitahata, N.; Kuchitsu, K.; Dowaki, K. Environmental Impact and Nutritional Improvement of Elevated CO2 Treatment: A Case Study of Spinach Production. Sustainability 2017, 9, 1854. [Google Scholar] [CrossRef]
  4. Shahbaz, M.; Loganathan, N.; Muzaffar, A.T.; Ahmed, K.; Jabran, M.A. How urbanization affects CO2, emissions in Malaysia? The application of STIRPAT model. Renew. Sustain. Energy Rev. 2016, 57, 83–93. [Google Scholar] [CrossRef]
  5. Wang, C.; Wang, F.; Zhang, X.; Yang, Y.; Su, Y.; Ye, Y.; Zhang, H. Examining the driving factors of energy related carbon emissions using the extended STIRPAT model based on IPAT identity in Xinjiang. Renew. Sustain. Energy Rev. 2017, 67, 51–61. [Google Scholar] [CrossRef]
  6. Shahbaz, M.; Chaudhary, A.R.; Ozturk, I. Does urbanization cause increasing energy demand in Pakistan? Empirical evidence from STIRPAT model. Energy 2017, 122, 83–93. [Google Scholar] [CrossRef]
  7. Shuai, C.; Shen, L.; Jiao, L.; Wu, Y.; Tan, Y. Identifying key impact factors on carbon emission: Evidences from panel and time-series data of 125 countries from 1990 to 2011. Appl. Energy 2017, 187, 310–325. [Google Scholar] [CrossRef]
  8. Wang, P.; Wu, W.; Zhu, B.; Wei, Y. Examining the impact factors of energy-related CO2 emissions using the STIRPAT model in Guangdong Province, China. Appl. Energy 2013, 106, 65–71. [Google Scholar] [CrossRef]
  9. Li, W.; Sun, S. Air pollution driving factors analysis: Evidence from economically developed area in China. Environ. Prog. Sustain. Energy 2016, 35, 1231–1239. [Google Scholar] [CrossRef]
  10. Al-Mulali, U.; Fereidouni, H.G.; Lee, J.Y.M.; Normee Che Sab, C. Exploring the relationship between urbanization, energy consumption, and CO2 emission in MENA countries. Renew. Sustain. Energy Rev. 2013, 23, 107–112. [Google Scholar] [CrossRef]
  11. Zhang, C.; Liu, C. The impact of ICT industry on CO2 emissions: A regional analysis in China. Renew. Sustain. Energy Rev. 2015, 44, 12–19. [Google Scholar] [CrossRef]
  12. Xie, C.; Hawkes, A.D.; Lund, H. Estimation of inter-fuel substitution possibilities in China’s transport industry using ridge regression. Energy 2015, 88, 260–267. [Google Scholar] [CrossRef]
  13. Yue, Y.; Li, M.; Zhu, A.X.; Ye, X.; Mao, R.; Wan, J.; Dong, J. Land Degradation Monitoring in the Ordos Plateau of China Using an Expert Knowledge and BP-ANN-Based Approach. Sustainability 2016, 8, 1174. [Google Scholar] [CrossRef]
  14. Kumar, U.; Jain, V.K. Time series models (Grey-Markov, Grey Model with rolling mechanism and singular spectrum analysis) to forecast energy consumption in India. Energy 2010, 35, 1709–1716. [Google Scholar] [CrossRef]
  15. Li, S.; Li, R. Comparison of forecasting energy consumption in Shandong, China Using the ARIMA model, GM model, and ARIMA-GM model. Sustainability 2017, 9, 1181. [Google Scholar] [CrossRef]
  16. Xu, J.; Ren, Q.; Shen, Z. Prediction of the strength of concrete radiation shielding based on LS-SVM. Ann. Nucl. Energy 2015, 85, 296–300. [Google Scholar]
  17. Gao, Y.; Cheng, H.; Zhu, J.; Liang, H.; Li, P. The Optimal Dispatch of a Power System Containing Virtual Power Plants under Fog and Haze Weather. Sustainability 2016, 8, 71. [Google Scholar] [CrossRef]
  18. Nieto, P.J.G.; García-Gonzalo, E.; Fernández, J.R.A.; Díaz Muñizb, C. A hybrid PSO optimized SVM-based model for predicting a successful growth cycle of the Spirulina platensis, from raceway experiments data. Ecol. Eng. 2015, 81, 534–542. [Google Scholar] [CrossRef]
  19. Kim, H.; Baek, S.; Choi, K.; Kim, D.; Lee, S.; Kim, D.; Chang, H.J. Comparative Analysis of On- and Off-Grid Electrification: The Case of Two South Korean Islands. Sustainability 2016, 8, 350. [Google Scholar] [CrossRef]
  20. Zhang, F.; Dong, Y.; Zhang, K. A Novel Combined Model Based on an Artificial Intelligence Algorithm—A Case Study on Wind Speed Forecasting in Penglai, China. Sustainability 2016, 8, 555. [Google Scholar] [CrossRef]
  21. Gao, C.; Wang, K. Micro-grid power forecast based on MSC-WNN model. Electr. Meas. Instrum. 2015, 52, 68–73. [Google Scholar]
  22. Wang, Z.; Wang, C.; Wu, J. Wind Energy Potential Assessment and Forecasting Research Based on the Data Pre-Processing Technique and Swarm Intelligent Optimization Algorithms. Sustainability 2016, 8, 1191. [Google Scholar] [CrossRef]
  23. York, R.; Rosa, E.A.; Dietz, T. STIRPAT, IPAT and ImPACT: Analytic tools for unpacking the driving forces of environmental impacts. Ecol. Econ. 2003, 46, 351–365. [Google Scholar] [CrossRef]
  24. Yu, X.; Geng, Y.; Dong, H.; Ulgiati, S.; Liu, Z.; Liu, Z.; Ma, Z.; Tian, X.; Sun, L. Sustainability assessment of one industrial region: A combined method of emergy analysis and IPAT (Human Impact Population Affluence Technology). Energy 2016, 107, 818–830. [Google Scholar] [CrossRef]
  25. Dong, J.F.; Wang, Q.; Deng, C.; Wang, X.-M.; Zhang, X.-L. How to Move China toward a Green-Energy Economy: From a Sector Perspective. Sustainability 2016, 8, 337. [Google Scholar] [CrossRef]
  26. Zhang, Z.; Chen, X.; Heck, P. Emergy-Based Regional Socio-Economic Metabolism Analysis: An Application of Data Envelopment Analysis and Decomposition Analysis. Sustainability 2014, 6, 8618–8638. [Google Scholar] [CrossRef]
  27. Liu, Y.; Yang, Z.; Wu, W. Assessing the impact of population, income and technology on energy consumption and industrial pollutant emissions in China. Appl. Energy 2015, 155, 904–917. [Google Scholar] [CrossRef]
  28. Li, K.; Lin, B. Impacts of urbanization and industrialization on energy consumption/CO2, emissions: Does the level of development matter? Renew. Sustain. Energy Rev. 2015, 52, 1107–1122. [Google Scholar] [CrossRef]
  29. Marquardt, D.W.; Snee, R.D. Ridge Regression in Practice. Am. Stat. 1975, 29, 3–20. [Google Scholar]
  30. Hoerl, A.E.; Kennard, R.W. Ridge Regression: Applications to Nonorthogonal Problems. Technometrics 1970, 12, 69–82. [Google Scholar] [CrossRef]
  31. Hoerl, A.E.; Kennard, R.W. Ridge Regression: Biased Estimation for Nonorthogonal Problems. Technometrics 2000, 42, 80–86. [Google Scholar] [CrossRef]
  32. Feng, Y.; Cui, N.; Zhao, L.; Hu, X.; Gong, D. Comparison of ELM, GANN, WNN and empirical models for estimating reference evapotranspiration in humid region of Southwest China. J. Hydrol. 2016, 536, 376–383. [Google Scholar] [CrossRef]
  33. Ganjefar, S.; Tofighi, M. Single-hidden-layer fuzzy recurrent wavelet neural network: Applications to function approximation and system identification. Inf. Sci. 2015, 294, 269–285. [Google Scholar] [CrossRef]
  34. Wang, W.; Zhang, M.; Liu, X. Improved fruit fly optimization algorithm optimized wavelet neural network for statistical data modeling for industrial polypropylene melt index prediction. J. Chemometr. 2015, 29, 506–513. [Google Scholar] [CrossRef]
  35. Kasiviswanathan, K.S.; He, J.; Sudheer, K.P.; Tay, J.-H. Potential application of wavelet neural network ensemble to forecast streamflow for flood management. J. Hydrol. 2016, 536, 161–173. [Google Scholar] [CrossRef]
  36. Walton, S.; Hassan, O.; Morgan, K.; Brown, M.R. Modified cuckoo search: A new gradient free optimisation algorithm. Chaos Solitons Fract. 2011, 44, 710–718. [Google Scholar] [CrossRef]
  37. Ahmed, J.; Salam, Z. A Maximum Power Point Tracking (MPPT) for PV system using Cuckoo Search with partial shading capability. Appl. Energy 2014, 119, 118–130. [Google Scholar] [CrossRef]
  38. Nguyen, T.T.; Vo, D.N.; Truong, A.V. Cuckoo search algorithm for short-term hydrothermal scheduling. Appl. Energy 2014, 132, 276–287. [Google Scholar] [CrossRef]
  39. Li, X.T.; Yin, M.H. A hybrid cuckoo search via Lévy flights for the permutation flow shop scheduling problem. Int. J. Prod. Res. 2013, 51, 732–4754. [Google Scholar] [CrossRef]
  40. Zheng, H.; Zhou, Y. A novel Cuckoo Search optimization algorithm base on gauss distribution. J. Comput. Inf. Syst. 2012, 8, 4193–4200. [Google Scholar]
  41. Lai, J.; Liang, S.; Center, C. Application of GCS-SVM model in network traffic prediction. Comput. Eng. Appl. 2013, 49, 75–78. [Google Scholar]
Figure 1. Wavelet Neural Network (WNN) structure.
Figure 1. Wavelet Neural Network (WNN) structure.
Sustainability 09 02377 g001
Figure 2. The flow chart of ridge regression Stochastic Impacts by Regression on Population, Affluence and Technology Gauss Cuckoo Search Wavelet Neural Network (RR-STIRPAT-GCS-WNN) model.
Figure 2. The flow chart of ridge regression Stochastic Impacts by Regression on Population, Affluence and Technology Gauss Cuckoo Search Wavelet Neural Network (RR-STIRPAT-GCS-WNN) model.
Sustainability 09 02377 g002
Figure 3. Ridge Trace.
Figure 3. Ridge Trace.
Sustainability 09 02377 g003
Figure 4. Prediction results. (a) Prediction results of population; (b) Prediction results of per capita GDP; (c) Prediction results of standard coal consumption; (d) Prediction results of thermal power proportion.
Figure 4. Prediction results. (a) Prediction results of population; (b) Prediction results of per capita GDP; (c) Prediction results of standard coal consumption; (d) Prediction results of thermal power proportion.
Sustainability 09 02377 g004aSustainability 09 02377 g004b
Figure 5. Prediction results of CO2 emission.
Figure 5. Prediction results of CO2 emission.
Sustainability 09 02377 g005
Figure 6. Error boxplot of different models.
Figure 6. Error boxplot of different models.
Sustainability 09 02377 g006
Figure 7. Prediction of CO2 emission.
Figure 7. Prediction of CO2 emission.
Sustainability 09 02377 g007
Table 1. Raw data.
Table 1. Raw data.
YearCO2 Annual Emissions (million tons)Population (million people)Per Capita GDP (yuan)Standard Coal Consumption (g/kWh)Proportion of Thermal Power
1995915.361211.2150914120.795
19961005.781223.8958984100.813
19971022.691236.2664814080.815
19981021.781247.6168604040.795
19991053.641257.8672293990.798
20001082.861267.4379423920.822
20011145.201276.2787173850.799
20021300.081284.5395063830.809
20031542.091292.2710,6663800.827
20041822.601299.8812,4873760.815
20052036.691307.5614,3683700.819
20062321.181314.4816,7383670.827
20072538.771321.2920,5053560.830
20082625.221328.0224,1213450.805
20092793.571334.5026,2223400.803
20103099.391340.9130,8763330.792
20113537.781347.3536,4033290.813
20123600.121354.0440,0073260.781
20133749.211360.7243,8523210.789
20143904.661367.8247,2033180.752
20154042.311374.6250,2513180.737
20164162.961382.7153,8173120.716
Table 2. Model Summary.
Table 2. Model Summary.
ModelRR2Adjusted R2Std. Error of the Estimate
10.9960.9920.9910.05326
Table 3. ANOVA.
Table 3. ANOVA.
ModelSum of SquaresdfMean SquareFSignificance (Sig.)
1Regression6.29941.575555.0980.000
Residual0.048170.003
Total6.34721
Table 4. Coefficients.
Table 4. Coefficients.
ModelUnstandardized CoefficientsStandardized CoefficientstSig.Collinearity Statistics
BStd. ErrorBetaToleranceVIF
1(Constant)−16.67821.648 −0.7700.452
ln P−0.5471.529−0.039−0.3580.7250.03826.232
ln A1.2270.2051.7705.9950.0000.005194.998
ln T14.0201.6650.6952.4130.0270.005185.319
ln T21.1910.4240.0812.8130.0120.5331.878
Table 5. Collinearity diagnosis.
Table 5. Collinearity diagnosis.
ModelDimensionEigenvalueCondition IndexVariance Proportions
(Constant)ln Pln Aln T1ln T2
114.9761.0000.000.000.000.000.00
20.02015.7210.000.000.000.000.48
30.00436.9750.000.000.010.000.33
49.050 × 10−72344.9460.030.150.980.790.03
51.718 × 10−75382.5770.970.850.010.210.15
Table 6. Ridge Regression with k = 0.4.
Table 6. Ridge Regression with k = 0.4.
Mult RR2Adjusted R2SE
0.98015792720.96070956230.95146475340.1211139063
Table 7. ANOVA.
Table 7. ANOVA.
dfSSMSF ValueSig. F
Regress4.0006.0971.524103.91881270.0000000
Residual17.0000.2490.015
Table 8. Variables in the Equation.
Table 8. Variables in the Equation.
BSE(B)BetaB/SE(B)
ln P3.924236250.306900790.2779233312.78666051
ln A0.212042470.011698940.3058668118.12492455
ln T1−1.650552410.09696411−0.28521636−17.02230296
ln T2−0.168153110.51870654−0.01150223−0.32417772
Constant−26.397596733.479364560.00000000−7.58690166
Table 9. Prediction results of influencing factors.
Table 9. Prediction results of influencing factors.
YearPopulation (million people)Per Capita GDP (yuan)Standard Coal Consumption (g/kWh)Proportion of Thermal Power
20181379.1259,894307.47950.6879
20191380.1561,761304.96420.6698
20201381.5664,597301.46810.6514
20211383.6966,717299.14970.6498
20221385.1469,486297.34760.6389
Table 10. Prediction of CO2 emission.
Table 10. Prediction of CO2 emission.
YearCO2 (Unit: million tons)
20184347.48
20194468.63
20204638.16
20214760.55
20224883.71

Share and Cite

MDPI and ACS Style

Zhao, W.; Niu, D. Prediction of CO2 Emission in China’s Power Generation Industry with Gauss Optimized Cuckoo Search Algorithm and Wavelet Neural Network Based on STIRPAT model with Ridge Regression. Sustainability 2017, 9, 2377. https://doi.org/10.3390/su9122377

AMA Style

Zhao W, Niu D. Prediction of CO2 Emission in China’s Power Generation Industry with Gauss Optimized Cuckoo Search Algorithm and Wavelet Neural Network Based on STIRPAT model with Ridge Regression. Sustainability. 2017; 9(12):2377. https://doi.org/10.3390/su9122377

Chicago/Turabian Style

Zhao, Weibo, and Dongxiao Niu. 2017. "Prediction of CO2 Emission in China’s Power Generation Industry with Gauss Optimized Cuckoo Search Algorithm and Wavelet Neural Network Based on STIRPAT model with Ridge Regression" Sustainability 9, no. 12: 2377. https://doi.org/10.3390/su9122377

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop