3.3.1. Analysis and Screening of Factors Affecting Carbon Dioxide Emissions
Based on this paper’s selection of 26 influencing factors from the seven aspects of economy, industrial structure, energy, society, technology, policy, and environment (examples of which are the proportion of primary industry, TCI index, afforestation area, GDP, urbanization rate, foreign investment, energy intensity, energy structure, population, and sulfur dioxide emissions), the correlation coefficients between these influencing factors are calculated. Thus, the correlation coefficient matrix between them is obtained. This paper visualizes the correlation coefficient matrix and produces a thermodynamic diagram as shown in
Figure 3.
Figure 3 clearly shows the linear relationship between influencing factors. Traditional linear models cannot analyze factors with multicollinearity, but machine learning models can solve this problem. Therefore, this paper uses random forest and ridge regression models to identify key influencing factors and predict future carbon dioxide emissions trends.
- (1)
Random Forest Results
Through the random forest model, the VIM values that represent the importance of factors are output; the importance of influencing factors is shown in
Figure 4.
According to
Figure 4, the importance of influencing factors over 0.04 is selected to predict carbon dioxide emissions; these are population, GDP, nitric oxide emissions, research and development (R&D) funding, air freight volume, methane emissions, proportion of primary industry, patent authorization volume, energy consumption, urbanization rate, economic complexity, renewable energy generation, national education funding, total import and export volumes, proportion of real estate industry, and foreign investment. Among these, population and GDP have the greatest impact on carbon dioxide emissions. The larger the population size, the greater the energy demand, and the more carbon dioxide emissions are generated. Coupled with the dependence of economic growth on energy consumption, the trend of carbon dioxide emissions has been increasing. The screened influencing factors can more accurately predict the trend of carbon dioxide emissions.
- (2)
Ridge Regression Results
To test the reliability of the results of random forest screening, ridge regression was used to analyze various influencing factors. After the addition of the 26 factors of carbon dioxide emissions to the model, the regression coefficients between patent authorization volume and renewable energy generation were not significant; other significant factors match the results of random forest screening. Therefore, the non-significant factors mentioned above were excluded, and other significant factors are the same as those used in the random forest screening.
By incorporating the selected factors into the model for regression and choosing the appropriate K value, the impact of each factor on carbon dioxide emissions can be identified. The results are shown in
Table 4.
R2 is 0.994, the adjusted R2 is 0.989, the F-statistic is significant at the 0.01 level, and the overall fitting effect is good. These results indicate that the selected factors have high interpretability for carbon dioxide emissions.
As shown in
Table 4, the impact of economic complexity on carbon dioxide emissions is relatively large, and the improvement in policy stability promotes an increase in economic complexity [
43]. The division of labor between regions and industries is clear, thus promoting efficient economic development. In the short run, carbon dioxide emissions will increase, but in the long run, carbon dioxide emissions will inevitably be controlled. Because the chosen urbanization level reflects the population urbanization rate, and there is an inverted U-shaped relationship between urbanization and carbon emissions [
45], an effective urbanization process can be gradually adopted to suppress the increase in carbon emissions. Energy consumption should be reduced, the stable growth of GDP and population should be controlled, and high-energy-consuming industries should be upgraded. R&D funding has a positive impact on carbon dioxide emissions, which follows a non-linear relationship [
46]. An increase in R&D intensity can only reduce carbon dioxide emissions when R&D intensity reaches a certain level. Therefore, it is necessary to increase R&D investment and national education funding and to suppress carbon dioxide emissions through inflection points.
Although the relationship between influencing factors and carbon emissions has been obtained, the fitting effect of the model and its prediction accuracy are insufficient. The results of ridge regression prediction and true values are shown in
Figure 5.
Therefore, the ridge regression model is more suitable for analyzing the relationship between influencing factors and carbon dioxide emissions. This paper searches for a better prediction model to predict the dynamic evolution trend of carbon dioxide emissions.
3.3.2. Model Selection
Data from 1990 to 2015 were selected to train the model, and data from 2016 to 2019 were selected as the test set. Then, carbon dioxide emissions data were predicted using the SVR, SSA-SVR, LSTM, and SSA-LSTM models, where SSA-SVR and SSA-LSTM optimize the parameters of the SVR and LSTM models using SSA. The MAE, MSE, MAPE, and
R2 of the four models are shown in
Table 5.
Based on the four evaluation indicators shown in
Table 5, the MSE, MAE, and MAPE values of SSA-LSTM are the smallest, and its
R2 is the closest to 1, reaching 0.9708. The SSA-LSTM model achieves the best performance on all four evaluation indicators and achieves higher predictive accuracy. Therefore, this paper selects the SSA-LSTM model for prediction. The parameter optimization of the SSA-LSTM model and the model fitting effect from 1990 to 2019 are shown in
Figure 6,
Figure 7 and
Figure 8.
Figure 8 shows that the SSA-LSTM model achieves a good fitting effect in predicting carbon dioxide emissions and can more accurately capture the situation and values of the carbon peak, which meets the research needs of this paper.
3.3.3. Scenario Setting
The meaning of control indicators refers to their ability to control corresponding indicators through macro policy guidance, environmental regulations, and other policy measures. Non-controlling indicators are more influenced by past data and the market, and they will develop according to a certain level of inertia. Therefore, it is advisable to use appropriate models for their prediction. This paper uses the G (1,1) model to predict non-controlling indicators.
The setting of carbon peak scenario prediction parameters requires the consideration of multiple factors. The following presents several parameters that can be macro controlled by policies.
- (1)
GDP
The prediction of carbon peak scenarios needs to consider the economic development trend. As China is still a developing country, pursuing moderate economic growth for a considerable period of time is a necessary condition to achieve national prosperity and strength. Therefore, the dual carbon target cannot be achieved by reducing the economic growth rate, making it necessary to set a reasonable annual GDP growth rate. According to the “Research Report on Carbon peak in China before 2030” [
47], it has been estimated that China will achieve carbon peak in the 14th Five Year Plan and 15th Five Year Plan period. The average GDP growth rate exceeded 5% every year, and the GDP of 2035 was projected to double compared to the 2020 level; therefore, China’s average annual GDP growth rate was set to 5.5% from 2021 to 2030 and to 4.5% from 2030 to 2040. This setting is used as the benchmark scenario. According to the World and China Energy Outlook 2060 [
48], the average annual growth rate of China’s GDP before 2025 is about 4.8%, and from 2025 to 2060, it is 3.1%, which is considered a low-carbon scenario. The average annual growth rate of GDP from 2021 to 2030 was set to 6.89%, and from 2030 to 2040, it was set to 6.31% [
49], which is considered an extensive scenario.
- (2)
Population
The population growth rate also impacts carbon emissions; therefore, it is necessary to consider expected changes in the population growth rate. Chen et al. [
50] used the population growth rate from 2020 to 2030 to predict the total population in the future, predicting the total population of China to reach 1412 million in 2025, 1424 million in 2030, and 1440 million in 2040. This is considered the benchmark scenario. The China Petroleum Economic and Technological Research Institute [
48] predicted that the population of China will reach a peak of 1.43 billion people by 2030, followed by a slow decline, and a decrease to 1.3 billion people by 2060. In this paper, this is used as the low-carbon scenario. The analysis of designated scenarios was conducted according to the National Population Development Plan [
51]. In this paper, the population is set to reach 1448 million in 2025, 1488 million in 2030, 1529 million in 2035, and 1567 million in 2040, which represents an extensive scenario.
- (3)
Urbanization rate
Because of the inverted U-shaped relationship between urbanization and carbon emissions [
9], effective urbanization processes can be gradually adopted to suppress the increase in carbon emissions. According to the “fourteen” proposed by Ou et al. [
52], China’s urbanization rate will generally follow a stable and slowing trend during the Five Year Plan period. The average annual growth rate from 2020 to 2025 will remain at around 0.71%, and it is expected to reach around 72.2% in 2025, 73.5% in 2035, and 74.9% in 2040 [
49]. In this paper, this serves as the benchmark scenario. Under the new pattern of “dual circulation”, the urbanization rate continues to increase fueled by new urbanization policies. It is expected that the urbanization rate will reach around 70% in 2030 and further increase to around 80% in 2060, which is considered a low-carbon scenario. According to the benchmark scenario and current data statistics, the urbanization rate in 2020 was 63.9%, the average annual growth rate of urbanization remained at 1.32% from 2020 to 2025, and the urbanization rate will reach around 74% in 2030, 76.4% in 2035, and 78% in 2040. This serves as the extensive scenario.
- (4)
Energy consumption
Carbon emissions are closely related to energy consumption; therefore, to substantially control carbon dioxide emissions, it is necessary to control energy consumption. The 14th Five Year Plan requires a decrease of 13.5% in energy consumption per unit of GDP by 2025 and requires energy consumption to be at the same level as that of the 13th Five Year Plan. In the benchmark scenario, this paper sets an annual growth rate of −3.40% for energy intensity from 2020 to 2030 and a rate of −2.7% from 2030 to 2039. The low-carbon scenario is characterized by an average annual growth rate of −3.90% from 2020 to 2030 and a rate of −3.2% from 2030 to 2039. The average annual growth rate of energy intensity is −2.5% from 2020 to 2030 and −2% from 2020 to 2033, which is considered an extensive scenario. Energy consumption can be obtained by setting energy intensity and GDP and by increasing the efforts of low-carbon transformation in the energy industry. Scenario parameter settings are shown in
Table 6,
Table 7 and
Table 8 3.3.4. Prediction Results
In these three scenarios, this paper can obtain the carbon peak time and peak value under the prediction of the SSA-LSTM model. These are shown in
Figure 9 and
Figure 10.
The carbon peak time of the benchmark scenario is 2031, with a peak of 12.346 billion tons. The carbon peak time of the low-carbon scenario is 2030, with a peak of 11.962 billion tons. In the extensive scenario, the peak time is 2037, with a peak of 13.291 billion tons, which is 945 million tons higher than the peak of the benchmark scenario and 1329 million tons higher than the peak of the low-carbon scenario. The high peak will also have a considerable impact on the environment.
Figure 9 shows that under the three scenarios, the peak value is relatively high and the post peak change rate is small; the decrease in carbon dioxide emissions is relatively stable, and without strict constraints on the economy and energy, it will be challenging for China to achieve carbon neutrality by 2060. Moreover,
Figure 10 shows that the change rates before and after the peak value differ between the three scenarios. In the benchmark scenario, the average annual growth rate before the peak is 0.7%, and the average annual growth after the peak is −0.2%. In the low-carbon scenario, the average annual growth rate before the peak is 0.55%, and the average annual growth after the peak is −0.4%. In the extensive scenario, the average annual growth rate before the peak is 0.39%, and the average annual growth after the peak is −0.15%. The increasing rate before reaching the peak is higher than the rate of decrease after reaching the peak. By constraining factors such as the economy, energy consumption, population, and urbanization rate, the degree of carbon dioxide emissions reduction is limited; therefore, the level of technological innovation should be improved and carbon dioxide emissions should be reduced from the perspective of energy use efficiency.
Therefore, we consider the impact of GDP growth, population growth, energy consumption, and technological progress on carbon dioxide emissions, because the energy consumption in this paper is calculated from energy intensity, and energy consumption is linked to technological progress; so based on the baseline scenario, low-carbon scenario, and extensive scenario, this paper sets the scenarios of low economic development, low population growth, and technological progress, except for the GDP (population or energy intensity) in the low-carbon scenario, other factors are set as in the baseline scenario, so as to compare with the baseline scenario. The low-carbon scenario and the extensive scenario were compared to observe the impact of changes in different factors on carbon dioxide emissions. Six scenario settings and results are shown in
Table 9 and
Table 10.
According to
Figure 11, in the low economic development scenario, GDP is set according to the low-carbon scenario, while other parameters are set according to the baseline scenario. The peak and peak times were similar to the baseline scenario and did not change significantly. In the low population growth scenario, the population is set according to the low-carbon scenario, while the other parameters are set according to the baseline scenario. The peak time is in 2030, with a peak of 12.372 billion tons, similar to the baseline scenario, with no significant change in the peak. Under the technological progress scenario, energy intensity is set according to the low-carbon scenario, and other parameters are set according to the baseline scenario, with carbon dioxide emissions peaking at 11.98 billion tons in 2034, similar to the low-carbon scenario. It can be concluded that energy intensity is a key factor in reducing the peak, i.e., technological progress; therefore, the level of technological innovation should be increased from the perspective of energy efficiency to reduce carbon dioxide emissions.