An EG-Tree Model Incorporating Spatial Heterogeneity for Analyzing Multifactorial Coupling Effects on Carbon Emissions Across Industries and Regions in China
Abstract
:1. Introduction
2. Model Construction
2.1. Construction of EG-Tree Model
2.2. Impact Factor Importance Scoring Algorithm
2.3. Evaluation of Model Accuracy
3. Data Specification
3.1. Data Description
3.2. Calculation Method
4. Experiment and Result Discussion
4.1. Carbon Emission Intensity of Various Industries Considering Regional Heterogeneity
4.1.1. Comparison of Prediction Results from Different Models
4.1.2. Impact Factors of Carbon Emission Intensity in Different Regions Across Various Industries Quantified Using the EG-Tree Model
4.2. A Coupled Analysis of the Diverse Factors Contributing to Carbon Emissions in Various Industries, Taking into Account Regional Heterogeneity
4.3. Model Error Analysis
5. Conclusion and Policy Recommendations
5.1. Conclusions
- (1)
- The EG-Tree model constructed in this study combined multiple weak classifiers through the gradient boosting method and introduced a regularization term to prevent model overfitting based on the decision tree model. In the task of achieving associated feature analysis, compared with the decision tree model, this model can better alleviate the overfitting problem and accurately identify the main impact factors leading to carbon emissions. The average model fitting precision of the EG-Tree model, as well as the degree of closeness between its predicted values and the carbon emission intensity values calculated using the carbon emission measurement dataset, was 0.30;
- (2)
- There are spatial differences in the distribution of carbon emissions across different regions, mainly influenced by factors such as regional economic development characteristics, industrial structure, natural climate, regional development strategies, and energy types. The total carbon emissions in East China and North China are relatively large, while the total carbon emission intensity in Northeast China and Central China is relatively small;
- (3)
- Industrial sector and transportation and construction industry have relatively large carbon emissions, accounting for 36.58% and 20.17% of the total national carbon emissions, respectively, while the purchased electricity industry and heating and cooling industry have relatively smaller shares at 9.45% and 9.29%, respectively;
- (4)
- Both East China and North China have relatively large carbon emissions in the industrial sector and transportation and construction industry, accounting for 36.20%, 20.09% and 36.93%, 20.06% of the total carbon emissions of all industries in these regions, respectively; Northeast China and Central China have relatively small carbon emissions in the heating and cooling industry and purchased electricity industry, accounting for 8.75%, 9.58% and 8.87%, 9.53% of the total emissions of all industries in these regions, respectively.
- (5)
- North China, East China, and Northwest China have a relatively high degree of dependence on energy sources such as coal, coke, crude oil, gasoline, kerosene, diesel oil, and fuel oil, resulting in significant carbon emissions, with their average carbon emission intensity impact factors exceeding 5300. All regions exhibit a relatively low degree of dependence on energy sources such as natural gas and liquefied petroleum gas, leading to smaller carbon emissions, with average impact factors below 5100. The high-dependency energy sources vary across Northeast China, Central China, South China, and Southwest China;
- (6)
- Regional carbon emissions are strongly correlated with the energy structure, with dependence on traditional energy sources being the core issue. High carbon emissions in East China and North China stem from their reliance on coal and kerosene (the average carbon emission intensity impact factor for coal in North China was 7442). Northwest China rank second in total carbon emissions due to its dependence on fuel oil (the average carbon emission intensity impact factor for fuel oil in Northwest China was 8580). Although Southwest China has a relatively high proportion of renewable energy, its dependence on coke and coal still drive up carbon emissions in various industries;
- (7)
- In the southern region of Southwest China, due to the high proportion of hydropower, the intensity of power-related carbon emissions is the lowest. However, the overall average impact factor of carbon emission intensity in the purchased electricity industry remained relatively high at 7290, reflecting the complexity of cross-regional energy allocation and the correlation between regional energy structure and carbon emissions.
5.2. Policy Recommendations
- (1)
- Concentrate on high-emission industries and carry out precise carbon control strategies. In the industrial industry, it is necessary to accelerate the cleanliness of the energy structure, promote direct connections between high-energy-consuming enterprises and renewable energy power generation, and establish a linkage mechanism for electricity prices. Penalty electricity rates should be imposed on enterprises using high-emission power to compel the use of clean energy. The transportation and construction industry should promote electrification technologies and, in combination with the differences in regional power carbon emission factors, guide high-energy-consuming links to shift towards low-carbon power regions;
- (2)
- Differentiated energy transformation paths should be adopted to break the inertia of resource dependence. Regions dependent on coal (such as North China and Southwest China) need to promote the upgrading of clean coal technologies. North China should focus on developing coal clean reuse and carbon capture technologies, while Southwest China should introduce funds and technologies through ecological compensation mechanisms to reduce dependence on coke and coal. Fuel oil-dependent regions (such as South China and Northwest China) can develop alternative projects like offshore wind power and tidal energy to reduce fuel oil consumption;
- (3)
- Improve the data support and regulatory system to enhance the accuracy of carbon accounting. Establish a nationally unified energy digital intelligent platform to dynamically integrate power generation data from thermal power, wind power, photovoltaic power, etc., and optimize inter-provincial power dispatching in combination with regional power grid carbon emission factors;
- (4)
- Coordinate international experience with regional practice to promote policy innovation. By drawing on the European carbon market and green certificate trading mechanism, explore the “Chinese-style collaborative trading model of green certificates and carbon allowances” to enhance enterprises’ motivation for low-carbon transformation. Pilot demonstration projects for coal substitution and clean energy transformation should be first carried out in pilot regions such as Northwest China and Northeast China, so as to form replicable technical and managerial experiences.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Nomenclature
Terminology and Symbol Names | meaning | Unit |
L | Loss function | The square of (10,000 tons of carbon dioxide) |
Fm−1(xi) | The model’s predicted values from the (m−1)-th iteration. | 10,000 tons of carbon dioxide |
xi | The feature vector of the i-th sample. | 10,000 tons |
gi | The first-order derivative (gradient) of the loss function with respect to the current prediction. | 10,000 tons of carbon dioxide |
hi | The second-order derivative (Hessian) of the loss function with respect to the current prediction. | The square of (10,000 tons of carbon dioxide) |
T | The number of leaf nodes in a decision tree | Dimensionless |
f | Regression function, prediction and structure of a single tree | Dimensionless |
wj | Regression coefficient of leaf node j | Dimensionless |
Regularization hyperparameter that controls the penalty strength for the number of leaf nodes | Dimensionless | |
V | Impact score | Dimensionless |
j | Node | Dimensionless |
i | Sample | Dimensionless |
I(j) | The set of samples mapped to leaf node j | Set of sample indices |
q(xi) | Node mapping function (decision path) for sample xi | node number |
Gj | The sum of the first-order gradients (partial derivatives of the loss function with respect to the predicted values) of the samples at node j | 10,000 tons of carbon dioxide |
Hj | The sum of second-order gradients (second-order partial derivatives of The loss function with respect to the predicted values) for the samples at node j | The square of (10,000 tons of carbon dioxide) |
The optimal weight for node j (obtained by minimizing the objective function) | Dimensionless | |
V* | Global aggregated score for the optimal weight of leaf nodes in the EG-Tree model | Dimensionless |
MAE | The mean absolute error | Dimensionless |
RMSE | Root mean squared error | Dimensionless |
MAPE | Mean absolute percentage error | Dimensionless |
The actual value | 10,000 tons of carbon dioxide | |
The predicted value | 10,000 tons of carbon dioxide | |
n | The total number of samples | |
CE | The amount of carbon dioxide emissions | 10,000 tons of carbon dioxide |
i | The i-th type of fossil fuel | |
Ei | The consumption of the i-th type of fossil fuel | 10,000 tons |
NCVi | The average low calorific value of the i-th type of fossil fuel | GJ/t orGJ/104Nm3 |
CEFi | The carbon emission coefficient of the i-th type of fossil fuel | Dimensionless |
References
- Zhu, T.; Li, C.; Zhang, X.; Yuan, B.; Wang, M.; Zhang, X.; Xu, X.; Sun, Q. Research Progress on Plasma-Assisted Catalytic Dry Reforming of Methane. Atmosphere 2025, 16, 376. [Google Scholar] [CrossRef]
- Li, Q.; Wang, P.; Wang, F.; Zhang, Y.; Wang, H.; Xu, Q.; Xu, M.; Bai, L. Low-Carbon Production in China’s Iron and Steel Industry: Technology Choices, Economic Assessment, and Policy. Atmosphere 2025, 16, 252. [Google Scholar] [CrossRef]
- International Energy Agency. Global Carbon Emissions Report 2023; National Science and Technology Library: Beijing, China, 2024.
- Fan, S.; Xu, G.; Zhao, Q.; Wu, J. Power system optimization with energy storage and carbon neutrality scenario analysis of China. China Environ. Sci. 2024, 44, 2833–2846. [Google Scholar]
- Wang, Z.; Gu, K.; Yu, H. The Internal Heterogeneity of Carbon Emissions in Megacities: A Case Study of Beijing, China. Atmosphere 2025, 16, 80. [Google Scholar] [CrossRef]
- Xu, G.; Schwarz, P.; Yang, H. Adjusting energy consumption structure to achieve China’s CO2 emissions peak. Renew. Sustain. Energy Rev. 2020, 122, 109737. [Google Scholar] [CrossRef]
- Wu, H.; Xu, L.; Ren, S.; Hao, Y.; Yan, G. How do energy consumption and environmental regulation affect carbon emissions in China? New evidence from a dynamic threshold panel model. Resour. Policy 2020, 67, 101678. [Google Scholar] [CrossRef]
- Li, Y.; Zhang, S. Spatio-temporal evolution of urban carbon emission intensity and spatiotemporal heterogeneity of influencing factors in China. China Environ. Sci. 2023, 43, 3244–3254. [Google Scholar]
- Xiao, M.; Peng, X. Decomposition of carbon emission influencing factors and research on emission reduction performance of energy consumption in China. Front. Environ. Sci. 2023, 10, 1096650. [Google Scholar] [CrossRef]
- Chen, X.; Shuai, C.; Wu, Y.; Zhang, Y. Analysis on the carbon emission peaks of China’s industrial, building, transport, and agricultural sectors. Sci. Total Environ. 2020, 709, 135768. [Google Scholar] [CrossRef]
- Zhang, X.; Shen, M.; Luan, Y.; Cui, W.; Lin, X. Spatial Evolutionary Characteristics and Influencing Factors of Urban Industrial Carbon Emission in China. Int. Environ. Res. Public Health 2022, 19, 11227. [Google Scholar] [CrossRef]
- Peng, B.; Zheng, H.; Zhu, L.; Hu, Q. Evolution characteristics and link prediction of China’s industrial carbon emission network structure. China Environ. Sci. 2024, 44, 1718–1731. [Google Scholar]
- Li, C.; Li, H.; Qin, X. Spatial Heterogeneity of Carbon Emissions and its influencing Factors in China: Evidence from 286 Prefecture-Level Cities. Int. Environ. Res. Public Health 2022, 19, 1226. [Google Scholar] [CrossRef] [PubMed]
- Wei, L.; Liu, Z. Spatial heterogeneity of demographic structure effects on urban carbon emissions. Environ. Impact Assess. Rev. 2022, 95, 106790. [Google Scholar] [CrossRef]
- Zhang, H.; Li, S. Carbon emissions’ spatial-temporal heterogenenity and identification from rural energy consumption in China. J. Environ. Manag. 2022, 304, 114286. [Google Scholar] [CrossRef]
- Han, H.; Zeng, X.; Wang, C. How does urbanization impact China’s carbon emissions: A regional heterogeneity perspective. Environ. Sci. Pollut. Res. Int. 2024, 31, 44920–44937. [Google Scholar] [CrossRef]
- Zhang, Y.; Yu, Z.; Zhang, J. Research on carbon emission differences decomposition and spatial heterogeneity pattern of China’s eight economic regions. Environ. Sci. Pollut. Res. Int. 2022, 29, 29976–29992. [Google Scholar] [CrossRef]
- Shao, Q.; Fu, J.; Huang, F.; Li, G.; Huang, H.; Tang, Z.; Zhang, Z. Low-Carbon Territorial Spatial Detailed Planning in the Context of Climate Change: A Case Study of the Wenzhou Garden Expo Park Area, China. Atmosphere 2024, 15, 1334. [Google Scholar] [CrossRef]
- Alpan, K.; Tuncal, K.; Ozkan, C.; Sekeroglu, B.; Ever, Y.K. Design and simulation of global model for carbon emission reduction using IoT and artificial intelligence. Procedia Comput. Sci. 2022, 204, 627–634. [Google Scholar] [CrossRef]
- Cheng, Q.; Wang, X.; Wang, S.; Li, Y.; Liu, H.; Li, Z.; Sun, W. Research on a Carbon Emission Prediction Method for Oil Field Transfer Stations Based on an Improved Genetic Algorithm-The Decision Tree Algorithm. Processes 2023, 11, 2738. [Google Scholar] [CrossRef]
- Li, J.; Zhang, Y.; Gai, R. Estimation of the column concentration of carbon dioxide using spaceborne shortwave infrared spectrometer. China Environ. Sci. 2023, 43, 1499–1509. [Google Scholar]
- O’Reilly, E. Statistical Advantages of Oblique Randomized Decision Trees and Forests. arXiv 2024, arXiv:2407.02458. [Google Scholar]
- Jin, Y.; Sharifi, A.; Li, Z.; Chen, S.; Zeng, S.; Zhao, S. Carbon emission prediction models: A review. Sci. Total Environ. 2024, 927, 172319. [Google Scholar] [CrossRef] [PubMed]
- Liu, H.; Liu, C.; Sun, Y. A study on the spatial correlation network structure of china’s energy consumption and its effect. China Ind. Econ. 2015, 5, 83–95. [Google Scholar]
- Wang, H. Research on the Development Index of China’s Megacities Incorporating Livelihood Improvement: Taking Beijing as an Example. Inq. Into Econ. Issues 2013, 1, 41–45. [Google Scholar]
- GB/T 2589-2008; General Principles for Calculation of the Comprehensive Energy Consumption. The Standardization Administration of the People’s Republic of China: Beijing, China, 2008.
- Cao, X.; Yan, T.; Chen, Z.; Zhen, F.; Xu, Z.; Cao, G. Comprehensive Survey of Geosciences in China: Urban Volume; Yang, Y., Yang, J., Ma, L., Huang, X., Xue, D., Wu, Q., Eds.; Shaanxi Normal University Press Group Co., Ltd.: Xi’an, China, 2018. [Google Scholar]
Province | Year | Coal Consumption (10,000 tons) | Coke Consumption (10,000 tons) | Crude Oil Consumption (10,000 tons) | Gasoline Consumption (10,000 tons) | Kerosene Consumption (10,000 tons) | Diesel Oil Consumption (10,000 tons) | Fuel Oil Consumption (10,000 tons) | Liquefied Petroleum Gas Consumption (10,000 tons) | Natural Gas Consumption (10,000 tons) |
---|---|---|---|---|---|---|---|---|---|---|
Beijing | 2020 | 134.98 | 0.01 | 781.77 | 423.06 | 457.88 | 112.55 | 0.27 | 19 | 189.12 |
2021 | 130.78 | 0 | 776.43 | 480.02 | 496.28 | 130.68 | 0.61 | 20.42 | 189.96 | |
2022 | 79.29 | 0 | 774.92 | 374.82 | 330.32 | 125.71 | 0.2 | 17.14 | 197.95 | |
Tianjin | 2020 | 3745.28 | 929.37 | 1394.16 | 284.73 | 85.32 | 325.05 | 52.61 | 16.56 | 117.03 |
2021 | 3428.78 | 752.06 | 1722.47 | 291.12 | 93.83 | 338.18 | 47.23 | 15.14 | 123.45 | |
2022 | 3314.72 | 704.72 | 1660.88 | 269.19 | 75.62 | 340.15 | 52.08 | 19.73 | 127.52 |
Province | Year | Transportation and Construction Industry (10,000 Tons of Carbon Dioxide) | Industrial Sector (10,000 Tons of Carbon Dioxide) | Agriculture and Forestry Industry (10,000 Tons of Carbon Dioxide) | Resource Recycling Industry (10,000 Tons of Carbon Dioxide) | Purchased Electricity Industry (10,000 Tons of Carbon Dioxide) | Heating and Cooling Industry (10,000 Tons of Carbon Dioxide) |
---|---|---|---|---|---|---|---|
Beijing | 2020 | 11635.97 | 20054.98 | 6335.69 | 3589.84 | 3749.30 | 7731.54 |
2021 | 8548.35 | 18585.80 | 4741.50 | 8272.53 | 3637.01 | 4151.85 | |
2022 | 8574.12 | 18346.06 | 7511.46 | 6035.23 | 4422.47 | 3477.19 | |
Tianjin | 2020 | 3088.76 | 7142.88 | 1755.09 | 3404.75 | 2038.59 | 2111.04 |
2021 | 3626.45 | 7657.18 | 1925.94 | 3600.56 | 2324.74 | 2505.79 | |
2022 | 4846.94 | 7131.17 | 2416.57 | 2024.48 | 2146.97 | 2557.58 |
Energy Sources | Average Low Heating Value (kJ/kg) | Standard Coal Coefficient (kgce/kg) | Carbon Content Per Unit of Heat (t-c/TJ) | Carbon Oxidation Rate (%) | Carbon Dioxide Emission Coefficient |
---|---|---|---|---|---|
Raw coal | 20,908 | 0.7143 | 26.37 | 0.94 | 1.9003 kg∙CO2/kg |
Coke | 28,435 | 0.9714 | 29.42 | 0.93 | 2.8604 kg∙CO2/kg |
Crude oil | 41,816 | 1.4286 | 20.08 | 0.98 | 3.0202 kg∙CO2/kg |
Gasoline | 43,070 | 1.4714 | 18.9 | 0.98 | 2.9251 kg∙CO2/kg |
Kerosene | 43,070 | 1.4714 | 19.6 | 0.98 | 3.0179 kg∙CO2/kg |
Diesel oil | 42,651 | 1.4571 | 20.2 | 0.98 | 3.0959 kg∙CO2/kg |
Fuel oil | 41,816 | 1.4286 | 21.1 | 0.98 | 3.1705 kg∙CO2/kg |
Liquefied petroleum gas | 50,179 | 1.7143 | 17.2 | 0.98 | 3.0119 kg∙CO2/kg |
Natural gas | 38,931 | 1.3300 | 15.32 | 0.99 | 2.1622 kg∙CO2/kg |
Eastern China | Performance evaluation | ||
MAE | RMSE | MAPE | |
DTR | 3425.9135 | 5005.8092 | 10.74% |
RFR | 3050.3819 | 3936.3953 | 9.29% |
GBDT | 2930.8671 | 3881.3712 | 8.94% |
Light-GBM | 2485.6903 | 3075.4195 | 7.68% |
EG-Tree | 2292.7858 | 3330.7504 | 6.47% |
Northeast China | Performance evaluation | ||
MAE | RMSE | MAPE | |
DTR | 2462.9956 | 2998.6893 | 10.18% |
RFR | 2353.5125 | 2782.0036 | 8.34% |
GBDT | 2473.5052 | 2917.3763 | 8.83% |
Light-GBM | 2456.0197 | 3081.107 | 7.94% |
EG-Tree | 2164.8509 | 2643.0878 | 7.90% |
Central China | Performance evaluation | ||
MAE | RMSE | MAPE | |
DTR | 4443.9273 | 5396.3621 | 17.08% |
RFR | 3329.2644 | 4987.0962 | 12.81% |
GBDT | 3660.2287 | 4957.0896 | 13.48% |
Light-GBM | 4216.9413 | 5654.2967 | 17.84% |
EG-Tree | 2879.325 | 6721.628 | 9.22% |
RMSE | Transportation and Construction Industry | Industrial Sector | Agriculture and Forestry Industry | Resource Recycling Industry | Purchased Electricity Industry | Heating and Cooling Industry |
---|---|---|---|---|---|---|
East China | 0.217 | 0.358 | 0.088 | 0.257 | 0.786 | 0.192 |
North China | 0.529 | 0.621 | 0.232 | 0.409 | 0.419 | 0.570 |
Central China | 0.404 | 0.448 | 0.118 | 0.223 | 0.226 | 0.102 |
Northwest China | 0.173 | 0.248 | 0.141 | 0.230 | 0.134 | 0.294 |
South China | 0.483 | 0.453 | 0.297 | 0.420 | 0.333 | 0.288 |
Northeast China | 0.408 | 0.351 | 0.290 | 0.572 | 0.283 | 0.224 |
Southwest China | 0.435 | 0.460 | 0.167 | 0.275 | 0.145 | 0.191 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zang, J.; Hu, X.; Qie, K.; Zhang, Z.; Zhang, S. An EG-Tree Model Incorporating Spatial Heterogeneity for Analyzing Multifactorial Coupling Effects on Carbon Emissions Across Industries and Regions in China. Atmosphere 2025, 16, 663. https://doi.org/10.3390/atmos16060663
Zang J, Hu X, Qie K, Zhang Z, Zhang S. An EG-Tree Model Incorporating Spatial Heterogeneity for Analyzing Multifactorial Coupling Effects on Carbon Emissions Across Industries and Regions in China. Atmosphere. 2025; 16(6):663. https://doi.org/10.3390/atmos16060663
Chicago/Turabian StyleZang, Jinrui, Xin Hu, Kun Qie, Zian Zhang, and Shi Zhang. 2025. "An EG-Tree Model Incorporating Spatial Heterogeneity for Analyzing Multifactorial Coupling Effects on Carbon Emissions Across Industries and Regions in China" Atmosphere 16, no. 6: 663. https://doi.org/10.3390/atmos16060663
APA StyleZang, J., Hu, X., Qie, K., Zhang, Z., & Zhang, S. (2025). An EG-Tree Model Incorporating Spatial Heterogeneity for Analyzing Multifactorial Coupling Effects on Carbon Emissions Across Industries and Regions in China. Atmosphere, 16(6), 663. https://doi.org/10.3390/atmos16060663