Next Article in Journal
Methodology and Modeling of UAV Push-Broom Hyperspectral BRDF Observation Considering Illumination Correction
Previous Article in Journal
Integrating SAR, Optical, and Machine Learning for Enhanced Coastal Mangrove Monitoring in Guyana
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Calculation of CO2 Emissions from China at Regional Scales Using Remote Sensing Data

1
Key Laboratory of Karst Dynamics, MNR&GZAR, Institute of Karst Geology, CAGS, Guilin 541004, China
2
The National Key Laboratory of Water Disaster Prevention, Hohai University, Nanjing 210024, China
3
Yangtze Institute for Conservation and Development, Hohai University, Nanjing 210024, China
4
Dayu College, Hohai University, Nanjing 210024, China
5
Department of Civil Engineering, Monash University, Clayton, VIC 3800, Australia
*
Author to whom correspondence should be addressed.
Remote Sens. 2024, 16(3), 544; https://doi.org/10.3390/rs16030544
Submission received: 22 November 2023 / Revised: 20 January 2024 / Accepted: 27 January 2024 / Published: 31 January 2024
(This article belongs to the Section Urban Remote Sensing)

Abstract

:
Since industrialization, global carbon dioxide (CO2) emissions have been rising substantially, playing an increasingly important role in global warming and climate change. As the largest CO2 emitter, China has proposed an ambitious reduction plan of peaking before 2030 and achieving carbon neutrality by 2060. Calculation of CO2 emissions inventories at regional scales (e.g., city and county) has great significance in terms of China’s regional carbon policies as well as in achieving the national targets. However, most of the existing emissions data were calculated based on fossil fuel consumptions and were thus limited to the provinces in China, making it challenging to compare and analyze the CO2 emissions of different cities and counties within a province. Machine learning methods provided a promising alternative but were still suffering from the lack of availability of training samples at city or county scales. Accordingly, this study proposed to use the energy consumption per unit GDP (ECpGDP) and GDP to calculate the effective CO2 emissions, which are the CO2 emissions if all consumed energy was generated by standard coal. Random forest models were then trained to establish relationships between the remote sensing night-light data and effective CO2 emissions. A total of eight predictor variables were used, including the night-light data, the urbanization ratio, the population density, the type of sensors and administrative divisions, latitude, longitude, and the area of each city or county. Meanwhile, the mean value of the five-fold cross-validation model was used as the estimated effective CO2 emissions in order to avoid overfitting. The evaluation showed a root mean square error (RMSE) of 10.972 million tons and an overall Pearson’s correlation coefficient (R) of 0.952, with satisfactory spatial and temporal consistency. The effective CO2 emissions of 349 cities and 2843 counties in China during 1992–2021 were obtained, providing a promising dataset for CO2-emission-related applications.

1. Introduction

Carbon dioxide (CO2) emissions have played an increasingly important role in contributing to global warming and climate change over the past decades. The World Meteorological Organization suggested that the average global temperature for the period 2013–2022 increased by 1.14 °C when compared to the pre-industrial period of 1850–1900 [1], and emissions reduction is critical to limit global warming to 1.5 °C [2]. China is the largest CO2 emitter, with an ambitious reduction plan of peaking before 2030 and achieving carbon neutrality by 2060 [3]. To meet this goal, China has taken an active role in energy conservation and emission-reduction activities and formulated a series of emission-reduction policies, such as optimizing the industrial structure and developing and utilizing new energy sources [4]. Calculation of CO2 emissions inventories at regional scales (e.g., city and county) has great significance to China’s regional carbon policies as well as to achieving the national targets [5].
At present, many studies have conducted estimations of China’s CO2 emissions at the national and provincial scales, with two main categories. The energy consumption data are the basis of most existing methods to estimate CO2 emissions. Many studies [6,7] have adopted the regional carbon emissions factor accounting method of the Intergovernmental Panel on Climate Change (IPCC) to calculate CO2 emissions. Some studies [8,9] used this method to estimate the CO2 emissions of China and its 30 provinces during 1997–2017, with the provincial carbon emissions inventory, including energy-related emissions (17 fossil fuels in 47 industries) and process-related emissions (cement production). Similarly, using the definitions provided by the IPCC regional emissions method, several studies [10,11] have developed a method for constructing production-based inventories of CO2 emissions from Chinese cities, covering 47 socio-economic sectors, 20 energy types, and 9 primary-industry products. This method was further combined with the input–output model to calculate the consumption-based CO2 emissions inventories of 13 Chinese cities, being the basis of regional CO2 reduction policies [12]. Despite the success of these studies, the required energy consumption data are only available for provinces and a small number of central cities in China, covering a few selected years. Estimating CO2 emissions at city and county levels is still challenging.
Remote sensing provides a promising alternative in CO2 emissions estimation on regional scales as it can capture the spatial details and dynamics of the Earth’s surface at a low cost. Conventional remote sensing requires a clear physical relationship between the observed optical or microwave signal and the research target, and it is challenging to directly measure CO2 emissions inventories [13]. However, the night light sensor, originally intended for measuring cloud reflectance and nocturnal weather dynamics, was found to be highly correlated to human activities and economic indicators, e.g., GDP, population density, industrial activities, and the urbanization degree of a region [14]. It has also been used to indirectly predict the energy use and CO2 emissions level of a region using data-driven methods. Han et al. [15] developed a night-light-based method to simulate the urban CO2 emissions of 11 selected cities in the Yangtze River Delta of China from 2003 to 2013. Meng et al. [16] proposed a top-down carbon emissions estimation method using night-light data and energy statistics to estimate CO2 emissions in urban areas of China from 1995 to 2010, assuming that the estimation method is scale invariant. Similarly, Chen et al. [17] first adopted the method of machine learning (PSO-BP algorithm) to establish the relationship between provincial CO2 emissions and remote sensing night-light data. The trained relationship then was used to estimate the CO2 emissions of 2735 counties in China from 1997 to 2017. Apart from the main input variable of night-light data, the latitude, longitude and area of a county were also used for improved estimation. Yang et al. [18] trained an ensemble-structure-based neural network model to estimate the carbon emissions at the municipal level in three northeastern provinces from 1998 to 2013, based on the weighted coefficient strategy of stable nighttime-light data. These studies confirmed the feasibility of using remote sensing night light to estimate CO2 emissions and provided valuable CO2 emissions data from city to county scales. However, the true values of CO2 emissions used for training were all from energy consumption data, only covering the 34 provinces and a few central cities in China. The models trained for the provinces were commonly directly applied to estimate the values at regional scales, with the uncertainty caused by the scale-effect being unclear.
Different from the limited data of energy consumption, many cities and counties in China reported their energy consumption per unit GDP (ECpGDP) irregularly, providing an alternative data source for energy consumption. In this study, all the publicly available records of ECpGDP were collected and the corresponding effective energy consumption, in terms of standard coal and CO2 emissions, were then calculated, providing a few ground truths at the city and county scales. They were then used to train a random forest model for CO2 emissions, taking the night-light data as the main input. Different from the existing estimation methods [17,18], two extra explainable parameters of population density and urbanization ratio were included in the model inputs to provide a better estimation of CO2 emissions. The trained model was then used to estimate the effective CO2 emissions for cities and counties over a long period of 1992–2021. Since the effective CO2 emissions are based on society’s energy consumption, the estimated effective CO2 emissions are expected to deviate substantially from the fossil-fuel-related emissions. However, it depicts a worst-case scenario for society’s CO2 emissions, with a clear urgency for energy transition. At the same time, this can provide policymakers with more direct, regionally accurate data at city and county scales, avoiding the consideration of the CO2 emissions footprint.

2. Data and Preprocessing

Administrative zone map of China. China has a three-tier system of administrative divisions, being provinces, cities, and counties. A city consists of a few counties, with the number of counties varying substantially from east to west. The administrative divisions of cities and counties in China have changed slightly in the past four decades and the estimation was all based on the current administrative division system, including 370 cities and 2878 counties. Six focus areas were highlighted for further analysis in this study in view of the economy and urbanization, with a summary and the locations of these areas being listed in Table 1 and Figure 1, respectively. These areas were as follows: (A) the three northeastern provinces, being the traditional old industrial base but undergoing a recession in the past two decades; (B) the Beijing–Tianjin–Hebei region; (C) the Chengyu city cluster that is located in western China; (D) the central Yangtze River cluster, being the most dynamic city cluster in China; (E) the Yangtze delta, the largest city cluster in China, and (F) the Pearl delta, being the early economically development central cities. These focus areas are well-developed areas with much higher CO2 emissions than the rest of China. Additionally, the longitudes and latitudes of the geometric center of each city or county and the corresponding areas were calculated and used as input variables in the model.
Night-light data. The data collected by the visible and infrared (OLS) sensors aboard the Defense Meteorological Satellite Program (DMSP) and the Visible Infrared Imaging Radiometer Suite (VIIRS) aboard the Suomi National Polar-Orbiting Partnership were used in this study. The spatial resolution of OLS is 3000 m, with the pixel size of the nighttime light (NTL) products being produced to 1000 m. The OLS NTL data were collected by six DMSP satellites (F10, F12, F14–F16, and F18), operating from 1992 to 2013. The VIIRS sensors are the successors of OLS starting from 2011. VIIRS is able to acquire new NTL remote sensing images (day/night band, DNB) with an enhanced spatial resolution of 750 m. The pixel size of the VIIRS NTL products was preprocessed to 500 m.
Since the two kinds of remote sensing data are obtained by different satellite sensors, there are differences in sensor parameters, data quality, time span, spatial–temporal and radiometric resolution, etc., which makes the two sets of data incompatible and limits the long-term study of night-light data. Fortunately, the two kinds of satellites had an overlap of 2012–2013, making it possible to achieve a consistent dataset. At present, a large amount of research is undertaken to harmonize the two kinds of remote sensing data. For example, Li et al. [19] used the monthly averaged VIIRS and monthly OLS data to generate DMSP-OLS-like data using a power function model. Wu et al. [20] proposed a geographically weighted regression model to generate DMSP-like data, and a consistent NTL time series for 1996–2017 was obtained. Recently, the National Center for Earth System Science released a “DMSP-OLS-like” 1 km NTL remote sensing dataset (1992–2022) of China (available at http://geodata.nnu.edu.cn, accessed on 30 January 2023.). This dataset was produced by a pseudo-invariant pixel method [20]. This “DMSP-OLS-like” dataset was found to hold strong linear correlations with various economic and development statistics, with average R2 values of 0.931 and 0.654 at the national and provincial scales, respectively. In this study, this “DMSP-OLS-like” 1 km NTL remote sensing dataset was directly used, considering its relatively higher accuracy. The annual mean of the digital number (DN) values (i.e., pixel values) of each city and county of China were calculated for 1992–2021.
Energy consumption per unit of GDP (ECpGDP) data. ECpGDP is an indicator of energy intensity, reflecting the energy inefficiency of an economy. In China, ECpGDP was defined as the ton of coal equivalent (TCE) per CNY 10,000. This indicator was calculated using the energy consumption data that are not publicly available. The data covered a long period of 1997–2019, including 603 records of 101 cities and 905 records of 235 counties. All the ECpGDP records of cities and counties available on China’s economic and social big data research platform (https://data.cnki.net/home, accessed on 30 January 2023.) were acquired in this study. The corresponding records of GDP were released by the National Bureau of Statistics annually, being also obtained from the same data platform. Figure 1a,b show the spatial distributions of the available records of energy consumption per unit GDP at the city and county scales, respectively, while the number of records for each year is provided in Figure 2a. Most records were collected from 2005 to 2017 as they were considered to be a critical indicator of social and economy development since the 11th five-year plan of China (2006–2011); however, there was no clear spatial pattern of the number of records as it was not compulsory to be released to the public, and thus, none of the cities or counties reported it continuously.
Urbanization ratio of cities and counties. The European Space Agency (ESA) Climate Change Initiative (CCI) and Copernicus Climate Change Service (C3S) landcover products were used to calculate the urbanization ratio of each city and county (Figure 3). The ESA CCI landcover products were updated annually from 1992 to 2015 using the SPOT4 satellite images [21], while the C3S landcover products were produced for 2016–2020 using Sentinel-2 data [22]. Though SPOT4 and Sentinel-2 had different spatial–temporal and radiometric resolutions, consistent global landcover maps were produced for the whole period of 1992–2020 at a spatial resolution of 300 m. The Earth’s surface was classified into 22 landcover types, including the urban target type with a fixed pixel value of 190. The products were released in the format of netCDF. In this study, the number of urban pixels of each city and county were calculated first. The value was then divided by the number of total pixels of a city or county to obtain the urbanization ratio. Since the CCI and C3S landcover product only covered 1992–2020, the urbanization ratio of 2021 was interpolated using a linear interpolation method.
Population density. The National Bureau of Statistics of China has produced seven national censuses, in 1953, 1964, 1982, 1990, 2000, 2010, and 2020, with the population of each city and county being publicly available. These datasets were also acquired from the China’s economic and social big data research platform. In this study, the population of non-census years was interpolated from the seven censuses using a spline function of the first order. The annual population of each city or county was then divided by the corresponding area (excluding Hong Kong, Macao, and Taiwan), resulting in the population density per km2. The calculated population density ranged from 0.08 to 48,382/km2, with the small values being mainly located in western cities and counties (Figure 4).
Fossil-fuel-based CO2 emissions data. Since the estimated CO2 emissions in this study are effective values, the CO2 emissions values estimated by the IPCC method were used for inter-comparison. Specifically, the carbon emissions of provinces available in the carbon accounting database (CEADs) were used in this study. These carbon emissions data were calculated according to the carbon emissions nucleic acid method of IPCC, covering 47 industries and 17 fossil fuels related to energy emissions [8,9,10,23]. The data cover the period of 1997–2019 and 30 provinces excluding Taiwan, Hong Kong, Macao, and the Tibet Autonomous Region.

3. Methodology

3.1. Calculation of the Effective CO2 Emissions

The annual ECpGDP records were used to calculate the annual effective CO2 emissions, being treated as the truth of the proposed estimation model. As previously mentioned, the ECpGDP was defined as TCE/CNY 10,000, and the energy consumption of a city or county i (ECi) in TCE can thus be calculated by:
E C i = E C p G D P i × G D P i ,
The effective CO2 emissions (ei) of a city or county i with a unit of million tons can then be calculated using:
e i = τ γ 0 α 0 β 0 E C i × 10 6 ,
where τ is 44/12 (unitless), representing the mass ratio of carbon (C) to carbon dioxide (CO2), and γ0 is a unitless parameter of 0.9, indicating the carbon oxidation coefficient of standard coal. The values of α0 and β0 (unitless) are 29.27 and 24.74, respectively. The values of α0 and β0 represent the thermal conversion coefficient of standard coal and the carbon content of fuel [24], also known as the average carbon content of standard coal and the carbon emissions factor, respectively. The ECi is an equivalent energy consumption of various energy sources, including fossil fuels and various renewable energy sources. Since the data source and the method to calculate the ECpGDP are not publicly available, the contribution of fossil fuels cannot be determined. The effective CO2 emissions defined in Equation (2) can thus deviate significantly from the real CO2 emissions, especially for urban areas with little consumption of fossil fuels but substantial energy consumption. However, the estimated effective CO2 emissions are more straightforward in terms of the regional CO2 reduction policies and domestic carbon emissions trading. Figure 2b shows the calculated effective CO2 emissions of the 1508 samples, with the values of counties being mainly <20 million tons.

3.2. CO2 Emissions Model

The flowchart of building the CO2 emissions model and the application of the model to estimate the CO2 emissions at city and county scales are provided in Figure 5. The main inputs of the estimation model (Table 2) included the mean of the night-light data (mean DN value), the population density, and the urbanization ratio of a city or county. Moreover, the geographical coordinates (x, y) and the area (A) of each city and county were used as auxiliary input parameters to reflect the regional differences in CO2 emissions, with x and y being the values of the geographical center [17,18]. The commonly used normalization method of z-score standardization was applied to the whole dataset, so that the data of different orders of magnitude were uniformly converted into the same order of magnitude to ensure comparability of the data. Notably, the mean and standard deviation required in the z-score standardization were those of the whole dataset. The effective CO2 emissions were treated as the truth of the estimation model.
Random forest regression models were selected to estimate the effective CO2 emissions, considering the complex non-linearity between the effective CO2 emissions and the aforementioned explainable parameters. Random forest models have been widely used in landcover classification and surface parameter retrieval from remote sensing data [25,26,27]. In the training process of the random forest model, multiple small decision trees were built with different subsets and feature attributes, and they were then ensembled into a more powerful model [28]. Each subset is built with randomly selected samples and randomly selected feature attributes, and this randomization reduces the sensitivity of the decision tree to the training data, thereby preventing overfitting [29].
A total of 1508 samples were achieved, including 603 samples of 101 cities and 905 samples of 235 counties. This is a relatively large sample set compared to that of other studies focusing on the provincial scale [16,17,18,19], but is still small for a machine learning method [30]. Different from the common practice of splitting the sample set into training, validation, and testing sets, 5-fold cross-validation was adopted for the whole sample set, where the sample set was randomly split into 5 subsets. This resulted in 5 independent training and validation sessions, and 5 random forest models were trained. In each session, 4 of the 5 subsets were used for training with the out-of-bag subset (10% of the whole sample set) being used for validation. Since a few hyperparameters were required in the random forest model, an exhaustive grid search was carried out for the two most important parameters [31], i.e., the number of trees (10 to 70 with an interval of 10) and the maximum features for splitting a leaf node (‘auto’ and ‘sqrt’). It is somewhat considered as “cheating” in this step, as the hyperparameters were optimized for the validation set and the 5-fold validation accuracy represents the best results that can be achieved. In the estimating stage, all 5 models were used to estimate the CO2 emissions, with the average of the five values being the final output. Moreover, the maximum values, minimal values, and standard deviation values of the 5 outputs were calculated, partly reflecting the potential estimation uncertainty

3.3. Accuracy Metrics

Four commonly used accuracy metrics were used to reflect the accuracy of the trained models, namely, bias, Pearson’s correlation coefficient (R), root mean square error (RMSE) in million tons, and mean absolute percentage error (MAPE). The RMSE and MAPE provide the absolute and relative errors. Since the CO2 emissions estimated in this study were effective values, with the absolute values being less critical, MAPE was treated as the main indicator.

4. Results

4.1. Estimation Accuracy of the Proposed Model

The five-fold cross validation resulted in five models, and the predictions over the five validation subsets were compared with the truth, covering the whole dataset (Figure 6a). Satisfactory validation accuracy metrics were achieved with an overall R and RMSE of 0.952 and 10.972 million tons, respectively. A small overall bias of −0.086 mill tons was observed; however, the large values were substantially underestimated. This is generally consistent with the applications of machine learning methods in other studies, demonstrating that predicted values always regress towards the mean [32]. A further investigation of the range-specific bias and MAPE was carried out in Figure 6b, including four ranges of 0–70, 70–140, 140–210, and >210 million tons. A small positive bias of 1.07 million tons was observed, which became negative and substantially decreased as the true value increased, being −75.86 million tons for the range of >210 million tons. The validation accuracy at the county scale was thus slightly better than that of the city scale because the CO2 emissions of counties were relatively small (Figure 2b). However, the difference in MAPE was relatively small, with the largest MAPE of 28.87% being observed in the range of >210 million tons. The MAPE values of the cities or counties within the range of 70–140 million tons were relatively small, being 11.69%.
The MAPE values of each city and county were also calculated to show the spatial variations in the model performance (Figure 7). While the MAPEs of a few cities or counties have a large value of >50%, e.g., those in the northeast of China, 75 of the 100 cities and 183 of the 235 counties had a small MAPE of <10%, suggesting a satisfactory spatial consistency. Seven cities and sixteen counties were observed to have a large MAPE of >70%. They were mainly cities or counties with low effective CO2 emissions or 1–2 samples, and a small estimation bias can result in a large MAPE.
Figure 8 shows the MAPE values of each calendar year. In general, the MAPE values showed a fluctuating upward trend, suggesting that the performance of the estimation models gradually decreased. Relatively large fluctuations were observed before 2005, being mainly caused by the limited number of samples in this period (Figure 2a). The MAPEs of 2005 to 2016 were stable in the range of 12% to 22%, being partly related to the large number of samples in this period. A large MAPE of 43% was observed in 2017. This was caused by two samples having low effective CO2 emissions of <1 million tons. After removing these samples, the MAPE was ~25%, being consistent with other years around this time period.

4.2. Effective CO2 Emissions of China: Results at the City and County Scales

The effective CO2 emissions of each city from 1992 to 2021 were estimated using the trained models. As was previously mentioned, five models were built using five-fold cross-validation and the averaged estimations were treated as the effective CO2 emissions; the values of 370 cities of four selected years are presented in Figure 9. In general, the CO2 emissions of all cities have been on the rise over the past four decades. The largest city-level emissions increased from 159.19 million tons in 1992 to 297.21 million tons in 2021, with an average annual increase of 86.7%. Meanwhile, the smallest city-level emissions increased from 4.83 to 11.84 million tons.
The geographical pattern of China’s CO2 emissions has changed substantially in the past three decades. In 1992, most of the cities had relatively low effective CO2 emissions of <15 million tons and the eastern coastal areas had higher emissions compared to the whole country, especially in the two megacities of Shanghai and Beijing. The hotspots were mainly located in four focus areas of the Beijing–Tianjin–Hebei region (B), the Yangtze delta (E), the Chenyu city cluster (C), and the Pearl delta (F). This is mainly due to the relatively early economic development of these areas, and at the same time, it also drives the development of surrounding cities to a certain extent, so that the high-carbon-emissions areas show a trend of spatial expansion. While the effective CO2 emissions of all cites increased from 1992 to 2000, a similar spatial pattern was observed in 2000, and the hotspots within each focus area were enhanced or extended in space. Most cities in northwest China still had a low amount of effective CO2 emissions of <15 million tons, except for the capital city of Xinjiang (Urumqi). However, this changed substantially in 2010, and only the cities of the Qinghai–Tibet Plateau maintained relatively low effective CO2 emissions. A few hotspots were observed outside the focus areas, generally being the central cities of each region, e.g., the capital cities of the three northeastern provinces (Figure 9a). From 2010 to 2021, the emissions growth in most cities began to accelerate, with 95% of cities having CO2 emissions of >15 million tons in 2021. The imbalance between the eastern costal area, central China, and the northwest area was further reduced. However, the three northeastern provinces and the eastern part of the middle Yangtze cities had much smaller increases compared to other areas, being mainly caused by the economic stagflation of these areas in the past decade.
Figure 10 shows the effective CO2 emissions of each county in 1992, 2000, 2010, and 2021. Similarly, the emissions of most counties kept increasing from 1992 to 2021. While the Beijing–Tianjin–Hebei region, the Yangtze delta, the Chenyu city cluster, and the Pearl delta had relatively higher CO2 emissions, the spatial patterns at the county scale were substantially different from those observed at the city scale (Figure 9). Little difference was observed between the eastern and western counties in 1992 and 2000. This was mainly caused by the large difference in the administrative areas, with the eastern counties commonly having a much smaller area. For example, the areas of the largest and the smallest county are 220,000 km2 and <100 km2, respectively. The hotspots in 2010 and 2021 generally matched those of the city scale. A few counties of the three northeastern provinces had reduced CO2 emissions from 2010 to 2021.
The standard deviation of the five models was estimated as an indicator of the model uncertainty. For simplicity, only the standard deviations of 1992 and 2021 are presented in Figure 11, being the calendar years with the smallest and largest uncertainties, respectively. The spatial patterns of the standard deviation and the mean values were generally consistent. The areas and/or periods with low effective CO2 emissions commonly had a small degree of uncertainty. In other words, the uncertainties were greater over the areas and/or periods with higher CO2 emissions. For example, 99% of counties have an uncertainty of <2 million tons in 1992, while Beijing had an uncertainty of >25 million tons during the same year. Moreover, the western areas had greater uncertainties than other areas.

4.3. Inter-Comparison with the CO2 Emissions from the IPCC Accounting Method

An inter-comparison between the estimated effective CO2 emissions and the value calculated by the IPCC accounting method was carried out for the whole country. Since the accounting method was based on the consumption data of fossil fuels, emissions have been denoted as fuel-based emissions. Briefly, the fuel-based CO2 emissions of 30 provinces from 1997 to 2019 were acquired [8,9,11,23], and the national CO2 emissions were calculated excluding Hong Kong, Hainan, Tibet, and Taiwan. The temporal evolution of the national fuel-based CO2 emissions was compared with that of the effective CO2 emissions in Figure 12.
In general, both the fuel-based CO2 emissions and the estimated effective emissions values presented an upward trend from 1997 to 2019 and can be roughly divided into three stages. In the first stage of 1997–2001, the fuel-based CO2 emissions were at a relatively low value of ~3200 million tons and changed little over time, while the estimated effective values increased slightly from 6031 to 6801 million tons. As expected, the effective CO2 emissions were much higher than the fuel-based values. This was mainly due to the different estimation methods. The effective CO2 emissions values used in this study were calculated based on GDP and energy consumption per unit of GDP, including all kinds of energy consumptions. Notably, only nine samples were available before 2005 (Figure 2a); thus, the trained models can have large uncertainties in this period. This, together with the fact that models overestimated the small values (Figure 6b), may have resulted in an overestimation of effective CO2 emissions in these years. From 2001 to 2013, both estimations of CO2 emissions showed a steadily increasing trend, with an average increase of 636 million tons per year. This was mainly driven by the rapid industrialization process of the first decade of the 21st century, leading the growth of fuel-based energy demand and, thus, CO2 emissions. While the effective CO2 emissions were still much higher than the fuel-based values in this period, the ratio between the fuel-based and effective emissions increased from 48% to 79%, suggesting that the fuel energy was increasingly important in this stage. After 2013, the growth rate of the fuel-based emissions began to decline, while the growth rate of effective emissions kept on increasing, with the gap increasing from 2965 to 8683 million tons as a result of the transition from traditional fossil fuels such as coal and oil to renewable energy.

5. Discussion

Acceptable validation accuracy (RMSE: 10.972 million tons) was achieved in the cross-validation, but the models underestimated the low values and overestimated the high values. This resulted in overestimations in the periods and areas with low effective CO2 emissions, e.g., the national emissions of 1992–2001, and a potential overestimation in the values after 2015. However, the spatial pattern and its temporal evolution were captured, providing valuable information about CO2 emissions at the finer scale of the city and county levels. Despite the promising results, the data used in this study do not fully reflect the CO2 emissions of individual regions. For example, some cities in western China had low degree of social-economic development; thus, their night light intensities and population densities were low, but the CO2 emissions were actually at a high level. The proposed method can thus potentially underestimate the effective CO2 emissions, resulting in greater uncertainties in these areas (Figure 11). Notably, the CO2 emissions of a city do not precisely equal the total emissions of all its constituent counties, owing to the model’s inherent challenge in maintaining consistent accuracy across diverse scales. The proposed methods tended to have a lower accuracy for the recent years (Figure 8). This was also related to the underestimations at high values. In view of the main input data, night-light data may not fully reflect the social and economic activities, because the night light of a city can saturate earlier than the peaking of CO2 emissions. Accordingly, more publicly available data sources related to CO2 emissions should be included for improved accuracy. However, this is still challenging because most statistics cannot fully cover the whole period of 1992–2021. Moreover, the uncertainties of other data sources such as comprehensive industry reports, satellite imagery, and government environmental databases are generally unknown and the potential benefit is unclear.
The proposed method used all of the publicly available records of ECpGDP in the training and validation, but the samples (1508) are still insufficient to train an “optimal” model that fully captures the CO2 emissions at the city and county scales. While some more samples can be included in the future, they may be still limited. Accordingly, data augmentation techniques may be more promising to improve the generalizability of the models [33].
The effective CO2 emissions metric was derived under the assumption that all energy originates from standard coal, making it not directly comparable to actual CO2 emissions (Figure 12) and, consequently, less applicable to the climate change models [34]. However, the primary motivation of this study was to provide an alternative to an improved regional CO2-emission-reduction policy. The estimated effective CO2 emissions provide an analytical lens that serves to fill crucial gaps in the regional dataset. It is essential to note that this approach, while not perfectly aligning with real-world emissions, still exhibits a noteworthy correlation, as evidenced by the high R2 values. This correlation suggests a parallelism between the effective CO2 emissions model and actual emissions, highlighting the potential utility of the proposed method. The real CO2 emissions caused by human activities within a city or county can be different from the fuel-based CO2 emissions, especially for areas with cross-region energy transfer. For example, the real CO2 emissions in eastern China may be much larger than the value estimated by the IPCC accounting method as a result of the China’s west–east power transmission. In contrast, the ECpGDP and effective CO2 emissions consider all kinds of energy consumption, thus being more straightforward for policy makers, without requiring consideration of the footprint of CO2 emissions. Furthermore, the concept of effective CO2 provides a robust baseline for assessing progress in emission-reduction efforts. For a city or county, a lower effective CO2 definitely means lower CO2 emissions, with the reduction being attributed to operating in an energy-efficient way and/or using lower carbon energy sources. This indicator can be used together with the statistics of renewable energy, providing a complete picture of the national and regional efforts of emissions reduction. For example, the growing gap between effective CO2 emissions and fuel-based emissions during the period from 2009 to 2019 (Figure 12) reflects, to a certain extent, the growing proportion of clean energy utilization and the considerable achievements of energy consumption transformation measures.

6. Conclusions

The effective CO2 emissions of China at the city and county scales were estimated and evaluated in this study using random forest models and a five-fold cross-validation, with an overall R and RMSE of 0.952 and 10.972 million tons, respectively. The spatial and temporal patterns of effective CO2 emissions were captured, being consistent with those of social and economic development. This study successfully provided the first carbon emissions estimations for cities and counties in China, which provides a key reference for the formulation of regional carbon policies in China.

Author Contributions

All the authors participated in editing and reviewing the manuscript. Methodology, algorithms, data analysis, validation, Y.L.; data preprocessing, methodology, validation, Y.C. conceptualization, writing—review, Q.C. Conceptualization, supervision, L.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by “the spatial–temporal pattern evolution of carbon emissions in karst systems” project, supported by the Guangxi Key Science and Technology Innovation Base on Karst Dynamics (KDL&Guangxi 202210), the Natural Science Foundation of Jiangsu Province (BK20210377), and the Fundamental Research Funds for the Central Universities (B220201009).

Data Availability Statement

The administrative zone map, energy consumption per unit of GDP (ECpGDP) data, GDP data, and census data are available via China’s economic and social big data research platform (https://data.cnki.net/home, accessed on 30 January 2023.). The European Space Agency (ESA) Climate Change Initiative (CCI) and Copernicus Climate Change Service (C3S) landcover products are available at the Copernicus Climate Change Service (C3S) Climate Data Store (CDS). Night-light data are available via http://geodata.nnu.edu.cn, accessed on 30 January 2023.

Acknowledgments

The authors are grateful to the reviewers for their valuable comments, which helped to improve the quality of this paper.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Forster, P.M.; Smith, C.J.; Walsh, T.; Lamb, W.F.; Lamboll, R.; Hauser, M.; Ribes, A.; Rosen, D.; Gillett, N.; Palmer, M.D. Indicators of Global Climate Change 2022: Annual update of large-scale indicators of the state of the climate system and human influence. Earth Syst. Sci. Data 2023, 15, 2295–2327. [Google Scholar] [CrossRef]
  2. Change, P.C. Global Warming of 1.5 °C; World Meteorological Organization: Geneva, Switzerland, 2018. [Google Scholar]
  3. Agency, I.E. An Energy Sector Roadmap to Carbon Neutrality in China; OECD Publishing: Washington, DC, USA, 2021. [Google Scholar]
  4. Wang, J.; Li, L. Sustainable energy development scenario forecasting and energy saving policy analysis of China. Renew. Sustain. Energy Rev. 2016, 58, 718–724. [Google Scholar] [CrossRef]
  5. Feng, R.; Hu, L.; Hu, X.; Fang, X. Knowledge gaps are making it harder to formulate national climate policies. Proc. Natl. Acad. Sci. USA 2023, 120, e2218563120. [Google Scholar] [CrossRef]
  6. Ipcc, I. Guidelines for National Greenhouse Gas Inventories; Eggleston, H.S., Buendia, L., Miwa, K., Ngara, T., Tanabe, K., Eds.; Prepared by the National Greenhouse Gas Inventories Programme; IGES: Hayama, Japan, 2006. [Google Scholar]
  7. IPCC. Refinement to the 2006 IPCC Guidelines for National Greenhouse Gas Inventories. In Guidelines for National Greenhouse Gas Inventories; International Panel on Climate Change (IPCC): Geneva, Switzerland, 2019. [Google Scholar]
  8. Shan, Y.; Guan, D.; Zheng, H.; Ou, J.; Li, Y.; Meng, J.; Mi, Z.; Liu, Z.; Zhang, Q. China CO2 emission accounts 1997–2015. Sci. Data 2018, 5, 1–14. [Google Scholar] [CrossRef] [PubMed]
  9. Shan, Y.; Huang, Q.; Guan, D.; Hubacek, K. China CO2 emission accounts 2016–2017. Sci. Data 2020, 7, 1–9. [Google Scholar] [CrossRef] [PubMed]
  10. Shan, Y.; Liu, J.; Liu, Z.; Xu, X.; Shao, S.; Wang, P.; Guan, D. New provincial CO2 emission inventories in China based on apparent energy consumption data and updated emission factors. Appl. Energy 2016, 184, 742–750. [Google Scholar] [CrossRef]
  11. Mi, Z.-F.; Wei, Y.-M.; He, C.-Q.; Li, H.-N.; Yuan, X.-C.; Liao, H. Regional efforts to mitigate climate change in China: A multi-criteria assessment approach. Mitig. Adapt. Strateg. Glob. Chang. 2017, 22, 45–66. [Google Scholar] [CrossRef]
  12. Mi, Z.; Zhang, Y.; Guan, D.; Shan, Y.; Liu, Z.; Cong, R.; Yuan, X.-C.; Wei, Y.-M. Consumption-based emission accounting for Chinese cities. Appl. Energy 2016, 184, 1073–1081. [Google Scholar] [CrossRef]
  13. Zhu, L.; Yuan, S.; Liu, Y.; Chen, C.; Walker, J.P. Time series soil moisture retrieval from SAR data: Multi-temporal constraints and a global validation. Remote Sens. Environ. 2023, 287, 113466. [Google Scholar] [CrossRef]
  14. Levin, N.; Kyba, C.C.; Zhang, Q.; de Miguel, A.S.; Román, M.O.; Li, X.; Portnov, B.A.; Molthan, A.L.; Jechow, A.; Miller, S.D. Remote sensing of night lights: A review and an outlook for the future. Remote Sens. Environ. 2020, 237, 111443. [Google Scholar] [CrossRef]
  15. Han, J.; Meng, X.; Liang, H.; Cao, Z.; Dong, L.; Huang, C. An improved nightlight-based method for modeling urban CO2 emissions. Environ. Model. Softw. 2018, 107, 307–320. [Google Scholar] [CrossRef]
  16. Meng, L.; Graus, W.; Worrell, E.; Huang, B. Estimating CO2 (carbon dioxide) emissions at urban scales by DMSP/OLS (Defense Meteorological Satellite Program’s Operational Linescan System) nighttime light imagery: Methodological challenges and a case study for China. Energy 2014, 71, 468–478. [Google Scholar] [CrossRef]
  17. Chen, J.; Gao, M.; Cheng, S.; Hou, W.; Song, M.; Liu, X.; Liu, Y.; Shan, Y. County-level CO2 emissions and sequestration in China during 1997–2017. Sci. Data 2020, 7, 1–12. [Google Scholar] [CrossRef] [PubMed]
  18. Yang, D.; Luan, W.; Qiao, L.; Pratama, M. Modeling and spatio-temporal analysis of city-level carbon emissions based on nighttime light satellite imagery. Appl. Energy 2020, 268, 114696. [Google Scholar] [CrossRef]
  19. Li, X.; Li, D.; Xu, H.; Wu, C. Intercalibration between DMSP/OLS and VIIRS night-time light images to evaluate city light dynamics of Syria’s major human settlement during Syrian Civil War. In Remote Sensing of Night-Time Light; Routledge: London, UK, 2021; pp. 80–97. [Google Scholar]
  20. Wu, Y.; Shi, K.; Chen, Z.; Liu, S.; Chang, Z. Developing improved time-series DMSP-OLS-like data (1992–2019) in China by integrating DMSP-OLS and SNPP-VIIRS. IEEE Trans. Geosci. Remote Sens. 2021, 60, 1–14. [Google Scholar] [CrossRef]
  21. Defourny, P.; Kirches, G.; Brockmann, C.; Boettcher, M.; Peters, M.; Bontemps, S.; Lamarche, C.; Schlerf, M.; Santoro, M. Land cover CCI. Prod. User Guide Version 2012, 2, 10–16. [Google Scholar]
  22. Hersbach, H.; Bell, B.; Berrisford, P.; Biavati, G.; Horányi, A.; Muñoz Sabater, J. ERA5 Hourly Data on Single Levels from 1979 to Present. Copernicus Climate Change Service (C3S) Climate Data Store (CDS). Available online: https://cds.climate.copernicus.eu/cdsapp#!/dataset/reanalysis-era5-single-levels?tab=overview (accessed on 29 October 2021).
  23. Guan, Y.; Shan, Y.; Huang, Q.; Chen, H.; Wang, D.; Hubacek, K. Assessment to China’s recent emission pattern shifts. Earth’s Future 2021, 9, e2021EF002241. [Google Scholar] [CrossRef]
  24. Jia, N.S.; Han, Y.H.; Hu, B. Research on the Development of China’s Emission Reduction Based on low carbon economy. Adv. Mater. Res. 2014, 962, 2381–2385. [Google Scholar] [CrossRef]
  25. Jie-min, W.; Feng, G. Discussion on the Problems on Land SurfaceAlbedo Retrieval by Remote Sensing Data. Remote Sens. Technol. Appl. 2011, 19, 295–300. [Google Scholar]
  26. Sinha, S.; Sharma, L.K.; Nathawat, M.S. Improved land-use/land-cover classification of semi-arid deciduous forest landscape using thermal remote sensing. Egypt. J. Remote Sens. Space Sci. 2015, 18, 217–233. [Google Scholar] [CrossRef]
  27. Balenzano, A.; Satalino, G.; Belmonte, A.; D’Urso, G.; Capodici, F.; Iacobellis, V.; Gioia, A.; Rinaldi, M.; Ruggieri, S.; Mattia, F. On the use of multi-temporal series of cosmo-skymed data for Landcover classification and surface parameter retrieval over Agricultural sites. In Proceedings of the 2011 IEEE International Geoscience and Remote Sensing Symposium, Vancouver, BC, Canada, 24–29 July 2011; pp. 142–145. [Google Scholar]
  28. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  29. Ali, J.; Khan, R.; Ahmad, N.; Maqsood, I. Random forests and decision trees. Int. J. Comput. Sci. Issues (IJCSI) 2012, 9, 272. [Google Scholar]
  30. Zhu, L.; Webb, G.I.; Yebra, M.; Scortechini, G.; Miller, L.; Petitjean, F. Live fuel moisture content estimation from MODIS: A deep learning approach. ISPRS J. Photogramm. Remote Sens. 2021, 179, 81–91. [Google Scholar] [CrossRef]
  31. Lerman, P. Fitting segmented regression models by grid search. J. R. Stat. Soc. Ser. C Appl. Stat. 1980, 29, 77–84. [Google Scholar] [CrossRef]
  32. Stigler, S.M. Regression towards the mean, historically considered. Stat. Methods Med. Res. 1997, 6, 103–114. [Google Scholar] [CrossRef] [PubMed]
  33. Li, P.; Li, D.; Li, W.; Gong, S.; Fu, Y.; Hospedales, T.M. A simple feature augmentation for domain generalization. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, BC, Canada, 11–17 October 2021; pp. 8886–8895. [Google Scholar]
  34. Solomon, S.; Plattner, G.-K.; Knutti, R.; Friedlingstein, P. Irreversible climate change due to carbon dioxide emissions. Proc. Natl. Acad. Sci. USA 2009, 106, 1704–1709. [Google Scholar] [CrossRef] [PubMed]
Figure 1. The spatial distribution of available records of energy consumption per unit GDP at the city (a) and county (b) scale, with A–F being the three northeastern provinces, the Beijing–Tianjin–Hebei region, the Chengyu city cluster, the central Yangtze river cluster, the Yangtze delta, and the Pearl delta, respectively.
Figure 1. The spatial distribution of available records of energy consumption per unit GDP at the city (a) and county (b) scale, with A–F being the three northeastern provinces, the Beijing–Tianjin–Hebei region, the Chengyu city cluster, the central Yangtze river cluster, the Yangtze delta, and the Pearl delta, respectively.
Remotesensing 16 00544 g001
Figure 2. The number of ECpGDP records from 1998 to 2019 at the city and county scales (a), and the distributions of calculated effective CO2 emissions at the city and county scales (b).
Figure 2. The number of ECpGDP records from 1998 to 2019 at the city and county scales (a), and the distributions of calculated effective CO2 emissions at the city and county scales (b).
Remotesensing 16 00544 g002
Figure 3. The spatial distribution of urbanization ratio at the city (a) and county (b) scales.
Figure 3. The spatial distribution of urbanization ratio at the city (a) and county (b) scales.
Remotesensing 16 00544 g003
Figure 4. The spatial distribution of population density at the city (a) and county (b) scales, with the unit being /Km2.
Figure 4. The spatial distribution of population density at the city (a) and county (b) scales, with the unit being /Km2.
Remotesensing 16 00544 g004
Figure 5. The proposed random forest models for effective CO2 emissions estimation.
Figure 5. The proposed random forest models for effective CO2 emissions estimation.
Remotesensing 16 00544 g005
Figure 6. True versus predicted values of effective CO2 emissions (Mt) (a) and group–specific bias and MAPE (b).
Figure 6. True versus predicted values of effective CO2 emissions (Mt) (a) and group–specific bias and MAPE (b).
Remotesensing 16 00544 g006
Figure 7. The city– and county–specific MAPEs at the city (a) and county (b) scales.
Figure 7. The city– and county–specific MAPEs at the city (a) and county (b) scales.
Remotesensing 16 00544 g007
Figure 8. The year–specific MAPE values of 2005–2019.
Figure 8. The year–specific MAPE values of 2005–2019.
Remotesensing 16 00544 g008
Figure 9. The effective CO2 emissions of each city in 1992, 2000, 2010, and 2021, with the unit being million tons, with A–F being the three northeastern provinces, the Beijing–Tianjin–Hebei region, the Chengyu city cluster, the central Yangtze river cluster, the Yangtze delta, and the Pearl delta, respectively.
Figure 9. The effective CO2 emissions of each city in 1992, 2000, 2010, and 2021, with the unit being million tons, with A–F being the three northeastern provinces, the Beijing–Tianjin–Hebei region, the Chengyu city cluster, the central Yangtze river cluster, the Yangtze delta, and the Pearl delta, respectively.
Remotesensing 16 00544 g009
Figure 10. The effective CO2 emissions of each county in 1992, 2000, 2010, and 2021, with the unit being million tons, with A–F being the three northeastern provinces, the Beijing–Tianjin–Hebei region, the Chengyu city cluster, the central Yangtze river cluster, the Yangtze delta, and the Pearl delta, respectively.
Figure 10. The effective CO2 emissions of each county in 1992, 2000, 2010, and 2021, with the unit being million tons, with A–F being the three northeastern provinces, the Beijing–Tianjin–Hebei region, the Chengyu city cluster, the central Yangtze river cluster, the Yangtze delta, and the Pearl delta, respectively.
Remotesensing 16 00544 g010
Figure 11. The estimation uncertainty of 1992 and 2021 at city (a,b) and county (c,d) scales, being the standard deviation of the five estimations.
Figure 11. The estimation uncertainty of 1992 and 2021 at city (a,b) and county (c,d) scales, being the standard deviation of the five estimations.
Remotesensing 16 00544 g011
Figure 12. Inter-comparison of the estimated effective CO2 emissions (green line) with the fuel-based CO2 emissions, calculated by the IPCC (blue line) between 1997 and 2019. The light green area in the figure represents the upper and lower bounds of the five estimated values.
Figure 12. Inter-comparison of the estimated effective CO2 emissions (green line) with the fuel-based CO2 emissions, calculated by the IPCC (blue line) between 1997 and 2019. The light green area in the figure represents the upper and lower bounds of the five estimated values.
Remotesensing 16 00544 g012
Table 1. Six focus areas of China in view of economy and urbanization.
Table 1. Six focus areas of China in view of economy and urbanization.
IDFocus AreaDescription
AThe three northeastern provincesTraditional old industrial base including Heilongjiang, Jilin, and Liaoning provinces
BThe Beijing–Tianjin–Hebei regionThe capital economic circle including Beijing, Tianjin, and Hebei provinces
CThe Chengyu city clusterThe major economic cities in western China including Chongqing and Sichuan provinces
DThe central Yangtze River clusterThe most dynamic cities in China including Hubei, Hunan, and Jiangxi provinces
EThe Yangtze deltaThe leading cities of the Yangtze River economic area including Shanghai, Jiangsu, Zhejiang, and Anhui provinces
FThe Pearl deltaEarly economically developed central cities including Guangzhou, Shenzhen, and other nine cities
Table 2. The variables contained in a sample (x) of the random forest model and the maximum and minimum values of each variable used in data normalization.
Table 2. The variables contained in a sample (x) of the random forest model and the maximum and minimum values of each variable used in data normalization.
Type (Source)Variable NameMinimumMaximumVariables
DMSP/OLS NPP/VIIRSType of sensors011
Administrative zone map of ChinaType of administrative divisions011
Administrative zone map of ChinaX[°]75.94127.961
Y[°]21.8449.661
A[km2]31.5471,0001
CCI and C3S landcover productsUrbanization ratio0.000.841
National censusesPopulation density[/km2]2.8112,596.651
DMSP/OLS NPP/VIIRSMean of DN values0.0337.361
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, Y.; Chen, Y.; Cai, Q.; Zhu, L. Calculation of CO2 Emissions from China at Regional Scales Using Remote Sensing Data. Remote Sens. 2024, 16, 544. https://doi.org/10.3390/rs16030544

AMA Style

Li Y, Chen Y, Cai Q, Zhu L. Calculation of CO2 Emissions from China at Regional Scales Using Remote Sensing Data. Remote Sensing. 2024; 16(3):544. https://doi.org/10.3390/rs16030544

Chicago/Turabian Style

Li, Yaqian, Yile Chen, Qi Cai, and Liujun Zhu. 2024. "Calculation of CO2 Emissions from China at Regional Scales Using Remote Sensing Data" Remote Sensing 16, no. 3: 544. https://doi.org/10.3390/rs16030544

APA Style

Li, Y., Chen, Y., Cai, Q., & Zhu, L. (2024). Calculation of CO2 Emissions from China at Regional Scales Using Remote Sensing Data. Remote Sensing, 16(3), 544. https://doi.org/10.3390/rs16030544

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop