Next Article in Journal
Analysis of the Relation Between Solar Activity and Parameters of the Sporadic E Layer
Previous Article in Journal
Contrasting Impacts of North Pacific and North Atlantic SST Anomalies on Summer Persistent Extreme Heat Events in Eastern China
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Reconstructing of Satellite-Derived CO2 Using Multiple Environmental Variables—A Case Study in the Provinces of Huai River Basin, China

School of Geographic Science and Planning, Huaiyin Normal University, Huai’an 223300, China
*
Author to whom correspondence should be addressed.
Atmosphere 2025, 16(8), 903; https://doi.org/10.3390/atmos16080903
Submission received: 13 June 2025 / Revised: 20 July 2025 / Accepted: 22 July 2025 / Published: 24 July 2025
(This article belongs to the Section Air Quality)

Abstract

The introduction of the ”dual carbon” target has increased the need for products that can accurately measure carbon dioxide levels, reflecting the rising demand. Due to challenges in achieving the required spatiotemporal resolution, accuracy, and spatial continuity with current carbon dioxide concentration products, it is essential to explore methods for obtaining carbon dioxide concentration products with completeness in space and time. Based on the 2018 OCO-2 carbon dioxide products and environmental variables such as vegetation coverage (FVC, LAI), net primary productivity (NPP), relative humidity (RH), evapotranspiration (ET), temperature (T) and wind (U, V), this study constructed a multiple regression model to obtain the spatial continuous carbon dioxide concentration products in the provinces of Huai River Basin. Using indicators such as correlation coefficient, root mean square error (RMSE), local variance, and percentage of valid pixels, the performance of model was validated. The validation results are shown as follows: (1) Among the selected environmental variables, the primary factors affecting the spatiotemporal distribution of carbon dioxide concentration are ET, LAI, FVC, NPP, T, U, and RH. (2) Compared with the OCO-2 carbon dioxide products, the percentage of valid pixels of the reconstructed carbon dioxide concentration data increased from less than 1% to over 90%. (3) The local variance in reconstructed data was significantly larger than that of original OCO-2 C O 2 products. (4) The average monthly RMSE is 2.69. Therefore, according to the model developed in this study, we can obtain a carbon dioxide concentration dataset that is spatially complete, meets precision requirements, and is rich in local detail information, which can better reflect the spatial pattern of carbon dioxide concentration and can be used to examine the carbon cycle between the terrestrial environment, biosphere, and atmosphere.

1. Introduction

The concentration of carbon dioxide in the atmosphere, accounting for approximately 0.04% of the total atmospheric composition [1], poses a significant threat as a greenhouse gas, ranking second only to water vapor. The concentration of carbon dioxide is 50% higher than that in the pre-industrial era, trapping heat in the atmosphere. Because carbon dioxide has a long lifetime in the atmosphere, temperatures will continue to increase in the coming years [2]. Since the reform and opening-up, China’s economic development has accelerated, and the country has now become the world’s second-largest economy, a leader in green economy technology, and has expanding global influence. In 2020, based on the inherent requirements of promoting sustainable development and the responsibility of building a community with a shared future for mankind, China announced the goal of carbon peaking and carbon neutrality [3]. To achieve the goal of “double carbon”, it is necessary to accurately monitor and evaluate carbon sources and sinks.
Currently, carbon dioxide data are obtained from ground-based, space-based observations and model simulations. It is difficult to describe the spatial distribution characteristics of regional carbon dioxide concentrations due to the scarcity of global ground-based sites. The reliability of the model simulation concentration is uncertain. Satellite observations have become one of the most effective means of monitoring global greenhouse gas emissions with high spatial and temporal resolution [4], and have improved human insight into the global distribution of carbon dioxide [5]. However, the spatial resolution of satellite products is relatively low, with spatial resolutions of 30 km, 10 km, 7 km, 2.5 km, 1.5 km, and 2.5 km for SCIAMACHY (Scanning Imaging Absorption Spectrometer for Atmospheric CHartographY, European Space Agency (ESA), Paris, France), GOSAT (Greenhouse Gases Observing Satellite), GOSAT-2, GOSAT-GW (Greenhouse Gases and Water Cycle Observation satellite), OCO-2, and TanSAT (Global Carbon Dioxide Monitoring Science Experiment Satellite), respectively [6]. The number of valid pixels is relatively low. In our study area, the proportion of the monthly average valid pixels of OCO-2 in 2018 was only 0.49%. The different satellite C O 2   products lack physical consistency owing to the differences in inversion algorithm. For example, the Differential Optical Absorption Spectroscopy (DOAS) algorithm, the Weighting Function Modified DOAS (WFM-DOAS) algorithm, and the Band-Enhanced Sensitivity Differential (BESD) algorithm used by SCIAMACHY, the ACOS (Atmospheric CO2 Observations from Space) XCO2 inversion algorithm used by TANSAT, and the combination of high-resolution spectroscopic measurements and advanced inversion techniques used by OCO-2.
Many studies have reconstructed and simulated regional or global atmospheric C O 2   concentration by simulating the relationship between atmospheric C O 2 concentration observed by satellites and environmental variables. To solve the problem of missing values in single-source satellite C O 2 products, the current main methods are the kriging spatiotemporal interpolation method [7,8,9], regression method [10], artificial neural network method [11], and high-precision surface modeling method [12]. Both kriging spatiotemporal interpolation and high-precision surface modeling methods are susceptible to the influence of the number and distribution of effective observations. The more effective measurement quantities there are and the more evenly the distribution is, the better the simulation effect is. In areas with sparse effective observations, the interpolation uncertainty was large. The interpolated data have strong continuity, but the data are smooth and lack local details. The current method for the fusion of multi-source C O 2 data is relatively simple, for example, the multi-source data averaging method [13,14]. When multi-source data are averaged based on a certain spatial resolution, the global spatial coverage of the fused dataset can be improved by 20% and the temporal resolution can be improved by two to three times. For example, several months of GOSAT and SCIAMACHY data were integrated, but there were still missing pixels and spatial discontinuity [13,14]. In the future, more ground-based observations are needed to integrate multi-source satellite-derived data, especially data from China’s Carbon Satellite (TANSAT) [15], to study the fusion methods and theories for multi-source products.
The spatial distribution of C O 2 was found to be related to the vegetation index, net primary productivity, leaf area index, atmospheric temperature, and wind [5,10,11,16]. Research results have shown that land surface parameters can be used to simulate atmospheric C O 2 concentrations [5]. Environmental variables, especially spatial pollution data, are closely related to human health. By linking them through advanced modeling, the motivation for improved C O 2 reconstruction is reinforced [17].
In the past 100 years, changes in vegetation and C O 2 concentration have led to a trend of aridity in some areas of East Asia, especially in the Huai River Basin, Shandong Peninsula, and Yunnan. Therefore, to resolve the problem of missing values in single-source satellite C O 2 products, in this study, a multiple environmental variable regression analysis method was used to reconstruct spatially continuous atmospheric C O 2 concentration data in the provinces of Huai River Basin, China in 2018. First, several environmental variables, such as vegetation coverage, relative humidity, evapotranspiration, temperature, and wind, were selected. The correlation between the OCO-2 C O 2 concentration and each environmental variable was evaluated. Variables with high correlation were selected, and collinearity analysis was conducted to determine the regression variables. The OCO-2 C O 2 concentration and environmental variables were resampled to a fine resolution, a regression model was constructed, and the spatially complete C O 2 concentration in the study area was obtained.
This study aimed to construct a regression model based on multiple environmental variables and obtain a spatially continuous monthly carbon dioxide concentration dataset for the study area in 2018. The remainder of this paper is organized as follows. The details of the datasets used in this study and the procedures for datasets preprocessing are provided in Section II. In this section, we also describe the regression model based on multiple environmental variables that was developed in this study. Section III demonstrates the implementation of the methods developed for the OCO-2 C O 2 products. The final section summarizes the conclusions of our study.

2. Material and Methods

2.1. Study Area

The study area includes five provinces in the Huai River Basin in eastern China, as shown in Figure 1. It is an important economic and agricultural region in China. These provinces include Shandong, Anhui, Jiangsu Henan, and Hubei, with a total area of approximately 270,000 square kilometers. The terrain is higher in the west and lower in the east. The west is mainly composed of mountainous and hilly areas, whereas the east is a vast plain. The Huai River Basin has a warm temperate monsoon climate. Here, summer is the primary rainy season and the season with the highest temperatures. The average annual precipitation in the basin is 880 mm, and precipitation increases from west to east. The average annual temperature in the basin ranges from 13.2 °C to 15.7 °C, with a high temperature in the south and low temperature in the north. The annual relative humidity ranges from 66% to 81%. The main crops are wheat, rice, corn, potato, soybean, cotton, and canola (http://www.mwr.gov.cn/szs/hl/201612/t20161222_776385.html, accessed on 17 February 2025). As an important agricultural and industrial area in China, the Huai River Basin has a significant impact on the country’s economic development and ecological balance.

2.2. Data and Preprocessing

The data used in this article include OCO-2 C O 2 products, vegetation coverage (FVC), leaf area index (LAI), net primary productivity (NPP), evapotranspiration (ET), temperature (T), relative humidity (RH) and wind (U, V) (Table 1).

2.2.1. OCO-2 C O 2 Products

The NASA OCO-2 satellite was launched in July 2014 and was equipped with three high-resolution spectrometers. It monitors sunlight reflected from the Earth’s surface in the 0.76 μm O 2 A-band and the 1.61 μm and 2.06 μm C O 2 bands to generate the column-averaged dry air mole fraction of C O 2 ( X C O 2 ) [18,19,20]. The OCO-2 observation data in this article are sourced from https://oCO2.gesdisc.eosdis.nasa.gov/data/OCO2_DATA/ (accessed on 13 January 2025) and the spatial resolution is 2.25 × 1.29 km, with a temporal resolution of 16 days. There were 351 scenes in the study area in 2018.

2.2.2. Environmental Variables

The concentration of carbon dioxide in the atmosphere is influenced by meteorological conditions, vegetation, and moisture levels, resulting in diverse and complex variability [21]. In the selection of environmental variables affecting the concentration of C O 2 , eight influencing factors in three categories of vegetation structure, vegetation function, and meteorological conditions were considered, including Fractional Vegetation Cover (FVC), Leaf Area Index (LAI), Net Primary Productivity (NPP), Evapotranspiration (ET), Air Temperature (T), Relative Humidity (RH), and 10-m wind components (U, V). The FVC, LAI, NPP, and ET data were obtained from the GLASS data website (http://www.glass.umd.edu/Download.html, accessed on 13 January 2025). The spatial resolution for FVC, LAI, and NPP was 500 m, while that for ET was 1000 m. The temporal resolution of all these data was 8 days. The data of air temperature at 2 m, relative humidity, and the eastward and northward components of the 10 m wind were obtained from the ERA5 reanalysis dataset (https://cds.climate.copernicus.eu/, accessed on 13 January 2025). The data format was NetCDF with a monthly temporal resolution and a spatial resolution of 0.25 degrees.

2.3. Data Preprocessing

2.3.1. C O 2 Data Preprocessing

First, the quality control of the data products was performed. High-quality C O 2 data with QF = 0 [22] were selected. GLT correction was then carried out, and it was calibrated to WGS84 coordinates with a spatial resolution of 0.01°, then cropped to the Huai River Basin, and the monthly data were generated by the maximum value synthesis method (MVC).

2.3.2. Environmental Variables Preprocessing

The MODIS Reprojection Tool (MRT) was used to batch-process FVC, LAI, NPP, and ET data on a monthly basis and project them onto the WGS84 coordinate system to ensure their suitability for analysis. The spatial resolution was resampled to 0.01° and the data were cropped accordingly.
The technical process is illustrated in Figure 2. Some of the processed data are presented in Figure 3.

2.4. Statistical Methods

In this section, the methodology used in this study is described. It mainly includes two methods: univariate regression analysis, which is used to describe the correlation between carbon dioxide concentration and individual environmental variable factors and multivariate regression analysis, which is used to obtain spatially complete fine-scale carbon dioxide concentration data.

2.4.1. Correlation Analysis Method

The Pearson correlation coefficient, denoted as r, quantifies the extent of the linear association between two continuous variables (Formula (1)). In this study, the correlation coefficient was employed to screen environmental factors.
r x y = i = 1 n ( x i x ¯ ) ( y i y ¯ ) i = 1 n ( x i x ¯ ) 2 i = 1 n ( y i y ¯ ) 2
where r represents the correlation coefficient between the two variables, and its value ranges from −1 to 1. When r > 0 , it indicates a positive correlation, that is, the two variables are correlated in the same direction; when r < 0 , it indicates a negative correlation, that is, the two variables are correlated in different directions. The closer the absolute value of r is to 1, the closer the relationship between the two variables is. The closer r is to 0, the less closely related are the two variables. When the absolute value of the correlation coefficient r is (0 0.1], (0.1 0.4], (0.4 0.6], (0.6 0.9], and (0.9 1], the correlation degree is very weak, weak, moderate, strong, and extremely strong, respectively [23].

2.4.2. Multiple Linear Regression Analysis

The multiple linear regression model is a statistical method used to study the linear relationship between two or more independent variables (explanatory variables) and one dependent variable (response variable) (Equation (2)). Its purpose is to predict or explain the variation in the dependent variables from the independent variables. In this study, environmental variables that exhibited a strong linear relationship with the dependent variable, C O 2 concentration, were selected to construct a spatiotemporal model of the C O 2 concentration in the Huai River Basin for the 12 months of 2018.
Y = β 0 + β i X i + ε ,     i = 1 , 2 , 3 n
Here, Y represents the dependent variable, which in this study is the C O 2 concentration; X i ( i = 1 , 2 , 3 n ) represents the independent variables, which in this study are the environmental variables; β i ( i = 1 , 2 , 3 n ) represents the regression coefficients, which are estimated using the method of least squares; ε represents the residual, which follows a normal distribution [22].

3. Results and Discussion

Based on the method suggested in the Section Material and Methods, the correlation between the OCO-2 carbon dioxide concentration and a single environmental variable was analyzed, and a multiple regression model of carbon dioxide concentration based on multiple environmental variables was constructed. A fine-scale carbon dioxide concentration product with spatial and temporal integrity was obtained.

3.1. Exploratory Data Analysis

MATLAB R2020a was used to read the C O 2 image data, create a validity mask to identify all non-zero and non-empty pixel values as valid pixels, filter out the data of each environmental variable at the positions of valid C O 2 pixels using a logical index, and calculate the monthly mean of the valid data. Table 2 shows the monthly mean OCO-2 C O 2 and environmental variables. Figure 4 shows the changing trend between OCO-2 C O 2 and the individual environmental variables. The OCO-2 C O 2 and environmental variables maintained similar changing trends. From April to July and from September to December, the two changed in opposite directions, except for the U and V.

3.2. Correlation Analysis

A correlation analysis was performed between the C O 2 concentration and eight environmental variables to explore the relationship between the CO2 concentration and variables. The results are presented in Table 3. The results show a significant linear relationship between FVC, LAI, T, U, and the C O 2 concentration. The linear relationship between ET, NPP, RH, and the C O 2 concentration was not obvious, but there was no correlation between V and the C O 2 concentration.

3.3. Multiple Linear Regression Model Construction

Before selecting independent variables whose absolute value of correlation coefficient with the dependent variable C O 2 concentration was greater than 0.4 (i.e., ET, FVC, LAI, R, T, and U) to participate in the construction of the multiple linear regression model, a collinear analysis was performed. The results are presented in Table 4. As shown in Table 4, there is severe collinearity between the independent variables FVC and LAI. Therefore, the FVC with a higher correlation with the C O 2 was selected to participate in the construction of the regression model.
Multiple linear regression analysis was performed on the readjustment variables. The coefficients of the model are presented in Table 5. The regression equation was found to be significant with an adjusted R2 of 0.404, indicating that the selected five independent variables explained 40.4% of the variation in C O 2 concentration. The Variance Inflation Factor (VIF) values were significantly reduced, indicating that there was no severe multicollinearity among the independent variables, and the regression results were reliable.
According to the results in Table 5, the multiple regression equation for the C O 2 concentration can be obtained (Formula (3)):
Y = 457.106 + 0.001 X 1 0.016 X 2 0.025 X 3 + 0.032 X 4 0.172 X 5 + 2.531 X 6
where Y is the concentration of C O 2 , X 1 , X 2 , X 3 , X 4 , X 5 , and X 6 are ET, FVC, NPP, RH, T, and U, respectively.

3.4. Comparative Analysis of Original OCO-2 Satellite C O 2 and Reconstructed C O 2 Data

According to Formula 3, based on the spatially complete satellite environmental variables FVC, T, U, ET, and RH, the spatially complete C O 2 concentration data with 1 km spatial resolution for 12 months in 2018 in the study area were obtained. The spatial distribution of the reconstruction data is shown in Figure 5.
There was significant regional and temporal heterogeneity in the distribution of reconstructed atmospheric C O 2 concentration. For regional heterogeneity, from January to December, there is a gradual increase from the southwest to the northeast. For temporal heterogeneity, the atmospheric C O 2 concentration in January, February, November, and December were markedly higher than those in other months, which was mainly due to the more concentrated land use and population activities and more frequent carbon emissions. Atmospheric C O 2 concentrations were lower in July, August, and September, when terrestrial vegetation activity began with photosynthesis, which fixed carbon from the atmosphere, and vegetation behaved as a carbon sink. Atmospheric C O 2 concentration in March, April, May, and June exhibited transitional characteristics.

3.4.1. Assessment of Spatial Completeness

A comparative analysis was performed on the reconstructed C O 2 concentration data and OCO-2 C O 2 concentration products, focusing specifically on spatial completeness. Spatial completeness was evaluated using the percentage of valid pixels relative to the total number of pixels. The results are presented in Table 6.
According to the results in Table 6, the reconstructed C O 2 concentration data were spatially continuous, and the number of valid pixels mostly exceeded 90%. For the original OCO-2 C O 2 concentration product, the proportion of valid pixels was mostly below 1%, which cannot reflect the spatial pattern of C O 2 in this region. However, the reconstructed C O 2 datasets obtained by the model proposed in this study were spatially continuous and complete and could reflect the spatiotemporal pattern of C O 2 concentration in the study area.

3.4.2. Assessment of Accuracy

Owing to the lack of ground-based reference data, in this study, 20% of the data were randomly selected from the matched data pairs with 1 km spatial resolution during modeling. That is, among the valid data pairs composed of OCO-2 C O 2 and environmental variables, 80% were randomly selected for constructing the regression model, and the remaining 20% were used for validation. Among the 20% validation data pairs, OCO-2 C O 2 was used as the reference data and combined with the reconstructed data to form data pairs. The mean bias ( B ¯ ), error standard deviation ( S t d . D e v ), root mean square error ( R M S E ), and correlation coefficient ( R ) were selected to evaluate the accuracy of the reconstructed data. Figure 6 shows the scatter graphs of the reconstructed and OCO-2 C O 2 data.
As shown in Figure 6, there is an underestimation phenomenon in the low-value area, whereas there is an overestimation phenomenon in the high-value area. However, from the mean bias in Table 7, it is only larger in January, February, and December and relatively small in other months. These months represent the winter season in the Huai River Basin, where increased human activities such as heating combined with weakened carbon sink by vegetation have led to a significant rise in atmospheric C O 2 concentrations. Both the error standard deviation and root mean square error were relatively small. The above results indicate that the model estimation is distributed around the mean, with a low degree of dispersion, stable data, and low volatility.
In relevant research, ref. [5] used an extreme random tree (ERT) model based on OCO-2 C O 2 and environmental factors to obtain reconstructed atmospheric C O 2 data (0.01° and 8-day resolution) at the global scale. The accuracy of the ERT model for the validation of the ground stations of the global continental-scale atmospheric C O 2 reconstruction was assessed. The RMSE at all stations ranged from 0.9 to 3.12. Ref. [10] used a regression model with variables including GOSAT C O 2 data, temperature, vegetation cover, and productivity from MODIS products to assess C O 2 concentrations on a global scale. The model was verified using the GOSAT C O 2 concentration as the true value. The results showed that the accuracy throughout the world is between −2.56 and 3.14 ppm. [11] used artificial neural networks to assess the spatial distribution of atmospheric C O 2 concentrations over Iran during the growing season (April−September) in 2015. The modeling data included OCO-2 C O 2 data and eight environmental variables. The final ANN model with the highest performance (R2) and lowest error (RMSE) for each month was selected. The range of RMSE was from 1.11 to 1.39.
Therefore, the values of the R2 and RMSE indices in this study indicate the good performance of the multiple regression model for monthly models.

3.4.3. Assessment of Spatial Pattern

Using the local variance suggested by Li et al. [24], we evaluated the ability of the reconstruction data to maintain spatial patterns at the fine scale. Local variance is a scene texture statistic that characterizes the relationship between the spatial resolution and objects in the scene [25,26]. Therefore, local variance can express the pattern information in the scene and can be used to describe the information richness of the reconstruction data. A high local variance indicates that the data have a large variability and fine patterns. The larger the local variance is, the finer the spatial pattern lies in reconstructed images [24]. The local variance is defined as Equation (3).
L V a r = j 1 N ( 1 n 1 i = 1 n ( X i j X ̿ J ) 2 ) N
where L V a r is the local variance, X i j denotes the ith pixel in the jth window, X ̿ J denotes the mean value of the jth window, n denotes the number of valid C O 2 pixels in a moving window, and N denotes the number of windows.
We chose a 7 × 6 window and calculated the local variance. Figure 7, Table 8 and Table 9 present the monthly local variances in the reconstructed C O 2 and original OCO-2 C O 2 .
From Figure 6, Table 8 and Table 9, we can see that the monthly mean values of the reconstructed C O 2 data are similar to those of the original OCO-2 C O 2 data, showing strong consistency. The local variance in the reconstructed C O 2 data was significantly higher than that of the original OCO-2 C O 2 data, indicating that the reconstructed C O 2 data contained richer detailed information and maintained the spatial pattern well at a fine scale.

3.4.4. Uncertainty Analysis

In this study, we proposed a regression framework with multiple environmental variables for C O 2 retrieval in the provinces of the Huai River Basin by optimizing the selection of the input features. The results of the retrieved accuracy and reconstructed C O 2 data proved our method to be high-precision and credible in the study area. However, this study has some uncertainties and limitations. First, because the model parameters were trained using the samples in the provinces of the Huai River Basin, the retrieval accuracy may decrease with the absence of land cover features if the trained model is directly applied to other study areas. However, the proposed framework—including the selection of models, the strategy of model construction, and feature optimization—remains effective for obtaining spatially completed C O 2 in other study areas. The corresponding study area data can be used to retrain and refresh the model using the same framework. Second, further opportunities are available to consider land cover, quadratic polynomial simulation, and machine learning in future research. In this study, the land cover factor was not considered because if partitioned by land cover type, the number of effective OCO-2 C O 2 data in partial land cover types was too small to be sufficient for modeling. In the next stage of this research, we will expand the study area and adopt machine learning and multiple regression analysis methods for zonal modeling based on land cover type. A larger study area and in situ C O 2 measurements will be considered and collected to optimize the model, further improving its stability and applicability in different areas.

4. Conclusions

In this study, we constructed a multiple regression model based on the OCO-2 C O 2 concentration products and multiple environmental variables, namely LAI, FVC, NPP, T, U, ET, and RH, to reconstruct the spatially complete monthly C O 2 concentration data with fine resolution in the provinces in Huai River Basin in 2018. Among the multiple environmental variables, LAI, FVC, T, U, ET, NPP, and RH were strongly correlated with the C O 2 concentration, whereas V was weakly correlated. These enhanced data better reflect the spatiotemporal patterns of C O 2 in the study region. The proportion of valid pixels increased from less than 1% to over 90%, achieving full coverage of valid pixels in the study region. The reconstructed C O 2 concentration data of the Huai River Basin have better spatial completeness. In addition to the advantage of spatial coverage, it can be seen from the comparison results of local variances that the local variance in the reconstructed dataset is significantly higher than that of the original data, indicating that the reconstructed dataset has more local detailed information.
The reconstructed atmospheric C O 2 concentration showed an increasing trend from summer (June, July, and August) to winter (November, December, and January). Based on the proportion of valid pixels, local variance, and reference data, the spatial integrity, accuracy, and spatial structure of the reconstructed dataset were evaluated. The results show the value of using satellite-driven C O 2 observations by extending discrete satellite observation of atmospheric C O 2 concentrations to spatiotemporally continuous C O 2 datasets. The regional atmospheric C O 2 concentration map produced in this study can serve as a baseline map for studying regional climate change and the carbon cycle in terrestrial ecosystems. Future work will focus on using machine learning, artificial intelligence, and multisource remote sensing data to estimate a spatiotemporal complete C O 2 concentration dataset with a high spatiotemporal resolution.

Author Contributions

Conceptualization, Y.Z. (Yuxin Zhu); Methodology, Y.Z. (Ying Zhang); Validation, L.Z.; Writing—review & editing, J.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Innovation and entrepreneurship training program for college students of Jiangsu Province (NO. 202210323050Y) and the Natural Science Foundation of China (No. 41271347).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data used in this paper can be obtained by sending an email to the first author.

Acknowledgments

This work was supported by the Innovation and entrepreneurship training program for college students of Jiangsu Province (202210323050Y) and the Natural Science Foundation of China (No. 41271347). We would like to express our sincere gratitude to the people who provided excellent comments and valuable suggestions in the preparation of this manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Brethomé, F.M.; Williams, N.J.; Seipp, C.A.; Kidder, M.K.; Custelcean, R. Direct air capture of CO2 via aqueous-phase absorption and crystalline-phase release using concentrated solar power. Nat. Energy 2018, 3, 553–559. [Google Scholar] [CrossRef]
  2. World Meteorological Organization. Climate Change Indicators Reached Record Levels in 2023; World Meteorological Organization: Geneva, Switzerland, 2024. [Google Scholar]
  3. Bai, Y.; Lu, N.; Li, S. Background, challenges, opportunities and implementation paths of the dual carbon goals. China Econ. Rev. 2021, 5, 10–13. [Google Scholar]
  4. Chiba, T.; Haga, Y.; Inoue, M.; Kiguchi, O.; Nagayoshi, T.; Madokoro, H.; Morino, I. Measuring Regional Atmospheric CO2 Concentrations in the Lower Troposphere with a Non-Dispersive Infrared Analyzer Mounted on a UAV, Ogata Village, Akita, Japan. Atmosphere 2019, 10, 487. [Google Scholar] [CrossRef]
  5. Li, J.; Jia, K.; Wei, X.; Xia, M.; Chen, Z.; Yao, Y.; Zhang, X.; Jiang, H.; Yuan, B.; Tao, G.; et al. High-spatiotemporal resolution mapping of spatiotemporally continuous atmospheric CO2 concentrations over the global continent. Int. J. Appl. Earth Obs. Geoinf. 2022, 108, 102743. [Google Scholar] [CrossRef]
  6. Hu, K.; Liu, Z.; Shao, P.; Ma, K.; Xu, Y.; Wang, S.; Wang, Y.; Wang, H.; Di, L.; Xia, M. A Review of Satellite-Based CO2 Data Reconstruction Studies: Methodologies, Challenges, and Advances. Remote Sens. 2024, 16, 3818. [Google Scholar] [CrossRef]
  7. Tomosada, M.; Kanefuji, K.; Matsumoto, Y.; Tsubaki, H. A prediction method of the global distribution map of CO2 column abundance retrieved from GOSAT observation derived from ordinary Kriging. In Proceedings of the ICCAS-SICE, Fukuoka, Japan, 18–21 August 2009; pp. 4869–4873. [Google Scholar]
  8. Hammerling, D.M.; Michalak, A.M.; Kawa, S.R. Mapping of CO2 at high spatiotemporal resolution using satellite observations: Global distributions from OCO-2. J. Geophys. Res. Atmos. 2012, 117, 10. [Google Scholar] [CrossRef]
  9. Hammerling, D.M.; Michalak, A.M.; O’Dell, C.; Kawa, S.R. Global CO2 distributions over land from the Greenhouse Gases Observing Satellite (GOSAT). Geophys. Res. Lett. 2012, 39, L08804. [Google Scholar] [CrossRef]
  10. Guo, M.; Wang, X.; Li, J.; Yi, K.; Zhong, G.; Tani, H. Assessment of Global Carbon Dioxide Concentration Using MODIS and GOSAT Data. Sensors 2012, 12, 16368–16389. [Google Scholar] [CrossRef]
  11. Siabi, Z.; Falahatkar, S.; Alavi, S.J. Spatial distribution of XCO2 using OCO-2 data in growing seasons. J. Environ. Manag. 2019, 244, 110–118. [Google Scholar] [CrossRef]
  12. Zhang, L.; Zhao, M.; Zhao, N.; Yue, T. Modeling the Spatial Distribution of XCO2 with High Accuracy Based on OCO-2’s Observations. J. Geo-Inf. Sci. 2018, 20, 1316–1326. [Google Scholar]
  13. Jing, Y.; Shi, J.; Wang, T. Toward accurate XCO2 level 2 measurements by combining different CO2 retrievals from gosat and sciamachy. In Proceedings of the International Workshop on Earth Observation & Remote Sensing Applications, Changsha, China, 11–14 June 2014; p. 5. [Google Scholar]
  14. Wang, T.; Shi, J.; Jing, Y.; Zhao, T.; Ji, D.; Xiong, C. Combining XCO2 Measurements Derived from SCIAMACHY and GOSAT for Potentially Generating Global CO2 Maps with High Spatiotemporal Resolution. PLoS ONE 2014, 9, e0148152. [Google Scholar] [CrossRef]
  15. Wang, J.; Liu, Y.; Yang, D. Exploring the distribution of carbon sinks in my country: Starting from atmospheric CO2 detection. Sci. Bull. 2021, 66, 709–710. [Google Scholar] [CrossRef]
  16. Guo, M.; Xu, J.; Wang, X.; He, H.; Li, J.; Wu, L. Estimating CO2 concentration during the growing season from MODIS and GOSAT in East Asia. Int. J. Remote Sens. 2015, 36, 4363–4383. [Google Scholar] [CrossRef]
  17. Lotrecchiano, N.; Montano, L.; Bonapace, I.M.; Giancarlo, T.; Trucillo, P.; Sofia, D. Comparison Process of Blood Heavy Metals Absorption Linked to Measured Air Quality Data in Areas with High and Low Environmental Impact. Processes 2022, 10, 1409. [Google Scholar] [CrossRef]
  18. Liang, A.; Gong, W.; Han, G.; Xiang, C. Comparison of Satellite-Observed XCO2 from GOSAT, OCO-2, and Ground-Based TCCON. Remote Sens. 2017, 9, 1033. [Google Scholar] [CrossRef]
  19. Crisp, D.; Atlas, R.M.; Breon, F.-M.; Brown, L.R.; Burrows, J.P.; Ciais, P.; Connor, B.J.; Doney, S.C.; Fung, I.Y.; Jacob, D.J.; et al. The Orbiting Carbon Observatory (OCO) mission. Adv. Space Res. 2004, 34, 700–709. [Google Scholar] [CrossRef]
  20. Hakkarainen, J.; Ialongo, I.; Tamminen, J. Direct space-based observations of anthropogenic CO2 emission areas from OCO-2. Geophys. Res. Lett. 2016, 43, 11,400–11,406. [Google Scholar] [CrossRef]
  21. Liu, Q.; Wu, S.; Lei, Y.; Li, S.; Li, L. Exploring spatial characteristics of city-level CO2 emissions in China and their influencing factors from global and local perspectives. Sci. Total Environ. 2021, 754, 142206. [Google Scholar] [CrossRef] [PubMed]
  22. Yu, H.; Su, Z.; Zhu, P.; Chen, Y.; Yang, Q.; Zhao, Z. Relationship between Cd contents in rice or wheat and soil: Insight from a simulation study. Geosci. Front. 2021, 28, 438–445. [Google Scholar] [CrossRef]
  23. Chen, Y.; Lv, P.; Cao, M.; Xia, Z.; Ma, F.; Yu, J. Microclimate characteristics of Tengger Desert lakes and its response to climate change. J. Desert Res. 2024, 44, 231–238. [Google Scholar]
  24. Li, A.; Bo, Y.; Zhu, Y.; Guo, P.; Bi, J.; He, Y. Blending multi-resolution satellite sea surface temperature (SST) products using Bayesian maximum entropy method. Remote Sens. Environ. 2013, 135, 52–63. [Google Scholar] [CrossRef]
  25. Coops, N.; Culvenor, D. Utilizing local variance of simulated high spatial resolution imagery to predict spatial pattern of forest stands. Remote Sens. Environ. 2000, 71, 248–260. [Google Scholar] [CrossRef]
  26. Woodcock, C.E.; Strahler, A.H. The factor of scale in remote sensing. Remote Sens. Environ. 1987, 21, 311–332. [Google Scholar] [CrossRef]
Figure 1. Location and spatial distribution of elevation, land cover of the study area ((a): location; (b): elevation range; (c): land cover).
Figure 1. Location and spatial distribution of elevation, land cover of the study area ((a): location; (b): elevation range; (c): land cover).
Atmosphere 16 00903 g001
Figure 2. Flowchart of the proposed downscaling framework for generating spatiotemporally complete fine-scale C O 2 .
Figure 2. Flowchart of the proposed downscaling framework for generating spatiotemporally complete fine-scale C O 2 .
Atmosphere 16 00903 g002
Figure 3. Spatial pattern of a portion of the processed OCO-2 C O 2 and environmental variables data ((a): OCO-CO2; (b): ET; (c): FVC; (d): LAI; (e): NPP; (f): RH; (g): T; (h): U; (i): V).
Figure 3. Spatial pattern of a portion of the processed OCO-2 C O 2 and environmental variables data ((a): OCO-CO2; (b): ET; (c): FVC; (d): LAI; (e): NPP; (f): RH; (g): T; (h): U; (i): V).
Atmosphere 16 00903 g003
Figure 4. Changing Trend between the OCO−2 C O 2 and environmental variables ((a): OCO−2 C O 2 and ET; (b): OCO−2 C O 2 and FVC; (c): OCO−2 C O 2 and LAI; (d): OCO−2 C O 2 and NPP; (e): OCO−2 C O 2 and RH; (f): OCO−2 C O 2 and T; (g): OCO−2 C O 2 and U; (h): OCO−2 C O 2 and V).
Figure 4. Changing Trend between the OCO−2 C O 2 and environmental variables ((a): OCO−2 C O 2 and ET; (b): OCO−2 C O 2 and FVC; (c): OCO−2 C O 2 and LAI; (d): OCO−2 C O 2 and NPP; (e): OCO−2 C O 2 and RH; (f): OCO−2 C O 2 and T; (g): OCO−2 C O 2 and U; (h): OCO−2 C O 2 and V).
Atmosphere 16 00903 g004
Figure 5. Spatial pattern of monthly reconstructed C O 2 concentration at fine-scale in 2018 ((a): January.; (b): February; (c): March; (d): April; (e): May; (f): June; (g): July; (h): August; (i): September; (j): October; (k): November; (l): December).
Figure 5. Spatial pattern of monthly reconstructed C O 2 concentration at fine-scale in 2018 ((a): January.; (b): February; (c): March; (d): April; (e): May; (f): June; (g): July; (h): August; (i): September; (j): October; (k): November; (l): December).
Atmosphere 16 00903 g005
Figure 6. Scatter plot between the original OCO-2 C O 2 and reconstructed data.
Figure 6. Scatter plot between the original OCO-2 C O 2 and reconstructed data.
Atmosphere 16 00903 g006
Figure 7. Comparison of mean and local variance between reconstructed C O 2 data and OCO-2 C O 2 .
Figure 7. Comparison of mean and local variance between reconstructed C O 2 data and OCO-2 C O 2 .
Atmosphere 16 00903 g007
Table 1. OCO-2  C O 2 and environmental variables used in this study.
Table 1. OCO-2  C O 2 and environmental variables used in this study.
DataSpatial ResolutionTime ResolutionRange or Standard Mosaic
(Horizontal, Vertical)
Amount
OCO-2 C O 2 2.25 × 1.29 km16 dglobal351
Vegetation coverage500 m8 dh27v05, h27v06, h28v05, h28v06184
Leaf area index500 m8 dh27v05, h27v06, h28v05, h28v06184
Net primary productivity500 m8 dh27v05, h27v06, h28v05, h28v06184
Transpiration1000 m8 dh27v05, h27v06, h28v05, h28v06184
Temperature0.25°30 d108~123° E, 29~39° N12
Relative humidity0.25°30 d107~124° E, 28~39° N12
Wind0.25°30 d108~123° E, 29~39° N12
Table 2. Monthly mean values of OCO-2 C O 2 concentration and environmental variables.
Table 2. Monthly mean values of OCO-2 C O 2 concentration and environmental variables.
Month X C O 2 (ppm)ET (mm)FVC (%)LAI (m2)NPP (gC/m2)RH (%)T (K)U (m/s)V (m/s)
1409.021508.335.445.6860.7359.94273.42−0.24−0.78
2410.543438.642.57.4880.8350.19276.35−0.08−0.03
3410.855940.976.9812.71159.4863.45283.96−0.780.38
4410.028487.4113.1615.63207.160.07290.54−0.280.29
5410.818022.279.9914.8178.0768.29294.52−0.250.51
6408.419652.592.5815.6216.7565.43299.09−0.581.26
7407.3811,080173.8328.05267.3478.02302.14−0.960.54
8403.4710,163166.0625.15255.8475.41300.9−1.36−0.55
9404.874931.178.8812.23252.662.15295.31−0.44−0.54
10408.253864.255.949.1105.2948.2289.11−0.34−0.18
11409.162314.561.487.0971.5269.02283.68−0.29−0.59
12412.011433.155.946.4427.6756.23275.3−0.47−1.13
Table 3. Correlation analysis results between C O 2 concentration and environmental variables.
Table 3. Correlation analysis results between C O 2 concentration and environmental variables.
C O 2 ETFVCLAINPPRHTUV
C O 2 1
ET−0.415 1
FVC−0.566 0.875 1
LAI−0.553 0.926 0.978 1
NPP−0.658 0.890 0.821 0.863 1
RH−0.446 0.659 0.774 0.764 0.629 1
T−0.641 0.889 0.815 0.855 0.918 0.650 1
U0.604 −0.623 −0.812 −0.790 −0.605 −0.683 −0.601 1
V0.102 0.707 0.331 0.451 0.528 0.240 0.550 −0.084 1
Table 4. Results of multiple linear regression and collinearity analyses.
Table 4. Results of multiple linear regression and collinearity analyses.
BβtpToleranceVIFR2Adjusted R2
Constant457.154 10.3520 0.7720.372
ET0.0011.4511.7020.1640.07912.734
FVC−0.005−0.095−0.0760.9430.03627.794
LAI−0.109−0.311−0.1860.8620.0248.918
RH−0.025−0.832−1.2450.2810.1287.817
T0.0340.120.3120.7710.3822.615
U−0.172−0.695−1.050.3530.137.682
Dependent variable: C O 2
Table 5. Multiple linear regression results after adjusting for variables.
Table 5. Multiple linear regression results after adjusting for variables.
BβtpToleranceVIFR2Adjusted R2
Constant457.106 11.5240 0.7700.493
ET0.0011.3572.20.0790.1218.253
FVC−0.016−0.284−0.4250.6890.1039.727
NPP−0.025−0.846−1.4190.2150.137.716
RH0.0320.1130.3280.7560.3862.589
T−0.172−0.693−1.1650.2970.137.679
U2.5310.3670.9420.390.3023.307
Dependent variable: C O 2
Table 6. Comparison analysis results of spatial integrity between OCO-2 C O 2 and reconstructed C O 2 data.
Table 6. Comparison analysis results of spatial integrity between OCO-2 C O 2 and reconstructed C O 2 data.
MonthTotal Number of PixelsOCO-2 C O 2 Reconstructed C O 2
Number of Valid Pixels Proportion of Valid PixelsNumber of Valid PixelsProportion of Valid Pixels
1250,161 11700.47%217,84287.08%
2250,161 24070.96%229,07591.57%
3250,161 15750.63%237,69795.02%
4250,161 6810.27%241,47196.53%
5250,161 2770.11%241,60296.58%
6250,161 4760.19%242,55196.96%
7250,161 3780.15%243,15897.20%
8250,161 8870.35%242,91697.10%
9250,161 9330.37%242,07896.77%
10250,161 38101.52%239,92895.91%
11250,161 15000.60%235,60094.18%
12250,161 6620.26%226,58090.57%
Table 7. Accuracy verification results.
Table 7. Accuracy verification results.
Month123456789101112
B ¯ −2.110−1.0201.8750.267−0.453−0.3700.573−2.585−0.031−0.1750.0651.960
S t d . D e v 2.0662.1832.2622.2972.3862.6643.0602.7502.0752.3791.9532.784
R M S E 2.9492.4072.9342.3042.4032.6743.0973.7682.0702.3841.9513.396
Table 8. Mean value and local variance of the original OCO-2 C O 2 data.
Table 8. Mean value and local variance of the original OCO-2 C O 2 data.
Month123456789101112
m e a n 409.01410.55410.85410.03410.8408.46407.3403.37404.82408.24409.18411.99
L V a r 0.460.360.460.440.350.270.460.20.380.560.470.44
Table 9. Mean value and local variance of the reconstructed C O 2 data.
Table 9. Mean value and local variance of the reconstructed C O 2 data.
Month123456789101112
m e a n 410.92411.03408.6409.06408.87408.19406.43405.71404.88408.38408.78409.99
L V a r 0.310.590.490.280.581.471.731.091.211.311.191.13
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhu, Y.; Zhang, Y.; Zhu, L.; Zhang, J. Reconstructing of Satellite-Derived CO2 Using Multiple Environmental Variables—A Case Study in the Provinces of Huai River Basin, China. Atmosphere 2025, 16, 903. https://doi.org/10.3390/atmos16080903

AMA Style

Zhu Y, Zhang Y, Zhu L, Zhang J. Reconstructing of Satellite-Derived CO2 Using Multiple Environmental Variables—A Case Study in the Provinces of Huai River Basin, China. Atmosphere. 2025; 16(8):903. https://doi.org/10.3390/atmos16080903

Chicago/Turabian Style

Zhu, Yuxin, Ying Zhang, Linping Zhu, and Jinzong Zhang. 2025. "Reconstructing of Satellite-Derived CO2 Using Multiple Environmental Variables—A Case Study in the Provinces of Huai River Basin, China" Atmosphere 16, no. 8: 903. https://doi.org/10.3390/atmos16080903

APA Style

Zhu, Y., Zhang, Y., Zhu, L., & Zhang, J. (2025). Reconstructing of Satellite-Derived CO2 Using Multiple Environmental Variables—A Case Study in the Provinces of Huai River Basin, China. Atmosphere, 16(8), 903. https://doi.org/10.3390/atmos16080903

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop