Submit to this Journal Review for this Journal Propose a Special Issue

Article Menu

Share Help Cite Discuss in SciProfiles

Open AccessData Descriptor

Peer-Review Record

Terrestrial Carbon Storage Estimation in Guangdong Province (2000–2021)

Data 2025, 10(4), 41; https://doi.org/10.3390/data10040041

by Wei Wang^1,2,3, Yueming Hu⁴, Xiaoyun Mao^1,*, Ying Zhang⁵

, Liangbo Tang^2,3 and Junxing Cai^2,3

Reviewer 1: Anonymous

Reviewer 2: Anonymous

Data 2025, 10(4), 41; https://doi.org/10.3390/data10040041

Submission received: 8 February 2025 / Revised: 14 March 2025 / Accepted: 20 March 2025 / Published: 25 March 2025

(This article belongs to the Section Spatial Data Science for Environment and Earth)

Round 1

Reviewer 1 Report

Comments and Suggestions for Authors

Data that combine remote sensing technology with extensive field data to calculate carbon storage for a wide range of regions and long periods of time are very beneficial. However, carbon storage is a field that has been studied for a long time and is calculated by country through IPCC guidelines. Therefore, the data in this paper do not seem original. Also, it seems that the criteria for simulating carbon storage by vegetation type should be clear.

Comments for author File: Comments.pdf

Author Response

Comments 1：Which data (field sampling data) ?

Response 1: Field sampling data (Soil sampling data and vegetation field survey data).

Comments 2：Soil sampling was done at 0-30 cm, to what depth of soil did the carbon amount be calculated? Since the effective depth of soil varies depending on the vegetation type, applying the same depth will cause errors(Soil Samples: 2,316 sites).

Response 2: Soil Samples: 2,316 sites (Soil sampling depths were set at 50 cm for croplands and 30 cm for forest/grassland ecosystems, with soil organic carbon content calculated based on the corresponding sampling depths, organic carbon measured via dry combustion).

Comments 3: What does "abovegrown biomass data" mean? Was abovegrown carbon strorage calculated in consideration of the stem volume, wood density, and carbon concentration suggested in the IPCC guidelines (Vegetation Samples: 1,264 sites (aboveground biomass))?

Response 3: Vegetation Samples: 1,264 sites (Carbon storage was quantified through the integrative calculation of stem volume, wood density, and carbon concentration following the methodological protocols outlined in the IPCC Guidelines for National Greenhouse Gas Inventories).

Author Response File: Author Response.pdf

Reviewer 2 Report

Comments and Suggestions for Authors

Data Description:

The data are estimates of carbon storage in Guangdong Province, China, derived using a convolutional neural network (CNN) model. The data integrates remote sensing indices, meteorological, land use/cover, soil/vegetation types, and field sampling data. The sources for land use/cover, remote sensing indices, meteorological data, and soil/vegetation data are well-defined and described.

The methodologies for data collection are included. In general the processing steps for land use/cover data, remote sensing indices and meteorological data are described. Field sampling data were filtered using ArcGIS geostatistical tools to remove outliers. The preprocessing steps to homogenize the different scales (including spatial resolution) and format of the data are missing.

The document provides details on temporal and geographic coverage, data format, and data sources- therefore the metadata can accurately describe the data

Data Quality:

The dataset was generated using a CNN model, which is a deep learning architecture suitable for processing grid-structured data

Quality control measures were not employed at the final product except for analysis during preprocessing (i.e. outlier removal, Thiessen polygon analysis for spatial representativeness)

The latter analysis (i.e. “validation through Thiessen polygon) is not clear

The submisison does not explicitly describe possible sources of error and noise

Data Access, Archiving, and Metadata:

According to the authors the data will be available in GeoTIFF (raster), CSV (tabular), and Shapefile (vector) formats. The dataset license is specified as Creative Commons Attribution 4.0 International. The dataset has a DOI number (10.5281/zenodo.14835471)-I checked the link and it was not possible to access data

Comments on the Quality of English Language

The quality is fine. But the description of specific steps of the analysis (pls check comments) must be improved

Author Response

Comments 1:The preprocessing steps to homogenize the different scales (including spatial resolution) and format of the data are missing.

Response 1:All raster datasets, including LUC, VEG, SOIL, TEMP, PRE,RESI, NPP, NDVI, EVI were reprojected to WGS_1984_UTM_Zone_49N (EPSG:32649) coordinate system us-ing bilinear resampling. The spatial resolution was standardized to 500 meters through pixel aggregation, with all outputs exported in GeoTIFF format compliant with ISO 19115-2 geospatial metadata standards.

Comments 2：Quality control measures were not employed at the final product except for analysis during preprocessing (i.e. outlier removal, Thiessen polygon analysis for spatial representativeness)。

Response 2：We rigorously implemented a three-phase quality control protocol spanning preprocessing, model training, and post-processing stages, as detailed below:

Preprocessing Quality Assurance

Spatiotemporal Consistency：Coordinate unification: All datasets were reprojected to WGS_1984_UTM_Zone_49N (EPSG:32649) using ARCGIS.

Temporal harmonization: Carbon factors were cross-calibrated using land cover.

Data Sanitization.

Process	Parameters	Outcome
Outlier removal	±3σ threshold + Moran's I spatial autocorrelation filter	7.2% samples excluded
Spatial representativeness	Thiessen polygon analysis with 15km maximum void radius	92.3% coverage of ecological zones

Model Training Verification

Architectural Controls：Kernel regularization: L2 penalty (λ=0.001) applied to convolutional filters.Spectral normalization: Layer-wise gradient stabilization using SN-GANs technique.

Comments 3:The latter analysis (i.e. “validation through Thiessen polygon) is not clear。

Response 3:The spatial representativeness assessment comprised:

Voronoi Tessellation: Generated 2,357 polygons covering 179,800 km²

Coverage Validation:93% of polygons < 2×mean nearest neighbor distance (8.7 km)

Jaccard similarity index: 0.86 vs. ecological zoning map

Edge Buffer Treatment: 20km exclusion zone along provincial boundaries.

Comments 4:The submisison does not explicitly describe possible sources of error and noise

Response 4:Post-processing Uncertainty Quantification

Source	Contribution (%)	Mitigation
Input data noise	38.7	Gaussian Process regression imputation
Model structure bias	29.1	Ensemble learning with 3 CNN variants
Spatial extrapolation	32.2	Geographically weighted error correction field

Comments 5: The dataset has a DOI number (10.5281/zenodo.14835471)-I checked the link and it was not possible to access data

Response 5:The dataset has been comprehensively updated to Version 2.0 (

https://doi.org/10.5281/zenodo.15013461，2025-03-12)

Author Response File: Author Response.pdf

Round 2

Reviewer 2 Report

Comments and Suggestions for Authors

Thank for the detailed response provided. Nice contribution

Article Menu

Terrestrial Carbon Storage Estimation in Guangdong Province (2000–2021)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI