1. Introduction
Forests are a vital component of terrestrial ecosystems, playing a critical role in climate regulation, water conservation, and soil erosion control [
1]. In the context of global warming, forest ecosystems are essential for mitigating climate change through the sequestration of greenhouse gases such as carbon dioxide (CO
2), thereby contributing to the achievement of carbon neutrality [
2]. Reliable assessments of forest resources are fundamental to effective management, requiring accurate measurements of structural attributes to evaluate ecosystem status and dynamics [
3]. Conventional field-based survey methods are often time-consuming, spatially limited, and constrained in precision, making it challenging to generate consistent and reliable large-scale data [
4]. Such baseline information is indispensable for understanding forest ecosystem functions and services, supporting long-term monitoring and the sustainable utilization of forest carbon stocks—objectives of particular significance in advancing China’s dual carbon goals (carbon peak by 2030 and carbon neutrality by 2060). These efforts not only strengthen national climate action but also foster international collaboration and contribute to the global advancement of climate change mitigation strategies.
Thanks to advances in remote sensing—especially spaceborne lidar missions such as GEDI and ICESat-2—wall-to-wall, high-precision mapping of forest height is now achievable [
5]. Potapov et al. developed a global 30 m resolution forest height dataset by integrating GEDI canopy height measurements with Landsat-8 OLI data using bagged regression trees, achieving RMSE values ranging from 6.6 to 9.07 m [
6]. Wang et al. fused NASA’s GEDI sensor and the ICESat-2 footprint data with Sentinel-1/2 imagery, UAVSAR observations, and terrain variables, attaining RMSE values between 4.85 m and 5.13 m [
7]. Liu et al. calibrated ICESat-2 and GEDI height estimates using airborne LiDAR, then integrated them with Sentinel-1/2 data and Digital Elevation Modelderived variables into an AutoGluon-based ensemble model, achieving a validation accuracy of 3.72 m [
8]. Tong et al. combined PolSAR volume scattering parameters with GEDI data through a hybrid RVoG-based machine-learning framework, significantly improving estimation accuracy [
9]. Lang et al. employed a Bayesian deep convolutional neural network utilizing GEDI L1B waveform data to generate a global canopy height map with RMSE values of 2.7–4.4 m [
10]. A multimodal attention-based deep-learning framework named MARSNet was proposed by Chen et al [
11]. By combining GEDI, Sentinel-1/2, and ALOS-2 PALSAR-2 data, it generates a canopy height map for Jilin Province with 10 m resolution, yielding an RMSE of 3.76 m and an R
2 of 0.58 [
11]. Similarly, Cambrin et al. introduced the Depth Any Canopy (DAC) model, which fine-tunes a monocular depth estimation network to enable efficient canopy height prediction from multi-source remote sensing imagery [
12].
On a regional level, Lin and his associates produced a forest height map of Nanning City with 30 m resolution. This map was created by combining GEDI and MISR data, and the work achieved an RMSE of 3.52 m [
13]. Zhu et al. further extended this approach to produce a national-scale 30 m forest height map through the fusion of GEDI and ATLAS data, with an RMSE of 3.75 m [
14]. Fan et al. developed a PRFXception-based deep-learning model that integrates GEDI, ICESat-2, and Sentinel-2 data, generating the first 10 m resolution forest height map for parts of Asia and attaining a regional RMSE of 5.75 m [
15]. Collectively, these studies demonstrate the effectiveness of multi-source data fusion combined with deep-learning techniques in enabling high-precision forest height estimation.
Chaling County, situated in Zhuzhou City, Hunan Province, is characterized by hilly terrain, rich forest resources, and diverse vegetation types. Traditional ground-based surveys are often costly and inefficient, posing significant limitations for large-scale canopy height estimation. To address these challenges, this study proposes a canopy height inversion method based on multi-source data fusion and deep learning. By integrating Sentinel-1 SAR data, Sentinel-2 multispectral imagery, and GEDI measurements, a UNet++-based model was developed to generate a high-resolution (10 m) canopy height map for Chaling County. We adopted GEDI’s RH98 value—the elevation where 98% of the returned waveform energy is reached—as a proxy for canopy top height.
The main work and contributions of this paper are as follows: (1). Compared with all existing regional-to-global scale canopy height mapping studies, a novel canopy height estimation approach was developed using the UNet++ deep neural network architecture, which integrates GEDI, Sentinel-2, and Sentinel-1 datasets. (2). A Forest canopy elevation map with a spatial resolution of 10 m was generated for Chaling County. The accuracy of this model surpasses that of the previously employed random forest model. (3). The study further investigated the impact of terrain slope, vegetation coverage, RH ratio, and algorithmic parameters on model performance, revealing correlations between these factors and model accuracy.
  5. Discussion
  5.1. Ablation Experiments
Figure 8 and 
Figure 9 indicate only a minor difference between U-Net and UNet++ when evaluated on sparse GEDI footprints, whereas UNet++ shows a clear advantage when assessed against continuous CHM data. This disparity stems from the distinct architectural features of the two networks. U-Net’s simple encoder–decoder structure, connected via straightforward skip links, primarily reconstructs spatial information through up-sampling and moderate feature blending. As a result, its outputs tend to be smooth. This inherent smoothness is advantageous for isolated GEDI footprints, as it mitigates the influence of outlier points and keeps point-wise residuals low, producing consistent and visually coherent predictions.
 In contrast, UNet++ augments the same backbone with dense skip connections and enhanced multi-scale feature aggregation. This increased capacity allows the network to capture fine-grained height variations present in CHM tiles, reproducing subtle canopy textures and small topographic features more accurately. However, the richer receptive field also makes the network more sensitive to sparse labels, which can occasionally amplify noise around isolated GEDI footprints, leading to slightly higher errors at the point level compared to the smoother U-Net baseline. The main purpose of this study is to predict the canopy height of the region; hence, this paper selects the UNet++ network as the training model.
  5.2. Relative Height Factor
As shown in 
Table 1 in 
Section 4.1, these findings underscore the importance of selecting an appropriate relative height metric for tree height inversion using GEDI data. Therefore, RH98 was chosen as the optimal indicator for subsequent analyses in this study.
The selection of RH98 is a rational choice based on the physical trade-off within the high signal-to-noise ratio zone: this percentile denotes the height where 98% of the waveform energy becomes accumulated, located precisely in the transition region between the canopy-top return and background noise. It thus provides an optimal balance between suppressing ground-tail artifacts and preserving the upper-canopy signal, resulting in the strongest linear correlation with the reference canopy height model at the Chaling test site (
Table 1; highest R
2 and lowest RMSE for RH98). However, this preference is not universally applicable. Terrain–structure coupling effects lead to a systematic underestimation of RH98 by approximately −1.2 m on slopes exceeding 30°, due to the elongation of the elliptical footprint, with an associated uncertainty of ±15%. More fundamentally, as an empirical percentile, RH98 lacks a clear physical definition; its optimality varies with forest type, biome, and sub-pixel heterogeneity. Within a 25 m footprint containing both canopy gaps and dense crowns, the metric is biased toward taller vegetation strata, leading to a sub-pixel underestimation ranging from −8% to −12%. Therefore, RH98 is only locally optimal under the specific sensor configuration and environmental conditions of this study; any extrapolation to other regions must involve re-evaluation of percentile sensitivity and terrain-structure coupling errors to avoid conflating statistical performance with mechanistic generalizability.
  5.3. Algorithm Factor
Analysis of the results from different algorithms in 
Table 2 reveals significant variations in accuracy across algorithm configurations. Differences in threshold settings influenced the number of samples included in the validation, which in turn affected inversion accuracy [
28]. Therefore, data processed using algorithm group ‘a1’ were selected as the optimal configuration for the study area. After quality filtering, a total of 10,694 GEDI footprints were retained for subsequent analysis.
The designation of algorithm set a1 as “optimal” is based on the rigorous statistical ranking presented in 
Table 2: under identical quality filtering conditions, it achieves both the highest sample retention and the best validation accuracy (highest R
2, lowest RMSE) among the six tested configurations. By applying conservative waveform denoising thresholds and stringent signal-to-noise ratio cut-offs, configuration a1 maximizes the preservation of “clean” canopy-top returns in the undulating terrain and medium-to-high canopy closure conditions characteristic of Chaling County. This enhances the linear relationship between RH98 and the reference canopy height model, resulting in superior inversion performance from a statistical standpoint. However, this advantage is biome-specific: the threshold ensemble of a1 is calibrated for subtropical hilly mixed conifer–broadleaf forests; when applied to tropical rainforests or boreal coniferous stands, its overly aggressive gating may eliminate numerous low-energy yet valid laser pulses, potentially reducing accuracy.
  5.4. Vegetation Coverage Factor
Analysis of 
Figure 10 and 
Figure 11 reveals that as NDVI values decrease, indicating sparser vegetation cover, the model achieves higher R
2 values and lower RMSE, reflecting improved prediction accuracy. Conversely, under denser vegetation cover, prediction accuracy tends to decline. Experimental results indicate that vegetation cover significantly influences canopy height estimation, with most sample points concentrated in low- and moderate-density vegetation areas. This suggests that under high vegetation cover conditions, increased canopy structural complexity combined with limited laser penetration leads to reduced prediction accuracy [
29]. Furthermore, the amount of GEDI data used in the training and validation processes also affects predictive performance.
The primary reason lies in the pronounced spatial differentiation of vegetation along the topographic gradient in the study area. Low-elevation gentle slopes (<20°) are predominantly occupied by monoculture plantations of Cunninghamia lanceolata and Pinus massoniana, which exhibit homogeneous canopy structures, simple vertical stratification, and high understory light transmittance. In contrast, moderate to steep slopes (>25°) are mainly covered by natural secondary evergreen broadleaved mixed forests dominated by Schima superba and Cyclobalanopsis glauca. These forests feature dense, multilayer canopies with high Leaf Area Indices (LAI = 5–6) and extensive branch and foliage overlap. Such “high-density–high-complexity” canopy configurations enhance multiple scattering and waveform attenuation, making it difficult to distinguish between canopy and ground returns in GEDI waveforms. Consequently, waveform broadening and retrieval uncertainty increase, resulting in larger RH98 deviations in structurally complex forest stands.
Moreover, the distribution of sample counts across three NDVI classes, as depicted in 
Figure 12, initially rises and then falls with increasing NDVI values. However, a comparison with 
Figure 10 and 
Figure 11 reveals a consistent downward trend in both R
2 and RMSE. Additionally, 
Figure 16 shows that an increase in NDVI affects canopy height at the same slope levels. Consequently, it can be inferred that the physical factor of increased vegetation coverage complicates prediction, thereby affecting the accuracy of the forecasts.
  5.5. Slopes Factor
The analysis reveals that the majority of GEDI samples are located in areas with slopes below 30°. As the slope increases, the canopy height estimation accuracy declines significantly, as indicated by decreasing R
2 values and increasing RMSE. This finding is consistent with previous studies showing that GEDI waveform signals are more susceptible to topographic distortions in complex terrain, leading to greater uncertainty in canopy height retrieval [
30]. For instance, Li et al. [
31] reported that when the slope reaches 30°, GEDI height estimation error increases substantially, with RMSE rising to 10.18 m without geolocation correction but decreasing to 6.1 m after correction. Similarly, Kutchartt et al. [
32] found in the Alpine region of northern Italy that slope is the most influential factor affecting GEDI canopy height accuracy, followed by canopy cover.
The pronounced slope dependence observed in our study further supports these findings. Model accuracy exhibits a distinct non-linear relationship with slope, showing a sharp decline when slope exceeds approximately 20°. This degradation can be attributed to several slope-induced physical mechanisms affecting GEDI waveform retrieval. First, elliptical footprint elongation on sloping surfaces causes the slant range to be misinterpreted as vertical height, resulting in systematic RH98 overestimations of approximately 1.5–3 m. Second, temporal overlaps between terrain and multilayer canopy returns generate waveform trailing of 5–7 m, obscuring the true ground return and producing mixed canopy–terrain height estimates. Third, multiple scattering between dense vegetation and steep rock surfaces can generate false return peaks, introducing artificial height biases of 2–3 m.
In summary, slope exerts a substantial influence on the accuracy of GEDI canopy height inversion, with the magnitude of this effect increasing as slope steepness increases. These physical factors collectively explain the observed sharp rise in RMSE and concurrent decline in R2 in steep slope areas.
Furthermore, as shown in 
Figure 15, which categorizes the sample counts by slope, there is an initial increase, followed by a plateau, and then a decrease in sample numbers with increasing slope. Comparing this with 
Figure 13 and 
Figure 14, it is evident that despite similar sample counts in the 10–20° and 20–30° slope ranges, there is a significant decline in R
2 and RMSE. Additionally, 
Figure 16 indicates that an increase in slope does influence canopy height under the same NDVI values. Hence, it can be concluded that within these slope ranges, physical factors, specifically the increased difficulty in prediction due to greater slope, are impacting the accuracy of predictions.
  5.6. The Ecology and Application Significance of CHM
CHM represents the integrated outcome of numerous ecological processes. Its complex three-dimensional structure creates diverse habitats that support a wide range of species. Forests characterized by greater structural complexity and taller canopies tend to exhibit higher levels of biodiversity. As such, canopy height and its derived structural metrics serve as fundamental indicators for assessing forest ecosystem health. CHM provides critical data support for precision forestry and sustainable forest management. Key structural parameters, such as average stand height, dominant tree height, and timber volume, can be derived rapidly and with high accuracy, reducing reliance on labor-intensive field surveys. By identifying mature forest stands and tall trees that meet harvesting criteria, these maps facilitate selective logging practices that minimize ecological disruption to the broader ecosystem. Furthermore, based on biomass or volume distribution maps, wood yield can be estimated with greater precision, enabling optimized planning of logistics and processing operations.
Biomass estimation and carbon stock assessment rank among the most direct and impactful applications of canopy height maps. A strong correlation exists between tree biomass, particularly above-ground biomass, and structural metrics such as tree height and diameter at breast height. Traditional field-based approaches for establishing these relationships often require tree felling or direct weighing, which are not only time-consuming and labor-intensive but also ecologically disruptive. In contrast, canopy height maps enable a non-destructive alternative: once a “CHM–biomass” model is calibrated, large-scale canopy height data can be efficiently converted into spatially explicit biomass distribution maps. By applying a conversion factor of approximately 0.5 to the biomass estimates, researchers can derive carbon stock values, the key indicators in ecological and climate change studies. Compared to conventional methods, this approach supports large-scale, high-precision, rapid, and non-invasive assessments, significantly enhancing both accuracy and operational efficiency.
CHM has transformed forest research from a two-dimensional perspective into a three-dimensional paradigm, marking a significant advancement in the field. They serve as a central link between forest structure and ecological function, enabling deeper scientific understanding and practical applications. For ecologists, these maps are essential for studying biodiversity, ecosystem succession, and functional dynamics. For climate scientists and policymakers, CHM provides a robust foundation for accurately quantifying the global carbon cycle and assessing the effectiveness of climate change mitigation strategies. For forestry professionals, they offer actionable insights to support efficient, sustainable, and precision-based forest management. The continued development and broad application of CHM hold immense potential for addressing climate change, conserving biodiversity, and ensuring the sustainable management of forest resources.
  6. Conclusions
This study focuses on Chaling County, Zhuzhou, Hunan Province, China, where a systematic preprocessing of the GEDI Level 2A product was first performed to extract forest canopy height parameters and retain high-quality LiDAR footprints through rigorous filtering. Subsequently, Sentinel-1 imagery was selected based on ascending orbit information and temporal consistency, while Sentinel-2 images were filtered according to cloud cover and required spectral bands. Finally, the optimized GEDI, Sentinel-1, and Sentinel-2 datasets were jointly input into the developed UNet++ deep-learning network to establish a forest height extrapolation model covering the study area, enabling accurate estimation of canopy height in the entire Chaling County. The model has achieved relatively good performance in the Chaling County area.
The correlations between canopy height percentile metrics (RH96 to RH100) derived from GEDI data, as well as the reference value of canopy height measurements, are comprehensively examined. The results demonstrate that RH98 shows the strongest correlation with the reference data. Furthermore, the study revealed that different processing algorithms have a significant impact on the accuracy of GEDI-derived canopy heights. Through analysis of the correlation between RH98 values processed using six distinct algorithms and the reference measurements, it was found that GEDI data processed with the ‘a1’ algorithm exhibited the highest correlation. These findings suggest that the ‘a1’ algorithm achieves the most accurate canopy height inversion within the scope of this study.
The influences of vegetation coverage and terrain slope on the accuracy of canopy height model inversion are investigated. Vegetation coverage, quantified using NDVI, was categorized into three intervals: 0.25–0.5, 0.5–0.75, and 0.75–1.0. Among these, the intervals of 0.25–0.5 and 0.5–0.75 encompassed the majority of GEDI footprint centers and demonstrated higher canopy height prediction accuracy. This is attributed to the enhanced ability of GEDI signals to penetrate the canopy within these vegetation coverage ranges, thereby capturing more accurate height information. Furthermore, the impact of slope on prediction accuracy was quantitatively assessed by classifying the data into four slope categories. Most GEDI footprint centers were found in areas with slopes below 30°, where canopy height estimation accuracy was significantly better compared to areas with slopes exceeding 30°. These results indicate that terrain slope is a critical factor influencing the accuracy of GEDI-based canopy height inversion. In steep terrain, complex topographic variations disrupt the propagation and reception of GEDI LiDAR signals, thereby reducing the reliability of canopy height estimates.
High-resolution CHM can accurately characterize stand structural attributes and exhibit strong correlations with above-ground vegetation biomass, making them a critical input for regional forest carbon stock estimation and providing reliable data support for carbon cycle research and climate change monitoring. Fine-scale height information also offers practical benefits for forest management and ecological planning, including the design of sustainable harvesting strategies, assessment of forest growth potential, monitoring of post-disturbance (natural or anthropogenic) recovery, and guidance for ecological restoration and protected area governance. In future work, integrating multi-temporal remote sensing observations with topographic and climatic variables will enable dynamic monitoring of forest growth and long-term ecological change. Furthermore, incorporating predictive uncertainty analysis, physics-constrained loss functions, or advanced multimodal data fusion approaches is anticipated to further enhance the precision and robustness of canopy height retrieval. This proposed method not only provides a scientific basis for forest management and carbon stock assessment in Chaling County, but also offers a transferable technical framework for large-scale forest carbon cycle studies and ecosystem service evaluations.