To achieve regional-scale continuous mapping of AGB in mountainous forests, this study developed a hierarchical framework consisting of local reference construction, UAV–GEDI bridging, footprint-level modeling, and regional continuous extrapolation (
Figure 3). First, a local AGB reference product was generated within the UAV-covered area from field plots and UAV LiDAR data. Second, GEDI geolocation correction, quality control, and footprint-scale label extraction were integrated to construct UAV–GEDI bridging samples containing reference AGB labels, GEDI structural variables, and multi-source wall-to-wall features. Third, GEDI footprint-level AGB models were developed and compared under a unified RF–RFECV feature-selection framework to identify the optimal point-level model. Finally, the optimal model was applied to all quality-controlled GEDI footprints across the study area to generate point-level AGB predictions, which were further combined with wall-to-wall covariates using empirical Bayesian kriging regression prediction (EBKRP) to produce a continuous AGB map. The final mapping schemes were evaluated and compared using 61 independent field plots.
2.3.2. Construction of UAV–GEDI Bridging Samples
Because GEDI footprints lack directly corresponding field-measured or UAV-derived AGB reference values, a UAV–GEDI bridging procedure was introduced to construct supervised samples for footprint-level modeling. Each bridging sample was formed by spatially linking a GEDI footprint with the local AGB reference product within the UAV-covered area, and simultaneously associating it with the corresponding GEDI structural variables and multi-source wall-to-wall features.
Prior to AGB reference value extraction, GEDI footprint positions were corrected to reduce spatial mismatch caused by horizontal geolocation error. Orbit-scale systematic offsets were estimated by matching GEDI L1B waveforms against simulated waveforms derived from high-density UAV LiDAR data within a local search window of 10 m radius, which was chosen to balance the expected magnitude of GEDI horizontal error against the risk of false matching [
9,
10]. The mean offset estimated from valid matched footprints within each orbit track was then applied uniformly to correct all footprints along that track. Only orbit tracks with a sufficient number of valid matched footprints were used for offset estimation, and correction quality was assessed by comparing waveform similarity before and after correction.
After geolocation correction, a circular buffer with a radius of 12.5 m was established for each GEDI footprint to approximate its spatial support domain. The reference AGB label for each footprint was computed as the mean of all valid UAV pixels within the buffer, as shown in Equation (1) [
7,
12].
where
is the reference AGB label of the
i-th GEDI footprint,
is the AGB value of the
j-th valid UAV pixel within the footprint buffer, and
is the number of valid pixels.
Because insufficient UAV coverage or excessive local heterogeneity within a footprint buffer may compromise label reliability, quality assessment was applied before finalizing the bridging sample set. Four quality indicators were calculated for each footprint as follows.
where
denotes the total number of pixels within the clipped window of the
i-th footprint,
represents the buffer of the
i-th footprint,
denotes the valid UAV coverage area, and
represents area. Specifically,
(Equation (2)) is the valid-pixel ratio, defined as the proportion of valid UAV pixels within the footprint window;
(Equation (3)) is the geometric overlap ratio, measuring the fractional area of the footprint buffer covered by valid UAV data;
(Equation (4)) is the local standard deviation of UAV-derived AGB values within the buffer, characterizing internal AGB dispersion; and
(Equation (5)) is the coefficient of variation, reflecting the relative variability of AGB within the footprint. Only footprints simultaneously satisfying the thresholds for
and
were retained in the final bridging sample set [
7,
9]. Both thresholds were set to 0.5 based on preliminary sensitivity analysis and with reference to previous UAV–GEDI upscaling studies [
12,
13], and a minimum valid-pixel constraint was additionally imposed to avoid assigning reference values from only a few local pixels.
2.3.3. GEDI Footprint-Level AGB Modeling
After the bridging samples were established, GEDI structural variables and multi-source wall-to-wall remote sensing features were combined to form the candidate feature set for footprint-level AGB modeling. The candidate variables mainly included three groups: GEDI structural variables, Sentinel-1 radar features, and wall-to-wall optical features derived from Sentinel-2 and Landsat 8/9. To ensure consistency with the GEDI footprint scale, all wall-to-wall variables were summarized as mean values within the same 12.5 m radius footprint buffer used during bridging [
7,
30,
31,
32,
33], as described in Equation (6).
where
is the summarized feature value of the
i-th sample,
is the value of the
k-th valid pixel within the corresponding buffer, and
is the number of valid pixels.
Considering the limited number of bridging samples and the potential redundancy among candidate variables, a 10-fold cross-validation framework was adopted for model evaluation. In each outer split, nine folds were used for training and one fold for validation. Within each training fold, recursive feature elimination with cross-validation (RFECV) based on a random forest estimator was used to identify the optimal feature subset. Random forest was selected as the RFECV base estimator because it is insensitive to variable scale, can handle nonlinear relationships and heterogeneous inputs, and provides relatively stable feature-importance rankings [
34,
35]. This fold-wise feature-selection strategy allowed variable selection to be incorporated directly into model evaluation, thereby improving the robustness of model comparison and selection.
Following feature selection, five candidate models were compared to systematically evaluate the suitability of different modeling paradigms for GEDI footprint-level AGB estimation in complex mountainous forests. Multiple linear regression (MLR) was included as a linear parametric baseline to assess the linear explanatory power of the selected features. Random forest (RF) and extreme gradient boosting (XGBoost) were selected as representative tree-based nonlinear learners capable of handling high-dimensional inputs and complex variable interactions [
36,
37]. Support vector regression (SVR) was included for its strong generalization potential under small-sample, high-dimensional conditions through kernel-based implicit mapping [
38]. A two-level Stacking ensemble model was further constructed to integrate the complementary strengths of the above heterogeneous base learners, with the expectation that secondary fusion could reduce individual model bias and improve overall predictive stability [
19,
20]. The corresponding model formulations are given in Equations (7)–(11).
where Equation (7) represents MLR, a linear additive model in which
are regression coefficients and
is the number of selected features; Equation (8) represents RF, which reduces estimation variance by averaging predictions across
bootstrap-trained regression trees
; Equation (9) represents SVR, which minimizes a margin-based loss under penalty coefficient
with slack variables
,
; Equation (10) represents XGBoost, which iteratively adds regression trees
with an explicit regularization term
to control model complexity; and Equation (11) represents the Stacking model, where
is the Ridge meta-learner and
–
are out-of-fold predictions from the four base learners, For the Ridge meta-learner, the regularization parameter λ was tuned within each outer training fold, with
λ ∈ {0.01, 0.1, 1, 10, 100}.
In the Stacking model, MLR, RF, SVR, and XGBoost served as first-level learners, and Ridge regression was adopted as the second-level meta-learner. The model-evaluation procedure was implemented within a nested cross-validation framework. At the outer level, 10-fold cross-validation was used to split the bridging samples into training and validation folds, and the validation fold was withheld from all model-training procedures. Within each outer training fold, RFECV-based feature selection and hyperparameter tuning for RF, SVR, and XGBoost were performed using only the training subset. For Stacking, an inner 5-fold cross-validation within the outer training subset was used to generate out-of-fold predictions from the base learners as meta-features for training the Ridge meta-learner. The fitted base learners and meta-learner were then applied to the withheld outer validation fold. This design ensured separation between model tuning and outer-fold evaluation within the random cross-validation framework, thereby reducing target leakage during model comparison.
Key parameters included the number of trees and maximum depth for RF, the penalty coefficient and kernel width for SVR, and the learning rate, maximum depth, and number of estimators for XGBoost.
To compare the performance of the five candidate models, R
2, RMSE, MAE, and Bias were adopted as evaluation metrics, with formulas given in Equations (12)–(15).
where
is the reference value,
is the predicted value, and
is the mean of the reference values. Equations (12)–(15) quantify overall prediction error, explanatory power, average deviation, and systematic bias, respectively.
The average RMSE across the 10 folds was used as the primary basis for model selection, while R2, MAE, and Bias were considered jointly for comprehensive comparison. After the optimal model was identified, the same preprocessing and feature-selection procedure was reapplied to the full bridging sample set, and final model training was completed for subsequent study-area-wide GEDI point prediction.
2.3.4. Study-Area-Wide GEDI Point Prediction and Regional Continuous Mapping
EBKRP was selected because it can simultaneously incorporate wall-to-wall explanatory covariates and account for spatial autocorrelation in the residuals, making it well suited for extending discrete point predictions to regionally continuous raster surfaces [
39,
40]. Previous studies have demonstrated that combining regression-based prediction with kriging interpolation can improve continuous mapping accuracy compared with either approach applied independently, particularly in scenarios involving spatially sparse sample points and continuous background covariates [
21,
41].
To construct the EBKRP covariate schemes, wall-to-wall variables were selected based on their RFECV selection frequency, random forest importance ranking, and physical interpretability, with redundant or highly correlated variables excluded. A stepwise expansion strategy was then adopted, whereby variables were added sequentially by information-source type to examine the incremental contribution of different data sources to continuous mapping performance.
All EBKRP schemes were implemented in ArcGIS Pro 3.5.2 under a unified software and parameter framework. Internal cross-validation statistics provided by the software, including Mean Error, RMSE, average standard error (ASE), and root mean square standardized error (RMSSE), were used as supplementary references for comparing different schemes. To further evaluate external mapping performance, the 61 independent field plots were used as validation samples. These plots were not involved in local reference construction, UAV–GEDI bridging, or footprint-level model training, and therefore remained fully independent of the preceding modeling process. For each continuous AGB raster, predicted values were extracted at the plot locations and compared with the corresponding plot-based reference AGB values, with external performance evaluated using R2, RMSE, MAE, and Bias. Because some support-domain mismatch may still exist between field plots and raster cells, the external validation results were treated as relative comparisons among schemes rather than as absolute accuracy estimates of the final product.