The Potential of Fully Polarized ALOS-2 Data for Estimating Forest Above-Ground Biomass

: SAR data have a longer wavelength and stronger penetrating power compared with traditional optical remote sensing. Therefore, SAR data are more suitable for the estimation of the above-ground biomass (AGB) of forests. This study was aimed at evaluating the sensitivity of L-band full polarization data to AGB. L-band data were improved to estimate the saturation point produced by AGB, and were found to be suitable for estimating a wide range of AGB. This study extracted backscattering coefﬁcients, polarization decomposition variables, and terrain factors. New parameters were constructed from these variables, and their performance in predicting AGB was evaluated. Signiﬁcant variables found with AGB were added to the multivariate linear model. A statistical analysis showed the presence of multicollinearity between the variables. Therefore, ridge regression, random forest method (RF), and principal component analysis (PCA) were introduced to solve the problem of collinearity. In all the three methods, the saturation of the ridge regression model was low, reaching it at 150 t/ha. Better accuracy was obtained with the RF model. No obvious saturation incident was detected in the model established using the principal component analysis. This could be attributed to the low biomass levels observed in our study area. This model provided accurate results (adjusted r 2 = 0.90 rmse = 14.24 t/ha), indicating that L-band data have the potential to estimate AGB. Additionally, suitable variables and models were selected in this study, with the principal component analysis being more helpful in combining various SAR parameters. The achievement of these accurate results could be attributed to the synergy among variables.


Introduction
Carbon sequestration capacity is an important manifestation of forest functions. Forest above-ground biomass (AGB) is a consequential evaluation index of carbon sequestration capacity. Therefore, it is necessary to estimate AGB to understand the carbon sequestration capacity in a particular area [1]. Previous studies have shown that the use of the backscatter coefficient of airborne L-band SAR data could not significantly improve the ability to estimate AGB [2]. In large survey areas, the AGB root mean square error (RMSE) estimated from HH polarization has been found to be about 30% [3,4]. Similarly, the use of L-band data to predict the Indian tropical forest had a higher accuracy with RMSE = 16.06 t/ha [5]. Previous studies have used the random forest (RF) method to estimate the AGB with RMSE = 18.9 t/ha. Additionally, a regression model was built to estimate the boreal forest AGB with RMSE = 37.3 t/ha [6,7]. These studies obtained tree height by laminar analysis of SAR data and then calculated AGB with RMSE = 36.3 t/ha [8]. However, different AGB estimation methods have been found to provide discordant results even when the same SAR data are analyzed. Meanwhile, variability of forests has been found to be among the factors significantly related to AGB were combined. A model that was more suitable for the estimation of AGB was also selected.
Specifically, this study was aimed at: (1) Using the original channel backscatter coefficients to establish a univariate model to estimate AGB. The ratio of backscatter coefficients was calculated and a univariate model established. The impact of topographical factors on AGB was also analyzed. (2) Selecting the most suitable polarization decomposition method and polarization decomposition parameters. Polarization decomposition parameters were used to construct a stronger estimation ability for the new parameters, and a model was established with AGB. (3) Comparing the ability of ridge regression, RF and the PCA method to resolve a highdimensional variable set. The focus was on establishing a model, and predicting the AGB at the regional scale by using all the relevant parameters.

SAR Data
The SAR data used in this study were the ALOS-2 PALSAR full-polarization observation data obtained on 8 August 2020. The selected image covered northern China ( Figure 1). The image is from a 1.1-level L-band radar developed by the Japan Aerospace Exploration Agency (JAXA). The average zenith angle was 27.8 • , the radar center frequency was 1.27 × 10 3 MHz, the range resolution was 5.66 m, and the azimuth resolution was 2.86 m. The pixel size was16.19 m 2 . The overall observation area was 4494.62 km 2 , and the average height of the sensor from the Earth's surface was 634.24 km 2 .
Remote Sens. 2022, 13, x FOR PEER REVIEW 3 of 24 from SAR data to construct parameters that were more sensitive to AGB. All variables that were significantly related to AGB were combined. A model that was more suitable for the estimation of AGB was also selected. Specifically, this study was aimed at: (1) Using the original channel backscatter coefficients to establish a univariate model to estimate AGB. The ratio of backscatter coefficients was calculated and a univariate model established. The impact of topographical factors on AGB was also analyzed. (2) Selecting the most suitable polarization decomposition method and polarization decomposition parameters. Polarization decomposition parameters were used to construct a stronger estimation ability for the new parameters, and a model was established with AGB. (3) Comparing the ability of ridge regression, RF and the PCA method to resolve a highdimensional variable set. The focus was on establishing a model, and predicting the AGB at the regional scale by using all the relevant parameters.

SAR Data
The SAR data used in this study were the ALOS-2 PALSAR full-polarization observation data obtained on 8 August 2020. The selected image covered northern China (Figure 1). The image is from a 1.1-level L-band radar developed by the Japan Aerospace Exploration Agency (JAXA). The average zenith angle was 27.8°, the radar center frequency was 1.27 × 10 MHz, the range resolution was 5.66 m, and the azimuth resolution was 2.86 m. The pixel size was16.19 m 2 . The overall observation area was 4494.62 km 2 , and the average height of the sensor from the Earth's surface was 634.24 km 2 .

Field Data
The study area was a typical temperate forest in northern China. The forest is located in Hebei Province, North China (117E, 42N). This area is located in the transition zone from Yanshan Mountain to Inner Mongolia. Except for the mountain, the rest of the area consists of plains and cities. Altitude ranges from 1171 m to 1960 m asl. This area is characterized by a mixed forest of coniferous and broad-leaved trees, with North China larch (Larix principis-rupperchtii Mayr) and white birch (Betula platyphylla Suk) being the main species. A total of 38 fixed plots were used in the image. The field data used in this study were obtained through field surveys in 2020. In order to avoid interference, the measurements were carried out at a distance of more than 30 m from non-forest areas. Field surveys included measuring tree species composition, and measuring diameter at breast height (DBH) at a distance of 1.3 m from the ground. All trees with a DBH of less than 2.5 cm were eliminated. Tree height was measured using Vertex IV and Transponder T3. The

Field Data
The study area was a typical temperate forest in northern China. The forest is located in Hebei Province, North China (117E, 42N). This area is located in the transition zone from Yanshan Mountain to Inner Mongolia. Except for the mountain, the rest of the area consists of plains and cities. Altitude ranges from 1171 m to 1960 m asl. This area is characterized by a mixed forest of coniferous and broad-leaved trees, with North China larch (Larix principis-rupperchtii Mayr) and white birch (Betula platyphylla Suk) being the main species. A total of 38 fixed plots were used in the image. The field data used in this study were obtained through field surveys in 2020. In order to avoid interference, the measurements were carried out at a distance of more than 30 m from non-forest areas. Field surveys included measuring tree species composition, and measuring diameter at breast height (DBH) at a distance of 1.3 m from the ground. All trees with a DBH of less than 2.5 cm were eliminated. Tree height was measured using Vertex IV and Transponder T3. The coordinates of the center point of the plot were determined using the Unistreng RTK-G10.
We ensured that the center point coordinate error of the sample plot was within 10 cm. The individual allometry equation of local tree species was used to calculate the AGB of the forest for each sample [40,41]. Based on the measured results, the minimum biomass above the forest was 4.24 t/ha and the maximum was 185.08 t/ha. The plots were separated at equal intervals, and each plot had an area of 0.06 ha. The shape of each set of field data was a rhombus, with a diagonal length of 17.3 m. The area of the plot was 149.645 m 2 . The AGB level in this area was found to be more suitable for this study. Meanwhile, microwave remote sensing observation methods were more suitable for forest biomass estimation, considering the complex geological and climatic conditions in the area. The actual biomass is shown in Table 1.

Methods
The processing steps for field inventory and ALOS-2 PALSAR-2 data are shown in Figure 2. Decomposition parameter acquisition: Since the research object was a distributed target, it was found suitable for incoherent decomposition. Three polarization decomposition methods suitable for forests were selected [45,46]. These methods included Yamaguchi three-component decomposition, eigenvalue-based H/A/alpha decomposition, and eigenvector-based H/A/alpha decomposition [47][48][49][50][51][52][53]. The Yamaguchi three-component decomposition method decomposes the echo signal into three scattering mechanisms, and

SAR Data Processing
Calibration: Conversion of the amplitude data record in the original image into a backscatter coefficient was not affected by changes in surface parameters. In order to expand the dynamic range of the scattering coefficient, the amplitude data record was expressed in decibels, as follows: where σ 0 i,j is backscatter coefficients, DN i,j is the gray value of the pixel, and CF 1 is the calibration factor [42].
Filter denoising: Given that SAR is a coherent system, speckle noise becomes an inherent feature that interferes with image readings. The present experiment used a refined Lee filter [43]. At the same time, multi-look processing also had a noise suppressing effect. Multi-look processing improved the effectiveness of feature information extraction by averaging the pixels of the SAR image azimuth and distance. A 4 × 9 multi-look process was performed on the original image to ensure that the pixels closed to the square and matched the area in the plot. The above process was run in the Gamma software [44].
Decomposition parameter acquisition: Since the research object was a distributed target, it was found suitable for incoherent decomposition. Three polarization decomposition methods suitable for forests were selected [45,46]. These methods included Yamaguchi three-component decomposition, eigenvalue-based H/A/alpha decomposition, and eigenvector-based H/A/alpha decomposition [47][48][49][50][51][52][53]. The Yamaguchi three-component decomposition method decomposes the echo signal into three scattering mechanisms, and volume scattering in the layered random medium provides good results [51]. H/A/alpha decomposition contains information about the dominance relationship between scattering mechanisms. Among them, the scattering entropy (H) not only represents the specific gravity of different scattering mechanisms in the whole scattering process, but also describes the randomness of the scattering process. The degree of heterogeneity in different directions (A) characterizes the degree of influence of the other two scattering mechanisms, which do not dominate the result when H increases. Scattering angle (α) describes the degree of freedom inside the target [54][55][56]. The polarization decomposition parameters were obtained using PolSARpro 6.0.2 [57].
Geocoding: Since SAR is a side-view system, it causes nonlinear distortion in areas with large terrain undulations. Therefore, SAR images cannot transform into a reference coordinate system by polynomial correction or affine transformation. The present study combined the imaging characteristics of the sensor and the ground morphology. It exploited external DEM data (SRTM V2 30 m resolution) and used a strict-range Doppler to geocode SAR image data. This process was run using Gamma software.
Thirty-five original polarization decomposition parameters were obtained through three polarization decompositions shown in Table 2 [49,50].

Backscatter Coefficient and Its Combination
The correlations between backscatter coefficients (σ HH , σ HV , σ VH , σ VV ) and AGB were analyzed. The radar satisfied the reciprocity of a single station, thus the crosspolarization channels were averaged (σ X replaces σ (HV+VH)/2 ). Each variable was used to establish univariate linear models with AGB. The model was built using Matlab-2014b [58]. Meanwhile, different combinations of backscatter coefficients had different sensitivities to AGB [19]. We combined the backscatter coefficients to find the parameters with more significant correlations. A total of 26 different combinations were created using backscatter coefficients, and correlation analysis was performed for the 26 combinations. Significant variables were selected to establish univariate linear models with AGB.

Terrain Factors
Topography is an important factor that affects AGB [59][60][61]. This study obtained slope, aspect, and elevation using DEM, which were combined in Arcmap 10.7 as shown in Figure 3 [62]. The correlation between the extracted terrain factors and AGB was also analyzed.

Constructing New Parameters
Previous studies have shown that the proportional combination of volume scattering, secondary scattering, and surface scattering has a certain sensitivity to forest canopy structure [63]. They derived a relationship between the growing stock volume (GSV) and polarimetric decomposition powers. In addition, these studies have found that the volumetric scattering power and GSV in different samples were all positively correlated, while the

Constructing New Parameters
Previous studies have shown that the proportional combination of volume scattering, secondary scattering, and surface scattering has a certain sensitivity to forest canopy structure [63]. They derived a relationship between the growing stock volume (GSV) and polarimetric decomposition powers. In addition, these studies have found that the volumetric scattering power and GSV in different samples were all positively correlated, while the surface scattering and GSV were all negatively correlated. Therefore, it was concluded that the GSV and ratio of the three scattering powers have a certain sensitivity [64]. In the present study, there was a close relationship between AGB and GSV. The products of the different scattering mechanisms of the Yamaguchi three-component decomposition and the other two scattering mechanisms were found to possess a ratio relationship. This study established new parameters with reference to the above-mentioned study. Each new parameter with AGB was used to build a linear model. The construction of the new parameters was as follows: • Ground scattering-scattering parameter ratio; Even-scattering molecular parameters; Volume-scattering molecular parameters.

Multivariate Linear Model
The parameters obtained through the three polarization decomposition methods were not all applicable to the study of the forest. Therefore, we analyzed the correlation between decomposition variables and AGB to obtain significant correlation variables. A multivariate linear model with all the significant correlation variables was set up to predict AGB. The model variance inflation factor (VIF) test showed that the variables had significant multicollinearity.

Ridge Regression
Due to the complexity of the radar signal, there was a degree of information overlap between the variables, resulting in collinearity. This study attempted to use the ridge regression model to solve the collinearity problem. Ridge regression is a regularization method for the regression analysis of ill-posed problems.

Random Forest
The RF method is a classifier that consists of multiple decision trees. It belongs to the Bagging ensemble learning algorithm. This method was used to collect multiple subdatasets from the original dataset and train multiple different decision trees. The prediction results of multiple decision trees were then averaged to obtain the final result. This method was not affected by collinearity between variables.

Principal Component Analysis
Principal component analysis (PCA) is suitable for populations of high-dimensional variables with a certain correlation between samples. There was collinearity in the above parameter set. Nevertheless, PCA is more suitable for removing collinearity [65]. The principal components express the additive combination with the variance of each sample. The linear combination is added when the current component is not enough to represent the information of the original parameter set. The principal component was calculated as follows: The eigenvector corresponding to the covariance matrix is a 1i , a 2i · · · a pi (i = 1, · · · m). Z x1 , ZX 2 Z x2 ,· · · , Z xp are the standard variables, R is the correlation coefficient matrix, and γ i , a i are the corresponding eigenvalue and eigenvectors.
We used IBM-SPSS 23.0 to perform principal component analysis on the dataset [66]. The two principal components were used to build a multivariate model to estimate AGB.

Verification and Prediction
Due to the small number of samples in this study, the cross-leave-one-out method was used for verification [67]. In total, 37 samples were used to model, and one sample was used for verification, resulting in 38 models. Through this method, the predicted AGB of SAR variables was obtained. Some of the evaluation indicators were used to describe the difference between the true AGB and the predicted AGB. The indicators selected in this study were goodness of fit (R 2 ), root mean square error (RMSE), mean absolute error (MAE), mean absolute percentage error (MARE), mean error (ME), and mean relative error (MRE) [68]. The best model was selected based on these indicators. The predicted biomass map was then obtained. The accuracy of the final model was referred to as the accuracy of the biomass map.

Backscatter Coefficient and Its Combination
The correlation analysis between the backscatter coefficient and AGB (Table 3) showed that the horizontal cross-polarization in this region was more sensitive to AGB [9,11]. A univariate linear model was established between the backscatter coefficient and AGB ( Figure 4). The accuracy of the model was low, and there were large deviations in estimating low-level AGB. The backscatter coefficient estimation of AGB was found to produce a saturation point, which led to greater limitations in the estimation results. The most accurate linear models were the σ X and AGB. The formula of this model was AGB = 280.394 + 12.591 × σ X . The backscatter coefficients were combined to determine whether they had the potential to improve AGB estimation (Table 4).
In this study, the combination of poorly correlated backscatter coefficients was not significant. The correlation of the three significantly correlated and newly combined variables (σ HH×VV , σ HH×X and σ HH×X×VV ) was better than the σ HH and σ VV polarization channels . A univariate model with AGB was established for the three new variables ( Figure 5). The most accurate were the σ HH×X×VV and AGB linear models. The formula of this model was AGB = 152.822 + 0.039 × σ HH×X×VV . However, it was found that the accuracy of these models was the same as that of the backscatter coefficient model. The saturation point of this model was about 125 t/ha.  The backscatter coefficients were combined to determine whether they had the p tential to improve AGB estimation (Table 4).  significant. The correlation of the three significantly correlated and newly combined va iables ( σ × , σ × and σ × × ) was better than the σ and σ polarizatio channels. A univariate model with AGB was established for the three new variables (Fig  ure 5). The most accurate were the σ × × and AGB linear models. The formula of th model was AGB = 152.822 + 0.039 × σ × × . However, it was found that the accurac of these models was the same as that of the backscatter coefficient model. The saturatio point of this model was about 125 t/ha.

Influence of Topographical Factors
Correlation analysis revealed that slope was the most important factor affecting AGB in this study (Table 5). During the field investigation, the slope of the study area was found to change greatly. However, the effect of aspect on AGB was not obvious, possibly due to the small number of samples.

New Parameters and AGB Estimation
The correlation between the three newly constructed parameters and AGB was determined, and it is illustrated in Table 6. The univariate model with R 1 as the independent variable produced better AGB estimation results. The formula of this model was AGB = 181.427 − 3.822 × R 1 . This model achieved the highest accuracy among all univariate models. The new parameters R 2 and R 3 predicted that the AGB results were poor ( Figure 6). The saturation point of the model was relatively high (140 t/ha) when compared to the backscattering coefficient model.

New Parameters and AGB Estimation
The correlation between the three newly constructed parameters and AGB was determined, and it is illustrated in Table 6. The univariate model with R1 as the independent variable produced better AGB estimation results. The formula of this model was AGB = 181.427 − 3.822 × R . This model achieved the highest accuracy among all univariate models. The new parameters R2 and R3 predicted that the AGB results were poor ( Figure  6). The saturation point of the model was relatively high (140 t/ha) when compared to the backscattering coefficient model.

Multivariate Linear Model
A correlation analysis of the polarization decomposition variables and AGB was performed, and it is summarized in Table 7 (only relevant significant variables are displayed). The variable R1 had a stronger correlation with AGB compared to the Yamaguchi scattering mechanism. This showed that there was still a relationship between the mechanisms of polarization decomposition. All the variables that were significantly correlated with AGB were used to build a multivariate linear model (Figure 7). These variables included three original backscatter coefficients, three combined backscatter coefficients, slope, nine polarization decomposition variables, and R1. However, the VIF test proved that there was multicollinearity between them ( Table 8). The joint hypothesis F value of the model was 17.536, and there was no significant saturation point (sig) ≤ 0.001. This model provided poor residual test results (Figure 7). Meanwhile, existing studies have showed that AGB cannot be estimated with a simple multivariate model [69,70]. Although this model provided reasonable results, it

Multivariate Linear Model
A correlation analysis of the polarization decomposition variables and AGB was performed, and it is summarized in Table 7 (only relevant significant variables are displayed). The variable R 1 had a stronger correlation with AGB compared to the Yamaguchi scattering mechanism. This showed that there was still a relationship between the mechanisms of polarization decomposition. All the variables that were significantly correlated with AGB were used to build a multivariate linear model (Figure 7). These variables included three original backscatter coefficients, three combined backscatter coefficients, slope, nine polarization decomposition variables, and R 1 . However, the VIF test proved that there was multicollinearity between them ( Table 8). The joint hypothesis F value of the model was 17.536, and there was no significant saturation point (sig) ≤ 0.001. This model provided poor residual test results (Figure 7). Meanwhile, existing studies have showed that AGB cannot be estimated with a simple multivariate model [69,70]

Ridge Regression Model
Ridge regression was used to estimate AGB and solve the collinearity problem. Variables in the ridge regression model were consistent with the multivariate model. The variables were standardized before ridge regression. However, the ridge regression model was found to have a poor fitting effect (Figure 8). This method solved the collinearity between the variables. However, the normality of the residuals of the model was poor. Based on these results, the model estimated that the AGB saturation point was low (~145 t/ha). We determined the ridge parameter (K) = 0.141 based on the variance expansion factor method.

Ridge Regression Model
Ridge regression was used to estimate AGB and solve the collinearity problem. Variables in the ridge regression model were consistent with the multivariate model. The variables were standardized before ridge regression. However, the ridge regression model was found to have a poor fitting effect (Figure 8). This method solved the collinearity between the variables. However, the normality of the residuals of the model was poor. Based on these results, the model estimated that the AGB saturation point was low (~145 t/ha). We determined the ridge parameter (K) = 0.141 based on the variance expansion factor method. The formula of this model was AGB = 83.396 − 0.161 × σ HH×VV − 12.726 × Yamaguchi vol + 405.828 × Yamaguchi Dbl + 2.125 × Entropy 2 − 13.587 × I3 + 78.432 × I2 + 50.428 × Entropy 3 − 1.316 × Entropy

Random Forest
There were 200 decision trees in this dataset. In order to reduce the result volatil caused by bootstrap sampling, all the models were trained 50 times and the average w obtained. We obtained the predicted AGB and computed the residuals (Figure 9). T saturation point of this model was about 155 t/ha.

Random Forest
There were 200 decision trees in this dataset. In order to reduce the result volatility caused by bootstrap sampling, all the models were trained 50 times and the average was obtained. We obtained the predicted AGB and computed the residuals (Figure 9). The saturation point of this model was about 155 t/ha.

Principal Component Analysis
Although the above results showed that multiple variables had the ability to improve AGB estimation, obtaining a stable model was still a challenge. However, the problem of collinearity can be solved through PCA. This method uses the same variable set as ridge regression. PCA processing was performed on the new parameter set. In addition, the Kaiser-Meyer-Olkin and Bartlett's tests were performed (Table 9), and the suitability of Kaiser-Meyer-Olkin sampling was between 0 and 1. A larger value indicated that it was convenient for PCA. Bartlett's spherical significance test showed that the selection of the parameter population was suitable for PCA. We calculated the principal components by the principal component coefficients (Table 10). Two principal component variables were extracted by default. The cumulative variance described 80.606% of the original parameters (Table 11). Additionally, the AGB was estimated using a multivariate model ( Figure  10). The formula of this model was AGB = 120.895 + 36.028 × Factor1 − 31.266 ×

Principal Component Analysis
Although the above results showed that multiple variables had the ability to improve AGB estimation, obtaining a stable model was still a challenge. However, the problem of collinearity can be solved through PCA. This method uses the same variable set as ridge regression. PCA processing was performed on the new parameter set. In addition, the Kaiser-Meyer-Olkin and Bartlett's tests were performed (Table 9), and the suitability of Kaiser-Meyer-Olkin sampling was between 0 and 1. A larger value indicated that it was convenient for PCA. Bartlett's spherical significance test showed that the selection of the parameter population was suitable for PCA. We calculated the principal components by the principal component coefficients (Table 10). Two principal component variables were extracted by default. The cumulative variance described 80.606% of the original parameters (Table 11). Additionally, the AGB was estimated using a multivariate model (Figure 10). The formula of this model was AGB = 120.895 + 36.028 × Factor1 − 31.266 × Factor2. The F value of the multiple regressions model was 164.421, sig ≤ 0.001 (Table 12), and it passed the significance test, indicating a significant improvement in accuracy when compared with other multivariate models. PCA solved the problem of collinearity and the residuals of the model were normal. The residuals were evenly distributed, and thus the variance was considered to be homogeneous ( Figure 9). All the results obtained through this model were acceptable.

Model Selection and Prediction of the Study Area AGB
We evaluated the relationship between the predicted AGB of each model and the real AGB (Table 13). This study concluded that the principal component model was the best model when compared with the evaluation indexes of the models. As such, the principal component model was used to predict the AGB in the study area ( Figure 11). All the variable matrices required by the principal components were entered in Matlab2014b. All the matrices were additively combined according to the principal component coefficients to obtain the principal components. Finally, large-area AGB prediction was based on the multivariate model formula of the principal component. The accuracy of the biomass map was 88%.

Model Selection and Prediction of the Study Area AGB
We evaluated the relationship between the predicted AGB of each model and the real AGB (Table 13). This study concluded that the principal component model was the best model when compared with the evaluation indexes of the models. As such, the principal component model was used to predict the AGB in the study area ( Figure 11). All the variable matrices required by the principal components were entered in Matlab2014b. All the matrices were additively combined according to the principal component coefficients to obtain the principal components. Finally, large-area AGB prediction was based on the multivariate model formula of the principal component. The accuracy of the biomass map was 88%.

Discussion
The use of large-scale remote sensing analyses to accurately estimate AGB is of great significance to global carbon-neutrality research. This study used a combination of backscatter coefficients, terrain parameters, and polarization decomposition parameters to estimate AGB. The adjusted r 2 increased from 0.45 to 0.90 through different processes. The accuracy of the processing results increased with the progress in various steps. The RMSE was 14.24 t/ha, as shown in step 4.6. Meanwhile, the ground information carried by the backscatter coefficient was limited. Therefore, AGB could not be accurately estimated. It was found that the combination of backscatter coefficients was more effective than HH and VV polarizations. Given that the topography of the study area was relatively complex, the slope was considered as an important factor affecting AGB. This study selected the polarization decomposition parameters that were suitable for forest AGB estimation. The univariate and the multivariate models were then compared. The results showed that the multivariate model estimates a high saturation point of the AGB. The saturation point of the backscattering coefficient was about 120 t/ha, and no obvious saturation point was estimated by the multivariate model. The saturation point of the variable R 1 was about 160 t/ha, which was higher than the backscattering coefficient model. This showed that the polarization decomposition parameters carried more ground information. The principal component analysis was found to be more suitable for the collinearity variable sets. As such, we used two principal components to build a multivariate model for estimating AGB without collinearity. No saturation point was found in this model, suggesting that the saturation point had been effectively improved. Finally, the principal component multivariate model was used to predict the AGB in our study area.
Previous reports have shown that the accuracy of the non-parametric model and the linear model is consistent in estimating AGB [71]. The accuracy of the multivariate model was proved to be higher than that of the univariate model [2]. Our findings were consistent with these previous findings. In the backscatter coefficient, cross-polarization had the strongest correlation with AGB [9,11]. The backscatter coefficient previously estimated the saturation point of AGB to be about 100 t/ha [17,18]. However, the saturation point of the backscattering coefficients in this study was higher than that previously reported. Many factors have been found to affect the saturation point of AGB [21]. For instance, variation in environmental conditions in a given study area was found to play a key role in causing variation in the saturation point [19]. Therefore, the different environments in our study area could have affected the saturation point. Reports have shown that the ratio of polarization backscattering coefficients has a high correlation with AGB [11]. However, the present study could not verify this finding. There is a possibility that no suitable combination of backscattering coefficients was found. Among several polarization decomposition methods, the most relevant parameter was Yamaguchi Dbl . Yamaguchi Dbl represented the secondary scattering between forest trunks, and 90% of the forest AGB was tree trunks. These results are consistent with previously reported L-band characteristics [72]. Meanwhile, the Yamaguchi three-component decomposition corresponded to the physical model. In some aspects, the performance was better than the characteristic decomposition parameters. The several scattering mechanisms of the Yamaguchi three-component decomposition make it impossible to correctly distinguish land units. However, H/A/alpha decomposition provides a different decomposition method. The H/A/alpha polarization decomposition theorem is based on the coherence matrix analysis of eigenvalues and eigenvectors. The decomposed parameters describe the main relationship between the scattering mechanisms [45]. This suggests a lack of conflict between the two polarization decomposition methods. Previous studies showed a certain connection between the scattering mechanisms [64]. This finding is supported by the variable R 1 reported in the present study. In the present study, the RF method was found to be less effective and unsuitable for small sample studies. This was consistent with previous reports [39]. The additive model was found to be suitable for estimating AGB, as previously reported [27][28][29].
The L-band was used to obtain higher accuracy. This study showed that SAR data have the potential to estimate AGB, and are not limited by the saturation point of the estimated AGB. The unary model proved to be unsuitable for accurately estimating AGB. The multivariate model was proved to have a higher saturation point. However, the results estimated by principal component analysis were the closest to the real AGB. This study showed that AGB could be accurately estimated by one SAR image. However, the present research did not achieve such results; the obtained results were more suitable for the prediction of AGB in large areas.
Future studies could, however, use more penetrating P-band data, and select study areas with high biomass levels, such as tropical rain forests. Such studies may therefore enhance the ability to estimate AGB based on model selection. The present study employed an estimation technique that obtains better accuracy at the biomass level. It proved that estimating AGB using a combination of long-wave SAR data parameters and non-remote sensing factors can address actual needs. Although previous studies investigated polarization decomposition methods, the present study obtained better results due to the choice of images and reasonable processing methods used.

Conclusions
This study investigated the effectiveness of using an L-band image to estimate AGB. The obtained results could be widely applied to estimate AGB. The problem of complex radar signals that generate high-dimensional parameter sets was also addressed, further emphasizing the wide applicability of the method. The key findings of this study were as follows: (1) The use of the backscatter coefficient to estimate AGB was more limited. The multivariate model provided better estimation capabilities than the univariate model. However, there was collinearity among the variables. (2) The backscatter coefficient estimated that the AGB saturation point was low. The variable R 1 improved the estimation of the saturation point. (3) The Earth-scattering ratio was more suitable for estimating AGB. This indicated that there was a degree of information complementarity between the variables. The combined backscatter coefficient was weak at estimating AGB. (4) The model established by combining the backscatter coefficients, terrain factors, and polarization decomposition parameters achieved high accuracy. The principal component analysis method was suitable for analyzing SAR data to estimate AGB. The final model effectively improved the saturation point of AGB.
It is noteworthy that the study did not require a large amount of SAR data to accurately estimate AGB. L-band PALSAR data can be used in most areas of the world, making this research widely applicable to the estimation of AGB in forest-covered areas. However, the level of AGB in this study was not among the highest recorded in the world. Therefore, the applicability of this method in areas with high biomass levels is still uncertain.