First Demonstration of Space-Borne Polarization Coherence Tomography for Characterizing Hyrcanian Forest Structural Diversity

: Structural diversity is recognized as a complementary aspect of biological diversity and plays a fundamental role in forest management, conservation, and restoration. Hence, the assessment of structural diversity has become a major effort in the primary international processes, dealing with biodiversity and sustainable forest management. Because of prohibitive costs associated with the ground measurements of forest structure, despite their high accuracy, space-borne polarization coherence tomography (PCT) can introduce an alternative approach given its ability to provide a vertical reﬂectivity proﬁle and spatiotemporal resolutions related to detecting forest structural changes. In this study, for the ﬁrst time ever, the potential of space-borne PCT was evaluated in a broad-leaved Hyrcanian forest of Iran over 308 circular sample plots with an area of 0.1 ha. Two aspects of horizontal structure diversity, including standard deviation of diameter at breast height ( σ dbh ) and the number of trees ( N ), were predicted as important characteristics in wood production and biomass estimation. In addition, the performance of prediction algorithms, including multiple linear regression (MLR), k -nearest neighbors ( k -NN), random forest (RF), and support vector regression (SVR) were compared. We addressed the issue of temporal decorrelation in space-borne PCT utilizing the single-pass TanDEM-X interferometer. The data were acquired in standard DEM mode with single polarization of HH. Consequently, airborne laser scanning (ALS) was used to estimate initial values of height h v and ground phase ϕ 0 . The Fourier–Legendre series was used to approximate the relative reﬂectivity proﬁle of each pixel. To link the relative reﬂectivity proﬁle averaged within each plot with corresponding ground measurements of σ dbh and N , thirteen geometrical and physical parameters were deﬁned ( P 1 − P 13). Leave-one-out cross validation (LOOCV) showed a better performance of k -NN than the other algorithms in predicting σ dbh and N . It resulted in a relative root mean square error (rRMSE) of 32.80%, mean absolute error (MAE) of 4.69 cm, and R 2 * of 0.25 for σ dbh , whereas only 22% of the variation in N was explained using the PCT algorithm with an rRMSE of 41.56%. This study revealed promising results utilizing TanDEM-X data even though the accuracy is still limited. Hence, an entire assessment of the used framework in characterizing the reﬂectivity proﬁle and the possible effect of the scale is necessary for future studies.


Introduction
Forest structure is an indicator of forest succession, development, and sustainability, which could be described by indices-based approaches [1,2]. Different aspects of stand density, species composition, horizontal and vertical tree distribution patterns, tree size distribution, and age composition characterize forest structure [2]. Thus, forest structure is interferometer has formed [39]. They provide high-resolution polarimetric acquisitions with a temporal baseline of zero (bistatic mode) and near-zero (pursuit monostatic) that could be a compensation for the key issues of space-borne PCT (see [40]). Although Xband is a rather suboptimal frequency in forest studies because of the lower penetration depth into the canopy, the sensitivity of InSAR measurements at the X-band to forest vertical structure has been shown for cases of both single-baseline [41][42][43] and multibaseline [39,44,45]. Such results stimulate more attempts to investigate X-band potentials in a wide range of forest sites. Praks et al. [30] demonstrated the sensitivity of PCT accuracy to the initial values of height and especially the ground phase. Their results showed initial values derived from airborne laser scanning data (ALS) have improved the performance of fully polarimetric repeat pass L-band PCT than when they are derived from the random volume over ground (RVoG) model [46]. On the other hand, the standard DEM mode of the TanDEM-X interferometer is single-pol operated in HH (or VV) polarization. Consequently, additional research would also be necessary for parametrizing complex coherence [30,43]. In this study, for the first time ever, single-pol (HH) TanDEM-X interferometer was used for approximating the vertical reflectivity profile of the Hyrcanian forest using the PCT technique. In addition, multiple prediction techniques, each based on different underlying assumptions and complexity, have been used to predict forest structural parameters [47][48][49][50]. Regression models are among them with demonstrated capability in predicting structural attributes while the results may be poor in the cases of weak regression relationship and/or insufficient ground sample sizes. Non-parametric algorithms, some of which use machine learning, have been shown to be an alternative approach. Therefore, the main research question is how accurately the variation in forest reflectivity profile can describe the horizontal structural diversity of the Hyrcanian forest, including the standard deviation of dbh (σ dbh ) and the number of trees (N). Moreover, we compare prediction algorithms of multiple linear regression (MLR), k-nearest neighbors (k-NN), random forest (RF), and support vector regression (SVR) in their performances. Our study provides insight into the feasibility of TanDEM-X interferometer in studying forest structure diversity by approximating the vertical reflectivity profile.

Site Description
The study area is located in the Shast-Kalateh forest, which is a part of the temperate deciduous Hyrcanian forests of Iran on the shores of the Caspian Sea ( Figure 1). It includes logged managed and unmanaged forests dominated by Parotia persica, Carpinus betulus, Fagus orientalis, and Diospyrus lotus. The area spreads over 1100 ha and is bounded by 54 • 31 -54 • 42 E and 36 • 73 -36 • 78 N. The elevation ranges from 270 to 740 m above sea level and is characterized as a middle humid climate with mean annual precipitation of 649 mm and a mean annual temperature of 15.4 • C.

Field Data and Structural Diversity Indices
The field survey was conducted across 308 circular sample plots with a size of 0.1 ha (radius = 17.84 m) in 2011 [51]. A systematic sampling design on the grid size of 150 × 200 m with a random starting point was used to locate sample plots ( Figure 1). The plot center coordinates were also estimated by the differential global positioning system (DGPS). All the trees with a diameter at breast height (dbh) of at least 12.5 cm were measured for the dbh with a caliper, height with a Vertex (VL 402), and tree species. Accordingly, the basal area, as a cross-sectional area of a tree that is assumed to be circular, and volume based on species-specific local volume tables with dbh and height as predictor variables were calculated. Table 1 shows the summary of plot-level field data, including basal-area-weighted mean dbh and height, mean volume, the standard deviation of dbh, and the number of trees.

Field Data and Structural Diversity Indices
The field survey was conducted across 308 circular sample plots with a size of 0.1 ha (radius = 17.84 m) in 2011 [51]. A systematic sampling design on the grid size of 150 × 200 m with a random starting point was used to locate sample plots ( Figure 1). The plot center coordinates were also estimated by the differential global positioning system (DGPS). All the trees with a diameter at breast height (dbh) of at least 12.5 cm were measured for the dbh with a caliper, height with a Vertex (VL 402), and tree species. Accordingly, the basal area, as a cross-sectional area of a tree that is assumed to be circular, and volume based on species-specific local volume tables with dbh and height as predictor variables were calculated. Table 1 shows the summary of plot-level field data, including basal-areaweighted mean dbh and height, mean volume, the standard deviation of dbh, and the number of trees.   A variety of structural indices have been developed to measure the diversity of forest stands in the literature [9,10,52,53]. In this study, two commonly used structural diversity indices of standard deviation of dbh (σ dbh , Equation (1)) and the number of trees (N) were used (Table 1).
where n denotes the trees and N is the number of trees in sample plot. The following criteria were met in the selection of indices. First, they are easy to measure, reliable, and relevant to biodiversity [9,[53][54][55][56]. The stands with high variation in size and number of stems are prone to having large distributions of forest-dependent species. Second, both the indices can be estimated from the national forest inventory (NFI) dataset, which can be an advantage for future global biodiversity monitoring networks [57,58]. Third, these are indicators introduced in the primary international processes dealing with biodiversity

TanDEM-X Data
Bistatic single polarized (HH) TanDEM-X data was acquired on February 2011 in strip map mode. It was collected at ascending pass with look angles of 46.22 • m and height of ambiguity (HOA) of 40.39 m. The delivered product was in a co-registered single-look slant range complex (CoSSCs) format with range and azimuth spacing of 1.36 and 2.16 m. By calculating the normalized complex interferometric coherence of s 1 and s 2 signals received at either end of the baseline ( γ, Equation (2)), the interferogram was generated and filtered to reduce speckle noises.
where * represents the complex conjugate, E is the expectation, and 0 < | γ| < 1 [46]. Furthermore, the flat earth effect was subtracted from the phase image to obtain valid height information.

Airborne Laser Scanning Data
ALS data were collected in 2011 across the study area. Riegl LMS Q560 were used to collect row data at a mean flying altitude of 1000 m above sea level under leaf-on condition. The pulse repetition frequency of 240 kHz and a scan angle of 22.5 • -30 • resulted in an average density of 4 pulses m 2 . The digital terrain model (DTM) was generated using the Kraus and Pfeifer [59] algorithm with the triangulated irregular network (TIN) interpolation. Its accuracy was investigated based on 90 ground control points and a 40 cm accuracy was achieved. TIN interpolation was also used to obtain DSM from the highest point within 1 m grid, and, simply by subtracting DTM from DSM a canopy height model was generated.

Forest Height and Ground Phase Estimation
Upper canopy height is a standard forestry parameter, defining the basal area-weighted height average of the 100 highest trees per hectare. Many studies showed its relevance to volume height measured by SAR data [30,43,60]. Considering the laser penetration into the forest, an estimate of upper canopy height was achieved by filtering CHM to the highest point in a 10 × 10 m window, called h v . This allows compensation for underestimated lidar forest height estimation [60,61]. Finally, DTM and H100 were converted to slant range TanDEM-X coordinate to use as initial values of the PCT algorithm (Section 2.5).
Moreover, unfiltered DTM was wrapped into the ground phase ϕ 0 using SAR vertical wavenumber k z and the train elevation h DTM (Equation (3)) [30,43].
where h f and ∅ f are unknown parameters. They were computed by fitting obtained ϕ 0 and SAR phase ϕ γ on open areas with sufficient high coherence, i.e., | γ| > 0.90 by minimizing the cost distance function E using the Nelder-Mead simplex method (Equation (4)). Figure 2 shows a view of ϕ 0 , ϕ γ , and h v along a transect in slant range coordinates. As you can see, the ground phase has good accordance with the dataset, especially in the open areas.
Remote Sens. 2023, 15, 555 6 of 17 minimizing the cost distance function using the Nelder-Mead simplex method (Equation (4)). Figure 2 shows a view of , , and ℎ along a transect in slant range coordinates. As you can see, the ground phase has good accordance with the dataset, especially in the open areas.

= ∑ −
(4) Figure 2. A view of ground phase ( ) estimated from ALS data against TanDEM-X SAR phase ( ). A transect has been considered as an example in elevation scale. is represented by the black line and is represented by the blue circles. Additionally, the canopy height model (CHM) and upper canopy height (ℎ ) were included in the transect by orange circles and red line, respectively.

Polarization Coherence Tomography (PCT)
PCT employs the variation of the complex interferometric coherence to reconstruct forest vertical structure function in the penetrable depth of volume scattering (Section 2.

Polarization Coherence Tomography (PCT)
PCT employs the variation of the complex interferometric coherence γ to reconstruct forest vertical structure function in the penetrable depth of volume scattering (Section 2.3). Moreover, the initial values of h v and ϕ 0 , estimated in Section 2.4.1, are used in the reconstruction. According to the RVoG scattering model, in forested areas with the randomly vertical distribution of scatterers, γ is formulated in Equation (5) [62].
where w is the polarization unitary vector, ϕ 0 is the ground topographic phase, h v is the vegetation height, k z is the vertical wavenumber, and F(z) is the vertical structure function. F(z) is defined as the vertical variation in microwave reflectivity at a point in 2D radar image. We reconstructed a new structure function, called F(ź), based on the dependency between γ(w) and F(z) in the absence of temporal decorrelation, as shown in Equation (5) [63]. For this purpose, ϕ 0 was removed and the integral range normalized to derive Equation (6). Therefore, F(ź) can be expanded as simpler functions of the Fourier-Legendre series. (Equation (7)) [29].
F(ź) = ∑ n a n p n (ź), a n = 2n + 1 2 where p n (ź) denotes Legendre polynomials with the vertical variable ofź, and a n denotes unknown Legendre coefficients. Then, γ(w) can be directly related to the unknown coefficients of Legendre by Equation (8) [63]: where f (n) are weighted integrals over the Legendre polynomials, which are a function of a single parameter k v = k z h v 2 . As it can be seen, γ(w) is the algebraic sum of series structure functions. In our single baseline case, the series was truncated for the second order, i.e., only f 0 , f 1 , and f 2 were computed (Equation (9)).
Although improved tomographic resolution can be achieved by multi-baselines, it increases the number of unknown parameters and computational complexity [35]. Hence, single baseline PCT is the simplest way of forest vertical structure approximation, which only requires a 10 and a 20 . They were estimated through matrix inversion, as shown in Equation (10). Finally, the normalized structure function for each pixel at a known layer depth of h v was obtained using estimated Legendre coefficients (Equation (11)) [40]. We approximated the relative tomographic profiles with 0.2 m intervals in vegetation height [34,35,64,65].
Characterizing Tomographic Profiles The objective of this section was to define a framework to link tomographic profiles with forest structural diversity. A direct transformation of ecology and forestry metric into PCT profile is a difficult challenge as individual tree measures are usually impossible to retrieve from tomographic profiles [66]. In this study, we mainly extracted geometric descriptors of the tomographic profile based on the method originally described by Lue et al. [37]. The arithmetic average of tomographic profiles was computed for the pixels inside each plot to further plot-level analysis [34,35,37]. Because of irregularly obtained mean profiles, i.e., the highly varied vertical reflectivity between adjacent pixels, we fitted the gaussian model for further use. Two plot-level mean relative reflectivity profiles are shown in Figure 3a, and b as examples corresponding to low (σ dbh = 9.38 cm) and high (σ dbh = 39.65 cm) structural diversity, respectively. By dividing the curve into two envelopes and defining h1, h2, and h3, ten geometrical parameters were defined (P1 − P10, Table 2).
The first envelope comprised the reflectivity values between h1 and h3 corresponded to the canopy phase zone, where h1 is the inflection point between the lower and upper parts of the mean relative reflectivity curve and h3 is considered as a height with mean relative reflectivity closest to 0.001. The h2 corresponds to the height position where the mean relative reflectivity is maximum. Notably, the data between 0 and h2 constitutes the second envelope. Moreover, the mean relative reflectivity profiles were also characterized as reflectivity power distribution, according to Zhang et al. [35], and P11 was computed. It is a physical scattering attenuation parameter obtained by introducing the area between h2 and h3 as a reflectivity power loose zone and above h3 that is mostly contributed by the noise (Table 2). In addition, the location of 3D reflectivity peaks appears significantly more robust to reflectivity variation induced by system features and environmental conditions [66]. Hence, we used the number and 3D distribution of reflectivity peaks for structure diversity estimation that were estimated by package splus2R of R (P12 − P13, Table 2) [67].
The objective of this section was to define a framework to link tomographic profiles with forest structural diversity. A direct transformation of ecology and forestry metric into PCT profile is a difficult challenge as individual tree measures are usually impossible to retrieve from tomographic profiles [66]. In this study, we mainly extracted geometric descriptors of the tomographic profile based on the method originally described by Lue et al. [37]. The arithmetic average of tomographic profiles was computed for the pixels inside each plot to further plot-level analysis [34,35,37]. Because of irregularly obtained mean profiles, i.e., the highly varied vertical reflectivity between adjacent pixels, we fitted the gaussian model for further use. Two plot-level mean relative reflectivity profiles are shown in Figure 3a, and b as examples corresponding to low ( = 9.38 cm) and high ( = 39.65 cm) structural diversity, respectively. By dividing the curve into two envelopes and defining h1, h2, and h3, ten geometrical parameters were defined ( 1 − 10, Table 2).   Table 2. Plot-level descriptors of mean relative reflectivity profile.

P1
The ratio of first envelope span to the relative reflectivity at h2 The integral of relative reflectivity multiplied by the corresponding height in the first envelope

P6
Reciprocal of relative reflectivity summation for the first envelope Reciprocal of relative reflectivity summation for the second envelope The ratio of P6 to P7 P8 = P6/P7

P9
The ratio of relative reflectivity summation in h1 and h2 to h2 and h3 The integral of relative reflectivity multiplied by the corresponding height in the whole curve Height corresponding to power loss in a z ranging from h2 to h3 with power loss value k from 0 to 1 optimized by the lidar true height value Number of reflectivity peaks with span=15

P13
The 3D distribution of reflectivity peaks

Prediction Algorithms
Multiple linear regression (MLR) and three non-parametric algorithms, k-nearest neighbors (k-NN), random forest (RF), and support vector regression (SVR), were employed to estimate plot-level forest structural diversity of σ dbh and N using P1 − P13. These algorithms and tuning parameters are described briefly below.
MLR algorithm was conducted with the stepwise AIC (Akaike information criterion) method to obtain the best combination of predictors to estimate forest structural diversity. The algorithm can treat the problem of collinearity and evaluation of predictor variables using leave-k-out stepwise AIC and clustering input variables [47].
The k-NN has been found as the most popular technique in remote sensing-assisted forest attribute prediction [48,68,69]. To predict ith population unit by the k-NN algorithm, a linear combination of observation in population units in a k nearest sample units (neighbors) in the feature space is calculated [68]. In this study, the rectangular kernel (standard unweighted k-NN) with k values ranging from 1 to 20 was considered. Additionally, four different distance metrics of Euclidian, Manhattan, Chebyshev, and Euclidian squared were compared to optimize k-NN algorithm [70].
RF is a widely used prediction algorithm based on regression trees and has been shown to be useful in reducing systematic errors and overfitting [49]. In this algorithm, the regression trees continue to grow until a minimum error in the aspect of response variable is achieved. All regression trees were implemented in a range of square root of the number of predictor variables ± 2 (F).
SVR assumes a unique relationship between each set of predictor variables and response variables and that the grouping among predictor variables is sufficient in formulating prediction rules [71,72]. This task is accomplished by hyperplanes in multi-dimensional space built from axes, representing predictor variables [50]. Here, kernels of linear, polynomials of degrees 2 and 3, radial base function, and sigmoid were compared in terms of prediction accuracy. A grid search approach based on gradient descent convergence is applied for optimizing SVR parameters, including cost (C), which regularized the slack from observation-to-kernel margin [50].

Accuracy Assessment
We used leave-one-out cross validation (LOOCV) to train prediction models and validation. In this method, the number of folds is equal to the number of sample plots in dataset. So, the learning algorithm is applied to each sample once that is withheld as a validation sample, while the remaining samples are used in training the model [73,74]. Although LOOCV ignores the independence requirement for accuracy assessment, it improves the reliability of accuracy assessment by reducing the random error caused by splitting sample data into train and test subsets [75][76][77]. All model predictions were evaluated by the absolute and relative root mean square error (RMSE), mean absolute error (MAE), and pseudo-R 2 (R 2 *) (Equations (12)-(16)).
whereŷ i is the predicted value, y i is the observed value, y is the mean of observed values, and n is the sample size. Table 3 shows the correlations of known coherency, initial value of h v , and Legendre coefficients with σ dbh and N at sample plots. Moreover, the thirteen parameters extracted from the mean relative reflectivity profile were included in the correlation analysis ( Table 3). As can be seen, some extracted parameters were meaningful when compared with a 10 and a 20 as intermediate products of PCT and coherency itself. However, upper canopy height showed the highest correlation of 0.4 with σ dbh . On the other hand, PCT parameters were more gainful to correlate with N when compared with h v . Generally, P9, P10, P11, P12, and P13 are common PCT parameters with the highest correlation with σ dbh and N.  Table 4 shows the results of σ dbh prediction with different algorithms of MLR, k-NN, RF, and SVR. The best MLR algorithm (AIC = 1085.23) predicted σ dbh with rRMSE = 33.09% and R 2 * = 0.25. Among different distance metrics of k-NN, Manhattan with optimum k = 18 had the best performance for predicting σ dbh . It resulted in an RMSE of 5.99 cm (32.80%) ( Figure 4). Whereas we did not find any obvious difference in σ dbh prediction accuracies by changing the number of F predictor variables in each tree node. Although, the RF model with F = 6 had marginally better accuracy than the others with an RMSE of 5.99 cm while explaining 25% of the σ dbh variation in our study area. It is worth mentioning that P11, P13, P9, P10, and P3 are the most important variables according to the RF algorithm. SVR with optimum RBF kernel was able to predict σ dbh with rRMSE of 33.09%, and rMAE of 25.44% that had a similar performance to what was obtained with the MLR algorithm. Table 4. Prediction accuracy of the standard deviation of dbh (σ dbh ) using multiple linear regression (MLR), k-nearest neighbors (k-NN), random forest (RF), and support vector regression (SVR) algorithms.

Algorithm
Tuning

Model Developments for Prediction of
The performance of MLR and non-parametric algorithms of k-NN, RF, and SVR in predicting are shown in Table 5. The best model of MLR with AIC of 1344.05 was able to predict with an MAE of 69.19 n and R 2 * of 0.16. Manhattan with an optimum number of k = 18 led to better results when compared to the other distance metrics of Euclidian, Euclidian squared, and Chebyshev (rRMSE = 41.56%, R 2 * = 0.22). A graph of the observed plot-level against k-NN predictions has illustrated in Figure 5. The accuracy of prediction using the RF algorithm with three predictor variables in each node was MAE and RMSE of 67.09 n and 91.42 n, respectively. The first variables with large importance, as estimated from the RF algorithm, were 11, 9, 5, 3, and 1. SVR with RBF kernel tuned out to predict more accurately than the other kernels with rRMSE of 42.69% and rMAE of 30.55%. Table 5. Prediction accuracy of the number of trees ( ) using multiple linear regression (MLR), knearest neighbors (k-NN), random forest (RF), and support vector regression (SVR) algorithms.

Model Developments for Prediction of N
The performance of MLR and non-parametric algorithms of k-NN, RF, and SVR in predicting N are shown in Table 5. The best model of MLR with AIC of 1344.05 was able to predict N with an MAE of 69.19 n ha −1 and R 2 * of 0.16. Manhattan with an optimum number of k = 18 led to better results when compared to the other distance metrics of Euclidian, Euclidian squared, and Chebyshev (rRMSE = 41.56%, R 2 * = 0.22). A graph of the observed plot-level N against k-NN predictions has illustrated in Figure 5. The accuracy of N prediction using the RF algorithm with three predictor variables in each node was MAE and RMSE of 67.09 n ha −1 and 91.42 n ha −1 , respectively. The first variables with large importance, as estimated from the RF algorithm, were P11, P9, P5, P3, and P1. SVR with RBF kernel tuned out to predict N more accurately than the other kernels with rRMSE of 42.69% and rMAE of 30.55%. Table 5. Prediction accuracy of the number of trees (N) using multiple linear regression (MLR), k-nearest neighbors (k-NN), random forest (RF), and support vector regression (SVR) algorithms.

Algorithm
Tuning

Discussion
In this study, for the first time ever, the potential of space-borne PCT technique was investigated to predict structural diversities in a mixed uneven-aged deciduous forest. Two structural diversity indices of the standard deviation of dbh (σ dbh ) and the number of trees (N) were predicted and validated based on plot-level field data. Unlike traditional tomographic studies concentrating on profile generation, it was characterized into geometrical descriptors (P1-P10, P12, and P13). We also included a P11 parameter through analyzing the vertical backscatter power variation. Moreover, the performance of MLR, k-NN, RF, and SVR algorithms were compared for forest structural diversity estimation. The results revealed no obvious difference between parametric and non-parametric algorithms in predicting σ dbh . However, k-NN with rRMSE of 32.80%, rMAE of 25.69%, and R 2 * of 0.25 showed the greatest accuracy in predicting σ dbh with the PCT technique. It produced an accuracy similar to the RF algorithm while improving the rRMSE of MLR and SVR by 0.29% (Table 4). σ dbh is a measure of variability in tree size and indicative index of the presence and diversity of microhabitats within the forest [78]. Neumann and Starlinger [9] observed a high correlation between σ dbh and a set of more complex structural diversity indices. Similarly, Zenner and Hibbs [79] found σ dbh to be a highly correlated index with structural complexity index resulting from the 3D model of forest structure. However, most of the studies related to σ dbh have been in coniferous rather than broad-leaved forests, where the simple structure of coniferous forests was the main driver of reliable predictions [8]. In particular, tomographic studies on the quantification of forest structural diversity, especially in horizontal dimension, are limited. For instance, Tello et al. [80] proposed a method to evaluate field and TomoSAR-derived σ dbh maps in specific spatial scales. They showed a rather high level of accordance (r = 0.77) between them while it was r = 0.75 when field-derived σ dbh was assessed against ALS-derived σ dbh . Notably, the study was implemented using multi-baselines' experimental tomographic data acquired at the L-band over the temperate forest in Germany. So, their higher level of accuracy compared to our predictions was expected. Considering AGB as a volumetric index related to height, a few PCT studies have addressed AGB prediction by long wavelength airborne tomographic acquisitions (L/P bands), and a high level of accuracy with R 2 of 0.77-0.86 and RMSE of 18.32-47.76 Mg.ha −1 have been obtained [34,35,37]. N is an important index in forestry and does not exclusively means the number of trees per area but also reflects the site occupancy and growing volume stock [55]. In this study, we also predicted N by means of characterizing PCT vertical reflectivity profile. The greatest accuracy of N prediction was found using k-NN algorithms; the accuracy was rRMSE = 41.56% and R 2 * = 0.22. It is worth mentioning that different parameters in prediction algorithms caused differences in their performance for predicting N in our study area. The k-NN algorithm with Manhattan distance as the most accurate algorithm improved the rRMSE of RF, SVR, and MLR by 1.36%, 1.13%, and 3% (Table 5), whereas Tello et al. [80] have reported a high correlation of 0.83 between field-measured and TomoSAR-derived N that can be explained by the high vertical resolution of multi-baseline TomoSAR.
According to natural forest dynamics, greater height variation that appears in the multi-stories canopy corresponds to a higher value of σ dbh and N resulting from different tree sizes of dominant, codominant, and suppressed trees that are competing and vice versa. Hence, the high potential of ALS data is expected for structural diversity estimation. Our results were comparable with Bottalico et al. [5], where σ dbh was estimated by the R 2 of 0.23-0.25 and the rRMSE of 41.2%-41.9%, using ALS data in the Mediterranean forests. They also investigated the capability of ALS in N estimation and an rRMSE of 82.9-89.6% was achieved. It should be noted that they adopted a linear conservative regression modeling approach with a few predictor variables and the study area was subjected to a complex forest in terms of structure and composition. In a similar study using ALS data in central Italy, the estimated accuracy for predicting σ dbh was RMSE of 3.64 cm, rRMSE of 65.47%, and adjusted R 2 of 0.35 [81]. They further predicted σ dbh using k-NN, and R 2 of 0.50 was achieved in the optimized configuration [8]. Although, our accuracy was lower than what Monnet et al. [82] obtained in a French alpine forest dominated by broad-leaved species using ALS data. They predicted σ dbh and N with rRMSE of 23.5% (R 2 of 0.63) and 29.2% (R 2 of 0.43), respectively. They also showed higher accuracy of ALS-derived estimates than values obtained by interpolation of field data. These differences are likely attributable to our more heterogenous structure that is deduced from Table 1.
This study provides promising results for the use of TanDEM-X data and singlebaseline PCT technique in highly diverse Hyrcanian forest structure, even though the accuracy is still limited and the predictions involves uncertainties. Constructing the link between PCT vertical reflectivity profile and forest structure is not well established yet. Forest structure is defined based on the geometrical properties of tree and forest stands, while the SAR reflectivity additionally depends on dielectric properties of forest elements, system frequency, polarization, and acquisition geometry. Hence, the interpretation of approximated vertical reflectivity profile and its change would be ambiguous. Notably, forest structure descriptors are computed on an individual-tree basis and do not have a direct correspondence in the SAR reflectivity profile because of its insufficient spatial resolution. Cloud [40] outlined the key issues in implementing space-borne PCT. We met the temporal decorrelation using a single-pass TanDEM-X interferometer; however, signalto-noise ratio (SNR) and the effective number of looks in coherence estimation, as well as its consequent error on Legendre coefficients estimates, can be studied further. Moreover, the quantitative characterization of forest structure that we used unavoidably incurs a loss of information as the 3D vertical reflectivity profile was projected into a 2D plan by the defining parameters of P1-P13. In particular, we approximated the relative vertical reflectivity profile at a rather small scale of 0.1 ha, which could have affected the results. Therefore, future work needs to entirely assess the limitation of the used framework and the scale in representing forest structural diversity.

Conclusions
Forests are 3D ecosystems whose structure is affected by dynamic procedures, such as growth, regeneration, decay, and disturbance. Accordingly, 3D information on forests is important for understanding and modeling forest ecosystem succession and function. In particular, structure diversity is an essential criterion in biodiversity and sustainable assessments. Hence, the PCT technique with the approximating 3D vertical reflectivity profile of forests can introduce the potential for modeling structural diversity. Moreover, utilizing the single-pass TanDEM-X interferometer makes it applicable for large-area predictions and monitoring purposes. In this study, the potential of space-borne PCT at a highly diverse forest structure using a single-pol (HH) TanDEM-X interferometer was evaluated for the first time ever. We characterized PCT vertical mean reflectivity profile in each plot by defining geometrical and physical parameters of P1-P13 to predict forest horizontal structural diversity, including the standard deviation of dbh (σ dbh ) and the number of trees (N). The performances of different prediction algorithms of MLR, k-NN, RF, and SVR were also compared. Generally, tomographic-derived parameters were meaningful when compared with Legendre coefficients of a 10 and a 20 as intermediate products of PCT and coherency itself. Interestingly, N was more correlated with tomographic parameters than upper canopy height h v . According to the results, σ dbh was predicted more accurately than N with an RMSE of 5.99 cm, rRMSE of 32.80%, rMAE of 25.69%, and R 2 * of 0.25, whereas only 22% of the variation in N was explained using the PCT algorithm with rRMSE of 41.56%. The k-NN was the most accurate prediction algorithm for both σ dbh and N. Considering the deficiency of the used framework in characterizing the vertical reflectivity profile from the 3D into the 2D plan and the possible effects of scale, an entire assessment of limitations is needed for further studies. Notably, we only evaluated the constructed models based on sample plots, while inference at population units would also be necessary for forest management and planning.