Automated Estimation of Sub-Canopy Topography Combined with Single-Baseline Single-Polarization TanDEM-X InSAR and ICESat-2 Data

.


Introduction
Digital elevation models (DEMs) are indispensable data sources for natural resource investigations, disaster monitoring, and climate and ecological change analyses [1,2].With the development of the economy and society, higher requirements have been proposed for large-scale, high-precision, and high-spatial-resolution DEMs.For this purpose, interferometric synthetic aperture radar (InSAR) is widely used for DEM extraction [2][3][4] and is considered to be a favorable tool for global DEM acquisition and updating because of its spatial continuity and independence from weather conditions [5,6].
TanDEM-X, the first twin-satellite spaceborne interferometric system, has been successfully used to acquire a global DEM with a resolution of 12 m [2,7,8].However, owing to its limited penetrability, the DEM obtained from TanDEM-X InSAR data contains significant forest height signals [9].Consequently, the obtained DEM does not reflect subcanopy topography, which is important for many applications.Therefore, it is necessary to remove the forest height signals from TanDEM-X InSAR-derived DEM.
To obtain a high-precision sub-canopy topography, methods based on polarimetric InSAR (PolInSAR) have been extensively developed [10][11][12].Among them, the random volume over ground (RVoG) model has been widely used to describe the scattering induced by forests and this can be utilized to distinguish forest and ground scattering contributions in a common resolution cell [13,14].In addition, methods based on sub-aperture [15] and tomographic SAR (TomoSAR) [16][17][18] have been used to remove vegetation signals from the InSAR-derived DEM.Such methods do not depend on any scattering model and use only InSAR observations without any external data.However, they require multibaseline, multi-polarization, time-frequency, or long-wave SAR data.Therefore, these methods cannot be applied to standard TanDEM-X InSAR data, owing to the limited number of interferometric pairs, single polarization, and poor penetrability.
The goal of estimating large-scale sub-canopy topography was typically achieved by estimating forest stand heights at their boundaries and subtracting average values from the InSAR-derived DEM [19].However, forest boundary height does not truly represent the height of the entire forest.To overcome this limitation, some studies have suggested employing external data acquired by spaceborne LiDAR systems [20][21][22][23], including the Ice, Cloud, and Land Elevation Satellite (ICESat) and the Global Ecosystem Dynamics Investigation (GEDI).There are two methods for obtaining sub-canopy topography by introducing spaceborne LiDAR data: (1) using forest heights obtained by the spaceborne LiDAR satellite combined with multi-source remote sensing data to estimate forest height, and then removing the forest signal in the InSAR-derived DEM in proportion [9,21,23,24].Alternatively, Wang et al. estimated the penetration depth of the InSAR signal based on the infinite depth hypothesis model and established a relationship between forest height and penetration depth to obtain the sub-canopy topography [9].Unfortunately, the synchronization of InSAR, spaceborne LiDAR, and other remote sensing data is difficult to guarantee, leading to great uncertainty in forest dynamic change areas.(2) Using spaceborne LiDAR ground elevation as a reference, combined with parameters for optical characteristics for regression analysis to train and predict the scattering phase center (SPC) height, and then the SPC height is subtracted from the InSAR-derived DEM to obtain subcanopy topography [20,22,[25][26][27][28].However, it is difficult to guarantee consistency of the temporal and spatial resolutions of these optical variables.More importantly, introducing optical data to estimate SPC height may have limitations related to the availability of cloudless data, spatial heterogeneity of forest attributes, and lack of sensitivity to forest density in tall and dense forests [29].
To overcome these limitations, this study combines the following innovations and proposes a framework for estimating sub-canopy topography: (1) Modeling to estimate SPC height using only the interferometric coherence and slope obtained from TanDEM-X InSAR data.Compared with optical parameters, Tan-DEM-X InSAR interferometric coherence records the scattering process of InSAR signals in the forest and is sensitive to forest height, vertical structure, and dielectric properties.More importantly, it exhibits time synchronization and consistency in spatial resolution with the InSAR-derived DEM.(2) ICESat-2 terrain control points (TCPs) with similar forest and slope conditions were adaptively screened within a local area to establish the model.The highlight of the framework for the estimation of SPC height is that the screening strategy comprehensively considers the spatial heterogeneity of forest characteristics and the influence of slope.In addition, the adaptive selection of TCPs enables the framework to estimate the SPC height well when ICESat-2 data are sparse.
(3) A weighted least-squares criterion was used to solve the model to estimate the SPC height, which can adapt to the influence on the results of different distances of the ICESat-2 TCPs.
The remainder of this paper is organized as follows.Section 2 describes the proposed framework for sub-canopy topography estimation.Section 3 describes the study area and the available experimental data.In Section 4, the effectiveness of this method is tested using single-polarization and baseline TanDEM-X InSAR data from four test sites with different topographies and forest conditions.Section 5 further discusses the results of subcanopy topography estimation.Finally, our conclusions are presented in Section 6.

Methods
The purpose of this study was to estimate high-precision sub-canopy topography by combining TanDEM-X InSAR data and ICESat-2 TCPs.The method proposed correlates elevation differences (i.e., SPC height) from ICESat-2/InSAR-derived DEMs with interferometric coherence and slope.The overall flowchart of the algorithm is shown in Figure 1.The first step was to obtain a high-precision InSAR-derived DEM and corrected coherence through interferometric processing of the TanDEM-X InSAR data.Simultaneously, InSAR-derived DEM, interferometric coherence, and slope were matched to the closest ICESat-2 orbital geodetic (latitude and longitude) locations.The second step considered the influence of interferometric coherence and slope on SPC height to model and obtained sufficient TCPs through the screening strategy to solve the model coefficients using the least squares criterion (   = ).The second step is key in the proposed method to estimate the sub-canopy topography and highlight the innovative contribution of the proposed method, which is described in detail in the following sections.The third step is to predict the SPC height of a specified pixel and remove it from the InSAR-derived DEM to obtain the sub-canopy topography.

Scattering Phase Center Height Retrieval Algorithm
The TanDEM-X system can simultaneously acquire two complex SAR images,  1 and  2 , and perform interferometry as follows [30]: where ⟨•⟩ represents expectation operation,  is the complex interferometric coherence.In forested areas, although  is not affected by temporal decorrelation, many sources of decorrelation still reduce interferometric coherence [30,31].Among them, the decorrelation associated with the SAR sensor parameters, acquisition geometry, thermal noise, and quantization errors can be canceled or minimized using known specific strategies [6,32,33].Therefore, the compensated interferometric coherence directly manifests as volume decorrelation, which is related to forest height, vertical structure, and dielectric properties and is an important parameter for the estimation of SPC height.The interferometric coherence subsequently used for modeling in this study refers to the volume decorrelation after compensation.
The interferometric phase was used to estimate DEM [9,34], but in forested areas, the InSAR-derived DEM cannot represent the real surface elevation [20,35].As shown in Figure 2, in mountainous areas covered by vegetation, the distance D was affected by forest height and terrain slope.Owing to the influence of the forest, the actual distance Dr of the TanDEM-X InSAR signal propagation was shorter than D (Figure 2a).This part is the SPC height caused by forest volume scattering.In addition, as shown in Figure 2b,c, when there is terrain slope α of the ground surface, the slope causes the distance D to increase or decrease.A larger slope will further increase the change in D [20,36,37], which will affect the accuracy of obtaining the sub-canopy topography using InSAR technology.In conclusion, the SPC height is related to forest volume scattering and is determined by the forest height, forest density, and dielectric properties [22,38,39].In other words, the SPC height can be linked to interferometric coherence [13,14,40].In addition, the slope is also a key factor affecting the SPC height.However, single-baseline and single-polarization TanDEM-X data acquired in standard mode makes it difficult to satisfy the existing physical models (e.g., random volume over ground, RVoG) to estimate the SPC height.Fortunately, spaceborne LiDAR systems (e.g., ICESat-2), although they have a low spatial resolution, can estimate some SPC heights from TanDEM-X InSAR data.Using the sparse SPC heights, a multi-regression model can be produced with the form: where  is the real SPC height,  is the fitting residual, (, ) is the SPC height of the polynomial fitting,   ( = 0, 1,•••,7) is the polynomial coefficient,  represents the interferometric coherence,  represents the slope.The slope (derived from the Tan-DEM DEM) represents the terrain slope in the range direction and is positive when facing toward the SAR satellite.

Estimation of the SPC Height
This model solution requires sufficient ICESat-2 TCPs to improve robustness.For the entire SAR image, the existing regression analysis strategy uses all screened ICESat-2 TCPs to fit and solve a set of coefficients to predict the SPC height of the ICESat-2 uncovered area [20,22,26].However, in many cases, it is difficult to predict the SPC height using interferometric coherence because the same interferometric coherence can be measured in forests with different vertical structures and dielectric properties.That is, the same interferometric coherence may correspond to different SPC heights.Therefore, to estimate SPC height more accurately, the orthogonal model applies a local fitting strategy and selects similar ICESat-2 TCPs for any pixel to fit and predict SPC height.
As shown in Figure 3, the red rectangle represents the pixel within which SPC height is to be predicted, and the dot with the same color (red) represents the ICESat-2 TCPs used for fitting.The ICESat-2 TCPs were selected for fitting based on the following two conditions: (1) Condition 1: Search within a local circular area with the pixel to be predicted as the center and R as the radius.The initial value of R can be empirically set based on the existing coverage density of ICESat-2, which is 100 pixels in this study.We assumed that forest conditions (i.e., type, density, and dielectric properties) were the same or similar within a local area.When the slope is zero, different interferometric coherences correspond to different SPC heights.(2) Condition 2: Based on Condition 1, ICESat-2 TCPs with the same slope direction (positive or negative) were selected to solve the orthogonal polynomial model.As shown in Figure 2, when a certain terrain slope exists, it leads to a higher or lower SPC height at the same geographical location [20].In addition, the slope distorts the relationship between interferometric coherence and SPC height.In the positive slope area, the interferometric coherence decreased with an increase in slope, because the increase in slope leads to an increase in volume scattering caused by the forest canopy in the same forest environment [41].Conversely, in the negative slope area, the increase in slope increases the surface scattering that occurs at the top of the forest, whereas the volumetric scattering caused by the forest canopy decreases.Therefore, this selection condition was beneficial for avoiding the heterogeneity caused by different slope directions to better predict the SPC height.Based on the above two conditions, for each pixel whose SPC height was to be predicted,  ICESat-2 TCPs were selected for modeling using an iterative method.The least-squares criterion was then used to solve the coefficients of the model [42,43], and the following equations can be obtained from the values of  (e.g., 30) ICESat-2 TCPs: The matrix form of each can be expressed as follows: According to formula (4) and the principle of least squares    = , the values of the parameter   ( = 0, 1, •••, 7) to be determined is as follows: where  represents the weight function of different ICESat-2 TCPs.Although we selected points with similar forest conditions in a local area, there are still some differences between them and the predicted points.The closer the points, the more likely it is that the forest conditions are similar in structure, type, and dielectric properties.As the distance increases, the degree of similarity gradually decreases.Therefore, the weight function can be used to reflect the degree of correlation between the ICESat-2 TCPs and the points to be predicted [44].Considering that weight   is related to distance in this study, the inverse ratio of the square of the distance was used as the weight, which is expressed as follows: where   is the distance between the ICESat-2 TCPs and the point to be predicted.  and   are the coordinates of the ICESat-2 TCPs, and  and  are the coordinates of the points to be predicted.

Study Area
Four test sites (A, B, C, D in Figure 4a) with different forest types and terrain were selected to test the applicability of the proposed method for estimating sub-canopy topography.The first test site (Figure 4b) is located in Krycklan in northern Sweden, where boreal coniferous forest dominates, with an average forest height of approximately 18 m.Hills are the main terrain at this test site, with several canyons and altitude ranges from 50 to 350 m.The second test site (Figure 4c) is located in Remningstorp, southern Sweden.The forest area is divided into a semi-northern area, with the tallest forest approximately 35 m.The terrain is relatively flat, except for the hills in the southwest, with elevations ranging from 120 m to 200 m.The third (Figure 4d) and fourth (Figure 4e) test sites are located in the Lope and Mabounie regions of Gabon, Africa.These two test sites are dominated by tall and dense tropical rainforests, with the tallest forest exceeding 60 m.The average forest height at the Lope test site is approximately 50 m, whereas that at the Mabounie test site is slightly lower, with an average height of approximately 30 m.These sites show visible topographic relief, and there are many fragmented slopes west of the Lope test site and northeast of the Mabounie test site, and the ground elevations range from 200 m to 750 m.

TanDEM-X InSAR Data
Table 1 details the parameters of the TanDEM-X co-registered single-look slant-range complex (CoSSC) data used.All data were acquired with HH polarization between 2011 and 2015, and the interferometric baseline was between 103.4 m and 185.9 m.Although there are large baseline differences, their HoAs indicate that all data are suitable for measuring forest height in the corresponding test sites, with satisfactory sensitivity to forest height, density, and dielectric properties.The ICESat-2 satellite was launched in September 2018 and uses the Advanced Topographic Laser Altimetry System (ATLAS) to acquire global elevation information, whose measurements provide elevation data for a wide range of scientific disciplines [45,46].In this study, we used the ATL08 product, which is the only along-track product that provides elevation data for land and vegetated surfaces [47], and all data were acquired from October 2018 to June 2022.To obtain reliable control points, it is necessary to screen the ATL08 data and retain as many points as possible while ensuring accuracy [48].The yellow dots in Figure 4b-e show the tracks of the ICESat-2 control points after screening.Table 2 shows the number of ICESat-2 control points used for modeling, ranging from several thousand to tens of thousands.Compared to the number of pixels in the entire test site, the data of the control points accounted for less than 0.3%.The TanDEM-X Forest Non-forest (FNF) Map was obtained using DLR from Tan-DEM-X InSAR data [49], which included four categories: water, forest, non-forest, and urban areas.To avoid misestimating water and urban areas, we used the FNF map for masking.

Airborne LiDAR Data
As shown in Figure 4b-e, the blue dotted boxes show the coverage of high-precision and high-resolution airborne LiDAR data.The Swedish Defense Research Agency acquired high-precision LiDAR data from the Krycklan and Remningstorp test sites during the BIOSAR2007 and BIOSAR2008 missions [50,51], respectively.During the AfriSAR mission, NASA obtained high-precision LiDAR data from the Lope and Mabounie test sites [52].The above LiDAR products were used to analyze and verify the accuracy of the estimated sub-canopy topography.

Comparison of InSAR-Derived DEM and LiDAR Data in Vegetated Areas
The original TanDEM-X CoSSC data were used to estimate DEM by interferometry.Low interferometric coherence (less than 0.3) and geometric distortion can cause severe data processing bias, so we mask these areas.Additionally, water and urban areas were masked using FNF maps.
As shown in Table 3, the forest heights were grouped in intervals of 10 m, and the DEM was compared with the airborne LiDAR data.The performance of the TanDEM-X InSAR-derived DEM was analyzed by calculating SPC height and penetration depth.For the four test sites, the mean difference between the InSAR-derived DEM and LiDAR DTM increased significantly with vegetation height.For the mean, the average difference exceeded 30 m higher at forest heights ≥ 50 m than 0~10 m.In addition, compared to the LiDAR digital surface model (DSM), the InSAR signal exhibited penetrability, and the penetration depth increased with forest height.However, when forest height exceeded 20 m, the increasing trend of the penetration depth was significantly reduced.Considering the known limitations of X-band SAR signals in terms of forest penetration, this was not surprising.Compared to the other three test sites, the rate of increase in the mean difference in DEM caused by forests was relatively small at the Krycklan test site.Owing to the sparse distribution and short forest at this test site, the influence of forests on the accuracy of the InSAR-derived DEM was weakened.In addition, we counted the number of pixels whose difference between InSAR-derived DEM and LiDAR DTM exceeded 0.5 m in each forest height interval, which increased with the increase in forest height.As shown in Table 3, the percentage of pixels with significant differences increased to approximately 100% in the group with the greatest vegetation height among the four test sites.A similar situation emerged when compared with the LiDAR DSM.In addition, the standard deviation between the InSAR-derived DEM and LiDAR data was relatively random, which was caused by forest type and density at different forest height intervals.Notably, interferometric coherence showed a significant correlation with the mean deviation, which decreased significantly with increased forest height.Based on this, interferometric coherence can become a key variable in the estimation of SPC height, highlighting its innovative contribution to this study.

Validation of Estimated Sub-Canopy Topography with Airborne LiDAR DTM
Figure 5 shows the DEMs obtained from the TanDEM-X InSAR data for the four test sites.The topography of the Krycklan (Figure 5a) and Lope (Figure 5c) test sites was obviously undulating, whereas the topography of the remaining two test sites was relatively flat.Figure 6 shows the SPC height estimated using the proposed method.Because of the different forest heights and densities, the Krycklan test site (Figure 6a) was less affected by forest height, and the estimated average SPC height was approximately 5 m.The Lope test site (Figure 6c) was covered by dense tropical rainforests with an average forest height of approximately 50 m, and the estimated SPC height exceeded 30 m in most areas.These results are consistent with those presented in Table 3.In addition, the SPC height estimated by the proposed method also showed excellent performance for bare land and lowvegetation areas, especially at the Remningstorp and Lope test sites.As shown by the red rectangle in Figure 6b, the estimated SPC height of the farmland area was close to zero, and the boundary between farmland and forest was clearly apparent.In addition, the area shown by the red rectangle in Figure 6c is a grassland dominated by low shrubs.The estimated SPC height was significantly smaller than that of the rainforest area, and there is a very significant dividing line with the rainforest.This ability to discriminate demonstrated the robustness of the proposed method for different types of vegetation cover.To demonstrate that the SPC was highly separated from the InSAR-derived DEM, local regions m and n, marked by blue rectangles in Figure 5, and profile AB, marked by the black solid line, were selected for analysis.Figure 7 enlarges the marked areas m and n, which assists in the intuitive analysis of the performance of the proposed method in separating SPC height signals, and also shows an optical image (Figure 7(c1,c2)) for comparison and reference.Compared to Figure 7(c1,c2), the vegetation-covered InSAR-derived DEM (Figure 7(a1,a2)) contains obvious forest signals, especially in the area marked with red solid lines, and the height gradient between bare ground and vegetation is clearly apparent.The elevation of the sub-canopy topography (Figure 7(b1,b2)) is significantly lower than that of the InSAR-derived DEM.In addition, the gradient between forest and non-forest boundaries appeared smoother, and terrain texture features were more prominent.Surprisingly, at the Krycklan test site, the road originally covered by trees became more visible, as shown by the blue rectangle in Figure 7(b1).As shown in Figure 8, the estimated sub-canopy topography was further compared with the InSAR-derived DEM and LiDAR DTM by analyzing profile AB.For the four test sites, compared to the InSAR-derived DEM, the elevation of the sub-canopy topography was significantly reduced and was closer to the LiDAR DTM.Although there are deviations from the LiDAR DTM at some locations owing to model errors, the overall consistency demonstrates the feasibility of the proposed method for the estimation of subcanopy topography.To evaluate the acquired sub-canopy topography in detail, high-precision airborne LiDAR DTMs, represented by blue dashed boxes in Figure 4, were used for comparison.Figure 9(a1-a4) shows an overview of the InSAR-derived DEM of the four test sites, and Figure 9(b1-b4) shows the difference maps between the InSAR-derived DEM and the Li-DAR DTM.Due to the influence of forest scattering, the InSAR-derived DEM was significantly higher than the LiDAR DTM.As shown in Figure 9(c1-c4), using the proposed method, the SPC height in the InSAR-derived DEM can be effectively removed, and the differences between sub-canopy topography and LiDAR DTM are mostly concentrated around zero.Furthermore, we used the LiDAR DTM to obtain statistics on the accuracy before and after SPC height correction in the vegetated area.As shown in Figure 10, the InSAR-derived DEMs were much higher than the ground truth values and contained significant vegetation signals.After removing SPC height signals, the coefficient of determination (R 2 ) between sub-canopy topography and LiDAR DTM improved, particularly at the Remningstorp test site, from 0.705 to 0.811.Due to the different forest and terrain, the RMSE of InSAR-derived DEM ranged from 6.12 m to 28.09 m.For the sub-canopy topography, the RMSE of the forest area ranges from 2.78 m to 8.27 m, both of which were significant improvements.The improvement in accuracy of the four test sites after removing the SPC height was 54.5%, 53.1%, 70.5%, and 67.2%.The significantly improved RMSE further demonstrates the robustness of the proposed method for the estimation of subcanopy topography for different terrains, forest types, and forest heights.

Advantages of Using Coherence and Local Modeling
Many studies have attempted to predict forest height signals in an InSAR-derived DEM using regression analysis with a small number of optical variables and ICESat-2 elevation [20,22,[25][26][27][28].However, owing to the penetrability of InSAR signals, optical remote sensing cannot fully record forest properties measured by InSAR because it can only observe the forest surface and is limited by the availability of cloud-free data [29].To verify the robustness of the proposed method for sub-canopy topography estimation, it was compared with the following two cases: Case 1: Using Landsat Vegetation Continuous Field (VCF) data and slope for regression analysis to predict the SPC height [26].Similar to the existing work, we selected the latest version of VCF in 2015 [53] and downloaded it from the USGS website (https://e4ftl01.cr.usgs.gov/MEASURES/GFCC30TC.003/2015.01.01/, accessed on 1 March 2023).The VCF data files used in this study are listed in Table 4. Case 2: Replace the VCF data in Case 1 with the interferometric coherence obtained by TanDEM-X to verify the performance of the estimation of sub-canopy topography without relying on external forest information products.In contrast to the proposed method of fitting multiple models, both Cases 1 and 2 fit only one model.
Table 4 shows the RMSEs of the sub-canopy topography estimated by different methods compared with the LiDAR DTM, and shows the accuracy of the InSAR-derived DEM for reference.Compared with the InSAR-derived DEM, the accuracy of the sub-canopy topography obtained in Case 1 improved by more than 50% on average.When the VCF data were replaced with interferometric coherence (i.e., case 2), similar accuracies were observed, and the Krycklan and Lope test sites showed slightly better results.In summary, interferometric coherence can completely replace VCF data and is more suitable for estimations of SPC height in many cases.In addition, the proposed method using local modeling to estimate sub-canopy topography achieved better accuracy, with the accuracy of sub-canopy topography improved by nearly 60% on average, demonstrating the effectiveness and robustness of local modeling.

Correlation between Forest Height and the Accuracy of Sub-Canopy Topography
Figure 11 shows histograms of the RMSEs of InSAR-derived DEM and sub-canopy topography accuracy with forest height variation.The RMSE of the InSAR-derived DEM increased with forest height.In particular, at the Lope test site, when forest height exceeded 50 m, the RMSE reached 37.63 m.After removing SPC height, the accuracy of the sub-canopy topography significantly improved in each forest height range.However, the RMSE of sub-canopy topography remained larger in areas of tall forests than in areas of short forests.This is because the ICESat-2 system cannot detect the real surface in dense and tall forests [54,55].For the Lope and Mabounie test sites, the RMSE of sub-canopy topography exhibited a trend of increase-decrease-increase, compared with the Krycklan and Remningstorp test sites, showing slightly worse accuracy in the forest height ranges of 0 m to 30 m.This was because there are few low-vegetation areas in the Lope and Mabounie test sites, and the SPC height in the low-vegetation area was significantly overestimated because of the influence of the adjacent area of high vegetation.This phenomenon of SPC height overestimation can be observed by comparing Figure 9(b3-b4) and Figure 9(c3-c4).

Limitations and Future Enhancements
In this study, we proposed a method for estimating sub-canopy topography using only TanDEM-X and ICESat-2 data.Validated at four test sites, the accuracy of the estimated sub-canopy topography product is satisfactory for most applications.However, the following limitations and future improvements must be further investigated to generalize the proposed method and achieve more accurate estimates of sub-canopy topography at large scales.
First, the core of the proposed method was to select ICESat-2 TCPs with the same or similar forest and slopes in the local area and use orthogonal polynomial modeling to predict SPC height.However, in forest areas with complex forest stands, there may also be differences in local forest conditions; therefore, it is necessary to develop more appropriate strategies for selecting ICESat-2 TCPs.In addition, the proposed method needs to model each pixel individually, which greatly increases the computational cost compared with traditional global modeling.Therefore, quickly and accurately selecting appropriate control points or using external remote sensing data for assisted modeling is key to accurately estimating sub-canopy topography.
Second, the advantage of using inverse distance weighting is that it considers the spatial heterogeneity of forest attributes, and can better characterize the local characteristics of spatial data.However, it is sensitive to gross errors.When ICESat-2 TCPs have large errors in a local region, the accuracy of estimated SPC height will be reduced.Therefore, it is necessary to extract more accurate ICESat-2 TCPs, such as a more suitable filtering method for extracting TCPs from the ICESat-2 raw data.
Finally, the proposed method requires sufficient ground control points to support modeling.In this study, the number of ICESat-2 TCPs in the Lope and Mabounie test sites was less than 0.1% of the corresponding TanDEM-X pixels, which led to an overestimation of SPC height in areas with low vegetation.In addition, comparing Figure 4 and Figure 9 of the Lope and Mabounie test sites, the error in estimating sub-canopy topography in the sparse area of the ICESat-2 TCPs was significantly larger than that in the dense area.To solve this problem, we must investigate whether it is necessary to introduce more spaceborne LiDAR data, including GEDI data.These results will help us further determine the relationship between InSAR observations and SPC height and achieve large-scale and higher-precision estimates of sub-canopy topography.These issues should be investigated in future studies.

Conclusions
There are a variety of methods to remove forest signals from InSAR-derived DEMs to estimate sub-canopy topography, such as regression analysis relying on external optical data.However, it is difficult to guarantee the consistency of temporal and spatial resolution of optical data and InSAR-derived DEM and the availability of cloud-free data.In contrast, the observations obtained by InSAR technology are not limited by the above problems, and fully record the forest characteristics measured from the InSAR perspective.Based on this, this study proposed a practical method for estimating sub-canopy topography by combining single-baseline, single-polarization TanDEM-X InSAR and ICE-Sat-2 data.The proposed method has been tested in four test sites characterized by different vegetation and terrain.The results show that the SPC height caused by forests in In-SAR-derived DEM is effectively eliminated.Afterward, we used high-precision LiDAR DTM for verification, the accuracy of sub-canopy topography in four test sites ranges from 2.78 m to 8.27 m.The successful results provide a basis for future work and explore the application of this method to the estimation of large-scale sub-canopy topography.In addition, given the effectiveness and robustness of the proposed method, it can be used as a supplementary tool for future spaceborne SAR missions to estimate sub-canopy topography, such as LT-1, TanDEM-L, and BIOMASS missions, when only single-baseline and single-polarization data are available.

Figure 1 .
Figure 1.A schematic of the workflow for estimation of sub-canopy topography.

Figure 2 .
Figure 2. Effects of vegetation and different surface slopes on TanDEM-X InSAR signal acquisition DEM.(a) Flat surface, (b) The surface aspect toward the TanDEM-X satellite, (c) The surface aspect opposite to the TanDEM-X satellite.

Figure 3 .
Figure 3. Schematic diagram of ICESat-2 TCPs selection and weighting scheme.The red rectangle is the pixel of the height of the SPC to be predicted.The dots represent the ICESat-2 footprint points, where red indicates control points with the same/similar forests and terrain as the pixels to be predicted, and blue indicates control points that do not participate in the modeling.

Figure 4 .
Figure 4. Study area and data used.The red five-pointed stars in (a) indicate the geographic locations of the study areas.In the following four illustrations (b-e), the base map is the InSAR backscatter intensity map, the blue dotted box indicates coverage of airborne LiDAR verification data, and the yellow dots are ICESat-2 control points.

Figure 5 .
Figure 5. DEMs estimated from TanDEM-X InSAR data: (a-d) are the DEMs of the Krycklan, Remningstorp, Lope and Mabounie test sites respectively.The blue rectangle represents the area used for enlarged analysis in Figure 7, and the black solid line AB represents the location of the profiles in Figure 8.

Figure 6 .
Figure 6.The SPC heights estimated by the proposed method: (a-d) are the SPC heights of the Krycklan, Remningstorp, Lope and Mabounie test sites respectively.The red rectangular highlight results for forested and non-forested areas.

Figure 7 .
Figure 7. Enlarged view of regions m and n marked with blue rectangular boxes in Figure 5. (a1,a2) InSAR-derived DEM, (b1,b2) sub-canopy topography, (c1,c2) true color image acquired by the Landsat 8 satellite.The red line box is used to analyze areas where the SPC height is significantly removed from the DEM.

Figure 8 .
Figure 8. Elevation profiles at the location of line AB marked in Figure 5: (a-d) correspond to the profiles marked at the Krycklan, Remningstorp, Lope, and Mabounie test sites in Figure 5, respectively.

Figure 10 .
Figure 10.Scatterplots to verify the accuracy of the InSAR-derived DEM and sub-canopy topography using the airborne LiDAR DTM, with colors representing scatterplots density increasing from blue to red.(a1-a4) The accuracy of InSAR-derived DEM, (b1-b4) the accuracy of sub-canopy topography.

Table 1 .
Geometric parameters of the TanDEM-X CoSSC data for the four test sites.

Table 2 .
The number and percentage of ICESat-2 data used for modeling.

Table 3 .
Statistics of InSAR-derived DEM minus LiDAR products, classified by forest height in 10 m intervals.

Table 4 .
Statistical results of the performance of three different methods for sub-canopy topography estimation.The percentage represents the accuracy improvement of sub-canopy topography estimated by different methods compared with InSAR DEM.