1. Introduction
Monitoring forest growth and loss, particularly the above-ground biomass (AGB), has become increasingly essential for a range of purposes such as forest management and global carbon accounting and carbon cycle modeling [
1]. Deforestation and forest degradation in the tropics contribute to considerable fractions of anthropogenic greenhouse gas emissions globally [
2]. Mitigation approaches such as performance-based payments for reducing deforestation and degradation require regular national scale estimates of changes in forest biomass and the corresponding green-house gas emissions [
3]. In the absence of a national forest inventory for regular biomass estimation, a commonly used approach to estimate forest carbon emissions involves detecting deforestation area through remote sensing and applying biomass densities or emission factors. This method aligns with the IPCC’s Gain–Loss approach, which estimates carbon stock changes by combining activity data with emission factors [
4]. More generally, this means that areas of land use that change categories are multiplied by their corresponding emission factors [
5]. Detection of deforested areas is straightforward in using a range of methods, including optical or SAR sensors, and associated emission factors can be taken from large-scale data sources [
6,
7,
8].
However, this approach has three major limitations. First, it only detects biomass loss from clearcuts, while biomass loss from partial logging such as thinning, single-tree logging or forest degradation remains undetected. This is a technological limitation, as the commonly used methods require a considerable change in the signal, i.e., a change so clear that it only occurs through a change from forest to bare soil and ground vegetation. Secondly, changes in forest biomass contain both losses and gains, and the greenhouse gas emission depends on the net change. A sustainable forest management with net zero emission can contain clearcut areas, where emissions are compensated by carbon sequestration in other forest compartments. Finally, the emission factors used often have high uncertainty or are erroneous [
9,
10]. Alternatively, AGB change estimation could be based on changes in the elevation of a digital surface model (DSM). Such a DSM should preferably be based on a short wavelength sensor, i.e., in the range from visible light to short wavelength SAR, to be present in the upper part of the forest canopy. The rationale for this approach is that the height and density of trees largely determines forest AGB, while it also largely determines the height of the DSM above ground. One suitable type of DSM is the phase center from an across-track, single-pass interferometer [
11], such as TanDEM-X [
12], which we use in the present study. With this approach, forest growth would be picked up as an increase, while any type of logging spanning the range from single-tree logging to a clearcut would be seen as a decrease caused by more canopy gaps and deeper microwave penetration. In addition, instead of using fixed emission factors per area unit, with this approach we would need an emission factor per m height change in the DSM. This could be seen as an emission factor per m canopy height [
11,
13,
14,
15,
16,
17].
There is another requirement regarding new methods. Many tropical countries are lacking both a national forest inventory based on field plots and a Digital Terrain Model (DTM). With the launch of the space lidar Global Ecosystem Dynamics Investigation (GEDI) [
18], it is possible that for such countries, field inventory in remote and hardly accessible forest areas can be replaced by predicted biomass from GEDI. There is a need for methods that can work also for such countries.
The objective of this study was to demonstrate how an across-track, single-pass interferometer like TanDEM-X together with GEDI can be used for deriving changes in AGB at the national scale without requiring a national scale DTM nor a national dataset of AGB from field inventory. The approach can provide the basis for national MRV systems by providing consistent, spatially explicit biomass data essential for an accurate estimation of CO
2 emissions and removals from forest land, as recommended by the 2006 IPCC Guidelines for National Greenhouse Gas Inventories [
4].
This study builds on a previous national-scale study in Uganda that used InSAR height changes from SRTM to TanDEM-X [
19]. Compared to that study, the present study is an advancement by using repeated TanDEM-X acquisitions, by utilizing GEDI [
18], and by using results from a national forest inventory. Furthermore, the present study works on a national scale rather than a smaller test area as in Solberg et al. [
20], and uses the easily available global DEM products rather than single-acquisition datasets in a single-look complex format that requires considerable processing.
2. Materials and Methods
2.1. Study Area and Period
The study area was the entire Tanzania (
Figure 1). However, we partly limited the analyses to the mainland, i.e., excluding Zanzibar, because the reference data from Tanzania’s National Forest Resources Monitoring and Assessment (NAFORMA) only covered that. The period was eight years starting with 2012 and ending with 2019. We selected this period to closely match the time of the other datasets we used. This was the case for one of the TanDEM-X change DEMs, the NAFORMA field inventory around 2012, and the timing of the GEDI data acquisitions.
2.2. MODIS Land Cover
We derived land cover classes from the Moderate Resolution Imaging Spectroradiometer (MODIS) sensor. We used the MODIS Land Cover Type Product (MCD12Q1) and specifically Land Cover Type 1 (LC_Type1) which is generated for the International Geosphere-Biosphere Programme (IGBP). This land cover dataset has 17 classes, of which 15 were present in Tanzania. We used version 6.1 representing the year 2022. The spatial resolution was 500 m which we resampled to the TanDEM-X grid using the nearest neighbor. We downloaded the MODIS data from
https://lpdaac.usgs.gov/products/mcd12q1v061/ on 2 July 2024. We used QGIS v. 3.34.11 and Python v. 3.8.6 for data management.
2.3. Global Forest Watch
To compare our method in the present study with conventional approaches, we obtained forest loss data from Global Forest Watch (GFW) [
6]. We combined annual forest loss data from the eight years 2012–2019 to represent the same period as TanDEM-X change data, by using data that has ‘Lossyear’ version 2023-v11.1. As a loss area, we required that the area should have had at least 10% of tree cover prior to the deforestation by using the ‘treecover’ setting in GFW. For tree cover prior to the period 2002–2013 we used data for the year 2000, while for 2012–2019 we used year 2010. The spatial resolution of these data was 0.00025° which we resampled to the TanDEM-X grid using the nearest neighbor. We downloaded these data from
https://www.globalforestwatch.org/ on 26 September 2024.
2.4. Tandem-X
We downloaded three TanDEM-X datasets, i.e., the GlobalDEM from around 2012 (
Figure 1b), and the ChangeDEM which contained the two datasets
first and
last (
Figure 1c) representing elevation changes from the former [
21,
22]. All these datasets had complete coverage of Tanzania. The former was generated from acquisitions taken over four years from December 2010 to January 2015 [
23]. The two ChangeDEMs represented two alternative time periods. Since the majority of Tanzania was covered by more than one acquisition during the change period, the
first was based on the oldest acquisition and the
last on the newest. Each of these datasets was a mosaic from different acquisitions where each pixel had a time stamp. The GlobalDEM fitted reasonably well in time with NAFORMA field inventory and ChangeDEM
last fitted with the time of the GEDI acquisitions we used. We calculated the mean value of the ChangeDEM
last for the entire Tanzania and separately for each land cover type.
Prior to these analyses, we discarded a small fraction of pixels that we considered unreliable. In this filtering, we excluded pixels that had the following: (1) a change in DEM elevation
> 25 m, (2) an estimated height change error > 1 m, and (3) MODIS land cover types 13 (Urban and Built-up Lands), 16 (Barren) or 17 (Water Bodies). We derived the height change error from the auxiliary datafile “Height Accuracy Indication (HAI)” which contained pixel-wise estimates of height change error based on the height errors of the initial DEM (GlobalDEM) and the last DEM [
21]. These datasets had 1 arcsec spatial resolution, which corresponds to an area of 947 m
2, or 30.7 m × 30.9 m, for a pixel in the center of Tanzania.
2.5. GEDI
GEDI is a spaceborne LiDAR generating ~25 m diameter full-waveform footprints (
Figure 1c), which are processed into a suite of forest structure and topographic products. For each footprint we used the AGB variable, i.e., the Level 4A Predicted aboveground biomass density (
agbd) [T/ha]. This variable had been obtained through parametric models based on simulated GEDI Level 2A (L2A) waveform relative height (RH) metrics from airborne LiDAR and ground-based measurements of AGB [
24]. There were distinct models for various regions and plant functional types [
25]. For Tanzania, the most used AGB models were the following: (1) Deciduous Broadleaf Trees (DBT), (2) Grasses, Shrubs, and Woodlands (GSW), and (3) Evergreen Broadleaf Trees (EBT) [
26]. The R
2 values of those models were 0.63, 0.86, and 0.64, respectively [
26]. The predictor variables were RH50 and RH98, i.e., height relative to the ground elevation below which 50% and 98%, respectively, of waveform energy has been returned. We also used the elevation of the terrain, i.e., the Level 4A Elevation of the center of the lowest mode relative to reference ellipsoid (
elev_lowestmode) [m].
We used the location of the center of each GEDI footprint to extract raster values from MODIS and TanDEM-X data. In this way, each GEDI footprint was supplemented by MODIS land cover class and TanDEM-X variables, i.e., DEM in 2012, the DEM change last and the height change error. We combined the data to generate new variables for the GEDI footprints, i.e., InSAR DEM in 2019 and InSAR height (IH) both in 2012 and 2019 (
Figure 2):
There was a total of 153 million available GEDI footprints over Tanzania which we reduced to 62 745 after filtering. First, we reduced this to 25 million by selecting footprints that had the following: (i)
l2_quality_flag = 1 (high-fidelity waveforms suitable for height estimation); (ii)
l4_quality_flag = 1 (suitable for AGBD estimation)); and (iii)
sensitivity > 0.95 and
geolocation/sensitivity_a2 > 0.95 (ability to penetrate dense canopy cover and having high confidence in relative height metrics) [
24]. Secondly, we reduced this to 14.9 million by excluding footprints acquired after July 2021. Thirdly, we excluded footprints using the same filter as for TanDEM-X data (see above) and in addition those having |InSAR height| > 25 m. Finally, we generated a systematic and representative sample of GEDI footprints. The footprints were irregularly spaced over Tanzania (
Figure 1c), and we laid out a 2 arcmin grid containing 69,182 points. For each grid point, we selected the nearest GEDI footprint within a maximum distance of the diagonal of 1 × 1 arcmin. This maximum distance was set to avoid the same footprint being duplicated for more than one grid point.
2.6. Conversion Factor
To estimate a conversion factor from InSAR height (IH) to AGB, we used the GEDI grid sample. Based on this, we calculated the conversion factor,
k, separately for each MODIS land cover class, i = 1 … 11, based on proportionality, i.e., the ratio of the mean values of AGB and InSAR height as
2.7. Estimating AGB Change
The present method is the main method we pursued in this study, where we estimated net AGB change from change in TanDEM-X elevation and the conversion factor, i.e., as
where
i = 1–11 are the eleven MODIS land cover classes having forest or woodlands,
was mean InSAR elevation change for MODIS land cover type
i,
was the area, and
was the conversion factor per m height change.
2.8. Comparison with Other Approaches
We compared our results with four alternative approaches. First, we used a conventional approach that corresponds to the IPCC Gain–Loss approach [
4], however, without taking gain into account:
We obtained the loss area for a given MODIS class, , from GFW. We derived mean biomass stock per ha for each MODIS class, , as the mean GEDI derived AGB value of the grid sample of GEDI footprints for that MODIS class. This should correspond to emission factors that are often used in gain–loss calculations.
Secondly, we used a hybrid approach by combining the two approaches above. We used the same GFW loss areas as in Equation (5). However, we now multiplied these loss areas,
L, by the mean TanDEM-X elevation change
and the same conversion factors as used in Equation (4):
Thirdly, we compared this with the Tanzanian FREL (Forest Reference Emission Level) for the years 2002–2013 [
27]. The FREL used the conventional approach as described above. The deforestation, or loss, data were based on a nationwide, wall-to-wall Landsat change analysis. The loss area was multiplied by emission factors calculated as the sum of three carbon stocks: Above Ground Carbon, Below Ground Carbon, and Dead Wood Carbon for each land cover type as given for mainland Tanzania by [
28], supplemented by new data for Zanzibar, and finally multiplied by 3.667 to convert from carbon to CO
2 equivalents [
27]. In the present study, we recalculated this to AGB decrease. The dominating carbon fraction was the Above Ground Carbon, which can be recalculated to AGB by dividing it by 0.47. Hence, AGB and the emission factor in CO
2 equivalents were strongly related. The ratio between them was fairly stable across land cover types with an area weighted mean value of 0.40. Hence, the FREL of 43.7 MT CO
2e/year corresponded to a decrease of 17.5 MT AGB/year.
Finally, we redid the conventional approach (Equation (5)) for the FREL period 2002–2013. The rationale for this was to have one and the same method for the two periods 2002–2013 and 2012–2019 to clarify whether the differences between our present approach and the FREL could be attributed to differences in deforestation rate in the two periods or different calculation methods.
2.9. Uncertainty
We estimated the uncertainty of the estimated
. We did this using the equation for standard deviation of a product, since we estimated
as the product of the InSAR elevation change (
) and the conversion factor (k):
where
and
were the mean value and standard error of the conversion factor. In order to obtain these values for the conversion factor, we generated 4 more 2 arcmin grids, by moving the initial grid 0.5 arcmins towards the four cardinal directions (
Figure 3). In this way, we obtained five nationwide samples, each having a mean AGB and InSAR height value, which resulted in five values of the conversion factor
k. Here, for simplicity, we did not calculate this separately for each MODIS class but used the overall conversion factor.
To obtain mean and standard error for the height change, we used the ChangeDEMfirst and ChangeDEMlast as a sample of two datasets on change. The main uncertainty of a DEM change is likely to stem from temporal rather than spatial variation. This is so because each DEM may have its own vertical bias, and because InSAR height can be influenced by seasonal variations between leaf-on and leaf-off in some forest types such as a miombo woodland. These two datasets covered different lengths in time, and to make them comparable we recalculated them to annual height change. We did that by calculating the length of the time period for each pixel as the difference between the date of the ChangeDEM and the mean date of the GlobalDEM. Based on this we first obtained the period length given as the number of days for each pixel. Secondly, based on this we obtained the annual height change for each pixel, which we averaged over the entire area.
The mean values of k and differed somewhat from their initial values as described above, i.e., based on the initial 2 arcmin grid and based on the ChangeDEMlast; however, we here retained only the estimated uncertainty, .
2.10. Validation
Estimated AGB change at the national level cannot be validated, as the true value is unknown. However, we carried out supplementary analyses that could support or corroborate the results. Firstly, we compared our AGB change estimates with GFW and the geography of protected areas. We divided the Tanzanian forest and wooded land into two categories based on GFW, i.e., loss and no-loss. We did this by combining the annual loss dataset for the eight years 2012–2019. We calculated the mean InSAR elevation change and estimated mean AGB change for these two categories. Furthermore, we divided the land area of Tanzania into two categories, protected and unprotected. We then compared our AGB change estimates with these categories. The protected areas made up 29% of the forest and wooded land area. Therefore, understanding biomass dynamics within protected areas is important for interpreting broader temporal trends in Tanzania’s forest landscape.
Secondly, we used the combination of TanDEM-X and GEDI to estimate mainland Tanzania’s AGB stock in 2012 and compared that to the field-plot-based estimates of AGB stock data from NAFORMA [
28]. A national high quality DTM does not exist for Tanzania, and hence our approach was to estimate this based on a sample estimate using the GEDI grid sample described above. As the GEDI footprints had a DTM variable, we could derive InSAR height and recalculate that to AGB for each footprint. We estimated the national scale AGB stock from this sample as
where
.
4. Discussion
This study demonstrated that forest biomass’ change over time can be estimated based on X-band across-track, single-pass interferometry, i.e., InSAR elevation change from TanDEM-X, and a conversion from elevation change to AGB change based on a combination of TanDEM-X and GEDI. In addition to mean values for entire Tanzania, the method can be used to show maps of changes. Furthermore, in addition to mean values, the method provided results for both gain and loss of AGB. The advantage of this is that it generates the basis for a better understanding of forest dynamics. It is valuable to see that for Tanzania as a country in the tropics, where deforestation is a well-known and severe environmental issue, there are considerable AGB gains. These gains are normally not picked up, and if they are, only at rare intervals like decades. This study showed that AGB gain mainly occurred in the three MODIS forest classes having the highest standing AGB, i.e., Evergreen and Deciduous Broadleaf and Mixed Forests, and in protected areas. The AGB loss mainly occurred in savannas and grasslands outside protected areas. This shows how the present method can provide valuable input to national forest policy.
To obtain the conversion factors, we used proportionality, i.e., the ratio between mean AGB and mean InSAR height. An alternative would be to use ordinary least squares in the form of a linear regression model. However, there is one problem with that. Such a model will have an intercept. In most cases the intercept will have a positive sign, and this will result in predicted AGB values > 0 for areas without trees, i.e., when InSAR height is zero. In this way, the model will be counterintuitive and not suitable for operational applications. In addition, this would prevent the method to be used in areas without a DTM. Another alternative would be to use a no-intercept regression model. However, the problem with that is that such a model represents a biased estimator. The mean value of the predictions will not be equal to the mean value of the ground truth. Based on this and our experience from several similar studies, we selected proportionality as the preferred model. However, the model choice is not critical, as the three model types can generate similar conversion factors. Solberg et al. [
20] obtained the conversion factor 12.1 T/ha/m with proportionality, 11.8 with ordinary OLS regression, and 12.0 with OLS no-intercept regression.
GEDI appears to be a good alternative to forest biomass data from field inventory. Field inventories, like national forest inventories, are rare in the tropics. GEDI makes up an alternative to costly and time-consuming field work in remote and low-accessibility areas. The role of GEDI in our method is to calibrate the conversion from InSAR elevation change to AGB change. One way to evaluate the GEDI data is to compare our obtained conversion factors with those from earlier studies based on field inventory. We see from our results that the conversion factors we obtained in the present study are consistent with earlier studies, and they were also quite stable across different MODIS land cover types. The overall conversion factor across land cover types was 13.9 T/ha/m, which is close to what has been found in other studies. In Tanzania there are two intensively studied forest areas, having field inventory plots and airborne laser scanning. One is the Liwale area in the southeast with miombo woodland. This area was dominated by the MODIS land cover classes Savannas, Deciduous Broadleaf Forests, and Mixed Forests, and in this study, we obtained the conversion factors 13.3, 13.1, and 11.3 T/ha/m, respectively, for them. Three earlier studies using various versions of field inventory plot data found 14.1 [
14], 11.9 [
16], and 14.0 T/ha/m [
20]. Using GEDI instead of field inventory resulted in 12.1 T/ha/m [
20].
A second study area is Amani, located in East Usambara Mountains in the northeast and has exceptionally high AGB density values ranging up to >1000 T/ha. It was dominated by the MODIS land cover class Evergreen Broadleaf Forests for which we obtained a conversion factor of 12.3 T/ha/m. Although this value appears to be reasonable because it is close to other conversion factors values, it is clearly lower than the value 18.4 T/ha/m that has been found based on field plots in Amani [
15]. There are several possible reasons for this difference. First, it is possible that there is a limitation with GEDI. This forest type can have extremely high AGB. Perhaps the GEDI LiDAR pulses have problems penetrating down to the terrain in this dense forest with tall trees, leading to biased AGB estimates here. Secondly, there is much sloping terrain in that mountain area, which possibly leads to SAR-specific geometric problems, including differences in obtained InSAR height between hillsides facing towards the sensor and hillsides facing away. Thirdly, the MODIS land cover type Evergreen Broadleaf Forests did not cover the same area as the Amani study area with its field inventory plots. Altogether, the causes for the discrepancy between the conversion factors 12.3 T/ha/m using GEDI and TanDEM-X in the present study and 18.4 T/ha/m found in another study based on field plots and TanDEM-X [
15] are unknown. However, the Evergreen Broadleaf Forests covers only 2% of Tanzania’s land area, and a bias on the conversion factor here has minor influence on national statistics. Still, we recommend a separate study to clarify the causes. For other forest types, conversion factors have been found to be 14.9 in a Norway spruce forest in Norway [
29] and 13.0 T/ha/m based on published data from a tropical forest in Brazil [
25]. The majority of conversion factors found in the present study appears to be close to an almost generic value across forest types of 12–14 T/ha/m, with a possible exception for the Amani area.
The alternative approaches for AGB change at the national scale were consistent and supplemented each other. While the present method provided an estimated annual net AGB change of −3.0 MT, the conventional and hybrid approaches resulted in an estimated annual loss of about 9 MT. Hence, the annual gain can be derived as about 6 MT. The similarity between the conventional and hybrid approaches makes up a validation of the method; they both use as input the loss pixels from GFW, and for those pixels the AGB loss was similar either when calculated as a complete loss of standing AGB prior to logging taken as the mean AGB of the actual MODIS land cover classes, or calculated as the mean InSAR elevation decrease multiplied by the conversion factor. However, the Tanzanian FREL appeared to be an overestimation of the greenhouse gas emissions from forest loss. This was mainly a result of a very large estimate of the deforestation area, and this shows one weakness of using the conventional approach for FREL and forest loss statistics: estimation of the loss area from optical satellite imagery is uncertain.
In addition to the correspondence between different approaches and the similarity with conversion factors found earlier, our findings are also supported in other ways. First, GFW loss pixels and InSAR elevation change corresponded well with each other. The GFW loss pixels and that of decreasing InSAR elevation had a similar spatial distribution. This was particularly evident when taking protected areas into account, where GFW loss and decreasing InSAR elevation dominated in the unprotected areas. The protected areas made up 29% of Tanzania’s land area, and hence, the temporal changes seen here have a large influence on national statistics on forest loss and CO
2 emissions. Finally, our estimate of national mainland AGB stock in 2012 was almost identical to the estimate generated by Tanzania’s NAFORMA field based national forest inventory [
28]. Our estimate was 2.76 MT while the NAFORMA estimate was 2.69 MT, i.e., we had a 2% overestimation. Any further validation of our results was not feasible. The main ground truth dataset in Tanzania is the NAFORMA field inventory plots. However, for the present study that dataset was not available, and they represent only one point in time. An exception is for a part of the Liwale study area, where NAFORMA plots have AGB data from repeated field inventory. Those data on AGB change were used by Solberg, Bollandsås, Gobakken, Næsset, Basak, and Duncanson [
20] to validate the present method in that smaller area.
One source of errors is that the datasets were not completely aligned in time. In particular, the temporal mismatch varied from pixel to pixel in the TanDEM-X datasets. However, these errors are of negligible importance for the results on the national scale. The mean InSAR height across Tanzania based on the representative grid sample was 2.38 m with the GlobalDEM around 2012 and 2.37 m based on the same data and ChangeDEMlast around 2019. This means a net decrease of only 0.3% of standing AGB occurred during those eight years. Hence, even though there has been a net loss of AGB over time, this has been small compared to the standing AGB.
When the GEDI AGB estimator was trained [
26], three important Tanzanian datasets were used, i.e., the field plots of NAFORMA and field plots and airborne laser scanning data for the two intensively studied areas Liwale and Amani. Hence, GEDI predictions for Tanzania may be less erroneous than what can be expected in other countries.
There is today a problem for a practical implementation of the present method for FREL and REDD+ MRV reporting. There is a lack of suitable data for the coming years, as there is a lack of across-track, single-pass interferometers. The practical implementation could have started at the onset of the TanDEM-X mission or with support for the planned TanDEM-L mission. By now, TanDEM-X is close to its termination. The commercial aspect of that mission has also prevented routine acquisitions on the global scale, and although global datasets are now available, this is the case only for a few points in time. Conventional, optical satellite imagery has during the last 10–15 years received most attention at the expense of suitable SAR interferometers. There are, however, some missions that have the technical capability to provide such data. One is the European Earth Explorer C-band mission Harmony, which is scheduled to go in across-track formation for part of its time, although this time share is small. Another mission is the Chinese Hongtu-1 X-band mission, which with one active and three passive receivers provides several across-track baselines simultaneously, and has the technical potential to improve performance beyond TanDEM-X [
30]. However, this sensor is mainly set up to provide national data takes in China. There are other missions to be mentioned in this context, such as the P-band BIOMASS mission and the L-band NISAR mission, which both provide repeat-pass interferometry. With this study, we want to support initiatives for launching new across-track, single-pass SAR interferometers in the coming years.