Evaluating the Potential of ALS Data to Increase the Efﬁciency of Aboveground Biomass Estimates in Tropical Peat–Swamp Forests

: Estimates of aboveground biomass (AGB) in forests are critically required by many actors including forest managers, forest services and policy makers. Because the AGB of a forest cannot be observed directly, models need to be employed. Allometric models that predict the AGB of a single tree as a function of diameter at breast height (DBH) are commonly used in forest inventories that use a probability selection scheme to estimate total AGB. However, for forest areas with limited accessibility, implementing such a ﬁeld-based survey can be challenging. In such cases, models that use remotely sensed information may support the biomass assessment if useful predictor variables are available and statistically sound estimators can be derived. Airborne laser scanning (ALS) has become a prominent auxiliary data source for forest biomass assessments and is even considered to be one of the most promising technologies for AGB assessments in forests. In this study, we combined ALS and forest inventory data from a logged-over tropical peat swamp forest in Central Kalimantan, Indonesia to estimate total AGB. Our objective was to compare the precision of AGB estimates from two approaches: (i) from a ﬁeld-based inventory only and, (ii) from an ALS-assisted approach where ALS and ﬁeld inventory data were combined. We were particularly interested in analyzing whether the precision of AGB estimates can be improved by integrating ALS data under the particular conditions. For the inventory, we used a standard approach based on a systematic square sample grid. For building a biomass-link model that relates the ﬁeld based AGB estimates to ALS derived metrics, we used a parametric nonlinear model. From the ﬁeld-based approach, the estimated mean AGB was 241.38 Mgha − 1 with a standard error of 11.17 Mgha − 1 (SE% = 4.63%). Using the ALS-assisted approach, we estimated a similar mean AGB of 245.08 Mgha − 1 with a slightly smaller standard error of 10.57 Mgha − 1 (SE% = 4.30%). Altogether, this is an improvement of precision of estimation, even though the biomass-link

Abstract: Estimates of aboveground biomass (AGB) in forests are critically required by many actors including forest managers, forest services and policy makers. Because the AGB of a forest cannot be observed directly, models need to be employed. Allometric models that predict the AGB of a single tree as a function of diameter at breast height (DBH) are commonly used in forest inventories that use a probability selection scheme to estimate total AGB. However, for forest areas with limited accessibility, implementing such a field-based survey can be challenging. In such cases, models that use remotely sensed information may support the biomass assessment if useful predictor variables are available and statistically sound estimators can be derived. Airborne laser scanning (ALS) has become a prominent auxiliary data source for forest biomass assessments and is even considered to be one of the most promising technologies for AGB assessments in forests. In this study, we combined ALS and forest inventory data from a logged-over tropical peat swamp forest in Central Kalimantan, Indonesia to estimate total AGB. Our objective was to compare the precision of AGB estimates from two approaches: (i) from a field-based inventory only and, (ii) from an ALS-assisted approach where ALS and field inventory data were combined. We were particularly interested in analyzing whether the precision of AGB estimates can be improved by integrating ALS data under the particular conditions. For the inventory, we used a standard approach based on a systematic square sample grid. For building a biomass-link model that relates the field based AGB estimates to ALS derived metrics, we used a parametric nonlinear model. From the field-based approach, the estimated mean AGB was 241.38 Mgha −1 with a standard error of 11.17 Mgha −1 (SE% = 4.63%). Using the ALS-assisted approach, we estimated a similar mean AGB of 245.08 Mgha −1 with a slightly smaller standard error of 10.57 Mgha −1 (SE% = 4.30%). Altogether, this is an improvement of precision of estimation, even though the biomass-link model we found showed a large Root Mean Square Error (RMSE) of 47.43 Mgha −1 . We conclude that ALS data can support the estimation of AGB in logged-over tropical peat swamp forests even if the model quality is relatively low. A modest increase in precision of estimation (from 4.6% to 4.3%), as we found it in our study area, will be welcomed by all forest inventory planners as long as ALS data and analysis expertise are available at low or no cost. Otherwise, it gives rise to a challenging economic question, namely whether the cost of the acquisition of ALS data is reasonable in light of the actual increase in precision.

Introduction
Because of their important role in the global carbon cycles, forests are in the focus of international policy programs. They are considered important for mitigation of climate change because they can act both as a source and a sink of carbon contrary to all other sectors. With the growing awareness of the importance of forests for climate change, information on changes of forest carbon pools is increasingly demanded. Most relevant are here the tropical forests because they store large amounts of carbon and continue to be converted into other land use systems. For example, Pan et al. [1] estimated that tropical forest released 1.3 ± 0.7 Pg yr −1 of carbon in the period 1990-2007.
Because almost 50% of the plant biomass is carbon, estimates of the total aboveground biomass (AGB) in forest ecosystems are critical for understanding carbon dynamics [2]. Information on forest variables like AGB stocks are commonly collected from in situ forest inventories as part of the ongoing forest management and forms the basis for sustainable management decisions. Nevertheless, in tropical forests, such information is often not available since (i) these forests are often not systematically managed, and (ii) conducting forest inventories in tropical forests is quite demanding given the limited accessibility, the challenging field conditions, the complex forest structure (e.g., multiple layers, high number of species, wide range in DBH), and the limited resources available for conducting large area forest inventories. Thus, there is a gap between an increasing information demand on forest carbon stocks in tropical forests from the international forest related programs (e.g., REDD+) and the limited availability of forest inventory data. To resolve this conflict different international programs are currently supporting tropical countries in developing National Forest Inventory (NFI) projects and National Forest Monitoring (NFM) programs where remote sensing plays an essential role [3].
Because the AGB of a forest area cannot be measured or observed directly, models need to be employed. A well established and statistically sound approach to estimate AGB stocks is to follow standard forest inventory protocols. These use probability selection schemes to select a set of field plots on which AGB is predicted from allometric models that commonly include single tree variables like diameter at breast height (DBH) and tree height (h). Because of the probability-based sampling, upscaling based on the estimated mean and variance can be done using design-based estimators. However, design-based estimation requires that the inclusion probability of each sample is positive and known and thus field plots need to be established on pre-determined locations. The effort to implement such sampling designs can be huge, particularly in tropical forests with limited accessibility. Hence, there is a large interest in developing methods that reduce the efforts of field surveys by reducing the number of sample units required or by relaxing the design restrictions e.g., by using model-based inference. In both cases, multi-source approaches that utilize auxiliary information are used. Remote sensing data is such a data source for auxiliary information and may support the assessment of AGB if useful predictor variables are available and statistically sound estimates can be derived.
Remote sensing assisted AGB assessments have a long history and many studies have shown that information derived from remote sensing observations can be used to support predictions of AGB in forests. The earlier studies used optical sensors operating in the visible and near infrared spectrum [4]. However, one of the major problems identified with these type of sensors was the saturation effect of the relationship between spectral reflectance measures and forest AGB. Multiple studies proposed advanced image enhancement techniques for optical data such as including texture analysis to reduce the saturation effects [5,6]. The potential of AGB mapping has also been studied for Radio Detection And Ranging (RaDAR) sensors. The limited sensitivity for atmospheric disturbances makes RaDAR the preferred sensor type for tropical areas where it is often not possible to obtain high quality/cloud free optical images. However, the saturation problem is also evident for RaDAR data and limits the applicability because most of the tropical forest have higher AGB densities than the reported saturation thresholds. For example, Dobson et al. [7] showed that the threshold for AGB predictions using RaDAR data depends on the wavelength. They reported a threshold value where saturation starts at 150 Mg ha −1 to 200 Mg ha −1 . In recent years, airborne Light Detection And Ranging (LiDAR) scanners (ALS) have become prominent for forest assessments and are even considered the most promising technology for large-area forest inventory and AGB assessments by many researchers (e.g., [8]). Because of the better availability of ALS data in the northern countries, most of the earlier studies were conducted in temperate forests [9][10][11][12]. Reviews that summarize the different approaches for remote sensing supported biomass assessments including optical, RaDAR and ALS sensors can be found in Koch [13], Lu [14].
From the perspective of a forest manager or decision maker, the additional inclusion of remote sensing products into forest inventories increases the initial effort required (e.g., for data acquisition, data processing and analysis) which need to be justified either by a gain in efficiency when estimating the target variables or by the provision of additional products (e.g., maps). To evaluate the first, it is necessary to apply statistical methods that allow the quantification of the estimation error expressed e.g., in terms of a confidence interval. Different sampling and estimation approaches have been developed to predict forest variables by combining field-based forest inventory with remote sensing data. Of particular relevance is the series of papers from the study site in Hedmark county in Norway [15][16][17]. Even though the statistical concepts presented date back to earlier studies (e.g., [18][19][20]), it was the first time that they were presented and compared in detail for deriving forest statistics from multi-source inventories. Ståhl et al. [16] used a model-based approach to predict biomass. For the same study area, Naesset et al. [21] used a model-assisted estimation framework. For both methods, it was shown that the precision of the AGB estimates can be increased when remote sensing data is used as an auxiliary variable. The methods developed in Hedmark were so far mainly tested in temperate forests. The application of model-based or model-assisted frameworks to estimate biomass stocks in tropical forests have only been tested for a limited number of studies. Potential reasons for this are the limited availability of ALS data sets and difficulties in implementing fieldwork. Two of the few exceptions are the recent study from Naesset et al. [22] who compared the power of different remote sensing products to contribute to AGB assessments in a woodlands in Tanzania with relatively small mean AGB 53.7 Mg ha −1 . They found that the largest increase in precision of AGB estimates could be gained when using ALS data, which reduced the variance in comparison to pure field-based estimates by a factor of 3.6! D'Oliveira et al. [23] showed for a tropical forest in the Western Brazilian Amazon that the usage of model-assisted estimators using ALS height metrics could reduce the relative standard error of the mean AGB per hectare from 4.5% to 2.4%.
Peat forests' ecosystems in the tropics have one of the largest terrestrial carbon pools and Indonesia hosts the largest peat areas in the tropical zone [24]. Currently, they are an important source of carbon emissions from forest fires and peat degradation by drainage or deforestation. For example, in the last few years, severe forest fires in Kalimantan peat-swamp forests have caused international problems in air quality and aviation security, and airports in neighboring countries needed to be shut down temporarily. These disturbances alter the distribution of AGB in the peat-swamp forests e.g., as large quantities of carbon are released from the fires. However, because of the limited accessibility caused by its remoteness and the peat itself, only little is known about the AGB distribution in these ecosystems. To estimate the changes in carbon stocks, forest inventory methods need to be designed and implemented to provide statistically sound estimates. There are multiple papers that used remote sensing data to predict AGB in Kalimantan peat-swamp forests. Ballhorn et al. [25] used ALS data to predict AGB and found high model accuracies. For the same forest, Ballhorn et al. [26] also tested spaceborne Laser Altimeter profile data from ICESat/GLAS. Jubanski et al. [27] research into the potential of ALS to predict AGB in these type of forest as well as Kronseder et al. [28] did. All of these studies have in common that they use field data collected from non-probability designs and that the accuracy assessment is restricted to the evaluation of the model quality (e.g., RMSE and R 2 ).
Thus, even though the models that are presented show a high quality, it remains unclear whether they could increase the precision of AGB estimates for larger forest areas when combined with field inventory data.
The objective of this paper was to analyze from a case study whether the precision of AGB estimates in a logged-over tropical peat-swamp forests can be increased when combining ALS and field-based inventory data. We compared the precision of AGB estimates derived from two approaches: (i) from a field-based inventory data only and, (ii) from an ALS-assisted approach where ALS and field data were combined.

Study Area
The study area is located in Central Kalimantan, Indonesia, covering an area of 311 km 2 in the upper catchment of the Sebangau river. The exact boundaries have been defined based on the natural borders (Sebangau River) and by the accessibility. It is part of the Natural Laboratory of Peat-Swamp Forest (NLPF) managed by the Center for International Cooperation in Sustainable Management of Tropical Peatland (CIMTROP) hosted at the University of Palangka Raya, Central Kalimantan Province, Indonesia.
The topography of the entire study area is flat and covered by an organic peat soil layer with depth varying between 1 m to 9 m depending on the distance from the Sebangau river [24]. The study area is approximately 260 km south of the Equator with a typical tropical wet climate with a mean annual temperature (2002-2010) of 26.2 • C [29] and mean annual rainfall (1981-2010) of approx. 2600 mm yr −1 [30,31]. It is a habitat to a large number of ape species including Orangutan (Pongo pygmaeus wurmbii). Between 1972 and 1997, the Sebangau forest was managed as a private concession with selective logging of Ramin (Gonystylus spp.), and Meranti (Shorea sp.) species [32]. After a period of illegal logging activities between 1996 and 2004, the forest became protected as part of the Sebangau National Park by a government decree in 2004 (MoF, 2004). Altogether, forest can be described as a secondary logged-over lowland tropical peat swamp forest.

Forest Inventory Data and Allometric Biomass Model
Terrestrial field inventories were conducted by three field teams between July 2013 and June 2014. Because the ALS data did not cover the entire study area, we selected the sample plots using two independent systematic grids: (i) A Large Scale Inventory (LSI) sample grid (grid width: 2.5 km) covering the entire study area, and (ii) a High Intensity LiDAR (HIL) sample grid (grid width: 0.5 km) covering the largest patch of the ALS data. To increase the number of sample units covered by ALS data, additional samples were located along the southern border of the study area. These consists of cluster plots overlapping the center of LSI plots along with two neighboring plots at a distance of 100 m from the center of LSI plots, on each side along the main direction of the LiDAR strips (hereinafter cluster plots). In total, n = 55 plots were established in the study area: 34 HIL plots, 12 sub plots from six clusters and nine LSI plots (see Figure 1).
At each sample location, we established three nested concentric circular sub-plots with radii of 4, 8, and 16 m. Trees were selected within the three sub-plots based on their DBH values. For the smallest plots, we selected all trees with DBH ≥ 5 cm, for the medium plots those trees with DBH ≥ 17 cm and for the largest plots those with DBH ≥ 30 cm. The following attributes were observed for the selected trees: tree species, DBH, azimuth and relative distance of each tree to the plot center. The tree height was measured for 10 trees on each plot including the four trees closest to the plot center and the six trees with the largest DBH. Plot locations were measured with standard hand-held global navigation satellite system (GNSS) devices. To improve the accuracy of the localization, we averaged the position measurements for a time period of minimum 15 min. Because AGB cannot directly be measured, a model to estimate the biomass of a single tree from a measurable variable (e.g., DBH) need to be used. The selection of such a model is a critical task in AGB assessments because it has a direct and potentially great impact on the AGB estimates [33]. However, in this work, we focused on the impact of using ALS derived auxiliary variables on the precision of AGB estimates. Thus, we were more interested in the relative difference between the two estimation approaches than in the absolute values. We further consider that the the effects of allometric model prediction uncertainty on the precision of the large area estimates are negligible relative to the effects of sampling variability. Therefore, we did not evaluate different available allometric models here and used a regional model developed for Indonesian peat swamp forests [34] with DBH and tree height as input. The DBH measurements were taken from the field inventory to calculate the biomass of each selected tree. Tree height models fitted using the trees with height measurements were used to predict the tree height of every other tree. To estimate the AGB per plot, all single tree biomasses were multiplied by the expansion factor according to the sizes of the concentric sub-plots and summed up per plot to get a plot level estimate of AGB per unit area in Mg ha −1 .

ALS Data and Processing
The airborne ALS data were acquired during the dry season in October 2011 using a fullwave Optech Orion M200 sensor mounted in a fixed wing aerial platform, operated at 1064 nm, a pulse repetition rate of 100 kHz, a maximum scan angle of ±11 • , an average flying height of 800 m above the WGS84 reference ellipsoid. The data cover an area of approximately 62 km 2 and were delivered in tiles of 1 km 2 with a minimum return density ranging from 0.42-7.62 returns/m 2 and a mean value of 5.2 returns/m 2 not considering overlapping flight lines. The absolute horizontal location error was reported to be < 0.07 m by the data provider.
We used FUSION software V. 3.4.2 [35] for filtering and interpolating data and generating the Digital Elevation Model (DEM), the Canopy Height Model (CHM) and the normalized height of the ALS point cloud (NH), which were necessary as input data for identifying and locating individual dominant trees and for computing ALS metrics for height and canopy cover within the limits of the field sample plots. The following processing steps were carried out: 1. Outliers were removed using the FilterData tool, considering a window size of 100 m and a maximum and minimum ellipsoidal height bound of ±5.0 * Std.dev. 2. The GroundFilter tool, which implements a filtering algorithm (adapted from Kraus and Pfeifer [36]) based on linear prediction [37], was used to extract ground returns from all the ALS points with a cell size of 1 m (based on available ALS data density). In the fourth and fifth steps, we only used returns with normalized heights between 0 m to 60 m, after considering the tallest trees observed in the field and to avoid errors that can occur in the filtering and interpolation processes or in the signal processing because of multiple backscatter of the laser beam.
When linking field inventory and remote sensing data, it is critical to ensure a high quality of the spatial co-registration between both datasets. From an analysis of multiple GNSS measurements on a full census plot in the same study area (data not provided here), we assume the GNSS location error to be in a magnitude of order of 10 m to 16 m. For the given size of the inventory plots, this might lead to large uncertainties because the location error can be larger than the plot radius, which might lead to situations where none of the selected ALS returns is actually taken from the plot where the field observations were conducted. Depending on the spatial autocorrelation structure of the forest, this will lead to an increase in uncertainty. In order to improve the co-registration, we developed a semi-automatic co-registration protocol. The co-registration is based on the identification of dominant trees from the inventory plots and in the ALS point cloud (details are provided in [38]).
The prediction of AGB based on the ALS data followed the standard area-based approach first described by Naesset [39] that uses ALS-derived metrics calculated from the NH as explanatories.
To obtain these metrics, we performed the following processes using the FUSION software: (i) the normalized ALS point cloud was clipped using the PolyClipData tool within the limits of each field plot (which were previously stored as polygons in shapefiles), and we created an independent file for each plot of 16 m radius and (ii) the CloudMetrics tool was used to estimate height and canopy cover metrics of these 55 clipped and normalized point clouds. The ALS metrics were computed using all normalized returns. The minimum height threshold (MHT), which is commonly specified as the lower boundary for calculating height metrics (central tendency, dispersion, shape and percentile statistics), was established at 2.5 m to exclude grasses and shrubs. The height break threshold (HBT), which is the limit for separating the point cloud data into two sets to separate canopy returns from the understory returns, in order to estimate canopy cover metrics, was established at 6.5 m (based on field observations). In total, 39 metrics widely used to predict AGB [40] were extracted from NH and used as predictors in the statistical analysis. The ALS metrics and the corresponding descriptions for height distribution and canopy closure are summarized in Table 1. ratio of the number of the first laser returns above 6 meter height to the total number of first laser returns for each plot PARA 6 ratio of the number of the all laser returns above 6 m height to the total number of first laser returns for each plot CRR Canopy relief ratio ((mean−min)/(max−min)) Finally, because we were also interested in the construction of a AGB map for the HIL area, we generated 39 rasters of 30 m cell size (chosen to match the scale of prediction to the size of the field plot observations, i.e., 804 m 2 ) for each of the statistics described in Table 1 using the GridMetrics tool.

Building the Biomass-Link Model
As a biomass-link model to predict plot level AGB estimates from the ALS metrics, we used a nonlinear logistic regression model as described by McRoberts et al. [41]: where i is the plot index, j indexes the ALS predictor metrics so that X ij is j'th ALS metrics observed on plot i. β is the vector of regression parameters to be estimated and is the residual error term. This model type was selected because it fits the properties of the target variable AGB. All predictions are non-negative and in between a lower (ŷ = 0) and upper asymptotic bound (ŷ =β J+2 ), which is estimated from the inventory data [41].
The usage of the model-assisted difference estimators requires a probability sample [42] when estimating bias and variance. However, to build the model itself, such an assumption is not required and external data can be utilized (see e.g., [43]). In the study at hand, we follow this approach and use all n = 55 observations from the different sampling grids to fit the regression model. However, inference on AGB estimates are restricted to the HIL area where the points were collected following a systematic sample design.
The limited number of field plots (n = 55), the large number of ALS metrics as potential predictors (39) and the fact that many of the ALS metrics carry redundant information are causing problems when fitting parametric models. To reduce the chance of overfitting and to keep the model simple, a feature selection was done. However, feature selection for parametric nonlinear regression models is subject to active research and most of the suggested methods are complex and lag generality. Therefore, we decided to follow a simplified approach based on linear models. We are fully aware that a linear approach might not select the best set of predictor variables for nonlinear models in all cases. However, because of the lack of a general feature selection method for parametric nonlinear regression models and the fact that this approach has been proven before [44], we consider this option as a viable and valid one. For the feature selection, we tested all possible combinations of maximal five predictors in an exhaustive feature selection using the leaps package [45] of the statistical software R [46]. We selected the model with the lowest residual standard error (RSE) and removed all terms with a variance inflation factor (VIF) > 10. We then fitted the nonlinear model (see Equation (1)) with the selected features using unweighted least square estimates as implemented in the nls function of R [46]. Finally, we removed all non-significant (p-value > 0.005) terms from the nonlinear model.

AGB Estimation
For the estimation of the AGB meanμ and its error variancev ar, we used two approaches: (i) the field-based approach which uses the field inventory data only, and (ii) the ALS-assisted approach which uses the field data and the selected ALS metrics as information sources. For the field-based approach, we used the estimators described in Särndal et al. [20] and estimate the meanμ FIELD AGB in Mg ha −1 and its error variancev ar FIELD for n plots with the estimated varianceσ 2 as follows: var(μ FIELD ) =σ 2 n .
For the ALS-assisted approach, we used the model-assisted framework, which was presented in Särndal et al. [20] and which has been successfully utilized for ALS based forest biomass and volume assessments before [15,21,41]. We estimate the ALS-assisted meanμ ALS AGB in Mg ha −1 and its error variancev ar ALS for n plots with the model residuals i =ŷ i − y i for all N sampling units (here pixel) as follows:μ var(μ ALS ) = 1 n(n − 1)

Forest Inventory Results
The analysis of the forest inventory data showed that the part of the Sebangau forest covered by the HIL plots is very homogeneous with relatively small mean DBH values and a large number of stems (see Table 2). The DBH distribution shows the typical self-thinning curve of a secondary forest with a large number of trees with small DBH and an exponentially decreasing number of trees for higher DBH classes (see Figure 2). Because of the logging history, very large trees with DBH ≥ 60 cm are rare. The tree heights show an irregular distribution without much differentiation and a relatively low mean tree height of 17.5 m. The distribution indicates that there is no single dominant tree layer but a very complex vertical forest structure with trees in many different height classes as shown in the photo in Figure 3. Thus, the DBH and tree height distributions reflect the management history characterizing the forest as logged-over secondary forest, where larger trees have been taken out and natural regeneration is filling the gaps. The forest hosts a large number of tree species (129 species observed on all plots). However, the allometric model that was used for estimating the single tree AGB is a mixed species model [34]. Therefore, the large number of tree species does not contribute to the variation of AGB on the plot level in our analysis.
Among the presented forest structure variables, the stem density has the highest coefficient of variation, whereas the variation of mean tree heights between the plots is small. From the field work and the analysis of the tree height distribution, the vertical forest structure can be described as complex. However, the small CV of the mean tree heights indicates that the complex vertical structure is homogeneously covering the study area. The differences in the plot level AGB estimates are mainly attributed to variations in the stem density as shown in Table 2. As shown in Figure 4 more than 75% of the total AGB can be found in trees with DBH ≤ 30 cm.

Biomass-Link Model
We started building the biomass-link model by selecting the ALS predictor variables. The exhaustive search for the best linear AGB prediction model showed only little improvement for models with more than five independent variables. The final model with the smallest RSE included the 5% percentile (h 05 ) and the median of the absolute deviations from the overall median (h MADmedian ) as significant predictors, which resulted in an RMSE of 47.43 Mg ha −1 . To evaluate the model quality, we present diagnostic plots in Figure 5. The slope of the observed versus predicted was estimated to be 1.01. However, the subplot of the observed means versus predicted means indicates a poor quality of the fit because the slope of the observed against predicted line is significantly different from 1 and the intercept is different from 0. As a consequence, the predictions from the nonlinear logistic biomass-link model will overestimate AGB values up to a threshold of approx. 225 Mg ha −1 and underestimate the AGB for larger values. We used the biomass-link model to create a map of AGB predictions for the HIL area as shown in Figure 6. The visual check of the spatial distribution of the AGB seems plausible. For example, the tracks that were formally cut to extract saw logs on small train tracks are clearly visible. However, from the map, no spatial trends in AGB density can be observed nor any other particular spatial pattern. Thus, the AGB map confirms the results from the forest inventory showing a homogeneous forest structure for the HIL area.

Comparison of the AGB Estimates
The evaluation of the forest inventory data confirmed the homogeneous forest structure. With only 34 plots in the HIL area, the mean AGB stock was estimated with a quite high precision as expressed by the small relative standard error of 4.63% of the mean. Given this fact plus the low model quality of the biomass-link model, only little improvements in the precision of the ALS-assisted estimates can be expected. In Table 3, both estimates are compared. The mean AGB estimates of both approaches are very similar. The standard error is 0.6 Mg ha −1 smaller for the ALS-assisted approach, which indicates a slightly higher precision when using ALS and forest inventory data.

Discussion
The potential of remote sensing technologies for monitoring forest biomass stocks in the tropics has been evaluated by many [8,[47][48][49]. However, most of the works are focusing on the biomass-link model and evaluate their quality from the residuals between observations and model predictions.
Even though the quality of the biomass-link model is an important determinant of the precision of the model predictions, it is by no means a good predictor for the precision of the AGB estimates when applied for larger forest areas because models need to be evaluated in context of their application [50]. In cases where statistical point and interval estimates are to be given, further analyses beyond the evaluation of the biomass-link model are required, or, in other words: if statistical inference should be made instead of "pretty pictures", statistical inference frameworks need to be applied [51].
The analysis of the inventory data considering only the field-based information resulted in an estimated mean biomass of 241.38 Mg ha −1 , which corresponds well to other biomass assessments conducted in the same study area. Kronseder et al. [28] reported a mean AGB of 258.5 Mg ha −1 and Campbell [32] reported a higher mean AGB value of 297.8 Mg ha −1 from five plots in the same study area. Englhart et al. [52] reported a mean AGB of 203.0 Mg ha −1 for a peat-swamp forest area that partly covers the study area. Thus, the field-based point estimates can be considered plausible. Because the listed studies were not design-based nor were the standard errors given, we cannot compare our interval estimates e.g., the standard error of 11.17 Mg ha −1 to other studies conducted in this region.
The biomass-link model used in this study has an RSE of 47.43 Mg ha −1 , which is in the range of other studies that use ALS data to predict biomass in tropical forests. Zolkos et al. [53] conducted a meta-analysis of such studies and reported a median RSE for tropical forest types of 40 Mg ha −1 . Even though, the RSE was within the range of other studies, the predictive power of the model is limited since the relationship between AGB and ALS metrics was weak. This is indicated e.g., by the Pearson correlation coefficient of 0.65 for the predicted versus observed AGB values. A similar finding is reported by Palace et al. [54] who used discrete return ALS metrics to describe the forest structure, including biomass, of a complex tropical forest in La Selva, Costa Rica. They found no significant relationship between ALS height metrics and AGB and concluded that alternative metrics need to be found for predicting AGB at local scales [54]. In addition, Vaglio Laurin et al. [49] reported that the upper height percentiles have only a limited predictive power for AGB in a tropical forest in Africa. They attributed this effect to the multi-layered forest structure where most of the biomass is found in the sub-canopy trees. In Sebangau, forest variation is caused mainly by differences in the tree densities and not so much to variations in the canopy heights. Other studies showed stronger relationships between ALS height metrics and AGB for tropical forests. Ioki et al. [48] created univariate linear biomass-link models to predict AGB in tropical forests of northern Kalimantan using ALS data that had coefficients of variation > 0.7.
The ALS data used in the study is comparable to other studies that predict forest AGB. The ALS processing followed a standard area-based approach and was conducted with standard algorithms. Thus, we assume that the data processing had only a minor effect on the quality of the AGB and ALS metrics relation. Among the reasons why the relationships are weaker in our study may be the following: (i) the forest structure in the study area is characterized by a high tree density (>2000 trees per ha), a small mean DBH and little variation in DBH and height. In general, the forest can be described as a closed canopy forest with very little variation of the AGB estimates at plot scale. As many of the ALS metrics describe the height distribution of the canopy, their explanatory power is obviously limited for these forest conditions, (ii) in a pan-tropical study, Slik et al. [55] showed that the density of large trees (DBH > 70 cm) has a large influence on the variations of AGB. The largest DBH found in the HIL study area was 63.3 cm and only 1.5% were larger then 50 cm so that the general absence of scattered large trees is another potential explanation for the little variation of the AGB values in the HIL area, and (iii) the plot design can have a great influence on the relationship between any forest variable and remote sensing observation because it determines the scale at which the relation is modeled. Mascaro et al. [56] analyzed the model qualities of ALS biomass-link models for different resolutions and formulated a scaling rule for the RMSE. Mauya et al. [57] showed that the precision of ALS-assisted AGB estimates is also affected by the field plot size. They concluded that increasing the field plot size reduces boundary effects, which leads to better prediction models. However, larger plot sizes are not efficient when it comes to field sampling because implementing larger plots in the field is difficult and time consuming and thus reduces the number of plots that can be implemented with a given budget, and (iv) there is a time gap of up to three years between the ALS acquisition and the field observations, which could reduce the correlations in cases of changes in the forest structure.
However, because this is a protected forest area, we assume that there are no logging activities in the study area. This is also confirmed by other research groups who maintain a field station and carry out regular monitoring in that area. Thus, we consider structural changes in the period 2011-2014 to be very minor in the studied area.
Given the limited correlation between the ALS metrics and the observed AGB values in our study area, the gain in precision when estimating AGB by combining forest inventory and remote sensing data was small. Thus, for the given situation, the investment in the ALS data acquisition could not be justified by the gain in efficiency. To achieve a confidence interval similar to the ALS-assisted using the field-based approach, a sample size of n = 37 would be needed. Thus, increasing the sample size by three more plots would have resulted in a similar confidence interval but with much lower costs when compared to conducting an ALS data acquisition and analysis. In addition, the add-on product of a wall-to-wall map was not very informative in our study because the spatial distribution of AGB is homogeneous and thus only little additional information is being gained from the map. However, for a designer of a forest monitoring program, it is currently not easy to make a decision beforehand whether it is worth investing in ALS data or not. Therefore, techniques that allow a pre-survey determination of the expected gain in precision are demanded. One such approach could be based on the analysis of the spatial auto-correlation patterns in the freely available satellite images as provided by Landsat 8 or Sentinel 2. If such a pilot survey reveals a long-range spatial correlation of the plots with little variability, a gain in precision is unlikely. Thus, methods that allow (i) predicting the homogeneity in advance, and (ii) finding a threshold of variation where an economically meaningful inclusion is required.

Conclusions
Based on the findings from this study, we draw the following conclusions: • The usage of expensive ALS data in forest monitoring programs cannot always be justified by an actual gain in precision, • Model-assisted estimators provide a good framework to examine the gain in precision and to evaluate the advantage of using different remote sensing products, • As different studies report different 'gain' in precision from using ALS data, further research is required to inform forest monitoring systems on the expected gain of collecting additional remote sensing products before the monitoring is implemented.
Author Contributions: All authors actively contributed to the manuscript through their review, editing and comments. P.M. took the lead and was involved in the fieldwork and project design and all steps of the data analysis. E.G.-F. contributed to the LiDAR data processing in FUSION, E.S.P. was involved in the fieldwork and implemented the positional correction procedure. D.S. was involved in the organization of the fieldwork and contributed to the analysis of the data. C.P.-C. contributed to the modeling of AGB and tree height and in designing the study. C.K. was involved in all stages of the project and contributed to the discussion and inventory design.