Comparing 3D Point Cloud Data from Laser Scanning and Digital Aerial Photogrammetry for Height Estimation of Small Trees and Other Vegetation in a Boreal–Alpine Ecotone

Næsset, Erik; Gobakken, Terje; Jutras-Perreault, Marie-Claude; Ramtvedt, Eirik Næsset

doi:10.3390/rs13132469

Open AccessArticle

Comparing 3D Point Cloud Data from Laser Scanning and Digital Aerial Photogrammetry for Height Estimation of Small Trees and Other Vegetation in a Boreal–Alpine Ecotone

Faculty of Environmental Sciences and Natural Resource Management, Norwegian University of Life Sciences, P.O. Box 5003, 1432 Ås, Norway

^*

Author to whom correspondence should be addressed.

Remote Sens. 2021, 13(13), 2469; https://doi.org/10.3390/rs13132469

Submission received: 11 May 2021 / Revised: 15 June 2021 / Accepted: 20 June 2021 / Published: 24 June 2021

(This article belongs to the Special Issue UAV Photogrammetry for Environmental Monitoring)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Changes in vegetation height in the boreal-alpine ecotone are expected over the coming decades due to climate change. Previous studies have shown that subtle changes in vegetation height (<0.2 m) can be estimated with great precision over short time periods (~5 yrs) for small spatial units (~1 ha) utilizing bi-temporal airborne laser scanning (ALS) data, which is promising for operation vegetation monitoring. However, ALS data may not always be available for multi-temporal analysis and other tree-dimensional (3D) data such as those produced by digital aerial photogrammetry (DAP) using imagery acquired from aircrafts and unmanned aerial systems (UAS) may add flexibility to an operational monitoring program. There is little existing evidence on the performance of DAP for height estimation of alpine pioneer trees and vegetation in the boreal-alpine ecotone. The current study assessed and compared the performance of 3D data extracted from ALS and from UAS DAP for prediction of tree height of small pioneer trees and evaluated how tree size and tree species affected the predictive ability of data from the two 3D data sources. Further, precision of vegetation height estimates (trees and other vegetation) across a 12 ha study area using 3D data from ALS and from UAS DAP were compared. Major findings showed smaller regression model residuals for vegetation height when using ALS data and that small and solitary trees tended to be smoothed out in DAP data. Surprisingly, the overall vegetation height estimates using ALS (0.64 m) and DAP data (0.76 m), respectively, differed significantly, despite the use of the same ground observations for model calibration. It was concluded that more in-depth understanding of the behavior of DAP algorithms for small scattered trees and low ground vegetation in the boreal-alpine ecotone is needed as even small systematic effects of a particular technology on height estimates may compromise the validity of a monitoring system since change processes encountered in the boreal-alpine ecotone often are subtle and slow.

Keywords:

forest monitoring; DAP; UAS; model-dependent inference; tree migration; alpine tree line

1. Introduction

The world’s climate will undergo distinct alterations over the coming decades leading to rapid changes in basic growth factors, such as temperature and precipitation. This will influence the alpine and tundra transition zones of the boreal forests. These areas are characterized by steep temperature–productivity gradients because the trees exist close to their tolerance limit in terms of temperature. Even a moderate increase in temperature may therefore lead to a rapid increase in growth of existing trees [1,2] as well as colonization of treeless areas and migration of the tree lines [3]. Other drivers of vegetation change also exist at the forest-tundra and forest-alpine interfaces, like herbivory [4,5,6,7]. Summer farms and grazing by domestic animals have been common in many montane areas. In areas where summer farming and grazing animals have previously limited tree growth, the tree line and the forest may expand towards higher elevations and latitudes when this activity is reduced. Migration of the alpine and northern tree lines will influence future carbon pools. A need therefore exists to monitor vegetation changes in these areas [8].

Airborne laser scanning (ALS) has been proposed as a technique to monitor subtle changes near the tree lines, such as colonization of treeless areas [9], to assist in prediction and estimation of tree height [9,10,11], and to estimate subtle changes in tree and vegetation height [12]. Numerous studies [9,11,13,14,15,16,17,18] have shown that ALS data with point densities of 7–11 points m⁻² may be applied to detect individual pioneer trees in the alpine tree line. With such pulse densities, about 90–100% of the trees with heights greater than 1 m are likely to be hit by laser pulses resulting in echoes with height values greater than zero, i.e., located above the terrain surface. Echoes with heights >1 m are mainly tree echoes. However, because laser beams of pulse lasers will tend to penetrate into the canopies before an echo is triggered, even the maximum of the recorded echo heights will typically underestimate the height of small trees. For example, Ref. [9] reported an underestimation of tree height by 0.43 m to 1.01 m, depending on tree species and tree height. The tree heights in their dataset ranged from 0.11 m to 5.20 m, which implies that the smaller trees will tend to have recorded laser echoes with height values equal to zero even when a tree is hit by a laser pulse. This will limit the sensitivity of ALS as a tool for early detection of recently established trees.

Three-dimensional (3D) photogrammetric point data produced in digital aerial photogrammetry (DAP) from imagery acquired by aircraft and unmanned aerial systems (UAS) have in recent years become a viable alternative to 3D data from ALS in forest and vegetation studies. DAP is an alternative to ALS in, for example, operational inventory of forest resources (e.g., [19,20]), although with somewhat lower accuracy than by use of ALS [21], but still economically competitive to ALS in forest inventory when both cost and utility of the data are taken into account [22]. 3D data from DAP based on UAS imagery have shown great promise in terms of precision of estimates of parameters such as volume and biomass in different forest types – from boreal forests ([23]) to dry African savannah [24].

It is usually cheaper to acquire 3D data from DAP than from ALS and UAS offer greater flexibility for small areas than acquisition from larger airborne platforms. Use of 3D data extracted by DAP from imagery acquired by UAS may therefore be a viable option to assist in detection and estimation of height of small trees in smaller areas in the forest–alpine or forest–tundra ecotones. Further, because a passive sensing technique like optical imaging may better capture the properties on the outer surface of a tree canopy than lasers, which tend to penetrate the canopy, DAP may in fact offer additional advantages compared to ALS for tree detection and estimation of height [19].

Still, to our knowledge, there is no scientific evidence of the performance of 3D data from DAP for small tree detection and height estimation in the forest–alpine or forest–tundra ecotones. A recent study by Ref. [25] may, however, give an interesting perspective on the potential usefulness of 3D data from UAS DAP for estimation of properties of small trees. Ref. [25] estimated mean tree height using DAP from imagery acquired with UAS over young forest stands under regeneration. The mean plot and stand height in their dataset ranged from 0.5 m to 13.0 m with an average value of 2.5 m. They found that the RMSE of the mean height estimate produced with assistance of 3D DAP data was substantially smaller than obtained with ALS. Although this study only addressed mean values of groups of trees (plots, stands) rather than individual trees, their findings may suggest that DAP also may be useful to assist in quantifying properties of individual small trees. In an assessment of UAS 3D point clouds from laser and DAP for individual tree height estimation over tree plantations with an average tree height of 2.6 m, Ref. [26] reported that DAP underestimated height to a greater extent than laser and that the accuracy was greater for laser than for DAP. However, their 3D data had an exceptionally high point density (443–939 points m⁻² for DAP and 325–649 points m⁻² for laser), which makes comparisons with previous findings based on data from airborne platforms and coarser resolution imagery difficult.

There are currently no commercial interests associated with small trees and other vegetation in the tree line ecotones, as there are in the productive forests where small trees represent young forest in an early stage of a rotation after clear-felling (cf. [25,26]). The primary purposes of quantifying and monitoring small trees and other vegetation in the tree line ecotones is therefore partly to keep track of how climate change affects this climate sensitive ecological environment and to enable consistent analysis of the net climate feedback of tree line migration caused by changes in biomass and soil carbon pools and changes in albedo. Ref. [12] proposed several statistical estimators to estimate changes in height of trees and other vegetation using bi-temporal data from ALS in a boreal-alpine ecotone. The estimators were applied to an observation period of six years and it was demonstrated that statistically significant increases in height could be found for relatively small monitoring units (1.5 ha primary monitoring units). The estimation framework was proposed as an operational methodology that could be applied to monitoring over vast tracts of land, and it was based on a so-called model-dependent approach to statistical inference. The method relies on bi-temporal 3D data from remote sensing and temporally consistent ground observations of heights of trees and other vegetation. Because the precision (confidence interval) of the height change estimates will determine the sensitivity of the method to detect subtle changes in height, it is important to quantify to what extent the source of the 3D information (ALS vs. DAP) influence the precision of the estimates.

The current study focused on the boreal-alpine ecotone in particular. The objectives were twofold. (1) We assessed and compared the performance of 3D data from ALS and DAP for prediction of tree height of small pioneer trees and evaluated how tree size and tree species affected the predictive ability of the two types of 3D data. (2) We compared the precision of vegetation height estimates (trees and other vegetation) across the chosen study area using 3D data from ALS and from DAP using the estimators proposed by Ref. [12]. As part of the latter objective, we also evaluated the different sources of uncertainty in the model-dependent mean square error estimators for vegetation height at different spatial scales. It should be noted that this analysis focused on height estimation rather than height change estimation because 3D data from DAP to be compared with ALS data were available just for one point in time. An operational methodology for change estimation could indeed exploit combined bi-temporal ALS and DAP data, but we wanted to quantify the effect of each of them on the precision of estimates without confounding the effects of the two 3D acquisition techniques.

2. Materials and Methods

2.1. Study Area

The study area is located in the municipality of Rollag in southern Norway (60°0′ N 9°01′ E, 910–950 m above sea level) (Figure 1). The entire study was conducted within a 200 m × 600 m rectangle (12 ha). The work took place in the boreal–alpine tree line, which at this location was around 900–940 m above sea level. The main tree species in the trial area are Norway spruce (Picea abies (L.) Karst.), Scots pine (Pinus sylvestris L.), and mountain birch (Betula pubescens ssp. czerepanovii). The total stem density when the study area was established in 2006 was estimated to be 97 trees ha⁻¹, of which only 15 trees ha⁻¹ were taller than 2 m [9].

2.2. Field Measurements

2.2.1. Overview

This study comprised field data from two complementary datasets which both have been subject to analysis in previously published work but updated with new and original measurements for the purpose of the current study.

First, we selected and georeferenced individual trees that could be used as ground-reference (1) for analysis of various types of remotely sensed data and the performance of such remotely sensed data to identify individual pioneer trees; (2) for detection of subtle changes at tree level over time using remotely sensed data; and (3) for development of methods for operational monitoring of vegetation changes over time by assistance of remotely sensed data, which subsequently could be adapted to monitoring over vast tracks of land. (4) An overarching purpose was to establish a long time series of individual tree data that could be used as reference for biological studies of changes in the tree cover in the boreal-alpine ecotone caused by anthropological drivers of change. The time series was established in 2006 and is still maintained, see details in Section 2.2.2. The individual tree dataset was subject to analysis under objectives #1 and #2.

Second, ALS data have to date been the primary source of remotely sensed data under study of pioneer trees. Because the sensitivity of ALS data for early detection of emerging pioneer trees depends on the accuracy of the digital terrain model (DTM) used for normalization of the ALS vegetation echoes, we collected ground reference points with known elevation and ground properties (e.g., terrain form and type of ground vegetation; see details in Section 2.2.3). The primary purpose was to assess systematic errors and accuracy of DTMs constructed from ALS data under different acquisition strategies, such as flying altitudes and pulse repetition frequencies. This dataset also contained valuable information on other vegetation than trees which complemented the individual tree dataset. It was subject to analysis under objective #2.

Figure 1. (a) Location of the study area and (b) design of the trial. Black dots are the ground reference points (n = 426); small circles indicate the locations of the point-centered quarter sampling points (black dots; n = 40), which were centers of the 25 m radius plots (gray circles) arranged along four sample lines; green is defined as forest according to the official N50 topographic map series; light yellow is above the tree line; the black triangle is the reference point (base station).

2.2.2. Individual Tree Data

The individual tree dataset was established in 2006 [9] and re-measured in 2012 [12] and 2017. In 2006, the point-centered quarter sampling method (PCQ; [27]) was used to select individual trees for the study. The tree sampling was conducted according to PCQ at 40 systematically distributed sample points within the 200 m × 600 m rectangle (Figure 1). It should be noted that due to time constraints, only four points were measured on line #4, see Figure 1.

At each point, the closest tree in each of four height classes (<1 m, 1–2 m, 2–3 m, >3 m) within each of the four quadrants around the points defined according to the cardinal directions, i.e., the NE, SE, SW, and NW quadrants, was selected. Thus, a maximum of 16 trees were selected at each point. It should be noted that the selection of trees was restricted to a maximum distance from each sample point of 25 m [28].

The stem positions of the trees were recorded with a real-time differential global positioning system (GPS) and global navigation satellite system (GLONASS) receiver, with a local reference receiver for differential correction located within the study area (Figure 1) at a national reference point of the Norwegian Mapping Authority. The expected accuracy was 3–4 cm [9]. For each tree, tree species, tree height, stem diameter at root collar, and crown diameter in two perpendicular directions (N–S and E–W) were recorded. In total, 342 trees were selected, ranging from 0.11 to 5.20 m in height. Details regarding the field work in 2006 can be found in Ref. [9]. The trees took many different forms, including distinct and solitary trees, groups of trees—for spruce often as krumholtz, and birch appearing as solitary trees as well as in the form of tall scrubby vegetation (Figure 2).

When the field work was repeated in 2012 and subsequently in 2017, the already defined 40 sample points were once again identified with real-time GPS + GLONASS. For each of the 40 points, the PCQ sampling was then conducted independently of the sampling in 2006, but according to the same protocol. Many of the trees selected in 2012 and 2017 were the same as those measured in 2006. During the 2017 campaign, which took place in the period 8 August to 21 September, we identified all trees that had been measured in 2006 and 2012 [12] and which were still alive, including those that were not selected for the 2017 PCQ sample. Thus, for the current study, we measured and analyzed all trees selected into the 2017 PCQ sample in addition to trees measured in 2006 and 2012 and which were alive in 2017. The individual tree recordings in 2017 followed the same protocol as in 2006. In total, 532 trees were recorded, ranging between 0.05 and 6.60 m in height (Table 1). The field-recorded heights of the trees were designated h. The 2017 data have not been published before.

2.2.3. Ground Reference Points

The ground reference point (GRP) dataset was acquired in August 2010 [29]. A total of 440 GRPs were distributed at 5 m intervals in the N-S direction through the center points of the 40 PCQ plots using a measuring tape and a hand-held compass. Real-time GPS + GLONASS was used to record the coordinates of each GRP. For some points the positioning was unreliable due to poor radio link between the base and rover receivers. Thus, 426 of the initial 440 points were available for analysis (Figure 1). For each GRP, three variables were recorded. They were “terrain form”, “terrain surface”, and “vegetation height”. Vegetation height was recorded according to three mutually exclusive height classes, namely <0.10 m, 0.10–0.20 m, and >0.20 m. Terrain surface was recorded according to three mutually exclusive classes, namely “rock/bare”, “lichen/heather”, and “green vegetation”. Heather comprised common heather (Calluna vulgaris), crowberry (Empetrum nigrum L.), cowberry (Vaccinium vitis-idaea), mountain heath (Phyllodoce caerulea), and alpine azalea (Loiseleuria procumbens), but not bilberry (Vaccinium myrtillus L.). The latter was classified as green vegetation.

In the analysis addressing objective #2, a model-dependent approach to inference was adopted, by which the height of trees and other vegetation was estimated for various domains of the study area. Under model-dependent inference, approximate unbiasedness of the estimators can only be assured if the model used for prediction is correctly specified for the domain of application. Since the vegetation in the study area is a mix of scattered trees of different species and other vegetation, the tree dataset alone would probably not warrant appropriate models for prediction of height of all vegetation, see detailed discussion in Ref. [12]. The GRPs with recorded vegetation height were considered complementary to the tree data for combined modelling of vegetation height for all vegetation, trees included.

Since the vegetation height was recorded in ordered classes only, we assigned a height value of 0.05 m to GRPs in the class <0.10 m. An exception was made for GRPs that were classified as “rock/bare” (see example in Figure 2E). They were assigned the value 0 m. A height value of 0.15 m was assigned to GRPs in the class 0.10–0.20 m. Because we did not know the upper height limit in the class >0.20 m, GRPs with vegetation height >0.20 m were discarded. In some cases, trees actually constituted the vegetation cover for points with height >0.20 m (see example in Figure 2C). Among the 426 recorded GRPs, 365 were subject to further analysis (Table 2). In a similar way as for the trees, the vegetation height for the GRPs was designated h.

2.3. Laser Scanner Data

ALS data were acquired under leaf-on conditions using a fixed-wing aircraft. The acquisition took place on 18 June 2017 using a LMS-Q1560 laser scanner system (Riegel, Horn, Austria) and was part of the governmental effort to construct a new detailed terrain model for Norway. The study area was located within a 1169 km² ALS block for which ground control points were established across the entire block for calibration of the height of the laser measurements. The contracted minimum point density for the block expressed as number of first echoes per 10 m × 10 m cell tessellating the block was 5 points m⁻². The data satisfied this criterion within our study area. In fact, in certain parts of the 12 ha study area the point density was >25 points m⁻² due to side overlap between adjacent, parallel strips and a single flight line perpendicular to the main direction of the scanned block. This dataset was used by the data vendor (TerraTec, Oslo, Norway) to produce the official national terrain model by classifying the points as ground and non-ground echoes using the progressive triangular irregular network (TIN) densification algorithm [30] in the TerraScan software [31].

The official national terrain model was used as terrain reference surface for the study. However, a harmonization of the point density was considered important because order statistics were used in the analysis of the tree and vegetation height. Order statistics, such as maximum height, are monotone increasing functions of number of points for a given target area [32]. In order to keep the point density stable across the study area, we discarded all data from the perpendicular flight line and from the overlap zone between adjacent, parallel strips. The resulting mean point density across the 12 ha area was reduced to 6.5 points m⁻² for “first” and “single” echoes.

Normalized height values were computed for all “first” and “single” echoes relative to the official TIN by linear interpolation. Only “first” and “single” echoes with normalized height values were used in the subsequent analysis. All classified ground and non-ground points with negative normalized height values were assigned the value zero. All classified ground points were assumed to lie on the official terrain surface and where therefore assigned the value zero.

2.4. Unmanned Aerial Systems Image Data

UAS image data were acquired under leaf-on conditions using a eBee fixed-wing drone (senseFly Ltd, Cheseaux-Lausanne, Switzerland) weighing approximately 0.41 kg without payload [33]. The acquisition took place on 21 June 2017, three days after the ALS acquisition, using a Canon IXUS127 HS (Canon Inc., Tokyo, Japan) red, green, and blue camera producing three separate 16.1 megapixel images in the red (660 nm), green (520 nm), and blue (450 nm) wavelengths. The drone was equipped with an inertial measurement unit and an on-board Global Navigation Satellite System (GNSS) to control the flight parameters and provide rough positioning during flight operations [33]. The eBee flight plan was managed through senseFly’s eMotion 2 software, ver. 2 [33], installed on a laptop computer. The longitudinal and lateral image overlaps were set to 90% and 80% respectively, although only a longitudinal overlap of 70% was achieved during the survey. The ground pixel resolution was set to 3.9 cm.

Prior to the image acquisition, the position of ten ground control points (GCPs) were determined and measured using the same RTK-based procedure as the one used to record positions of the GRPs. The GCP targets consisted of a set of 1 × 1 cross-shaped 4 cm × 46 cm timber planks painted orange to insure good contrast with the background vegetation.

The UAS images were processed in Agisoft PhotoScan Professional software, ver. 1.4.3 (Agisoft LLC, St.Petersburg, Russia), to produce a 3D point cloud [34]. The processing steps followed in the PhotoScan software together with the parameters used are described in Table 3.

After initial testing, an adaptive camera model fitting was used to perform the alignment. This function automatically selects the camera parameters to be included in the adjustment based on their reliability. The position of the GCPs were imported in the software to improve the estimates of the camera position and orientation. The GCP positions were manually refined and the camera alignment was optimized based on the GCPs to allow a more accurate model reconstruction. The average RMSEs associated with the estimated camera and GCP locations compared to the PhotoScan-estimated values were 0.92 m and 0.06 m, respectively. A dense point cloud was constructed using a medium quality parameter to reduce excessive processing time and a mild depth filtering parameter to remove outliers and reduce noise while allowing height variation between the 3D points. The point density of the resulting dense point cloud was around 50 points m⁻².

At this point we would like to clarify that the DAP methodology applied in this study is what is commonly known in the literature as structure-from-motion (SfM). Using an iterative least-squares solution, camera position and orientation, and scene geometry are simultaneously reconstructed by identification of matching features, or tie points, in multiple images. The output from SfM is fixed into a relative, not absolute, coordinate system [36]. The GCPs were used to transform the data to the absolute coordinate system adopted in this study. For the sake of simplicity, we refer to this methodology as DAP.

Normalized height values were computed for the DAP points relative to the official TIN by linear interpolation. Because the absolute height values of the DAP data were determined according to the elevation of the ten GCPs, we compared the elevation of the GCPs to the elevation of the official TIN. There was a mean difference in elevation of 0.055 m with a standard deviation of the differences of 0.028 m. The mean difference was subtracted from the normalized heights of all the DAP points. All DAP points with negative normalized height values after this subtraction were assigned the value zero.

2.5. Extracion of ALS Data and DAP Data for Trees, Ground Reference Points, and Population Elements

2.5.1. Trees

Crown polygons for the recorded individual trees were constructed as ellipses around the recorded stem positions with the perpendicular crown diameter measurements (N–S, E–W) as minor and major axes. The tree crown polygons were laid atop the ALS dataset and the DAP dataset. Three different polygon height metrics were calculated for each of the two remotely sensed datasets. They were the maximum, mean, and the 90th percentile. The analysis conducted in this study revealed, however, that maximum height produced consistently greater accuracies in for example the tree height modeling and prediction. This is consistent with previous findings showing maximum height to be a strong predictor for tree height of small trees [10]. The fact that only a single point would be present for a large fraction of the small trees, at least in the ALS dataset (see examples in Ref. [9]), would exclude the use of, for example, deciles or moments of the height distributions, which are commonly used for modeling biophysical properties of larger forest trees. Only the results for the maximum values within each crown polygon were documented, and they were designated h_ALSmax and h_DAPmax.

2.5.2. Ground Reference Points

Circular polygons were constructed for each of the GRPs for which the vegetation height recorded in field was < 0.20 m (Table 2). These polygons were laid atop the ALS and DAP datasets. When the GRP dataset was established in 2010, the variable “terrain form” was recorded within a circle with radius 1 m centered on the GRP [29]. However, the variable “vegetation height” was recorded for the point only without any further assessment of the vegetation height surrounding the point. In the current study, we chose a radius of 0.5 m for the circular polygon to which the recorded vegetating height was assigned. Even though we restricted the size of the polygon to a radius of 0.5 m, there was a risk of overhanging and non-recorded trees and bushes aside the GRP but with presence inside the polygon, which potentially could be represented by large positive height values in the remotely sensed point data and for which we had no field observations. We therefore inspected the ALS point data and the DAP data numerically and visually to identify such polygons where there would be a likely mismatch between the field-recorded vegetation height and the height in the remotely sensed data. There were 10 among the 365 polygons for which the ALS data had h_ALSmax > 1.00 m when h_ALSmax was defined in a similar way as for the tree polygons (Section 2.5.1). Because the point density of the ALS data was much smaller than for the DAP dataset, the DAP dataset contained a greater number of polygons with maximum heights > 0.20 m than the ALS dataset. Among the 365 polygons, 24 and 26 polygons in the ALS and DAP data, respectively, had maximum heights > 0.20 m. We decided to discard these polygons from both remotely sensed datasets with maximum height > 0.20 cm in either of the two datasets. Thus, 327 polygons were retained for the analysis addressed in objective #2. Because we did not have ground observations of the vegetation height within the polygons, there is uncertainty associated with the final data resulting from this data screening. For the retained polygons, h_ALSmax ranged between 0 and 0.19 m with a mean value of 0.07 m. For DAP, the maximum value (h_DAPmax) was in the range 0–0.19 m with a mean value of 0.03 m.

2.5.3. Population Elements

Under objective #2, we compared the precision of vegetation height estimates across the chosen study area using the different remotely sensed 3D data. The 200 m × 600 m study area was tessellated into regular population elements of 1.5 m² in size. This size was a compromise between the size of the GRP polygons (0.79 m²) and the tree polygons (2.10 m²) subject to modeling under objective #2 (Section 2.7.1). The resulting 79,242 population elements constituted the overall population in a statistical sense (Section 2.7.2). The number of elements was slightly smaller than the theoretical size of 80,000 elements due to a small water body in the study area that was excluded from the population. The maximum point height was extracted for each individual element for each of the three 3D remotely sensed datasets.

2.6. Analysis—Objective #1

Under objective #1, we assessed and compared the performance of 3D remotely sensed data from ALS and from DAP for prediction of tree height of small pioneer trees and evaluated how tree size and tree species affected the predictive ability of the two types of 3D data. The main steps of the analysis are shown in Figure 3 for the sake of clarity and overview.

A first step of the assessment was to analyse to what extent the different 3D data were sensitive to the small trees, i.e., if positive height values of the point clouds could be expected for a tree. Previous research (e.g., [9]) suggests that this will depend on factors such as tree height, size of the tree crown, tree species, the point density of the remotely sensed data which in the current study clearly differed between ALS and the DAP 3D data (Section 2.3 and Section 2.4), and degree of laser pulse penetration into the tree crowns for ALS as opposed to a likely depiction of the outer surface of a tree crown with DAP data.

A logistic regression analysis with binary response supported this assessment. Among the 532 field-measured trees (Table 1), two trees had a substantially higher maximum height in the 3D remotely sensed datasets (1.51–2.86 m) than field-measured tree height. These two trees (trees #48 and #2100) had likely overhanging branches from taller, neighbouring trees and they were discarded from all subsequent analysis. They were both spruce trees. Fifteen trees with maximum heights in the remotely sensed datasets 0.20–0.97 cm greater than the corresponding field-measured tree heights were retained because we were unable to identify a specific reason for this pattern. We were thus rather conservative in the treatment of potential outliers. The logistic regression analysis was based on the remaining 530 trees (Table 4).

For each of the two remotely sensed datasets (ALS, DAP) every tree was classified as POSITIVE if the maximum height for the tree polygon (h_ALSmax or h_DAPmax) had a positive value. If the maximum value was zero or the tree polygon did not contain any points for a given 3D remotely sensed dataset, the tree was classified as ZERO. The analysis was carried out in two steps. First, a general logistic regression model reflecting all effects mentioned above was fitted. This model of the probability of POSITIVE was formulated as follows:

\log (\frac{π_{POSITIVE}}{1 - π_{POSITIVE}}) = β_{0} + β_{1} {DATA}_{DAP} + β_{2} {SP}_{pine} + β_{3} {SP}_{birch} + β_{4} h + β_{5} A + ε

(1)

where

π_{POSITIVE}

is the probability of maximum height of a tree polygon with a value greater than zero using observations from datasets (DATA) ALS and DAP. DATA_DAP is a dummy variable for DAP (DATA_DAP = 1 if DAP). Further, SP_pine is a dummy variable for pine (SP_pine = 1 if pine), SP_birch is a dummy variable for birch (SP_birch = 1 if birch), h (m) is the tree height measured in field, and A (m²) is the elliptic tree crown area according to the field recordings of crown diameters. The betas (

β_{0}

,

β_{1}

,

β_{2}

,

β_{3}

,

β_{4}

,

β_{5}

) are parameters to be estimated. Maximum-likelihood computation for fitting of the logistic model in Equation (1) was performed with the LOGISTIC procedure of the SAS package [37].

It should be noted that the reference in the model is the ALS dataset and the tree species spruce. Thus, the estimated parameters for the DATA and SP variables express differences relative to this reference (differences in intercept of the model). The effects of, for example, DAP relative to ALS will be expressed directly by the parameter estimate of the former variable. Finally, a Wald chi-square test was performed to test the null hypothesis that the parameter estimates for the two dummy variables for tree species were equal.

One of the results of the first step of the logistic regression analysis was that the effects of tree species on probability of detected trees differed significantly in the statistical sense between some of the species (p < 0.001, p = 0.037, and p = 0.080, respectively), see Table 7. On the other hand, the effect of dataset was not significant in the statistical sense (p = 0.733; Table 7). Further, both tree height and crown area were statistically significant (p < 0.001, Table 7).

Although some effects in the basic model in Equation (1) were significant and others not, some of the effects are likely confounded which may lead to incorrect interpretations. For example, the point density of the DAP point cloud was around 50 points m⁻² whereas the corresponding density in the ALS data was 6.5 points m⁻². It is therefore reasonable that the area of a tree crown polygon is more critical for a crown polygon having a positive height value in the ALS data than in the DAP data. Likewise, tree species may affect the probability of positive height values differently in the two 3D remotely sensed datasets since laser pulses tend to penetrate the tree crowns before an echo is triggered while DAP may better capture the surface of a crown. Crowns of different species have different densities of biological matter (foliage and branches) and different shapes which may influence the point clouds for the two 3D remote sensing techniques differently. A more complex model was therefore formulated. In the model in Equation (1), it was assumed that the effect of dataset was similar for each individual tree species, i.e., that the different datasets only affected the intercept of the model. In addition to the basic effects accommodated by Equation (1), we allowed the effects of tree species to vary between the two 3D remotely sensed datasets. This was accommodated by introducing separate regression coefficients for tree species for the different datasets. Further, in the former model, it was assumed that the effect of dataset was constant across the entire range of tree heights and tree crown areas. In the second step of the analysis, we allowed the effects of dataset to vary according to the magnitude of the tree height and the tree crown area as well. This was accommodated by introducing separate regression coefficients for tree height and crown area for each individual dataset in the model:

\begin{matrix} \log (\frac{π_{POSITIVE}}{1 - π_{POSITIVE}}) = β_{0} + β_{1} {DATA}_{DAP} + β_{2} {SP}_{pine} + β_{3} {SP}_{birch} + β_{4} {SP}_{pine} \cdot {DATA}_{DAP} + \\ β_{5} {SP}_{birch} \cdot {DATA}_{DAP} + β_{6} h + β_{7} h \cdot {DATA}_{DAP} + β_{8} A + β_{9} A \cdot {DATA}_{DAP} + ε \end{matrix}

(2)

Similar to the model in Equation (1), the ALS dataset and the species spruce represent the reference in the model in Equation (2). Thus, the estimated parameter for the DATA variable (β₁) will express the overall difference in intercept relative to ALS, while parameters for the two SP variables (β₂ and β₃) will express the overall difference in intercept relative to spruce. The height and crown area parameter estimates (β₆ and β₈) will express the general effects of these two variables. The estimated parameters for the respective products of the DATA variable and the two species variables (β₄ and β₅), and the DATA variable and height and crown area (β₇ and β₉), will express differences in parameters for pine, birch, h, and A for DAP relative to ALS. Finally, a Wald chi-square test was performed to test the null hypothesis that the parameter estimates for pine and birch (β₂ and β₃) were equal. Likewise, Wald chi-square tests were performed to test the null hypotheses that the parameter estimates of the products of the DATA variable and the two SP variables (β₄ and β₅) were equal.

The second part of the analysis under objective #1 entailed modeling of tree height of the pioneer trees and evaluation of how tree size and tree species affect the predictive ability of the two types of the 3D remotely sensed data. Following a similar strategy as in the logistic regression analysis, we formulated a model with tree height observed in field as dependent variable and maximum height for each tree crown polygon in each of the 3D remotely sensed datasets and the factors to be evaluated as independent variables:

\begin{matrix} h = β_{0} + β_{1} {DATA}_{DAP} + β_{2} {SP}_{pine} + β_{3} {SP}_{birch} + β_{4} h_{\max} + β_{5} h_{\max} \cdot {DATA}_{DAP} \\ + β_{6} h_{\max} \cdot {SP}_{pine} + β_{7} h_{\max} \cdot {SP}_{birch} + ε \end{matrix}

(3)

where h_{max =} h_ALSmax when the dataset was ALS and h_{max =} h_DAPmax when the dataset was DAP. The other variables in the model were defined as above. The analysis was based on 389 of the trees for which h_max ≥ 0 in both datasets (Table 5). The least squares method for fitting the model was applied by using the REG procedure of the SAS statistical software package [37]. F-tests were performed to test the null hypotheses that (1) the parameter estimates for the two SP variables (β₂ and β₃) were equal and that (2) the parameter estimates of the products of the two SP variables and h_max (β₆ and β₇) were equal.

Finally, leave-one-out cross validation was adopted to assess the predictive ability of the two 3D datasets. However, the model in Equation (3) assumed the error variance to be the same for both 3D datasets. Separate models would be required if the error variances could be assumed to be different ([38], p. 173). The cross validation was therefore performed for separate models constructed according to:

h = β_{0} + β_{1} {SP}_{pine} + β_{2} {SP}_{birch} + β_{3} h_{\max} + β_{4} h_{\max} \cdot {SP}_{pine} + β_{5} h_{\max} \cdot {SP}_{birch} + ε

(4)

where h_{max =} h_ALSmax when the model was constructed by using the ALS data. h_{max =} h_DAPmax when the model was constructed by using the DAP data.

In the cross validation of the two respective models, the prediction accuracy was assessed separately for different classes according to tree height and different tree species. The assessment was based on the differences between predicted and observed tree height for individual trees according to the statistics (1) mean difference, (2) standard deviation of the differences (Stdev), and root mean square error (RMSE). These statistics were also calculated across all trees for each individual 3D dataset and the null hypothesis of homogeneity of prediction variances among the two 3D datasets was tested by Levene’s F-test [39] in the GLM procedure of the SAS package [37].

2.7. Analysis—Objective #2

To provide initial overview, the main steps of the analysis under objective #2 are shown in Figure 4.

2.7.1. Models for Vegetation Height

The first step of this analysis entailed construction of regression models used for prediction of vegetation height. These predictions were subsequently used to estimate mean vegetation height following model-dependent inferential principles (see details in Section 2.7.2). A critical assumption for model-dependent estimators to be approximately unbiased is that the model is correctly specified for the area of application. Misspecification of the model can lead to serious bias in the estimators [40].

Considerable research has been devoted to development of combinations of sampling designs and estimators that protect the inference from the adverse biasing effects of model misspecification [41]. A primary finding has been that model-dependent estimators tend to be biased unless the sample is balanced, i.e., the sample moments of the distribution of the independent variables equal the corresponding population moments [42]. In the current study, the dataset available for model construction was composed of the 389 tree polygons (Table 5) and the 327 polygons constructed for the 327 GRPs (see Section 2.5.2), i.e., 716 observations in total. The 327 GRP polygons were added to the tree dataset to better represent the vegetation structure in the study area. In the remote sensing community, criteria for characterizing the appropriateness of data and models for model-dependent inference have received little attention. A couple of exceptions are the studies by Refs. [12,43] who made explicit reference to the effects of sample imbalance on bias.

h_max was the independent variable to be used in the models that were to be constructed. Prior to choosing a specific model form and model fitting technique, we constructed the distributions of h_max for the sample of 716 observations and the 79,242 population elements that constituted the population and calculated the four first moments of the distributions where h_max = h_ALSmax when we used the ALS data and h_{max =} h_DAPmax when the DAP data were used. As is evident from the graphical presentation of the distributions (Figure 5), the population distributions were extremely skewed towards small values of h_max whereas the samples had few observations in the lower end of the distributions. This is also evident in the calculated moments (Table 6). With a very large majority of population elements in the lower end of the distributions it is obvious that we should strive for models with appropriate prediction properties in that part of the population.

Non-linear models of the form

y = β_{0} x^{β_{1}}

, where y is the dependent variable and x the independent variable, are commonly used to construct prediction models for biophysical parameters such as tree height and forest biomass with 3D data from ALS and DAP. One reason for choosing this model form is the fact that the predicted value of y will be zero when x is zero, which is a logical property in many applications. In our case, most of the population elements have a value of the independent variable equal to or close to zero. However, it is well documented that especially lasers tend to penetrate into tree and vegetation canopies before an echo is triggered. This is well illustrated for height of small trees in, for example, the study by Ref. [9], Figure 4. Therefore, forcing the model through origo will most likely result in a general tendency of under-prediction at the lower end of the range of the dependent variable and thus a biased estimator of vegetation height. The same would be the case with a simple linear zero-intercept model like

y = β_{1} x

. Based on the 716 observations, we therefore chose to construct simple linear models of the form:

h = β_{0} + β_{1} h_{\max} + ε

(5)

where h_{max =} h_ALSmax when the model was constructed by using the ALS data. h_{max =} h_DAPmax when the model was constructed by using the DAP data.

Now, given the large differences in population and sample distributions of h_max, measures should be taken to reduce the risk of inappropriate models for the population in question and thus biased estimators. We chose to adopt a weighting scheme in the model fitting by which weights were assigned to each of the 716 sample observations in such a way that the weighted sample distributions of h_max approximated the population distributions of h_max. First, for each of the two remotely sensed datasets, we calculated the h_max values corresponding to the nine percentiles p10, p20, …, p90 of the population distributions and formed ten equally large, ordered classes <p10, p10–p20, …, p80–p90, >p90. Then we assigned the 716 sample observations of each 3D dataset to these ten classes according to the h_max value of each individual sample observation. Since each class in the population constitutes exactly 10% of the population, each class was given a total class weight of 0.1 (1/10). Further, each sample observation was given a class-specific weight depending on how many sample observations that were assigned to a particular class. For example, if a class contained 100 sample observations, each sample observation would have a within-class weight of 0.01 (1/100). Finally, we calculated individual weights for each sample observation by multiplying the class weight (0.1) by the within-class weight. This weighing scheme ensured that the sum of the weights was always equal to 1. It should be noted that for DAP, 25% of the population distribution had an h_max value of zero. Thus, all sample observations falling in the three lowest classes (up to p30) where assigned the same weight. The weighted sample distributions of h_max are illustrated graphically in Figure 5 and their moments are presented in Table 6. As is evident from the graphical presentation as well as the calculated moments, the weighted sample distributions were generally much more similar to the population distributions than the sample distributions in their original form.

The models were constructed according to Equation (5) and the weighting scheme outlined above using the least squares method as implemented in the lm function of the stats package [44]. White’s and the Studentized Breusch-Pagan test statistics were calculated using the white_lm and breusch_pagan functions, respectively, of the skedastic package [45]. Both tests rejected the hypothesis of homoscedastic residuals for ALS as well as for DAP (p < 0.001). In the presence of heteroscedasticity, heteroscedasticity-consistent covariance matrix estimators were used, as recommended by Ref. [46]. The heteroscedasticity-consistent covariance matrix estimators of type HC₃, presented by Ref. [47], were computed using the sandwich package [48,49] in R.

The constructed models were subsequently applied for prediction for every single 1.5-m² population element across the entire population. The predictions were performed for each of the two individual 3D remotely sensed datasets with h_max of each element as predictor variable. The result was four prediction maps for vegetation height, two for each of the 3D remotely sensed datasets using the weighted and unweighted models, respectively. Two of these maps are presented in Figure 6.

2.7.2. Estimator for Vegetation Height

Based on the predictions, estimates of mean vegetation height were then produced for each of the 3D remotely sensed datasets and by using the weighted and unweighted models, respectively, and for different domains of the population (see details in Section 2.7.4). The general approach to estimation adopted in this study is known in the literature as the area-based approach. In the current study, we did not have a probabilistic sample that would have allowed design-based inference. Model-dependent estimation and inference was therefore adopted. Model-dependent inference has been applied frequently in recent years when estimating biomass, volume, and other biophysical parameters using remotely sensed data (e.g., [50,51,52,53]). An overview of the concept and a brief review of recent studies can be found in Ref. [54]. In the current study, we adopted the estimators in the way they were formulated by Ref. [12]. The context in the study by Ref. [12] was slightly more complex than in the current study. First, they estimated changes in height in bi-temporal data, not the height using single-date data. Second, they estimated height changes of trees in addition to height changes of all vegetation, which required two separate models—one for classification of trees and other (non-tree) vegetation and one for prediction of changes in height. This complicated in particular the estimation of the uncertainty.

Now, let U be the entire population of elements (the 79,242 elements of size 1.5 m² tessellating the study area) where U = {1, …, k, …, N}. Furthermore, let

{\hat{h}}_{k}

denote the predicted vegetation height according to the model in Equation (5) for element k. Thus, the collection of spatially distributed predictions for the N elements constitutes the vegetation height maps mentioned above (Figure 6C,D). Mean vegetation height across the entire study area can be estimated by the point estimator:

\bar{\hat{h}} = \frac{\sum_{k \in U} {\hat{h}}_{k}}{N} .

(6)

This estimator can be used for smaller domains within the population as well.

2.7.3. Estimator of Mean Square Error

The term mean square error (MSE) rather than variance was used to characterize the uncertainty of the model-dependent estimator in Equation (6) because the model-dependent estimator of the population mean cannot be assured to be unbiased. For large areas, model-dependent MSE estimators will, in general, depend mainly on the uncertainty of the estimates of the model parameters (e.g., [55,56,57]). For small domains, an additional source of uncertainty must often be accounted for, namely the residual variance. Ref. [58] derived a model-dependent variance estimator that accounted for residual variance and incorporated a spatial autocorrelation structure of the residuals. Ref. [52] demonstrated with empirical data of timber volume from small forest stands and auxiliary data from DAP that ignoring the residual variance component may induce bias in the mean square error estimator. In the study on vegetation height change conducted in the current study area [12], the analysis suggested that “… most of the mean square error estimates (>95%) of the estimators will be accounted for by quantifying the variance attributable to the model parameter uncertainty”. Nevertheless, it cannot be assumed that the magnitude and spatial structure of the residual variance of vegetation height predictions necessarily follow the same patterns as the height change predictions. In the current study, we therefore addressed all the mentioned components, i.e., (1) the variance due to uncertainty of the model parameters and the residual (2) variance and (3) spatial covariance components. These three components were treated as additive to obtain the total mean square error and will be described in detail below.

Under the model-dependent inferential framework, the variance due to uncertainty of the model parameters can be approximated either by using a closed-form formula based on a first-order Taylor series approximation (see e.g., [57]), or by using Monte Carlo simulations in the form of, for example, parametric bootstrap. Both approaches to estimation have been adopted in forest- and vegetation-related studies in recent years (e.g., [50,59]). Ref. [60] noted a few properties of parametric bootstrap that makes it attractive and demonstrated the technique with ALS data. In the current study, we chose to adopt this technique, as in Ref. [60]. In the following, it is assumed that the model parameter estimates of the predictive model (

β

in the model in Equation (5)) follow asymptotically multivariate normal distributions, i.e.:

\hat{β} ~ N (E [β], Σ_{\hat{β}}),

(7)

where the expected value of the vector of

\hat{β}

estimates is

E [β]

and

Σ_{\hat{β}}

is the heteroscedasticity-consistent estimates of the variance–covariance matrix of

\hat{β}

.

By sampling from the multivariate distribution in Equation (7), a large parametric bootstrap sample of random vectors

β_{PB} ~ N (\hat{β}, Σ_{\hat{β}})

was generated. The sample was denoted S_PB, where S_{PB =} {1, …, l, …, M}. This sample can be used to produce new predictions of

h

, according to Equation (5). Predictions of

h

were produced for all M random vectors

β_{PB}

and for all N population elements. Thus, we obtained unique predictions

{\hat{h}}_{PB, k, l}

for

k \in U

and

l \in S_{PB}

. A parametric bootstrap variance estimator for the point estimator in Equation (6) is:

var {(\bar{\hat{h}})}_{par} = \frac{1}{M - 1} \sum_{l \in S_{PB}} {({\hat{h}}_{PB, l} - {\hat{h}}_{PB})}^{2},

(8)

where:

{\hat{h}}_{PB, l} = \frac{1}{N} \sum_{k \in U} {\hat{h}}_{PB, k, l}

(9)

and:

{\hat{h}}_{PB} = \frac{1}{M} \sum_{l \in S_{PB}} {\hat{h}}_{PB, l} .

(10)

Analytical estimators for residual variance under heteroscedasticity have been adopted in previous analysis of important parameters encountered in forest surveys, for example timber volume [52]. In the current study, every geographical domain subject to estimation had sample units (ground observations of vegetation height) that could be used to provide estimates of residual variance. Thus, for a particular domain with a sample S with n sample units, S = {1, …, k, …, n}, the residual variance for the point estimator for mean vegetation height (Equation (6)) was formulated as [12]:

var {(\bar{\hat{h}})}_{res} = \frac{1}{N n} \sum_{k \in S} {(h_{k} - {\hat{h}}_{k})}^{2} .

(11)

Residual covariance of substantial magnitude, as compared to the other sources of uncertainty when estimating forest resource parameters for small areas, has been encountered for shorter distances in several studies (e.g., [52]), while at greater distances—and consequently for larger areas—the residual covariance is often assumed or found to be negligible in magnitude (e.g., [51]). However, as noted above, Ref. [12] found the residual covariance to be negligible even for areas as small as 1.5 ha. Analytical ways of addressing the residual covariance have been demonstrated by e.g., Ref. [52].

Quantifying the spatial autocorrelation of the residuals is essential in the analysis of the residual covariance. Spatial correlation (ρ) is often estimated from the model prediction residuals by constructing a correlogram. Assuming that a correlogram has been fitted and that ρ can then be predicted to obtain predicted values of the correlation for all combinations of the N population elements in U, the residual covariance for the point estimator of mean vegetation height for a particular domain for which we had actual observations of the residuals, can be estimated by [12]:

cov {(\bar{\hat{h}})}_{res} = \frac{1}{n N^{2}} \sum_{k \in S} {(h_{k} - {\hat{h}}_{k})}^{2} \sum_{k \in U} \sum_{l \in U} {\hat{ρ}}_{k l}, k \neq l,

(12)

where

{\hat{ρ}}_{k l}

is the predicted value of the residual correlation between elements k and l in U and the correlogram was fitted using the residuals

h_{k} - {\hat{h}}_{k}

and

h_{l} - {\hat{h}}_{l}

.

2.7.4. Calculations

The first step of the analysis was to construct the regression models for the ALS and DAP data according to Equation (5) based on the 716 weighted sample observations. As noted above, for the sake of comparison of estimates we also constructed simple models according to Equation (5) without adopting the weighting scheme. These models are nevertheless not documented in the current article.

Once the regression models were constructed, we proceeded with the estimation. First, we estimated mean vegetation height across the entire study area, according to Equation (6), by using model predictions from the height models (Equation (5)).

Two cases of special interest were identified. First (Case A), we tessellated the study area into 1 ha cells that may serve as the primary mapping and monitoring units in, for example, a tree line monitoring program (see Figure 6B). This size was chosen for demonstration purposes, and this size can be changed to any size found meaningful for a particular application. 1 ha resolution may be found useful for some applications, while, for example, 100 m² resolution may be found relevant for other applications. We could even consider the primary resolution of 1.5 m², but some level of aggregation is in many cases perhaps easier to comprehend and interpret. Another reason for choosing an area as large as 1 ha was that a restricted number of cells (12) would ease the interpretation of any potential differences in the properties of the estimates obtained from the two different 3D remotely sensed datasets.

Second (Case B), the current study area is subject to variations in wind exposure, temperature, soil depth, and moisture due to its location on and along a small ridge with variations in aspect. In particular, bedrock outcrops and mires are abundant in certain parts of the area. These factors tend to influence establishment of different types of vegetation, its height and density. We wanted to assess any potential differences in estimated height and precision for the two 3D remotely sensed datasets which could be attributed to different vegetation structures. The area was therefore delineated into four distinct sub-regions (see Figure 6A,B) based on visual interpretation of the orthophoto (natural colors). Thus, this delineation was independent of the height information in the ALS as well as the DAP point clouds. In the visual interpretation, we sought homogeneity within each polygon with respect to appearance of trees, greenness of the ground vegetation suggesting existence of bushes, and lighter areas suggesting lichens and bedrock outcrops.

Finally, we estimated the various MSE components for the entire study area and for each geographical domain (twelve 1 ha cells and four sub-regions) subject to analysis. First, we constructed separate correlograms for the residuals for each domain of interest. Correlations based on the observed residuals were graphed and then visually inspected. For natural phenomena, it is common to observe greater correlations at shorter distances and then declining correlations with increasing distances. Such spatial structures can be modeled, e.g., by some exponential model form. In the current dataset, no such spatial structure could be observed (see examples in Figure 7). Similar to the study on vegetation height change [12] linear regression models of the form:

ρ = β_{0} + β_{1} D + ε

(13)

appeared to be suitable for vegetation height as well, where D is the distance between pairs of residuals. In total, 34 correlogram models were constructed according to Equation (13), i.e., separate models for each domain subject to estimation for each of the two 3D remotely sensed datasets.

The results of the model construction revealed non-significant estimates (p > 0.05) of

β_{0}

as well as for

β_{1}

for 16 of the 17 models constructed for ALS and for 15 of the 17 models for DAP for

β_{0}

and 14 of the 17 models for

β_{1}

. Thus, the results suggested constant residual correlation equal to zero in 30 of the 34 cases. We therefore concluded that the residual covariance components of the estimates of MSE (

\hat{MSE}

) would be zero and could be ignored in these 30 cases. Even in the four remaining cases, the autocorrelation was negligible, even though the parameter estimates were statistically significantly different from zero. One of the three cases for DAP was the model constructed across the entire study area, which also had the smallest p-values (Figure 7, bottom). As is evident from visual inspection of the graphical presentation in the figure, there is hardly any structure in the spatial correlations of the residuals. For all the 34 cases, the residual covariance was therefore ignored.

In the characterization of the uncertainty of the various estimates of vegetation height, we proceeded with estimating MSE by ignoring the residual covariance and adding the two remaining variance components, accounting for model parameter uncertainty (Equation (8)) and residual variance (Equation (11)). Subsequently, we calculated the standard error of the mean estimate (SE; square root of

\hat{MSE}

). The proportions of

\hat{MSE}

that were attributed to the residual variance were also calculated.

In the bootstrap variance estimation used to characterize the model parameter uncertainty (Equation (8)), the bootstrap was repeated M times until the mean variance over replications stabilized. Stabilization was judged by visual inspection of graphical plots of the mean variance. For some of the estimates, stable variances were obtained with fewer than M = 1000 simulated realizations, while for other estimates, M = 2000 realizations were needed to reach stable estimates. Thus, M = 2000 realizations were used in all simulations reported in this study.

3. Results and Discussion

3.1. Performance of Tree Height Prediction with ALS and DAP Data (Objective #1)

3.1.1. 3D Data Sensitivity to Small Trees

As is evident in Table 4, among the 530 trees subject to analysis, most of the trees greater than 1 m in height had positive maximum height values in the ALS as well as the DAP data. It is also evident that a greater portion of the trees lacked height observations in the ALS data than in the DAP data, which can be attributed to the greater point density of the DAP data than the ALS data. A surprising feature was noticed for some of the trees in the DAP data. For example, for a birch tree (tree #375) with h = 2.52 m and a crown area of A = 0.66 m², the maximum height in the DAP data was h_DAPmax = 0 whereas the corresponding height in the ALS data was h_ALSmax = 2.23 m. Tree #375 appeared close by a pine tree (tree #1102) (gap between crowns of 0.2 m) with h = 3.00 m and crown width of 1.83 m². Tree #1102 had h_ALSmax = 2.33 m and h_DAPmax = 0.38 m. Obviously, the DAP algorithm was unable to follow the outline of the crowns of these trees. Both trees are clearly visible to the human eye in the orthophoto and appears as a solitary cluster of two trees. This illustrates that the DAP algorithm sometimes produced spurious results.

The results of the regression analysis of probability of trees having a maximum height in the ALS and DAP datasets greater than zero (model in Equation (1)) suggested statistically significant differences among two of the tree species (p < 0.037; Table 7) while the effect of dataset (ALS versus DAP) was not significant (p = 0.733; Table 7), see Section 2.6. Both tree height and crown area were statistically significant (p < 0.001, Table 7). However, a more detailed regression analysis following the model in Equation (2), suggested that the statistical significance of many of the effects in the initial regression analysis following Equation (1) were misleading because effects were confounded.

The detailed analysis showed that there is generally a higher probability of positive height values for the trees in the DAP data than in the ALS data (DATA_DAP: p = 0.009; Table 7). There was hardly any effect of tree species at all, with p-values for the various parameter estimates for the tree species dummy variables and the Wald chi-square test ranging from p = 0.194 to p = 0.637 (Table 7). There was, however, a slightly negative and significant effect for the interaction of pine and DAP data (

{SP}_{pine} \cdot {DATA}_{DAP}

: p = 0.045; Table 7), indicating a smaller probability of pine trees having positive height values in the DAP data than in the ALS data. The pine trees in the study area have to a large extent more compact tree crowns than the two other species, which may lead to an early triggering of ALS echoes as the laser pulses penetrate into the tree crowns. The greater probability of positive height values for ALS for pine trees might therefore be attributed to better conditions for laser echo triggering in compact tree crowns.

The most striking result of the detailed analysis in Equation (2) was found for tree height and crown area. There was a very strong effect of crown area for ALS (A: p < 0.001; Table 7) with substantially greater probability of positive height values for the trees with increasing crown size, whereas this effect was strongly dampened for DAP expressed by the interaction of crown area and DAP data (A · DATA_DAP: p < 0.001; Table 7). This is a very logical result and confirmed our expectations (see Section 2.6). Crown area is critical for even hitting the smaller trees with ALS pulses, given the point density in this study of 6.5 points m⁻², whereas this certainly is less critical for DAP, for which the point density was 50 points m⁻². Further, the height as such was not statistically significant for ALS (p = 0.540; Table 7), at least as long as crown area is included in the model. It should be noted that h and A were inter-correlated with Pearson r = 0.74. For DAP there was a tendency of greater probability of positive height values for the trees with increasing height than for ALS, but this effect was not significant in the statistical sense (h · DATA_DAP: p = 0.052; Table 7).

3.1.2. Influence of Tree Size and Tree Species on Tree Height Modeling

The second part of the analysis under objective #1 entailed modeling of tree height of the pioneer trees and evaluation of how tree size and tree species affected the predictive ability of the two types of the 3D remotely sensed data using the 389 trees documented in Table 5. The regression analysis following the model in Equation (3) confirmed a statistically significant overall greater height of trees measured in field than by the two 3D remote sensing techniques as expressed by the positive intercept of the model (p < 0.001; Table 8). This tendency was smaller for DAP than for ALS as expressed by the negative sign of the dummy variable DATA_DAP (p < 0.001; Table 8) but this difference between ALS and DAP was compensated for as the height of the trees increased. There was a strong effect of increasing differences between h and h_max with increasing tree heights in general (p < 0.001; Table 8) with an additional effect with increasing tree height for DAP relative to ALS (h_max · DATA_DAP: p < 0.001; Table 8). The stronger underestimation of tree height for the DAP data than for the ALS data with increasing tree height can easily be seen even by visual inspection of the data (Table 5). This is a somewhat surprising finding and contrary to the expectation that DAP may better capture the properties on the outer surface of a tree canopy of small trees than lasers, and that the DAP height points thus could resemble the surface of the crowns. This result is nevertheless consistent with recent findings reported by Ref. [26]. Whether this is a general phenomenon or other DAP algorithms/software packages and parameter settings would produce substantially different results is unknown and out of the scope of the current study. However, this might be a subject for further studies.

The analysis further revealed that the effect of pine as opposed to spruce and birch was negative on modelled tree height, meaning that differences between h and h_max for the two 3D remotely sensed datasets were greater for pine than for the two other species (p < 0.001; Table 8). Especially the difference between pine and birch was somewhat surprising (F-test: p < 0.001; Table 8) since the pine trees in most cases had compact and regularly shaped crowns which intuitively should be easier to handle using DAP and even result in early triggering of echoes in laser scanning, whereas the birch trees often constituted shrubby and more open vegetation forms (see illustrations in Figure 2). The separate analysis of ALS and DAP following the model in Equation (4) even showed that the specific effect associated with pine trees was more pronounced for DAP (p < 0.001; Table 8) than for ALS (p < 0.05; Table 8). This is counterintuitive, given the expectation that a more regular surface as represented by a pine tree crown would be better suited as an object for DAP. Thus, to get a better understanding of the performance of the DAP, we inspected the data for individual pine trees in more detail by looking at their spatial context in the 3D DAP data as well as in the orthophoto. Among the 64 pine trees with h_max = 0 m in the DAP data, 16 trees had h_max > 0 m in the ALS data despite the much smaller point density in the ALS point cloud. These 16 trees had a mean tree height of 0.68 m (range: 0.16–1.09 m) and mean crown area of 0.36 m² (range: 0.02–0.93 m²). Most of these trees appeared as solitary objects or as objects surround by only low vegetation. Thirteen of them were found in the open areas dominated by mosses and lichens in sub-region II, see orthophoto in Figure 6A. A reason for the DAP height values of zero for these trees might be an inability of the DAP algorithm to capture height variations when the objects have a small horizontal extension, i.e., a smoothing effect. As suggested below, the DAP algorithm may behave differently for larger groups of trees even if the height is not necessarily greater.

When the ability of ALS and DAP data to predict tree height by regression models was assessed by leave-one-out cross validation, separate models were fitted for ALS and DAP data respectively, following the model in Equation (2). The choice of fitting separate models was well justified by rejection of Levene’s F-test of the null hypothesis of homogeneity of prediction variances between the two 3D datasets (p = 0.0032). The performance of the two models in terms of R² and RMSE were 0.90 and 0.83, and 0.36 m and 0.46 m for ALS and DAP, respectively (Table 8). The RMSE of the model fitting corresponded closely with that of the cross validation of the same models (0.37 m and 0.47 m, Table 9). This is well in line with previous findings for small pioneer trees using ALS. For example, Ref. [18] reported an R² value of 0.86 and an RMSE value of 0.49 m in the same study area as the current study, but with data from another point in time and a laser scanning with greater point density. However, they used a simpler model with just a single independent variable (h_max). For DAP, the current study is to our knowledge the first effort to model and predict height of individual pioneer trees in the boreal-alpine ecotone. Ref. [25] nevertheless modelled height of young forest plots with trees in a similar or somewhat greater height range (range: 0.5–13.0 m; mean: 2.5 m) than in the current study. They addressed mean height of plots and stands rather than individual trees. They reported similar or better prediction performance in terms of RMSE for DAP data than for ALS. However, their plots had a mean stem number of 5572 ha⁻¹ as opposed to 97.2 ha⁻¹ for the current study area [9]. A reason for a better performance of DAP for dense plots as opposed to more solitary individuals or clusters of trees might be an ability of the DAP algorithm to follow the surface of larger objects in the form of a forest plot with denser and more continuous canopy surface as opposed to solitary objects, which in our data tended to be smoothed out.

The cross validation also revealed a tendency of greater standard deviation with increasing tree height, at least for DAP (Table 9). This heteroscedasticity should be mitigated by more sophisticated approaches to modeling when the models are to be used for estimation (see Section 2.7.1). For both ALS and DAP there was also a tendency of statistically significant over-prediction for small values of tree height and under-prediction of large values of tree height for spruce in particular (p < 0.001; Table 9).

3.2. Estimation of Vegetation Height with ALS and DAP Data (Objective #2)

3.2.1. Model Construction

Results for the models constructed for vegetation height based on the 716 sample observations are displayed in Table 10, which also includes the estimated heteroscedasticity-consistent variance–covariance matrices for the parameter estimates that were needed for the parametric bootstrap variance estimation. All parameter estimates for ALS as well as for DAP models using weighted as well as unweighted sample observations were highly significant (p < 0.001) and the models displayed an adequate fit to the data with R² values ranging from 0.80 to 0.91 and RMSE values ranging from 0.36 to 0.49 m. We are unaware of any previous studies that have constructed similar prediction models for vegetation height in the boreal-alpine ecotone. Model fit assessed by R² values and RMSE nevertheless corresponded well with models constructed for trees only, see e.g., Table 8 and Ref. [18].

3.2.2. Overall Vegetation Height Estimation and Inference

Based on the predicted vegetation height using the ALS and DAP data (see Figure 6C,D), the estimated mean vegetation height across the entire study area was 0.64 m (SE = 0.014 m) for ALS and 0.76 m (SE = 0.018 m) for DAP, respectively, when the prediction models were constructed with weighted sample observations (Table 11). An F-test revealed that the SE for DAP was significantly greater than for ALS (p < 0.001). Further, 95% confidence intervals of the mean vegetation height estimates clearly showed that the height estimate for DAP was significantly greater than for ALS, the difference in estimates being as great as 0.12 cm. This difference was surprisingly large, given that the same ground observations were used to construct the respective prediction models. This issue is further discussed in light of the different properties of the various sub-regions under Case B, see Section 3.2.3 below.

A systematic difference on the order of 0.12 cm can in fact have a rather detrimental effect on change estimates. Ref. [12] estimated the overall change in vegetation height for our study area to be 0.16 m (SE = 0.02 m) for the time period 2006–2012 using ALS data for both points in time. Clearly, small but statistically significant change estimates can easily be confounded with systematic effects associated with choice of remote sensing technology used in the estimation.

It was also revealed that the mean vegetation height estimates were 0.03 to 0.04 m greater for ALS and DAP, respectively, when unweighted sample observations were used. Despite the fact that the differences in estimates using weighted and unweighted sample observations were not significant in the statistical sense, these differences were fairly constant across the entire study area—even for the smaller 1 ha cells of Case A representing a great variation in vegetation properties (Table 11).

The fact that there were significant differences between ALS and DAP estimates of mean vegetation height and between estimates based on weighted and unweighted sample observations, demonstrates that the model-dependent estimators are likely biased. The magnitude of the bias is however unknown, and based on the data and the analysis we can hardly tell which of the estimators that are the least biased. Simulations based on a known population would be needed to assess estimator bias. It is nevertheless reasonable to assume that weighting the sample observations during model construction to obtain more similar distributions of the predictor variables in the population and in the sample will provide models that are more apt for prediction purposes for the study area as a whole. Thus, the results illustrate that the composition of the sample is an important aspect of model-dependent inference, which often seems to be neglected by the remote sensing community.

Further, these results demonstrate that change estimation based on bi-temporal data with a combination of ALS and DAP may be challenging even when field data are available for model calibration at each point in time.

At the spatial scale of the entire study area (200 m × 600 m) the residual variance had a negligible effect on overall MSE estimates of mean vegetation height. The residual variance contributed only 0.85% and 0.96% to the overall MSE for ALS and DAP, respectively (Table 11). This is on the same order of magnitude as for mean change in vegetation height (0.4%) using ALS data in the very same study area [12]. Thus, residual variance can clearly be ignored at this spatial scale without any adverse effect on the statistical inference.

3.2.3. Domain-Specific Vegetation Height Estimation and Inference

The mean vegetation height estimates for the 1 ha cells (Case A) ranged from 0.30 m to 1.36 m for ALS (Table 11, Figure 6E) and from 0.36 m to 1.61 m for DAP (Table 11, Figure 6F) when using the weighted sample observations. The mean height estimates were generally higher for DAP for all cells as noted for the overall study area (Section 3.2.2). Despite varying differences in height estimates between ALS and DAP among cells, the two remotely sensed datasets captured the same trends in differences in vegetation height among the cells, as is evident in Figure 6E,F. This illustrates that for monitoring units of, for example, 1 ha in size, it should be possible to identify gradients in vegetation height with both remote sensing techniques, although the absolute value of the height estimates may suffer from systematic errors due to potential estimator bias, as discussed above. By assessing 95% confidence intervals of the respective cell-wise mean vegetation height estimates, it appeared that most cell estimates were significantly different from the others in pair-wise comparisons. As pointed out by Ref. [12], it should be noted, that when multiple statistical comparisons (tests) are performed simultaneously, one may wish to alter the level of significance to control the total Type I error (Bonferroni approach [62]). Especially for studies of trends in vegetation height over larger areas covering many monitoring units, this may be an important consideration.

The standard error estimates were generally smaller for ALS than for DAP, and the cell-wise estimates of SE also varied less among the cells for ALS than for DAP. The cell-wise SE estimates ranged from 0.014 m to 0.018 m for ALS and from 0.019 m to 0.031 m for DAP (Table 11). At the 1 ha cell level the residual variance component accounted for 4.33% to 13.21% of the overall MSE estimate for ALS and 5.05% to 18.64% for DAP (Table 11). Thus, the contribution of the residual variance component was clearly much greater at 1 ha level than for the study area as whole (12 ha), as one would expect. Further, the contribution was also greater than at the intermediate spatial scale represented by the four sub-regions in Case B, see Figure 6A, which is also reasonable. Further, it should be noted that there did not seem to be a clear relationship between the height of the vegetation and the magnitude of the residual variance component. For ALS, for example, the greatest residual variance components were found for cells #2, #3, and #11 for which the vegetation height estimates were at the low as well as the high ends while the very smallest (cell #5) and very largest (cell # 1) height estimates showed smaller residual variance components. This may suggest that the relative magnitude of the residual variance may not be sensitive to the particular properties of the vegetation in the ecotone.

In an operational context, most spatial units subject to estimation will lack field observations and residuals cannot be estimated. The current results suggest that ignoring the residual variance component will lead to an underestimation of the overall MSE estimate by, say, 10–15% at a 1 ha level. The order of magnitude of this underestimation seems to be similar for ALS and DAP, and is somewhat greater than the ~5% underestimation for change in vegetation height using ALS reported by Ref. [12].

The intention of Case B was to offer an opportunity to explore and understand how vegetation properties affected the performance of estimation based on the ALS and DAP data, respectively. As is evident in Table 11 and Figure 6G,H, the vegetation height estimates followed the same crude trends for ALS and DAP and the estimates were logical in the sense that they reflected the subjective criteria used when delineating the four sub-regions (Figure 6A). Thus, it was rather surprising that the greatest differences in vegetation height estimates between ALS and DAP were found for one of the regions with low vegetation (sub-region I; difference of 0.20 m) and one of the regions with tall vegetation (sub-region IV; difference of 0.19 m). Likewise, the smallest differences in vegetation height estimates were found for one of the regions with low vegetation (sub-region II; difference of 0.06 m) and one of the regions with tall vegetation (sub-region IV; difference of 0.06 m). In fact, for these two latter regions the ALS- and DAP-based estimates were not significantly different in the statistical sense. We could not find any other properties of the sub-regions that could help explaining the differences in the estimates.

The observed patterns of the differences in estimates based on ALS and DAP therefore largely remain unexplained and further studies, preferably with a broader range of image acquisition settings, such as for example image overlap, and different parameter settings for the DAP would probably be useful.

During the analysis under objective #1 it was observed that solitary trees tended to be smoothed out in the DAP data (Section 3.1.1 and Section 3.1.2). Similar findings have recently been reported even for large, scattered forest trees over large areas subject to operational forest inventory based on DAP and the area-based method [63], despite the generally good performance of DAP in denser and tall forests for estimation of a number of biophysical forest variables, including height (e.g., [20]). Along with the unexplained patterns of the height estimates when comparing ALS and DAP, this suggests that there is a critical need for more in-depth and fundamental understanding of the behavior of DAP in studies of trees and forests—from biomass-rich and productive forests in the lowlands to small trees in the boreal-alpine ecotone.

4. Conclusions

As one would expect, the results confirmed that the probability of obtaining positive height values for small individual pioneer trees will increase with increasing point density of the 3D remotely sensed data. Thus, DAP data with 10 times as great point density as the ALS data showed a significantly greater probability of obtaining positive height values for trees. Still, it was surprising and contrary to our expectation, that there was a more pronounced underestimation of tree height with DAP data than with ALS data. This tendency was stronger for pine than the other species, which was counterintuitive, given our expectation that DAP points clouds would capture the properties on the outer surface of a more regular object as represented by a pine tree crown and thus could resemble the surface of the crowns. We noticed that particularly solitary trees tended to be smoothed out. We were unable to explain these results, and general conclusions on the species effect could therefore hardly be drawn. We were unable to identify whether this was a general phenomenon, or a result of the particular DAP algorithm and parameter values adopted in this study. Although not documented in this article, it should be noted that numerous different settings for the image matching were tried in a preliminary phase of the analysis without any substantial impact on the results. It was revealed that tree height was predicted with significantly smaller residual variance when using ALS data, which gives greater promise for estimation of vegetation height with ALS than with DAP data.

The proposed method for estimation of vegetation height of small trees and other vegetation has the potential to produce very precise estimates. The analysis suggested that most of the mean square error estimates (>85%) of the estimators will be accounted for by quantifying the variance attributable to the model parameter uncertainty when the size of the target areas subject to estimation is as small as 1 ha. For larger areas, the model parameter uncertainty will account for an even greater portion of the total uncertainty, as demonstrated for the entire 12 ha study area (>99%). The most precise estimates were found for ALS. Statistically significant differences were found between ALS-based and DAP-based height estimates for certain sub-regions of the study area. Because we were unable to explain the causes of these differences, caution should be exercised regarding interpretation of differences in precision as well as systematic differences between use of ALS and DAP data for vegetation height estimation. Further investigations are needed to understand how DAP data in particular behave for small trees and low vegetation in the boreal-alpine ecotone under different image acquisition and DAP settings. This is of great importance for future operational monitoring of vegetation change in the ecotone since multi-temporal applications often will have to rely on combinations of 3D data from ALS and DAP from different points in time. Even small systematic effects of a particular technology on height estimates may compromise the validity of a monitoring system since change processes encountered in the boreal-alpine ecotone often are subtle and slow.

Author Contributions

Conceptualization, E.N.; methodology, E.N. and T.G.; data collection, E.N. and E.N.R., software, T.G.; validation, E.N. and T.G.; formal analysis, E.N. and T.G.; investigation, E.N.; resources, E.N.; data curation, E.N., M.-C.J.-P. and T.G.; writing—original draft preparation, E.N.; writing—review and editing, E.N., E.N.R., M.-C.J.-P. and T.G.; visualization, T.G.; project administration, E.N.; funding acquisition, E.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Research Council of Norway, grant number 281066 “Changing forest area and forest productivity—climatic and human causes, effects, monitoring options, and climate mitigation potential”.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The field data are not publicly available due to privacy of the private landowners. The DAP data are available on request from the corresponding author. The ALS data (3rd party data) are available at https://hoydedata.no/ (accessed on 22 June 2021).

Acknowledgments

The authors wish to acknowledge Marek Pierzchala for participating in the acquisition of the UAV imagery.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

Kullman, L. Tree line population monitoring of Pinus sylvestris in the Swedish Scandes, 1973–2005: Implications for tree line theory and climate change ecology. J. Ecol. 2007, 95, 41–52. [Google Scholar] [CrossRef]
Kullman, L. Late Holocene reproductional patterns of Pinus sylvestris and Picea abies at the forest limit in central Sweden. Can. J. Bot. 1986, 64, 1682–1690. [Google Scholar] [CrossRef]
Danby, R.K.; Hik, D.S. Variability, contingency and rapid change in recent subarctic alpine tree line dynamics. J. Ecol. 2007, 95, 352–363. [Google Scholar] [CrossRef]
Gehrig-Fasel, J.; Guisan, A.; Zimmermann, N.E. Tree line shifts in the Swiss Alps: Climate change or land abandonment. J. Veg. Sci. 2007, 18, 571–582. [Google Scholar] [CrossRef]
Tasser, E.; Walde, J.; Tappeiner, U.; Teutsch, A.; Noggler, W. Land-use changes and natural reforestation in the Eastern Central Alps. Agric. Ecosyst. Environ. 2007, 118, 115–129. [Google Scholar] [CrossRef]
Speed, J.D.M.; Austrheim, G.; Hester, A.J.; Mysterud, A. Experimental evidence for herbivore limitation of the treeline. Ecology 2010, 91, 3414–3420. [Google Scholar] [CrossRef]
Bryn, A.; Hemsing, L.Ø. Impacts of land use on the vegetation in three rural landscapes of Norway. Int. J. Biodivers. Sci. Ecosyst. Serv. Manag. 2012, 8, 360–371. [Google Scholar] [CrossRef]
Callaghan, T.V.; Werkman, B.R.; Crawford, R.M.M. The Tundra-Taiga Interface and Its Dynamics: Concepts and Applications. Ambio 2002, 12, 6–14. [Google Scholar]
Næsset, E.; Nelson, R. Using airborne laser scanning to monitor tree migration in the boreal-alpine transition zone. Remote Sens. Environ. 2007, 110, 357–369. [Google Scholar] [CrossRef]
Hauglin, M.; Bollandsås, O.M.; Gobakken, T.; Næsset, E. Monitoring small pioneer trees in the forest-tundra ecotone: Using multi-temporal airborne laser scanning data to model height growth. Environ. Monit. Assess. 2017, 190, 12. [Google Scholar] [CrossRef]
Stumberg, N.; Ørka, H.O.; Bollandsås, O.M.; Gobakken, T.; Næsset, E. Classifying tree and nontree echoes from airborne laser scanning in the forest–tundra ecotone. Can. J. Remote Sens. 2012, 38, 655–666. [Google Scholar] [CrossRef]
Næsset, E.; Gobakken, T.; McRoberts, R.E. A Model-Dependent Method for Monitoring Subtle Changes in Vegetation Height in the Boreal–Alpine Ecotone Using Bi-Temporal, Three Dimensional Point Data from Airborne Laser Scanning. Remote Sens. 2019, 11, 1804. [Google Scholar] [CrossRef] [Green Version]
Næsset, E. Effects of different sensors, flying altitudes, and pulse repetition frequencies on forest canopy metrics and biophysical stand properties derived from small-footprint airborne laser data. Remote Sens. Environ. 2009, 113, 148–159. [Google Scholar] [CrossRef]
Næsset, E. Discrimination between Ground Vegetation and Small Pioneer Trees in the Boreal-Alpine Ecotone Using Intensity Metrics Derived from Airborne Laser Scanner Data. Remote Sens. 2016, 8, 548. [Google Scholar] [CrossRef] [Green Version]
Thieme, N.; Bollandsås, O.M.; Gobakken, T.; Næsset, E. Detection of small single trees in the forest-tundra ecotone using height values from airborne laser scanning. Can. J. Remote Sens. 2011, 37, 264–274. [Google Scholar] [CrossRef]
Stumberg, N.; Bollandsås, O.M.; Gobakken, T.; Næsset, E. Automatic Detection of Small Single Trees in the Forest-Tundra Ecotone Using Airborne Laser Scanning. Remote Sens. 2014, 6, 10152–10170. [Google Scholar] [CrossRef] [Green Version]
Stumberg, N.; Hauglin, M.; Bollandsås, O.; Gobakken, T.; Næsset, E. Improving Classification of Airborne Laser Scanning Echoes in the Forest-Tundra Ecotone Using Geostatistical and Statistical Measures. Remote Sens. 2014, 6, 4582–4599. [Google Scholar] [CrossRef] [Green Version]
Hauglin, M.; Næsset, E. Detection and Segmentation of Small Trees in the Forest-Tundra Ecotone Using Airborne Laser Scanning. Remote Sens. 2016, 8, 407. [Google Scholar] [CrossRef] [Green Version]
Bohlin, J.; Wallerman, J.; Fransson, J.E.S. Forest variable estimation using photogrammetric matching of digital aerial images in combination with a high-resolution DEM. Scand. J. For. Res. 2012, 27, 692–699. [Google Scholar] [CrossRef]
Gobakken, T.; Bollandsås, O.M.; Næsset, E. Comparing biophysical forest characteristics estimated from photogrammetric matching of aerial images and airborne laser scanning data. Scand. J. For. Res. 2015, 30, 73–86. [Google Scholar] [CrossRef]
Noordermeer, L.; Bollandsås, O.M.; Ørka, H.O.; Næsset, E.; Gobakken, T. Comparing the accuracies of forest attributes predicted from airborne laser scanning and digital aerial photogrammetry in operational forest inventories. Remote Sens. Environ. 2019, 226, 26–37. [Google Scholar] [CrossRef]
Kangas, A.; Gobakken, T.; Puliti, S.; Hauglin, M.; Næsset, E. Value of airborne laser scanning and digital aerial photogrammetry data in forest decision making. Silva Fenn. 2018, 52, 9923. [Google Scholar] [CrossRef] [Green Version]
Puliti, S.; Gobakken, T.; Ørka, H.O.; Næsset, E. Assessing 3D point clouds from aerial photographs for species-specific forest inventories. Scand. J. For. Res. 2017, 32, 68–79. [Google Scholar] [CrossRef]
Kachamba, D.; Ørka, H.; Gobakken, T.; Eid, T.; Mwase, W. Biomass Estimation Using 3D Data from Unmanned Aerial Vehicle Imagery in a Tropical Woodland. Remote Sens. 2016, 8, 968. [Google Scholar] [CrossRef] [Green Version]
Puliti, S.; Solberg, S.; Granhus, A. Use of UAV Photogrammetric Data for Estimation of Biophysical Properties in Forest Stands Under Regeneration. Remote Sens. 2019, 11, 233. [Google Scholar] [CrossRef] [Green Version]
Hartley, R.J.L.; Leonardo, E.M.; Massam, P.; Watt, M.S.; Estarija, H.J.; Wright, L.; Melia, N.; Pearse, G.D. An Assessment of High-Density UAV Point Clouds for the Measurement of Young Forestry Trials. Remote Sens. 2020, 12, 4039. [Google Scholar] [CrossRef]
Cottam, G.; Curtis, J.T. The Use of Distance Measures in Phytosociological Sampling. Ecology 1956, 37, 451–460. [Google Scholar] [CrossRef]
Warde, W.; Petranka, J.W. A Correction Factor Table for Missing Point-Center Quarter Data. Ecology 1981, 62, 491–494. [Google Scholar] [CrossRef]
Næsset, E. Vertical Height Errors in Digital Terrain Models Derived from Airborne Laser Scanner Data in a Boreal-Alpine Ecotone in Norway. Remote Sens. 2015, 7, 4702–4725. [Google Scholar] [CrossRef] [Green Version]
Axelsson, P. DEM generation from laser scanner data using adaptive TIN models. Int. Arch. Photogramm. Remote Sens. 2000, 33, 111–118. [Google Scholar]
Soininen, A. TerraScan User‘s Guide. Available online: https://www.terrasolid.com/download/tscan.pdf (accessed on 21 March 2017).
Harter, H.L. Order Statistics and Their Use in Testing and Estimation; US Government Printing Office: Washington, DC, USA, 1970.
senseFly. eBee—Extended User Manual; senseFly Ltd: Cheseaux-Lausanne, Switzerland, 2014. [Google Scholar]
Agisoft LLC. User Manual: Professional Edition, Version 1.4. 121 p. Available online: https://www.agisoft.com/pdf/photoscan-pro_1_4_en.pdf (accessed on 24 September 2020).
Agisoft LLC. Tutorial (Beginner level): Orthophoto and DEM Generation with Agisoft PhotoScan Pro 1.3 (with Ground Control Points). Available online: https://www.agisoft.com/pdf/PS_1.3%20-Tutorial%20(BL)%20-%20Orthophoto,%20DEM%20(GCPs).pdf (accessed on 6 February 2015).
Westoby, M.J.; Brasington, J.; Glasser, N.F.; Hambrey, M.J.; Reynolds, J.M. ‘Structure-from-Motion’ photogrammetry: A low-cost, effective tool for geoscience applications. Geomorphology 2012, 179, 300–314. [Google Scholar] [CrossRef] [Green Version]
SAS. SAS OnlineDoc®, Version 9.2; SAS Institute Inc.: Cary, NC, USA, 2007. [Google Scholar]
Weisberg, S. Applied Linear Regression, 2nd ed.; Wiley: New York, NY, USA, 1985; p. 324. [Google Scholar]
Levene, H. Robust Tests for the Equality of Variance. In Contributions to Probability and Statistics; Olkin, I., Ed.; Stanford University Press: Palo Alto, CA, USA, 1960; pp. 278–292. [Google Scholar]
Lohr, S.L. Sampling: Design and Analysis, 2nd ed.; Brooks/Cole: Boston, MA, USA, 2010. [Google Scholar]
Nedyalkova, D.; Tillé, Y. Bias-robustness and efficiency of model-based inference in survey sampling. Stat. Sin. 2012, 22, 777–794. [Google Scholar] [CrossRef] [Green Version]
Brewer, K.R.W. Design-Based or Prediction-Based Inference? Stratified Random vs Stratified Balanced Sampling. Int. Stat. Rev. 1999, 67, 35–47. [Google Scholar] [CrossRef]
Esteban, J.; McRoberts, R.E.; Fernández-Landa, A.; Tomé, J.L.; Nӕsset, E. Estimating Forest Volume and Biomass and Their Changes Using Random Forests and Remotely Sensed Data. Remote Sens. 2019, 11, 1944. [Google Scholar] [CrossRef] [Green Version]
R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2020. [Google Scholar]
Farrar, T.J. Skedastic: Heteroskedasticity Diagnostics for Linear Regression Models. R Package Version 1.0.0.; University of the Western Cape: Bellville, South Africa, 2020. [Google Scholar]
Long, J.S.; Ervin, L.H. Using Heteroscedasticity Consistent Standard Errors in the Linear Regression Model. Am. Stat. 2000, 54, 217–224. [Google Scholar] [CrossRef]
MacKinnon, J.G.; White, H. Some heteroskedasticity-consistent covariance matrix estimators with improved finite sample properties. J. Econ. 1985, 29, 305–325. [Google Scholar] [CrossRef] [Green Version]
Zeileis, A. Econometric Computing with HC and HAC Covariance Matrix Estimators. J. Stat. Softw. 2004, 11, 1–17. [Google Scholar] [CrossRef]
Zeileis, A.; Köll, S.; Graham, N. Various Versatile Variances: An Object-Oriented Implementation of Clustered Covariances in R. J. Stat. Softw. 2020, 95, 1–36. [Google Scholar] [CrossRef]
McRoberts, R.E.; Næsset, E.; Gobakken, T. Inference for lidar-assisted estimation of forest growing stock volume. Remote Sens. Environ. 2013, 128, 268–275. [Google Scholar] [CrossRef]
McRoberts, R.E.; Næsset, E.; Gobakken, T.; Chirici, G.; Condés, S.; Hou, Z.; Saarela, S.; Chen, Q.; Ståhl, G.; Walters, B.F. Assessing components of the model-based mean square error estimator for remote sensing assisted forest applications. Can. J. For. Res. 2018, 48, 642–649. [Google Scholar] [CrossRef]
Breidenbach, J.; McRoberts, R.E.; Astrup, R. Empirical coverage of model-based variance estimators for remote sensing assisted estimation of stand-level timber volume. Remote Sens. Environ. 2016, 173, 274–281. [Google Scholar] [CrossRef] [Green Version]
Saarela, S.; Grafström, A.; Ståhl, G.; Kangas, A.; Holopainen, M.; Tuominen, S.; Nordkvist, K.; Hyyppä, J. Model-assisted estimation of growing stock volume using different combinations of LiDAR and Landsat data as auxiliary information. Remote Sens. Environ. 2015, 158, 431–440. [Google Scholar] [CrossRef]
Ståhl, G.; Saarela, S.; Schnell, S.; Holm, S.; Breidenbach, J.; Healey, S.P.; Patterson, P.L.; Magnussen, S.; Næsset, E.; McRoberts, R.E.; et al. Use of models in large-area forest surveys: Comparing model-assisted, model-based and hybrid estimation. For. Ecosyst. 2016, 3, 1. [Google Scholar] [CrossRef] [Green Version]
Mandallaz, D. A Unified Approach to Sampling Theory for Forest Inventory Based on Infinite Population and Superpopulation Models. Ph. D. Thesis, ETH Zürich, Zürich, Switzerland, 1991. [Google Scholar]
Kangas, A. Small-area estimates using model-based methods. Can. J. For. Res. 1996, 26, 758–766. [Google Scholar] [CrossRef]
Ståhl, G.; Holm, S.; Gregoire, T.; Gobakken, T.; Næsset, E.; Nelson, R. Model-based inference for biomass estimation in a LiDAR sample survey in the county of Hedmark County, Norway. Can. J. For. Res. 2011, 41, 96–107. [Google Scholar] [CrossRef] [Green Version]
McRoberts, R.E. A model-based approach to estimating forest area. Remote Sens. Environ. 2006, 103, 56–66. [Google Scholar] [CrossRef]
Strîmbu, V.F.; Ene, L.T.; Gobakken, T.; Gregoire, T.G.; Astrup, R.; Næsset, E. Post-stratified change estimation for large-area forest biomass using repeated ALS strip sampling. Can. J. For. Res. 2017, 47, 839–847. [Google Scholar] [CrossRef] [Green Version]
Bollandsås, O.M.; Ene, L.T.; Gobakken, T.; Næsset, E. Estimation of biomass change in montane forests in Norway along a 1200 km latitudinal gradient using airborne laser scanning: A comparison of direct and indirect prediction of change under a model-based inferential approach. Scand. J. For. Res. 2018, 33, 155–165. [Google Scholar] [CrossRef]
Hosmer, D.W.; Lemeshow, S. Applied Logistic Regression; John Wiley and Sons: New York, NY, USA, 2000. [Google Scholar]
Miller, R.G. Simultaneous Statistical Inference, 2nd ed.; Springer: New York, NY, USA, 1981; p. 299. [Google Scholar]
Groesz, F.J.; Blom Norway AS, Lysaker, Norway. Personal communication, 2021.

Figure 2. Tree measurements during field work in 2006 (A,B) and measurements of ground reference points in 2010 (C,D,E). (A) Spruce tree appearing in a group of trees. (B) Birch tree part of a group of trees forming tall scrubby vegetation. (C) Ground reference point with vegetation height > 0.20 m and “green vegetation”. (D) Ground reference point with vegetation height 0.10–0.20 m and “green vegetation”. (E) Ground reference point with vegetation height < 0.10 m and “rock/bare” surface.

Figure 3. The sequence of analysis steps undertaken to address objective #1.

Figure 4. The sequence of analysis steps undertaken to address objective #2.

Figure 5. Distributions of h_max in the population (N = 79,242) and the sample with and without weighting of the sample observations (n = 716) for the two different 3D remotely sensed datasets.

Figure 6. (A) Orthophoto of study area with manual delineation of the four sub-regions of Case B; (B) Map with 5 m counter lines, the 12 tessellated cells of Case A (Cell 1–12), and boundaries of the sub-regions for Case B (I–IV); (C) Predictions of vegetation height for the 1.5 m² population elements using ALS data; (D) Predictions of vegetation height for the 1.5 m² population elements using DAP data; (E) Estimated mean vegetation height using ALS data for tessellated cells of Case A; (F) Estimated mean vegetation height using DAP data for tessellated cells of Case A; (G) Estimated mean vegetation height using ALS for sub-regions of Case B; (H) Estimated mean vegetation height using DAP for sub-regions of Case B. The vegetation height estimates for each domain in Figures (E–H) are given by numbers.

Figure 7. Autocorrelation of observed residuals in models for vegetation height (Equation (5)) based on the ALS (top) and DAP (bottom) datasets across the entire study area (solid lines) and predicted autocorrelation (dashed lines) according to models in Equation (13).

Table 1. Summary of field measurements of 532 trees recorded in 2017.

Tree Species	Characteristic	n	Range	Mean
Norway spruce	Tree height (m)	236	0.05–6.60	1.46
Norway spruce	Crown area ^a (m²)	236	0.0003–23.378	2.507
Scots pine	Tree height (m)	90	0.05–3.00	0.53
Scots pine	Crown area ^a (m²)	90	0.0007–2.114	0.241
Mountain birch	Tree height (m)	206	0.11–4.00	1.51
Mountain birch	Crown area ^a (m²)	206	0.0012–9.726	1.756
All trees	Tree height (m)	532	0.05–6.60	1.32
All trees	Crown area ^a (m²)	532	0.0003–23.378	1.83

^a Crown area calculated as the area of an ellipse with the perpendicular crown diameter measurements as axes.

Table 2. Frequency distribution of 365 ground reference points recorded in 2010.

	Terrain Surface
Vegetation Height	Rock/Bare	Lichen/Heather	Green
0–0.10 m	17	37	251
0.10–0.20 m	0	0	60

Table 3. Processing steps with corresponding parameters in Agisoft PhotoScan Professional software for the generation of 3D point cloud from UAS imagery.

Task	Parameter
Alignment	Accuracy: high ^a
	Generic preselection: yes ^a
	Reference preselection: yes ^a
	Key point limit: 40,000 ^a
	Tie point limit: 4000 ^a
	Adaptative camera model fitting: yes ^a
Guided marker positioning	Number of GCPs: 10
Depth maps and dense point cloud	Quality: medium ^b
	Depth filtering: mild ^b

^a Parameters suggested in PhotoScan online tutorial [35]. ^b Parameters chosen using a trial and error approach.

Table 4. Distribution of trees by 3D remotely sensed dataset for different tree species and tree height classes according to three categories of h_max ^a for the tree polygons (n = 530). Percent in brackets.

Tree Species	Height (m)	Total Number of Trees	Number of Trees with h_max > 0	Number of Trees with h_max = 0	Number of Trees with h_max Missing
ALS:
Norway spruce	0–1	115	50 (43)	2 (2)	63 (55)
	1–2	54	54 (100)	0 (0)	0 (0)
	2–3	26	26 (100)	0 (0)	0 (0)
	>3	39	39 (100)	0 (0)	0 (0)
Scots pine	0–1	77	25 (32)	6 (8)	46 (60)
	1–2	11	11 (100)	0 (0)	0 (0)
	2–3	1	1 (100)	0 (0)	0 (0)
	>3	1	1 (100)	0 (0)	0 (0)
Birch	0–1	63	36 (57)	3 (5)	24 (38)
	1–2	85	83 (98)	1 (1)	1 (1)
	2–3	48	48 (100	0 (0)	0 (0)
	>3	10	10 (100)	0 (0)	0 (0)
DAP:
Norway spruce	0–1	115	61 (53)	20 (17)	34 (30)
	1–2	54	54 (100)	0 (0)	0 (0)
	2–3	26	26 (100)	0 (0)	0 (0)
	>3	39	39 (100)	0 (0)	0 (0)
Scots pine	0–1	77	16 (21)	40 (52)	21 (27)
	1–2	11	8 (73)	3 (27)	0 (0)
	2–3	1	1 (100)	0 (0)	0 (0)
	>3	1	1 (100)	0 (0)	0 (0)
Birch	0–1	63	39 (62)	16 (25)	8 (13)
	1–2	85	85 (100)	0 (0)	0 (0)
	2–3	48	47 (98)	1 (2)	0 (0)
	>3	10	10 (100)	0 (0)	0 (0)

^a When the dataset is ALS or DAP, h_max is h_ALSmax or h_DAPmax, respectively.

Table 5. Mean of tree height measured in field, differences between h_max ^a in 3D remotely sensed datasets and tree height measured in field, and standard deviation for differences (Stdev) for different 3D remotely sensed datasets, different tree species and tree height classes (n = 389).

Tree Species	Height (m)	n	Observed Mean (m)	Difference (m)
				ALS		DAP
				Mean	Stdev	Mean	Stdev
Norway	0–1	49	0.61	−0.39	0.20	−0.37	0.21
spruce	1–2	54	1.45	−0.40	0.45	−0.73	0.33
	2–3	26	2.44	−0.53	0.39	−1.07	0.34
	>3	39	4.12	−0.62	0.38	−1.53	0.61
Scots	0–1	29	0.62	−0.45	0.30	−0.57	0.22
pine	1–2	11	1.32	−0.68	0.29	−1.17	0.24
	2–3	1	2.41	−0.58	-	−1.70	-
	>3	1	3.00	−0.67	-	−2.63	-
Birch	0–1	37	0.73	−0.55	0.24	−0.63	0.18
	1–2	84	1.45	−0.48	0.44	−0.96	0.33
	2–3	48	2.39	−0.47	0.37	−1.23	0.50
	>3	10	3.38	−0.24	0.41	−1.09	0.70
All		389	1.72	−0.48	0.37	−0.91	0.51

^a When the dataset is ALS or DAP, h_max is h_ALSmax or h_DAPmax, respectively.

Table 6. The first four moments of the distributions of h_max in the population (N = 79,242) and in the sample (n = 716) for the two different 3D remotely sensed datasets.

Dataset	Moments ^a
Dataset	First	Second	Third	Fourth
ALS:
Population	0.43	0.90	3.22	11.84
Sample	0.70	1.02	1.84	3.21
Sample, weighted	0.42	0.83	2.85	8.50
DAP:
Population	0.34	0.74	3.51	15.23
Sample	0.45	0.75	2.25	5.14
Sample, weighted	0.33	0.66	2.89	8.76

^a First: mean; second: variance; third: skewness; fourth: kurtosis.

Table 7. Estimation results for logistic regression models shown in Equations (1) and (2) (n = 2 × 530).

Coefficient	Estimate	Wald Chi-Square	p-Value
Model in Equation (1):
Intercept	–4.72	49.95	<0.001
DATA_DAP	0.08	0.12	0.733
SP_pine	−1.25	18.47	<0.001
SP_birch	−0.57	3.06	0.080
H	2.64	20.04	<0.001
A	5.24	26.95	<0.001
Model fit ^a		17.62	0.014
Wald chi-square tests:
SP_pine vs. SP_birch		4.35	0.037
Model in Equation (2):
Intercept	–2.53	40.08	<0.001
DATA_DAP	1.27	6.82	0.009
SP_pine	−0.65	1.69	0.194
SP_birch	−0.36	0.33	0.565
${SP}_{pine} \cdot {DATA}_{DAP}$	−1.31	4.03	0.045
${SP}_{birch} \cdot {DATA}_{DAP}$	−0.25	0.11	0.740
h	0.64	0.38	0.540
h · DATA_DAP	2.51	3.77	0.052
A	20.29	32.95	<0.001
A · DATA_DAP	−18.03	24.44	<0.001
Model fit ^b		7.54	0.274
Wald chi-square tests:
SP_pine vs. SP_birch		0.22	0.637
SP_pine · DATA_DAP vs. SP_birch · DATA_DAP		1.96	0.162

^a Hosmer–Lemeshow statistic with 7 degrees of freedom [61]. ^b Hosmer–Lemeshow statistic with 6 degrees of freedom [61].

Table 8. Estimation results for regression models shown in Equations (3) (n = 2 × 389) and Equation (4) (n = 389).

	Model According to Equation (3)		Estimates for Models According to Equation (4)
Coefficient	Estimate	p-Value	ALS	DAP
Intercept	0.67	<0.001	0.60 ***	0.73 ***
DATA_DAP	−0.35	<0.001
SP_pine	−0.31	<0.001	−0.26 *	−0.64 ***
SP_birch	−0.01	0.893	−0.05 ns	−0.09 ns
h_max	0.82	<0.001	0.91 ***	1.13 ***
h_max · DATA_DAP	0.41	<0.001
h_max · SP_pine	0.38	<0.001	0.24 *	0.79 ***
h_max · SP_birch	0.04	0.156	0.02 ns	0.15 **
Model fit:
R²	0.91		0.90	0.83
RMSE (m)	0.34		0.36	0.46
F-tests:
SP_pine vs. SP_birch		<0.001
h_max · SP_pine vs. h_max · SP_birch		<0.001

Level of significance: ns: not significant (p > 0.05); *: p < 0.05; **: p < 0.01; ***: p < 0.001.

Table 9. Results of leave-one-out cross validation of the models in Equation (4) (Table 7) based on differences between predicted and observed tree height for the different 3D remotely sensed datasets, tree species, and tree height classes (n = 389).

Tree Species	Height (m)	n	Difference (m)
Tree Species	Height (m)	n	Mean	Stdev	RMSE
ALS:
Norway spruce	0–1	49	0.19 ***	0.20	0.28
	1–2	54	0.11 ns	0.41	0.42
	2–3	26	−0.10 ns	0.36	0.37
	>3	39	−0.33 ***	0.38	0.50
Scots pine	0–1	29	0.02 ns	0.27	0.26
	1–2	11	−0.08 ns	0.30	0.29
	2–3	1	0.23	-	0.23
	>3	1	0.30	-	0.30
Birch	0–1	37	<0.01 ns	0.23	0.23
	1–2	84	0.01 ns	0.40	0.40
	2–3	48	−0.04 ns	0.34	0.34
	>3	10	0.10 ns	0.39	0.38
All		389	<0.01 ns	0.37	0.37
DAP:
Norway spruce	0–1	49	0.39 ***	0.22	0.44
	1–2	54	0.09 ns	0.37	0.37
	2–3	26	−0.17 *	0.39	0.42
	>3	39	−0.49 ***	0.64	0.80
Scots pine	0–1	29	<0.01 ns	0.11	0.11
	1–2	11	−0.03 ns	0.12	0.12
	2–3	1	0.48	-	0.48
	>3	1	−0.19	-	0.19
Birch	0–1	37	0.13 ***	0.17	0.21
	1–2	84	−0.04 ns	0.36	0.36
	2–3	48	−0.09 ns	0.59	0.59
	>3	10	0.36 ns	0.86	0.89
All		389	<0.01 ns	0.47	0.47

Level of significance: ns: not significant (p >0.05); *: p < 0.05; ***: p < 0.001.

Table 10. Estimation results for regression models for vegetation height shown in Equation (5). The models were used in the estimation of vegetation height (n = 716).

Coefficient	Estimates
	Weighted Sample Observations				Unweighted Sample Observations
	ALS		DAP		ALS		DAP
Intercept	0.17 ***		0.28 ***		0.20 ***		0.32 ***
h_max	1.10 ***		1.43 ***		1.08 ***		1.41 ***
Model fit:
R²	0.88		0.80		0.91		0.82
RMSE (m)	0.36		0.49		0.36		0.49
Heteroscedasticity-consistent variance-covariance matrix of parameter estimates:
	Intercept	h_max	Intercept	h_max	Intercept	h_max	Intercept	h_max
Intercept	0.00026	−0.00012	0. 00037	−0.00032	0.00025	−0.00015	0.00042	−0.00039
h_max	−0.00012	0.00026	−0.00032	0.00129	−0.00015	0.00027	−0.00039	0.00132

Level of significance: ***: p < 0.001.

Table 11. Estimates of mean vegetation height (

\bar{\hat{h}}

; Equation (6)) for various domains of the study area and corresponding standard error estimates ^a (SE), and contribution of the residual variance component (Equation (11)) to overall mean square error estimates of mean vegetation height (var(

\bar{\hat{h}})_{res}

).

Table 11. Estimates of mean vegetation height (

\bar{\hat{h}}

; Equation (6)) for various domains of the study area and corresponding standard error estimates ^a (SE), and contribution of the residual variance component (Equation (11)) to overall mean square error estimates of mean vegetation height (var(

\bar{\hat{h}})_{res}

).

	ALS				DAP
Domain	$\bar{\hat{h}} (m)$	$\bar{\hat{h}} (m)^{b}$	SE (m)	$var (\bar{\hat{h}})_{res} (%)$	$\bar{\hat{h}} (m)$	$\bar{\hat{h}} (m)^{b}$	SE (m)	$var (\bar{\hat{h}})_{res} (%)$
Study Area	0.64	0.67	0.014	0.85	0.76	0.80	0.018	0.96
Case A:
Cell 1	1.36	1.38	0.018	8.44	1.61	1.64	0.031	5.26
Cell 2	0.81	0.84	0.015	11.07	1.02	1.05	0.021	9.37
Cell 3	0.78	0.81	0.015	13.21	0.86	0.90	0.019	11.12
Cell 4	0.91	0.94	0.015	8.87	0.92	0.96	0.020	12.59
Cell 5	0.30	0.33	0.015	5.37	0.36	0.40	0.019	5.61
Cell 6	0.89	0.91	0.015	12.99	0.96	0.99	0.021	18.64
Cell 7	0.32	0.35	0.015	5.15	0.40	0.44	0.019	5.05
Cell 8	0.47	0.51	0.014	4.98	0.52	0.56	0.018	7.36
Cell 9	0.57	0.60	0.014	7.42	0.68	0.72	0.018	8.95
Cell 10	0.41	0.44	0.014	4.33	0.52	0.56	0.018	6.34
Cell 11	0.47	0.50	0.015	11.68	0.70	0.74	0.018	9.21
Cell 12	0.39	0.42	0.015	6.87	0.59	0.63	0.018	8.78
Case B:
Sub-region I	0.54	0.57	0.014	3.37	0.74	0.78	0.018	3.27
Sub-region II	0.29	0.32	0.015	1.29	0.35	0.40	0.019	1.37
Sub-region III	0.92	0.95	0.014	3.77	0.98	1.02	0.020	5.93
Sub-region IV	1.13	1.15	0.016	5.09	1.32	1.35	0.025	3.59

^a Standard error based on the sum of the variance due to parameter uncertainty (Equation (8)) and residual variance (Equation (11)). ^b Point estimates based on models constructed from unweighted sample observations.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Næsset, E.; Gobakken, T.; Jutras-Perreault, M.-C.; Ramtvedt, E.N. Comparing 3D Point Cloud Data from Laser Scanning and Digital Aerial Photogrammetry for Height Estimation of Small Trees and Other Vegetation in a Boreal–Alpine Ecotone. Remote Sens. 2021, 13, 2469. https://doi.org/10.3390/rs13132469

AMA Style

Næsset E, Gobakken T, Jutras-Perreault M-C, Ramtvedt EN. Comparing 3D Point Cloud Data from Laser Scanning and Digital Aerial Photogrammetry for Height Estimation of Small Trees and Other Vegetation in a Boreal–Alpine Ecotone. Remote Sensing. 2021; 13(13):2469. https://doi.org/10.3390/rs13132469

Chicago/Turabian Style

Næsset, Erik, Terje Gobakken, Marie-Claude Jutras-Perreault, and Eirik Næsset Ramtvedt. 2021. "Comparing 3D Point Cloud Data from Laser Scanning and Digital Aerial Photogrammetry for Height Estimation of Small Trees and Other Vegetation in a Boreal–Alpine Ecotone" Remote Sensing 13, no. 13: 2469. https://doi.org/10.3390/rs13132469

APA Style

Næsset, E., Gobakken, T., Jutras-Perreault, M.-C., & Ramtvedt, E. N. (2021). Comparing 3D Point Cloud Data from Laser Scanning and Digital Aerial Photogrammetry for Height Estimation of Small Trees and Other Vegetation in a Boreal–Alpine Ecotone. Remote Sensing, 13(13), 2469. https://doi.org/10.3390/rs13132469

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Comparing 3D Point Cloud Data from Laser Scanning and Digital Aerial Photogrammetry for Height Estimation of Small Trees and Other Vegetation in a Boreal–Alpine Ecotone

Abstract

1. Introduction

2. Materials and Methods

2.1. Study Area

2.2. Field Measurements

2.2.1. Overview

2.2.2. Individual Tree Data

2.2.3. Ground Reference Points

2.3. Laser Scanner Data

2.4. Unmanned Aerial Systems Image Data

2.5. Extracion of ALS Data and DAP Data for Trees, Ground Reference Points, and Population Elements

2.5.1. Trees

2.5.2. Ground Reference Points

2.5.3. Population Elements

2.6. Analysis—Objective #1

2.7. Analysis—Objective #2

2.7.1. Models for Vegetation Height

2.7.2. Estimator for Vegetation Height

2.7.3. Estimator of Mean Square Error

2.7.4. Calculations

3. Results and Discussion

3.1. Performance of Tree Height Prediction with ALS and DAP Data (Objective #1)

3.1.1. 3D Data Sensitivity to Small Trees

3.1.2. Influence of Tree Size and Tree Species on Tree Height Modeling

3.2. Estimation of Vegetation Height with ALS and DAP Data (Objective #2)

3.2.1. Model Construction

3.2.2. Overall Vegetation Height Estimation and Inference

3.2.3. Domain-Specific Vegetation Height Estimation and Inference

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI