Prediction of Species-Specific Volume Using Different Inventory Approaches by Fusing Airborne Laser Scanning and Hyperspectral Data

Fusion of ALS and hyperspectral data can offer a powerful basis for the discrimination of tree species and enables an accurate prediction of species-specific attributes. In this study, the fused airborne laser scanning (ALS) data and hyperspectral images were used to model and predict the total and species-specific volumes based on three forest inventory approaches, namely the individual tree crown (ITC) approach, the semi-ITC approach, and the area-based approach (ABA). The performances of these inventory approaches were analyzed and compared at the plot level in a complex Alpine forest in Italy. For the ITC and semi-ITC approaches, an ITC delineation algorithm was applied. With the ITC approach, the species-specific volumes were predicted with allometric models for each crown segment and aggregated to the total volume. For the semi-ITC and ABA, a multivariate k-most similar neighbor method was applied to simultaneously predict the total and species-specific volumes using leave-one-out cross-validation at the plot level. In both methods, the ALS and hyperspectral variables were important for volume modeling. The total volume of the ITC, semi-ITC, and ABA resulted in relative root mean square errors (RMSEs) of 25.31%, 17.41%, 30.95% of the mean and systematic errors (mean differences) of 21.59%, −0.27%, and −2.69% of the mean, respectively. The ITC approach achieved high accuracies but large systematic errors for minority species. For majority species, the semi-ITC performed slightly better compared to the ABA, resulting in higher accuracies and smaller systematic errors. The results indicated that the semi-ITC outperformed the two other inventory approaches. To conclude, we suggest that the semi-ITC method is further tested and assessed with attention to its potential in operational forestry applications, especially in cases for which accurate species-specific forest biophysical attributes are needed.


Introduction
Stem volume is one of the most relevant resource attributes of forest inventories.In most European countries, volume is usually estimated or modeled based on field measurements [1].In the Nordic countries, conventional forest inventories at various geographical scales have over the past few decades been enhanced by using remotely sensed data such as airborne laser scanning (ALS) and stereo aerial photography [2,3].It has been shown that for local management planning, data from ALS may reduce the overall costs by reducing the economic losses caused by incorrect decisions due to erroneous data [4].One of the biggest challenges in remote sensing-assisted inventories is the discrimination among tree species [5].Moreover, combining ALS data with hyperspectral images may improve the accuracy of species identification [6].The tree species information is needed for many forest applications, especially to retrieve species-specific forest biophysical properties, such as volume and diameter at breast height (DBH), to derive biodiversity indicators and to plan silvicultural activities and cutting regimes.
Due to accurate three-dimensional measurements obtained by ALS [7,8], the ALS technology has become one of the most valuable remote sensing methods for providing forest information.By using ALS data, biophysical attributes such as volume, height, DBH, crown area, and stem density can be derived or modeled with high accuracies [2,[9][10][11][12].However, the estimation of biophysical attributes can be challenging in dense, multispecies, heterogeneous forest stands [13,14].In the past, ALS data were examined to classify the tree species at the individual tree crown (ITC) level using shape, structure, and intensity features of the tree crowns [5,15].In addition, when species-specific biophysical attributes are required, ALS data are often fused with optical images, i.e., color-infrared [16][17][18], multispectral [19,20], or hyperspectral [21,22] images, in order to improve the tree species classification [6,23,24].Recently, multi-or hyper-spectral ALS sensors were developed with a high potential to be used as a future single-sensor solution for forest mapping [25][26][27].Currently, remotely sensed hyperspectral data are the most promising data source for classifying tree species due to their ability to detect subtle variations in the chemical and structural properties of the tree canopy.Likewise, other studies demonstrated the success of identifying tree species in other ecosystems [28,29].Thus, the remote sensing technologies that currently are considered to have the greatest potential to improve forest attribute characterization are ALS, describing the 3D forest structure, and hyperspectral imagery, outperforming other techniques for the identification of tree species [30].The fusion of both has the potential to improve the forest attribute prediction accuracy [31].
Most local forest management inventories assisted by ALS data follow the area-based approach (ABA).For field sample plots distributed across the area of interest, several variables are extracted from the ALS echoes describing structural properties, for which the statistical relationships with field-measured biophysical attributes are constructed.The relationships, typically in the form of regression models, are then used to predict biophysical attributes for larger areas [9,32].As an alternative, individual trees can be identified and the forest resource variables can be estimated as an aggregate of individual tree properties.To obtain tree attributes, the ITC approach was introduced [33].With the ITC approach, crown segments are detected and delineated by applying segmentation algorithms [34,35].The crown segments, often referred to as ITCs, can contain none, one, or several trees.In particular, this approach presumes that one crown segment contains exclusively one tree.For each crown segment, various ALS-derived structural variables are extracted, such as height and crown area.Based on these variables, the biophysical attributes, such as volume and DBH, are predicted for each segment and can be aggregated to other scales, e.g., a forest stand.Detection errors, i.e., omission error (failure to detect a tree or segmenting multiple trees into a single crown segment) and commission error (detecting an object that is not a tree or splitting a single tree into multiple crown segments), usually lead to underprediction of the forest biophysical attributes [36].To overcome this problem, the semi-ITC approach has been proposed [37], which, in contrast to the ITC approach, allows a crown segment to contain multiple trees.
Several studies in Finland aimed to estimate species-specific forest volume using the combination of ALS data and aerial imagery based on ABA for three species groups: Norway spruce, Scots pine, and deciduous [16][17][18]38].All these studies used a non-parametric k-most similar neighbor (kMSN) approach [39] based on Packalén and Maltamo [16].Packalén and Maltamo [16] compared two approaches for determining species-specific volumes at the plot level.The first approach predicted the total volume using ALS data, upon which the species-specific volume was obtained by multiplying the total volume by the proportion of each tree species obtained from aerial photographs and fuzzy classification.The second approach used the kMSN method for simultaneous prediction of volumes by tree species.The kMSN method resulted in more accurate species-specific volume estimates, except for the total volume where the fuzzy classification performed slightly better.Packalén and Maltamo [17] extended the kMSN approach from the previous study [16] to a stand level and tested the simultaneous prediction of volume, stem number, basal area, basal area median diameter, and tree height for the same tree species groups.The attributes of coniferous tree species were predicted accurately and those of deciduous trees less accurately, since they were minority species.Vauhkonen et al. [38] tested the performance of alpha shape metrics calculated from ALS data combined with image variables for species-specific volume predictions at the plot level.They demonstrated that using only ALS variables resulted in less accurate estimates compared to using combined ALS and images variables.Niska et al. [18] compared the kMSN method with three artificial neural network modeling methods: the multilayer perceptron (MLP), support vector regression (SVR), and self-organizing map (SOM) at the plot and stand level.The results revealed that the SVR and MLP models reached the greatest prediction accuracy, the kMSN a lower accuracy, and the SOM the smallest prediction accuracy.It should be noted that the SVR and MLP yielded negative estimates to some extent, whereas the kMSN and SOM estimates were always within the range of the modeling data.
In Norway, Breidenbach et al. [37] determined species-specific volumes for four dominant tree species (spruce, pine, birch, and aspen) using the ABA and semi-ITC approaches, combining the ALS and multispectral data.They applied the kNN approach using MSN and random forest (RF) methods for calculating statistical distances between the neighbors.The volumes predicted with the semi-ITC approach resulted in smaller RMSE compared with the ABA results.Ørka et al. [23] evaluated tree species composition in a Norwegian forest using (1) ALS data, (2) multispectral imagery, (3) hyperspectral imagery, (4) fused ALS and multispectral data, and ( 5) fused ALS and hyperspectral data.Subsequently, they predicted the basal area for spruce, pine, and deciduous trees for three inventory approaches: ABA, semi-ITC, and ITC.For the ABA and semi-ITC approach, the kNN algorithm with an Euclidean distance matrix was applied.The results suggested that the greatest species accuracy was achieved by fusing the ALS and hyperspectral data.The ITC approach resulted in the highest accuracy for deciduous species, while the ABA performed the best for the other tree species.
As mentioned above, different remote sensing-assisted forest inventory approaches have been suggested to assess species-specific forest attributes.Each of them has their own pros and cons.The advantage of the ABA is that it provides predictions with small systematic errors, i.e., mean differences, but at the same time only the attributes of the main tree species can be predicted with high accuracy, while for the minority species the errors were large and consequently resulted in inaccurate species-specific attributes [23].The ITC-based approaches are suitable for developing species-specific models which could lead to more accurate stand-level estimations, particularly for mixed stands.The main drawback of the ITC procedure are the under-predictions due to problems in detecting suppressed and understory trees, especially in stands with a complex forest structure.We expect that a solution for alleviating systematic errors lies in the semi-ITC approach, which has not been sufficiently explored and tested in various forest conditions and ecosystems.In fact, only a few studies have evaluated and compared the results of species-specific volumes and basal areas obtained from different remote sensing-assisted forest inventory approaches [23,37].In particular, most of these studies were carried out in boreal forest conditions, accounting for spruce, pine, and deciduous tree species groups.Moreover, most of the remote sensing-assisted inventory methods combine ALS data with aerial photogrammetry or multispectral imagery to obtain species-specific volumes.The fusion of complementary data sources, in particular ALS and hyperspectral data, typically result in greater accuracies in contrast to using the respective data separately [6,23,40].Therefore, the fusion of ALS and hyperspectral data can offer a powerful basis for discriminating tree species and enables an accurate prediction of species-specific attributes.The kNN approach has grown in popularity due to its ability to successfully and simultaneously relate multiple forest attributes derived from field observations to remotely sensed data [41].However, to the very best of our knowledge, there are no studies to date that have tested the ITC, semi-ITC, and ABA approaches on a comparative basis with emphasis on species-specific volumes.
Thus, the main goal of the present study was to predict and evaluate the species-specific volume in a complex Alpine forest based on fused ALS and airborne hyperspectral data using three remote sensing-assisted forest inventory approaches, i.e., the ITC, semi-ITC, and ABA.The performances of all three inventory approaches were analyzed and compared for the total volume and volume of five species classes.

Study Area
The 32 km 2 study area is located in an Italian Alpine forest in Pellizzano (46 • 17 22 N, 10 • 46 05 E), situated in the province of Trento.The altitude of the forest area ranges from 900 to 2200 m above sea level.The forest is dominated by Norway spruce (Picea abies (L.) Karst.), with the presence of other coniferous species (e.g., larch (Larix decidua Mill.), silver fir (Abies alba Mill.)) and broadleaves species (e.g., rowan (Sorbus aucuparia L.), aspen (Populus tremula L.), silver birch (Betula pendula Roth), and sycamore maple (Acer pseudoplatanus L.)).At higher altitudes, the forest is sparse, whereas at lower altitudes, the forest structure is more complex, varying from a one-to multi-layer forest with patches of mixed and homogeneous tree species composition.The forest is managed by selective logging focusing on the productive forest area, especially Norway spruce, and trees are harvested according to their stem diameter.

Field Data
Between the summers of 2013 and 2015, 47 circular sample plots were surveyed.The size of the sample plots was 700 m 2 .The center location of each plot was determined with global navigation satellite system (GLONASS) measurements, resulting in a positional error of less than 1 m.At each sample plot, the tree locations were recorded as polar coordinates by measuring the azimuth and range to the center of the plot.For all trees within the sample plot, stem diameter at breast height and tree species were recorded.Tree height for randomly selected trees was measured using a Vertex III hypsometer.Tree heights of the remaining trees were predicted with allometric equations [1].In each sample plot, only trees with a DBH greater than 7.5 cm were considered in the analysis.Dead or damaged trees were also excluded.In total, 1283 field trees were available for the analysis.The tree level and plot level statistics are summarized in Table 1.The tree species observed for the 1283 trees were: 76.5% Norway spruce, 9.0% larch, 6.2% rowan, 1.7% silver fir, 1.5% silver birch, 1.4% sycamore maple, and 3.7% other minority broadleaves.For the following analysis, we grouped tree species into five classes to predict species-specific volumes using the ITC and semi-ITC approaches: Norway spruce, larch, rowan, silver fir, and other broadleaves (silver birch, sycamore maple, and other minority broadleaves).The field statistics for the total and species-specific volumes are represented in Table 2.
2.1.3.ALS and Hyperspectral Data and Pre-Processing ALS data were acquired with a Riegl LMS-Q680i sensor between the 7th and 9th of September 2012.The flying speed was approximately 51 m s −1 at the altitude of 660 m above ground level with the pulse repetition frequency of 400 kHz.For each emitted pulse, up to seven returns were recorded and the mean point density was 48 points m −2 .A digital terrain model (DTM) was extracted from the ALS points with a grid of 0.5 m by the vendor.In the preprocessing, underlying DTM elevations were subtracted from the ALS point elevations to convert ALS point elevations to heights above ground.From these ALS points, a raster canopy height model (CHM) of the area was created with the resolution of 0.5 m.Twenty-one hyperspectral images were acquired on the 13th of June 2013 by an AISA Eagle II sensor with a spatial resolution of 1 m.The minimum overlap among the images was 20%.The images consisted of 65 spectral bands between 403.09 nm and 995.31 nm.The hyperspectral strips were mosaicked into one image.For each pixel, the normalized difference vegetation index (NDVI) was computed based on the red and infrared wavelengths of 646.72 nm and 815.27 nm, respectively, and pixels with NDVI below 0.5 were removed in order to reduce noise and shadowing effects.Afterwards, the value of each pixel was normalized with respect to the sum of the values of the same pixel in all the bands to reduce the minor differences in radiance occurring between the different images [42].The hyperspectral images were orthorectified using ALS data by the data vendor.

Overview
The flowchart for the species-specific volume prediction for the three remote sensing inventory approaches using ALS and hyperspectral data is presented in Figure 1.For the ITC approach, the predicted volumes were summed per plot and scaled to m 3 per hectare, and for the semi-ITC and ABA approaches, the volumes were scaled to per-hectare values before modelling.In the case of the ITC and semi-ITC approaches, ALS and hyperspectral variables were calculated for each crown segment, whereas for the ABA variables, they were calculated for each subsample plot (sample plot halves).In the ITC approach, the tree species and DBH were predicted per segment.Subsequently, the allometric models were applied to predict species-specific volumes, which were summed up to have the total volume.For the semi-ITC approach and the ABA, the total and species-specific volumes were predicted simultaneously by a non-parametric multivariate kNN method using leave-one-out (LOO) cross-validation at the plot level.
The basic idea of the kNN method is that biophysical attributes within the target data are predicted by imputing them from the k nearest neighbors within the reference data.The nearness of neighbors is measured with the distance matrix between predictors (e.g., ALS and hyperspectral variables), which are known in the reference and target data sets [43].For the distance matrix, the most similar neighbor (MSN) method was applied, where the nearness of the k-nearest neighbors was defined in terms of weighted Euclidean distance in a conical search space.We chose this distance as from some preliminary analysis it appeared to be the most suitable among the random forest proximity, Euclidean distance, Euclidean distance without normalization, and Mahalanobis distance for this specific problem.Thus, we applied the kMSN, which is a particular type of the kNN methods.One property of the kMSN method is that the predictions are always within the range of the observed distribution.The main advantages of the kMSN approach are that the multivariate responses include the simultaneous predictions of the volumes and preserve their complex variance-covariance structure [39], along with their robustness to handle the outliers or extreme values in the data.The main disadvantage is that the kMSN approaches normally require a larger number of observations, although some studies have shown that reasonable results can be obtained also with a limited number of sample plots [37,44,45].
Remote Sens. 2017, 9, 400 6 of 19 distribution.The main advantages of the kMSN approach are that the multivariate responses include the simultaneous predictions of the volumes and preserve their complex variance-covariance structure [39], along with their robustness to handle the outliers or extreme values in the data.The main disadvantage is that the kMSN approaches normally require a larger number of observations, although some studies have shown that reasonable results can be obtained also with a limited number of sample plots [37,44,45].
Figure 1.Flowchart of the procedure for predicting total and species-specific volumes.

ITC Delineation Method
To avoid ground hits and effects of rocks and small vegetation, all echoes with the height below 2 m were removed from the point cloud in the preprocessing step.The ITC delineation for the ITC and semi-ITC approaches was conducted with the algorithm implemented in the "itcSegment"package in the R software [46] based on ALS data.For each crown segment, position, height (ℎ ), and crown area ( ) were calculated.The position and height were determined from the highest ALS point within a crown segment.The crown area ( ) was extracted with the convex hull.The height and crown area were used as ALS variables.The hyperspectral variables were computed for each crown segment as the average value of pixels for all 65 bands, and they are referred to as band means (from 1 to 65).This step was the same for both the ITC and the semi-ITC approach.

ITC Approach
The ITC approach assumes that each delineated crown segment contains one field tree.The matching procedure of the crown segments and field trees was different from the classical methods [34,47].An empirical threshold for the permitted distance (horizontal or vertical) between the

ITC Delineation Method
To avoid ground hits and effects of rocks and small vegetation, all echoes with the height below 2 m were removed from the point cloud in the preprocessing step.The ITC delineation for the ITC and semi-ITC approaches was conducted with the algorithm implemented in the "itcSegment"-package in the R software [46] based on ALS data.For each crown segment, position, height (h max ), and crown area (CA) were calculated.The position and height were determined from the highest ALS point within a crown segment.The crown area (CA) was extracted with the convex hull.The height and crown area were used as ALS variables.The hyperspectral variables were computed for each crown segment as the average value of pixels for all 65 bands, and they are referred to as band means (from B1 to B65).This step was the same for both the ITC and the semi-ITC approach.

ITC Approach
The ITC approach assumes that each delineated crown segment contains one field tree.The matching procedure of the crown segments and field trees was different from the classical methods [34,47].An empirical threshold for the permitted distance (horizontal or vertical) between the "matched" crown segment and field tree was avoided.In particular, each segment was matched to the closest field tree according to a 3D distance.To confirm a good match, we fitted a simple linear regression with the field tree height as response (h) and crown segment maximum height as predictor (h max ): where i corresponds to a crown segment, b 0 and b 1 are fixed parameters, ε i is the error of tree i , and σ 2 is the model variance.The prediction interval was defined as: where t stands for the t-distribution, n means a number of segments, p represents a number of model parameters, and X is a design matrix.Out of 629 segments, 25 segments' fitted heights were not within the prediction interval (Figure 2).These 25 segments were excluded from the tree species and stem diameter modeling but not from the accuracy analysis.Breidenbach et al. [37] applied the same procedure to exclude segments with incorrect field trees from modeling.After matching, the ALS and hyperspectral variables were merged for each crown segment.
Remote Sens. 2017, 9, 400 7 of 19 "matched" crown segment and field tree was avoided.In particular, each segment was matched to the closest field tree according to a 3D distance.To confirm a good match, we fitted a simple linear regression with the field tree height as response ( ℎ) and crown segment maximum height as predictor (ℎ ): where corresponds to a crown segment, and are fixed parameters, is the error of tree ( ~ (0, )), and is the model variance.The prediction interval was defined as: where stands for the t-distribution, means a number of segments, represents a number of model parameters, and is a design matrix.Out of 629 segments, 25 segments' fitted heights were not within the prediction interval (Figure 2).These 25 segments were excluded from the tree species and stem diameter modeling but not from the accuracy analysis.Breidenbach et al. [37] applied the same procedure to exclude segments with incorrect field trees from modeling.After matching, the ALS and hyperspectral variables were merged for each crown segment.The tree species were classified at the pixel level with a support vector machine (SVM) classifier using hyperspectral variables.We used the SVM implementation in the R package "kernlab" [48].The penalty factor (C) was selected with a grid search strategy, testing values of 1, 10, 100, and 1000.The value of 10 turned out to reach the highest classification accuracy.The weights for different species classes were set as a ratio between the number of trees of the most frequent species and the number of trees of each species.The predicted species for each pixel were aggregated within each crown segment according to a majority rule.
The stem diameter at breast height (cm) was predicted for each crown segment using a nonlinear regression model, taken from the study of Dalponte and Coomes [49] conducted on the same dataset.The fitted species-specific nonlinear model was: The tree species were classified at the pixel level with a support vector machine (SVM) classifier using hyperspectral variables.We used the SVM implementation in the R package "kernlab" [48].The penalty factor (C) was selected with a grid search strategy, testing values of 1, 10, 100, and 1000.The value of 10 turned out to reach the highest classification accuracy.The weights for different species classes were set as a ratio between the number of trees of the most frequent species and the number of trees of each species.The predicted species for each pixel were aggregated within each crown segment according to a majority rule.
The stem diameter at breast height (cm) was predicted for each crown segment using a nonlinear regression model, taken from the study of Dalponte and Coomes [49] conducted on the same dataset.The fitted species-specific nonlinear model was: where h max ij is the maximum crown segment height (m), CA ij is the crown area (m), and ε j , ρ j and ϑ j are parameters determined in [49].Indices i and j correspond to segment and species, respectively.Knowing the predicted species and stem diameter, the species-specific volumes were predicted based on allometric models for temperate species of Scrinzi et al. [1]: where a j , b j , c j , and d j are parameters taken from species-specific tables derived from trees in the Trentino province [1].Finally, the volume was summarized for each species at the plot level.

Semi-ITC Approach
The semi-ITC approach presumes that a crown segment can contain none, one, or more field trees.Thus, each field tree was matched to the closest segment according to a 3D distance.The field derived values of the total and species-specific volumes were assigned to each segment.For the segments that were matched with more than one field tree, the tree volumes were summed up over the total and species-specific volumes.The segments with no match had zero values for all volumes.In the following analysis, all the segments were used for the modelling and accuracy analysis.
The kMSN prediction method, implemented in the R package "yaimpute" [50], was applied using the total and species-specific volumes per segment derived from the field observations as response variables and the selected predictor variables per segment derived from the ALS and hyperspectral variables as covariates.The predictor variables were selected with the variable selection method varSelection implemented in the R package "yaimpute" [50].For the selection, the remotely sensed variables were added to the kMSN model based on the computed generalized root mean square distance (grmsd) between the predicted and observed response variables among the field observations.The grmsd computes the root mean square distance between the observed and predicted volumes over several variables simultaneously.The variable that was related to the largest grmsd was removed.Finally, the selected predictor variables were: CA, h max , B4 (428.96nm), B13 (509.16 nm), B14 (518.10 nm), B29 (655.98 nm), B31 (674.49nm), B33 (693.00 nm), and B43 (786.82 nm).The number of the nearest neighbors was selected empirically and the distance to the nearest neighbors, i.e., crown segments, was defined by the MSN distance matrix based on the predictor variables.

ABA
In order to increase the number of plots, i.e., neighbors, important for the kMSN method, we subdivided the 47 sample plots into halves, resulting in 94 subsample plots with a size of 350 m 2 .The same procedure was applied by Breidenbach et al. [37].The ALS canopy height-and density-related variables calculated for each subsample plot were the ones also used in other studies [9,23], using the lower limit of the canopy defined by a threshold value of 2 m [9,51,52].The ALS variables were computed for the first, intermediates, and last returned laser pulses, denoted x, from 1 to 3, respectively: maximum (h max_x ), mean (h mean_x ), skewness (h sk_x ), kurtosis (h ku_x ), and coefficient of variation (h cv_x ) of the ALS heights (m) and quantiles corresponding to the 0, 10, 20,...,90 percentiles of the ALS heights (h q10_x − h q90_x ).Furthermore, 10 vertical slices of equal height were defined as a range between the lowest laser canopy height (2 m) and the 90th percentile of the canopy height.Then, the canopy density variables (d 1_x − d 10_x ) were calculated as the proportions of laser pulses above each vertical slice to the total number of pulses.Additionally, the canopy volume (C Vol ) was calculated as the cell-wise difference between rasterized height values (m) of the first (F ij ) and last (L ij ) returned pulses of the i-th pixel in the j-th sample plot as The hyperspectral variables were the same as for the semi-ITC.Additionally, the NDVI and the difference vegetation Remote Sens. 2017, 9, 400 9 of 19 index (DVI) were computed for each plot as both indices can be potentially good explanatory variables for the volume prediction [53,54].
The variable selection and prediction method employed were the same as for the semi-ITC approach.For the simultaneous volumes prediction, 18 variables were selected: h CV_2 , h max_1 , h q20_1 , h q50_1 , h q10_2 , h q40_2 , h q20_3 , h q30_3 , d 10_2 , d 4_3 , B3 (420.34 nm), B14 (518.10 nm), B16 (535.98 nm), B20 (572.65 nm), B27 (637.47 nm), B35 (711.62 nm), B38 (739.59 nm), and B54 (891.00 nm).To avoid overfitting, the number of variables should be smaller than the number of samples.In addition, adding more predictor variables in the training data does not always improve the kNN model accuracy [55].Then, we applied the kMSN distance metric to find the reference plots for the target plots based on the selected variables above.The number of the nearest neighbors used was based on preliminary tests according to the balance between the highest accuracy and the smallest systematic error.

Accuracy Assessment
For the ITC approach, the tree species classification accuracy was validated at the ITC-level with a 3-fold cross-validation using the overall accuracy (OA), kappa coefficient (KA), the producer's and user's accuracies derived from the confusion matrix.The accuracy of the predicted stem diameter at the ITC-level was assessed with the root mean square error (RMSE), and the mean differences (MD) as an indicator of the systematic error were calculated as where n is the sample size (number of segments) and i denotes a segment for which the observed and predicted attributes for the j-th species class were calculated.The relative RMSE and MD were calculated by dividing with the mean of the observed values.Additionally, the squared correlation (r 2 ) was computed as the Pearson's correlation coefficient of observed and predicted values.The accuracy and the systematic error of the total and species-specific volume models at the plot level were calculated with the same statistics as above, where the sample size represented the number of sample plots and i denoted the plot.The reliability of the predicted volumes by the semi-ITC and ABA was tested by means of LOO cross-validation.For the semi-ITC and ABA approaches, the crown segments and subsample plots, respectively, belonging to the same sample plot, were subsequently left out in the cross-validation process.The accuracies of the predicted total and species-specific volumes at the plot level by both approaches were evaluated with the same measures as for the ITC approach.We performed the Wilcoxon signed rank test to test if the differences between the observed and predicted volumes for all inventory approaches were significantly different at a significance level of 0.05.The Friedman test at the same significance level was applied to check the significance of the differences in the distribution of the differences between the observed and predicted species-specific volumes among sample plots for the three inventory approaches.In addition, we conducted Conover post-hoc analysis in order to decide which pairs of the three inventory approaches were significantly different from each other for each species class volume.

ITC Approach
The ITC segmentation method delineated 629 segments within the 47 plots.The omission and commission errors were 51% and 4%, respectively.When calculating hyperspectral variables per crown, one crown was removed due to the effects from shadowing.Thus, the following analysis comprised Remote Sens. 2017, 9, 400 10 of 19 628 crown segments.The tree species overall and kappa accuracies were 85.2% and 0.57, respectively, where the producer's and user's accuracies are shown in Table 3.The predicted stem diameter per each crown segment resulted in an RMSE of 11.0 cm (relative RMSE = 23.40%) and MD of 0.43 cm (relative MD = 0.92%) with the explained proportion of the variance of 0.61.Knowing the predicted species, the species-specific volumes were predicted and aggregated at the plot level (Table 4 and Figure 3).We also checked if the RMSE of the predicted volumes were within the standard deviation of the field volumes (Table 2).In this regard, the RMSEs of the volumes for larch, Norway spruce, and rowan classes were within the standard deviation of the observed volumes while the volumes of the species classes other broadleaves and silver fir were not.Moreover, the Wilcoxon signed rank test showed that the volume differences between the observations and predictions were statistically different for all species classes (p ≤ 0.05), except for the larch and rowan classes.deviation of the field volumes (Table 1).In this regard, the RMSEs of the volumes for larch, Norway spruce, and rowan classes were within the standard deviation of the observed volumes while the volumes of the species classes other broadleaves and silver fir were not.Moreover, the Wilcoxon signed rank test showed that the volume differences between the observations and predictions were statistically different for all species classes (p ≤ 0.05), except for the larch and rowan classes.

Semi-ITC Approach
For the semi-ITC approach, the same segmentation method was employed as for the ITC approach, but with different matching procedures where every field tree was matched to the closest segments.Fifty-four segments out of 628 remained without any field match.The number of nearest neighbors was selected empirically, according to the smallest RMSE and MD values.We choose three neighbors, even if two or four revealed similar results.The predicted volumes were aggregated at the plot level (Table 5) and displayed in Figure 4.The highest accuracies were reached for total, Norway spruce, and larch volumes, while the lowest accuracies were obtained for silver fir, other broadleaves, and rowan volumes.The relative RMSE of the predicted silver fir, other broadleaves, and rowan volumes were also greater than the relative standard deviations of the corresponding observed volumes (Table 2).The predicted volumes for total, larch, and Norway spruce resulted in minor systematic errors with relatively high r square correlations, except for larch.All the predicted volumes were underpredicted except for the total and Norway spruce volumes.Regarding the Wilcoxon signed rank statistic, the differences between the observed and predicted volumes were not statistically significant (p > 0.05) for all species classes.

ABA
The number of nearest neighbors was set to three, although the volumes of the majority species did not substantially change using one or two neighbors.The accuracy assessment and visualization of the volume predictions using the kMSN approach appear in Table 6 and Figure 5.The RMSE of the total volume (Table 6) was within the standard deviation of the observed volumes except for the other broadleaves volume.The Wilcoxon signed rank test showed that the differences between the predicted and observed volumes were not statistically significant (p > 0.05).

Comparison of the Different Inventory Approaches
For all three inventory approaches, the RMSEs and MDs of the total and five species-specific volumes were compared (Figure 6).The lowest relative RMSEs of the volumes were achieved by the

Comparison of the Different Inventory Approaches
For all three inventory approaches, the RMSEs and MDs of the total and five species-specific volumes were compared (Figure 6).The lowest relative RMSEs of the volumes were achieved by the ITC approach, except for the total and Norway spruce volumes where the semi-ITC resulted in the greatest accuracies.The ABA and semi-ITC approaches resulted in quite large RMSEs for the minority species, like silver fir, rowan, and other broadleaves.The accuracies for the total volume and majority species (Norway spruce) were in the same range for all three inventory approaches.Regarding the relative MDs, the smallest values were obtained by the semi-ITC approach, except for the silver fir which resulted in smaller systematic error by ABA.The ITC approach resulted in the largest MDs, except for the rowan where the ABA resulted in a larger MD.The Friedman test showed that the three inventory approaches had statistically significant different median of the differences between the observed and predicted volumes per plot for all species classes, except for the larch.Furthermore, we performed post-hoc analysis for the Friedman's Test, which demonstrated that the pairs of the inventory approaches (ITC vs. semi-ITC, ITC vs. ABA, semi-ITC vs. ABA) were significantly different (p ≤ 0.05) from each other for the total, silver fir, other broadleaves, and rowan volumes.For the pairs of ITC vs. semi-ITC and ITC vs. ABA, the differences between the larch and Norway spruce volumes were also statistically significant but not for the pair of semi-ITC vs. ABA (p > 0.05).Overall, the greatest balance between the RMSEs and MDs for all volumes was achieved by the semi-ITC approach.
and majority species (Norway spruce) were in the same range for all three inventory approaches.Regarding the relative MDs, the smallest values were obtained by the semi-ITC approach, except for the silver fir which resulted in smaller systematic error by ABA.The ITC approach resulted in the largest MDs, except for the rowan where the ABA resulted in a larger MD.The Friedman test showed that the three inventory approaches had statistically significant different median of the differences between the observed and predicted volumes per plot for all species classes, except for the larch.Furthermore, we performed post-hoc analysis for the Friedman's Test, which demonstrated that the pairs of the inventory approaches (ITC vs. semi-ITC, ITC vs. ABA, semi-ITC vs. ABA) were significantly different (p ≤ 0.05) from each other for the total, silver fir, other broadleaves, and rowan volumes.For the pairs of ITC vs. semi-ITC and ITC vs. ABA, the differences between the larch and Norway spruce volumes were also statistically significant but not for the pair of semi-ITC vs. ABA (p > 0.05).Overall, the greatest balance between the RMSEs and MDs for all volumes was achieved by the semi-ITC approach.

Discussion
Many methods have been proposed for fusing ALS data with stereo or airborne multispectral or hyperspectral images in order to achieve a more accurate species recognition [23,56,57].In this study, we showed that the fusion of ALS and hyperspectral data enabled the prediction of volumes at good accuracy levels.Moreover, hyperspectral variables were not only used for species classification, but they were also incorporated in the common ALS framework as they were selected as important variables for volume modeling.At the moment, hyperspectral data are the most powerful tool for species identification [6] and consequently they can improve the accuracy of the predicted biophysical attributes [31].Forest inventories can be improved by the use of these combined data as they can increase the spatial detail, coverage, and accuracy of forest biophysical attributes.Thus, the combined data also helps forest managers to develop a broad and detailed database of forestry information to be coupled with a decision support system [58].
The accuracies of the species-specific volumes of the minority species obtained by the ITC approach were relatively higher compared to the semi-ITC and ABA approaches, but the accuracies of the more frequent species, i.e., the Norway spruce, were very much alike.The semi-ITC approach had slightly smaller RMSEs compared to the ABA, except for the larch and other broadleaves where the RMSEs were similar for both approaches.The largest systematic errors occurred in the ITC approach, caused by non-detected trees.Although all delineated crown segments were used for the

Discussion
Many methods have been proposed for fusing ALS data with stereo or airborne multispectral or hyperspectral images in order to achieve a more accurate species recognition [23,56,57].In this study, we showed that the fusion of ALS and hyperspectral data enabled the prediction of volumes at good accuracy levels.Moreover, hyperspectral variables were not only used for species classification, but they were also incorporated in the common ALS framework as they were selected as important variables for volume modeling.At the moment, hyperspectral data are the most powerful tool for species identification [6] and consequently they can improve the accuracy of the predicted biophysical attributes [31].Forest inventories can be improved by the use of these combined data as they can increase the spatial detail, coverage, and accuracy of forest biophysical attributes.Thus, the combined data also helps forest managers to develop a broad and detailed database of forestry information to be coupled with a decision support system [58].
The accuracies of the species-specific volumes of the minority species obtained by the ITC approach were relatively higher compared to the semi-ITC and ABA approaches, but the accuracies of the more frequent species, i.e., the Norway spruce, were very much alike.The semi-ITC approach had slightly smaller RMSEs compared to the ABA, except for the larch and other broadleaves where the RMSEs were similar for both approaches.The largest systematic errors occurred in the ITC approach, caused by non-detected trees.Although all delineated crown segments were used for the volume modeling, a large underprediction due to omission errors of the ITC delineation still appeared in the ITC approach, mainly due to the forest structure [14].The semi-ITC approach underpredicted all volumes, except for the Norway spruce volume, which was negligibly overpredicted.Even though, the semi-ITC is compensating the problem related to the omission error, it is not able to eliminate it completely.On average, the species-specific volumes were achieved with the smallest systematic errors with the semi-ITC approach.The ABA approach also underpredicted all volumes, but overpredicted the Norway spruce and silver fir volumes.The overprediction in ABA can be the result of species presence of the nearest sample plots.As the volumes were predicted based on selected nearest neighbors, the kMSN method can predict small volumes for tree species classes that do not really occur in a sample plot.Since the kNN method was used, the predicted values will have a smaller range than the observed because of an averaging effect, which can caused the underprediction for the ABA and semi-ITC approach.We have to consider that the systematic error of the semi-ITC approach is the result of the combination of the delineation and kNN method's characteristics.In most cases, the systematic errors obtained with the semi-ITC approach were smaller compared to the ABA.A similar observation was also found in the study of Breidenbach et al. [37].
In the literature, there are no similar studies comparing the species-specific attributes obtained by the ITC and semi-ITC approaches, except for Ørka et al. [23].In general, the main problem of the ITC methods is the quite large omission error, caused by undetected understory and suppressed trees [59], mostly resulting in underprediction [12] of biophysical attributes when aggregated at the plot level [60].This systematic error can be considerably reduced with the semi-ITC approach as demonstrated in our study.Comparing the semi-ITC and ABA inventory approaches, Breidenbach et al. and Rahlf et al. [37,61] showed that the semi-ITC approach provided prediction accuracies that were higher or similar to the ABA, while in the study of Ørka et al. [23], the ABA performed better than the semi-ITC approach.Based on a trade-off between the goodness of fit and the systematic error, the compared total and species-specific results suggest that the semi-ITC approach in total outperformed the other approaches.
For the kNN methods, a sufficient number of sample plots is required for the ABA in order to have a sufficient number of neighbors available for the calculation of the distance matrix.For example, 300-500 sample plots are usually applied in operational forest inventories.On the other hand, many studies showed that nonparametric models were in line with parametric models also with a lower number of observations (e.g., 200 sample plots) [44,62].However, we assumed that 94 subsample plots using three nearest neighbors were sufficient to achieve good results.The reason to not apply a parametric method was that the field-observed species-specific volumes contained many values close to zero or zeros due to the absence of minority tree species in the sample plots.Moreover, in complex stands with a wide variety of tree sizes and species, the nonparametric methods might be preferred to avoid implausible (e.g., negative) predictions and to obtain a reasonable extrapolation beyond the range of calibration data.In fact, among the nonparametric methods, the kNN, in particular the kMSN method, have been shown to be efficient in providing simultaneous multivariate predictions at a satisfactory accuracy level [17,37,54,63,64].
The volumes of the minor species were always predicted with smaller accuracies than the volumes of the majority species.The reason could be the small share of some tree species, especially broadleaves, and thus it was difficult to obtain accurate results.Moreover, a large share of only one species is the typical situation of the European forests.This phenomenon of small accuracy of the minor species is typical also for other studies [23,37], but despite the small RMSEs, the information on the existence of the minority species in stands can be valuable information, for example in tree-oriented silvicultural practices.Looking at the problem from an economical perspective, minority species are of secondary interest as the majority of the harvested volume is coming from Norway spruce and larch.Other species are mainly used for energy wood, as is the case of the broadleaves volume, or for very specific uses (e.g., high quality furniture).Minority species are important from the ecological perspective and thus, the use of ITC or semi-ITCs approaches is essential in this case as it is difficult to monitor such species with the ABA approach [23].The ITC and semi-ITC approaches can also be used to detect and report the presence and spread of non-native invasive species that can irreversibly alter the productivity of the systems they invade.The semi-ITC approach is overall quite good for predicting volumes of the minority and majority species.However, in studies where the volume of majority species is of high interest, the ABA is recommended, as the field data collection (e.g., tree positions) is less demanding and less costly compared to what is required for the semi-ITC approach.

Conclusions
The ITC approach reached high accuracies for the volumes of minority species but in general large systematic errors and the ABA approach resulted in small systematic errors and relatively high accuracies for the dominant species.Considering the systematic errors and accuracies for the total and species-specific volumes, the semi-ITC approach achieved the greatest balance.Eventually, we found that the ITC approach is important for applications in which information on the minority species is needed, such as for biodiversity studies and silviculture treatments.The ABA is recommended where the dominant species are a key value for management purposes, due to a less demanding collection of field data, relatively high accuracy, and minor systematic error for the dominant species.Overall, the semi-ITC approach has the potential to become competitive with the ABA, as it achieved more accurate results with negligible systematic error for the dominant species.Additionally, when the minor species volumes are requested, the semi-ITC approach might be preferred over the ITC approach as it resulted in small differences with the field measured values.In addition, when the ITC or semi-ITC approaches are applied, they could always be supplemented with the results of the ABA as its computation time is fast.We also suggest selecting the approach according to the application's tolerance to systematic errors.To conclude, the results of the semi-ITC approach are promising in providing accurate total and species-specific volumes.Further studies are needed to examine the accuracies of such an approach for other biophysical attributes and its use at the operational level over larger areas.

Figure 1 .
Figure 1.Flowchart of the procedure for predicting total and species-specific volumes.

Figure 2 .
Figure 2. The field-derived tree height of the matched field trees and the respective maximum airborne laser scanning (ALS) height of crown segments.

Figure 2 .
Figure 2. The field-derived tree height of the matched field trees and the respective maximum airborne laser scanning (ALS) height of crown segments.

Figure 3 .
Figure 3. Observed versus predicted total and species-specific volumes at the plot level derived from the individual tree crown (ITC) method.

Figure 3 .
Figure 3. Observed versus predicted total and species-specific volumes at the plot level derived from the individual tree crown (ITC) method.

Figure 4 .
Figure 4. Observed versus predicted species-specific volumes at the plot level derived from the semi-ITC method.

Figure 4 .
Figure 4. Observed versus predicted species-specific volumes at the plot level derived from the semi-ITC method.

Figure 5 .
Figure 5. Observed versus predicted species-specific volumes at the plot level derived from the area based approach (ABA) method.

Figure 5 .
Figure 5. Observed versus predicted species-specific volumes at the plot level derived from the area based approach (ABA) method.

Figure 6 .
Figure 6.(a) Relative RMSEs and (b) relative MDs of the total and five species classes' volumes obtained by the three inventory approaches: ITC, semi-ITC, and ABA.

Figure 6 .
Figure 6.(a) Relative RMSEs and (b) relative MDs of the total and five species classes' volumes obtained by the three inventory approaches: ITC, semi-ITC, and ABA.

Table 1 .
Statistics of the field-observed biophysical attributes at the tree and plot level (SD = standard deviation, DBH = diameter at breast height, H = height, BA = basal area, V = volume).

Table 2 .
Statistics of the field-observed total and species-specific volumes for five species classes at the plot level.

Table 3 .
Producer's and user's accuracies of tree species classification at the ITC level.

Table 4 .
Results of the predicted volumes derived from the ITC approach, aggregated to the plot level (* = differences between the predicted and observed volumes are significantly different from zero (p ≤ 0.05), RMSE = root mean square error, and MD = mean differences).

Table 5 .
Results of the predicted volumes derived from the semi-ITC approach extended to the plot level.

Table 6 .
Results of the predicted volumes using the k-nearest neighbor approach derived from ABA at the plot level.

Table 6 .
Results of the predicted volumes using the k-nearest neighbor approach derived from ABA at the plot level.