Predicting Individual Tree Diameter of Larch (Larix olgensis) from UAV-LiDAR Data Using Six Different Algorithms

Sun, Yusen; Jin, Xingji; Pukkala, Timo; Li, Fengri

doi:10.3390/rs14051125

Open AccessArticle

Predicting Individual Tree Diameter of Larch (Larix olgensis) from UAV-LiDAR Data Using Six Different Algorithms

¹

Key Laboratory of Sustainable Forest Ecosystem Management—Ministry of Education, School of Forestry, Northeast Forestry University, Harbin 150040, China

²

School of Forest Sciences, University of Eastern Finland, P.O. Box 111, 80101 Joensuu, Finland

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Remote Sens. 2022, 14(5), 1125; https://doi.org/10.3390/rs14051125

Submission received: 21 January 2022 / Revised: 16 February 2022 / Accepted: 21 February 2022 / Published: 24 February 2022

(This article belongs to the Special Issue Applications of Individual Tree Detection (ITD))

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Individual tree detection is an increasing trend in LiDAR-based forest inventories. The locations, heights, and crown areas of the detected trees can be estimated rather directly from the LiDAR data by using the LiDAR-based canopy height model and segmentation methods to delineate the tree crowns. However, the most important tree variable is the diameter of the tree stem at the breast height (DBH) which can seldom be interpreted directly from the LiDAR data. Therefore, the use of individually detected trees in forest planning calculations requires predictions for the DBH. This study tested six methods for predicting the DBH from laser scanning data collected by an unmanned aerial vehicle from Larix olgensis plantations located in northeast China. The tested methods were the linear regression model (LM), a linear model with ridge regularization (LMR), support vector regression (SVR), random forest (RF), artificial neural network (ANN), and the k-nearest neighbors (KNN) method. Both tree-level and stand-level metrics derived from the LiDAR point cloud data (for instance percentiles of the height distribution of the echoes) were used as potential predictors of DBH. Compared to the LM, all other methods improved the accuracy of the predictions. On the other hand, all methods tended to underestimate the DBH of the largest trees, which could be due to the inability of the methods to sufficiently describe nonlinear relationships unless different transformations of the LiDAR metrics are used as predictors. The support vector regression was evaluated to be the best method for predicting individual tree diameters from LiDAR data. The benefits of the methods tested in this study can be expected to be the highest in the case of little prior knowledge on the relationships between the predicted variable and predictors, a high number of potential predictors, and strong mutual correlations among the potential predictors.

Keywords:

random forest; artificial neural network; k-nearest neighbors; support vector regression; ridge regulation; machine learning

1. Introduction

Forest ecosystems play an important role in maintaining ecological balance and carbon cycle, regulating local and regional climate, and preserving biosphere stability [1]. Information on tree cover is required for the management of forest ecosystems and to support policies on ecological restoration and climate change mitigation [2]. Traditional forest inventories often employ intensive field samplings with accurate measurements in sample plots [3]. However, field-based forest inventories are labor-intensive and time-consuming, and therefore expensive for collecting data from large areas.

Remote sensing is an effective tool for monitoring wide forest areas [4]. As an active remote sensing technology, airborne light detection and ranging (LiDAR) can directly capture detailed information on forest canopies in three dimensions from large areas [2,5]. LiDAR has proven to have a high potential in predicting forest attributes and acquiring auxiliary information for sampling inventories [6].

However, the flying cost per hectare of airborne LiDAR inventory may be high if the inventory area is not large. Airborne LiDAR inventories may also be limited by adverse flight conditions [7]. Compared with airborne LiDAR, laser scanning from unmanned aerial vehicles (UAVLS) has the advantage of low material and operational costs and high flexibility [8]. UAVLS has been used successfully in several recent studies to predict the diameter distribution of trees [9], mean tree height [8], and aboveground carbon stock [10]. Furthermore, due to the high point density of UAVLS data, the crowns of individual trees can be delineated, improving the accuracy of individual tree detection (ITD) [11].

The diameter at breast height (DBH) correlates strongly with other tree attributes and is easy to measure accurately in the field [12]. It is the most common predictor variable in stem volume equations, tree growth models, and biomass equations. Several studies have been recently carried out where LiDAR data were applied to estimate the DBH of individual trees [2,8,12].

UAVLS data may not be sufficient for the direct detection of the stems of individual trees, due for instance to canopy obstruction and non-optimal scanning angle [13]. However, tree diameter can be predicted through case-specific models that account for the strong relationship between DBH and LiDAR-derived canopy height [12]. Estimates of tree locations and diameters over large continuous forest areas would make it possible to optimize forest management at the level of individual trees [14,15,16].

The existing studies on the estimation of DBH from ALS data mostly employ tree-level LiDAR-derived metrics [17]. However, the relationship between LiDAR metrics and DBH may vary between sites and stands [18]. For example, the diameter-height curve of an even-aged stand of shade-intolerant species is often higher in older stands, i.e., a tree of a given DBH is taller the older the stand is [19]. Therefore, it is logical to test stand-level LiDAR-derived metrics in the prediction of DBH, although it would greatly increase the number of potential predictors. Multicollinearity between different LiDAR-derived metrics is another potential problem when both tree- and stand-level LiDAR metrics are used in model development.

The traditional statistical model-fitting methods, such as the ordinary least squares (OLS) regression, require large sample sizes in the presence of multicollinearity. In the case of multicollinearity, the variances of regression coefficients are large, which makes it difficult to test hypotheses concerning the effects of predictors [20]. The assumptions of OLS regression include linearity, independence of predictors variables, normality of the distribution of residuals, and homogeneity of variance [20].

Nonparametric machine learning techniques, such as random forest (RF), support vector regression (SVR), k-nearest neighbors imputation (KNN), and artificial neural networks (ANN) are alternatives to the traditional regression analysis, especially in the case of a high number of potential predictors, high degree of multicollinearity, and little prior knowledge on the relationships between variables [21]. The use of ridge or lasso regularization in regression analysis also mitigates some of the problems of OLS since it increases the robustness of the model [22].

Nonparametric approaches have already been used in the prediction of forest attributes from LiDAR metrics. For example, Corte et al. (2020) used the support vector regression approach, which showed similar predictive performance as OLS for modeling diameter, height, and volume [17]. Hao et al. (2021) used the random forest approach to estimate the individual tree diameter [11]. Pascual et al. (2019) used the random forest approach to estimate forest attributes [23]. However, more research is needed on different procedures for modeling DBH and other tree attributes with LiDAR metrics.

In this study, we compared six alternative methods for predicting individual tree diameter from UAVLS data in northeast China. The tested methods were linear regression model (LM), linear model with ridge regularization (LMR), support vector regression (SVR), random forest (RF), artificial neural network (ANN), and the k-nearest neighbors method (KNN). For this purpose, we (1) calculated a large number of tree-level and stand-level LiDAR metrics for trees detected from UAVLS data, (2) used the recursive feature elimination (RFE) method to deal with the high data dimensionality, (3) determined the optimal hyper-parameter values for each method, and (4) compared the performance of the algorithms in an independent validation dataset. Our comprehensive analyses reveal the benefits and potential pitfalls of alternative prediction methods and help to understand the mechanisms of predicting DBH from LiDAR data.

2. Materials and Methods

2.1. Study Sites and Field Data

The study area is located at the Mengjiagang forest farm (45°30′16″–46°20′20″N, 130°32′0″–130°52′6″E) of Huanan country in Heilongjiang province of northeast China (Figure 1). The total forest area of the farm is 15,503 ha, of which 4438 ha are natural forests and the rest are plantations. The average elevation of the case study forest is 250 m above sea level [16]. The plantation forests are dominated by coniferous tree species, predominantly Pinus koraiensis, Pinus sylvestris, Larix olgensis, and Picea asperata.

Larch (Larix olgensis A. Henry) is one of the main economic tree species in Northeast China. It accounts for 31% (4800 ha) of the tree plantations of the Mengjiagang forest farm. The breast height diameter of the trees of the current plantations ranges from 5 to 40 cm and the age of the planted stands ranges from 12 to 52 years. Most of the larch plantations have been planted using a density of 3300 stems/ha. The stands are thinned regularly so that a typical density of a near-mature stand is 500 stems/ha [16].

This study developed models for the diameter at breast height (DBH) of larch. A total of 53 square-shaped sample plots (30 m × 30 m) were established shortly after the LiDAR flights were conducted in July 2019. The field-measured inventory data included DBH, total tree height, crown width, and tree position. A total of 4109 trees at least 5 cm in DBH were measured for the DBH (at the height of 1.3 m from the ground) using diameter tape. The total height of the tree was measured using a Vertex IV instrument (Haglöfs, Sweden). Crown width was measured in four directions by using a tape measure. The positions of the trees were measured using a real-time kinetic (RTK) global navigation satellite system (GNSS) (UniStrong G10A, Beijing, China) for which the positioning errors were less than 1 m. Summary statistics of the tree variables are presented in Table 1.

2.2. UAVLS Data

UAVLS data were collected with a RIEGL mini VUX-1UAV LiDAR scanner (RIEGL Laser Measurement Systems GmbH, Horn, Austria) on 12 August 2019. The scanner was carried by a DJI M600 Pro unmanned aerial vehicle. The collected raw data were processed with the software package RiPROCESS (http://www.riegl.com/products/software-packages/riprocess/, accessed on 8 October 2019) to generate UAVLS point cloud data [24]. The total scanned area was 669 ha, consisting of three sub-areas, each of which contained a rectangular survey area of 1 km × 1.5 km. The total scanning time was about one hour. The average point density was 136 pulses/m². The flight speed was 10 m/s and the flight altitude was 180 m above ground. The field of view (FOV) of the laser scanner was 330°. The operational scanning specifications are summarized in Table 2.

2.3. UAVLS Data Pre-Processing

The whole process of developing the prediction models for DBH is shown in Figure 2. After pre-processing, both stands and the crowns of individual trees were delineated using the UAVLS data, after which a high number of metrics were calculated for the stands and trees.

As the first step of data pre-processing, the Gaussian-smoothing filter was employed to remove the noise data points by removing small variations from the canopy surface. The degree of smoothness was determined by the standard deviation [25].

Then, cloth simulation filtering was used to separate non-ground and ground point clouds using 0.5 as the value of the grid resolution parameter, 0.6 as the time step, and 3 as the rigidness parameter [26]. Kriging interpolation was used to generate a digital elevation model (DEM) from the ground points in 1-m spatial resolution using the ArcGIS 10.6 software [27].

The point cloud data were height-normalized by using the DEM. A canopy height model (CHM) was created from the normalized point cloud using the GridSurfaceCreate function in the LiDAR360 software (www.lidar360.com, accessed on 8 October 2019) [2]. The CHM was subsequently used for individual tree delineation. Based on Hao et al. (2021), the resolution of the CHM was set to 0.1 m, which made it possible to detect tree crowns wider than 0.1 m [11].

2.4. Individual Tree Delineation

The first step in the tree-level LiDAR metrics extraction was the delineation of individual trees from the CHM. It was done with the watershed segmentation algorithm of the LiDAR360 software, using 90 as the buffer size and 0.8 as the crown base height threshold. The algorithm was used as explained in Chen et al. (2006) [28].

The algorithm segmented the CHM into different polygons, polygon boundaries representing the perimeters of individual tree crowns (Figure 3). The maximum elevation of each polygon indicated the treetop. Those LiDAR-detected trees that could be matched with field-measured trees were used to develop models for DBH. Those segmented trees that could not be matched with field-observed trees were either trees that failed to produce local maxima in the CHM, or they were local maxima that did not represent treetops (Figure 3). In total, 79% of the trees were detected with commission errors in 22% of the trees. Out of 4109 field-measured trees, 3815 trees were correctly matched and used in modeling.

2.5. Stand Delineation

The stands were delineated using the method of Sun et al. (2021) [16]. The method employs simulated annealing (SA) to maximize an objective function where the criteria are small within-stand variation in the UAVLS attributes (weight 0.7), sufficient stand area (weight 0.15), and roundish shape of the stands (weight 0.15). The result depends on the weights of the criteria, but the usual aim is to avoid delineations with a high number of very small stands or irregular stand shapes.

In this study, the 95% percentiles of the height distribution of the echoes with 1-m² raster cells were used as input data in the stand delineation (Figure 4). A detailed description of the algorithm and its parameters is available in Sun et al. (2021) [16]. The stand delineation obtained from the SA method explained 80% of the variation in canopy height. The average stand area was 1.2 ha. All trees that were within the same stand received the same values of stand level metrics.

2.6. Calculation of the LiDAR Metrics

The normalized point clouds and the delineations of trees and stands were used to calculate 82 tree- and stand-level LiDAR metrics that were used as potential predictors of DBH (Table 3).

The maximum echo height within the perimeters of an individual tree may be used to predict tree height. The coefficient of determination (R²) for the linear regression between the LiDAR-derived and field-measured height was 0.8834. However, predicted tree heights were not used to predict DBH. The following categories of metrics were computed: density-related metrics (D₁₀, D₂₀, …, D₉₀, and SD in Table 3); canopy volume metrics (CPA); topography-related metrics (Slope and Aspect); and height-related metrics (all other variables listed in Table 3).

2.7. Modeling Methods

Six methods, namely linear model (LM), linear model with ridge regularization (LMR), support vector regression (SVR), k-nearest neighbors imputation (KNN), random forest (RF), and artificial neural networks (ANN), were used for DBH modeling.

Preliminary model fittings showed that H_max^T and CPA (see Table 3 for the explanation of these abbreviations) were among the most important predictors of DBH. H_max^T represents LiDAR-estimated tree height and CPA represents LiDAR-estimated projection area of the tree crown. Since the shape of the relationship between tree DBH and tree height is known to be nonlinear [19], H_max^T was transformed into (H_max^T)² to linearize the relationship. In the same way, since DBH is a one-dimensional tree attribute and CPA is two-dimensional, the square root of CPA was used to linearize the relationship between CPA and DBH. These transformations, which were used as additional predictors compared to Table 3, allowed fair comparisons between the modeling methods, of which some have been developed only for linear relationships (especially LM and LMR).

All algorithms were implemented in the Python ‘sklearn’ (scikt-learn) package [29]. The LM is a parametric method that can be expressed as

{y = a}_{0} {+ a}_{1} x_{1} {+ a}_{2} x_{2} {+ \dots a}_{n} x_{n} + ε

(1)

where a₀ is the intercept, a₁, a₂, …, a_n are the regression coefficients that represent the effects of normalized features, x₁, x₂, …, x_n are normalized explanatory features, y is the DBH of a tree, and

ε

is the error term. In the linear model, the target value is expected to be a linear combination of the features. When the number of predictors increases, the estimates of the LM regression coefficients may be unstable and have high standard errors, especially in small datasets.

The LMR method addresses the harmful influence of high data dimensionality and multicollinearity on the robustness of the linear model by imposing a penalty on large absolute values of the coefficients. The regularization term of LMR is the L2-norm of the coefficient vector, which reduces the impact of redundant variables by shrinking their coefficients while increasing the generalization ability of the linear model [15].

The loss function of LMR can be expressed as

\min \frac{1}{2 n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2} + λ \sum_{i = 1}^{m} a_{j}^{2}

(2)

where

y_{i}

is the observed value and

{\hat{y}}_{i}

is the predicted value for tree i, a_j is the regression coefficient, λ is the regularization coefficient, n is the number of observations, and m is the number of regression coefficients.

The minimized loss function is the mean of squared errors plus the sum of squared regression coefficients multiplied by a hyper-parameter (

λ

). Regularization restricts model complexity and decreases the likelihood of overfitting. The hyper-parameter defines the importance of aiming at small absolute parameter values. The hyper-parameter was “tuned”, which means that a value resulting in the best fitting statistics in cross-validation was searched.

SVR is a nonparametric machine learning regression algorithm, which is an extension of support vector machines [30]. The loss function of SVR can be expressed as

\min \frac{1}{2} \sum_{i = 1}^{m} a_{j}^{2} + C \sum_{i = 1}^{n} (L_{i} {+ L}_{i}^{*})

(3)

subject to

{f (x}_{i} {) - y}_{i} \leq {ε + L}_{i}

(4)

y_{i} - f (x_{i}) \leq {ε + L}_{i}^{*}

(5)

L_{i} \geq {0, L}_{i}^{*} \geq 0 i = 1, 2, \dots, n

(6)

where x_i is a vector of predictor variables for tree i, a_j is regression coefficient, y_i is the observed DBH and

f (x_{i})

is the predicted DBH,

L_{i}

and

L_{i}^{*}

are the relaxation variables, C is the regularization coefficient, ε is the allowed error, n is the number of observations, and m is the number of predictors. SVR allows predictions to deviate from the measured value by a small amount (ε). Deviations larger than ε are penalized.

SVR transforms nonlinear regression into linear regression by mapping the input data into a high-dimensional feature space through a kernel function [31]. In this study, we used the radial basis function (RBF) as the kernel function of SVR. The radial basis function can be expressed as

K (x, z) {= \exp}^{(- r {∥ x - z ∥}^{2})}

(7)

subject to

r > 0

(8)

where

x

is the vector of predictor variables, z is a high-dimensional feature vector which can be expressed as a new data distribution by mapping

x

into a higher dimensional feature space, and

r

is the parameter of the radial basis function. The larger the parameter r is, the faster the value of the feature decreases.

The advantages of SVR include efficiency in high-dimensional data, robustness, the possibility to control the effect of outliers, and a good generalization ability [32]. The kernel parameter r of the RBF controls the influence of a single observation and establishes a cost function to mitigate the impact of outliers [31]. Two of the hyper-parameters of SVR were tuned in this study: the cost of violating the restrictions (C) and the kernel parameter of the radial basis function (

r

) [33].

RF is a machine learning method that combines the predictions of several classification and regression trees (CART) to improve generalizability and robustness over a single estimator [34]. CART is trained to predict the value of the target variable through simple decision rules inferred from the input data. A simple tree usually consists of a set of constants that are used sequentially to provide the estimate. The prediction of the RF method is the averaged prediction of several CART estimators. Each estimator is generated by taking a bootstrap sample from the training dataset, and randomly selecting a specified number of features as predictive variables [35]. In this study, two hyper-parameters of the RF method were tuned: the number of trees that were used to provide the average estimate (ntree) and the number of predictor variables randomly sampled as candidates at each split (mtry) [29].

ANN is a supervised learning algorithm based on multi-layer perceptron (MLP) networks. ANN has already been widely used in forestry surveys and parameter estimation [36]. The advantages of ANN include the capability to learn nonlinear models, learn in real-time (on-line learning) and the use several neurons to model a nonlinear relationship in parallel [37].

ANN consists of two phases: the learning phase and the prediction phase. The learning phase finds the rules between the input layer and hidden layers, as well as hidden layers and the output layer, by establishing a weight matrix between the neurons. A neuron calculates an input signal value as a weighted linear combination of the predictors (Figure 5). An activation function converts this value to an output signal ranging from 0 to 1. This is the hidden layer value produced by a neuron. Through iteratively modifying the weight matrix, the ANN finds an estimator that minimizes the loss function.

The updated values of the weights are calculated as follows:

w^{'} = w - ƞ \frac{\partial LOSS}{\partial w}

(9)

where

w

is the weight of a predictor,

w^{'}

is the updated weight, ƞ is the learning rate that controls the step size of the updating process, and LOSS is the loss function.

In this study, the loss function minimized the root mean square error. The number of hidden layers was set to two. Three hyper-parameters were tuned: type of the activation function, number of neurons in the hidden layer (size), and the weight decay rate (η). The tested activation functions were logistic function, hyperbolic tangent function, and the rectified linear unit function. The rectified linear function converts negative input values into zero output values and uses a linear relationship between positive input values and the output value.

KNN is a nonparametric imputation method, which is based on the similarity of the LiDAR metrics calculated for a subject tree and field-measured trees. In this study, the Mahalanobis distance was used to calculate the similarity of two trees (a subject tree and its neighbors) [21]. The Mahalanobis distance can be expressed as

D = \sqrt{{(x - y)}^{T} \sum^{- 1} (x - y)}

(10)

where D is the Mahalanobis distance between observed values x and y and

\sum

is the covariance matrix. The KNN method produces a prediction for the subject tree as a weighted average of the field-measured DBH of several most similar neighbors. The weights of the neighbors were based on the inverse distances of the neighbors from the subject tree. One hyper-parameter of the KNN method was tuned: the number of nearest neighbors that were used to compute the prediction (k).

2.8. Feature Selection and Hyper-Parameter Tuning

The feature selection and hyper-parameter tuning consisted of the following steps:

Select the optimal feature subset using the initial (first repetition) or tuned (later repetitions) hyper-parameters and 10-fold cross-validation.
Tune the hyper-parameters by using 10-fold cross-validation and the feature subset found in Step 1.
Repeat Steps 1 and 2 using tuned hyper-parameters in Step 1 until the feature subset and the hyper-parameters stabilize.

Before the model training process, all LiDAR metrics were normalized (centered on the mean and scaled by the standard deviation) and the dataset was divided into a training set (75%) and testing set (25%) randomly.

The recursive feature elimination (RFE) algorithm was used for feature selection [31]. The RFE is a greedy method for finding the optimal feature subset, which is specific for each method [38]. The optimal feature subset size was based on the performance of the model, which was evaluated by the root mean square error (RMSE) of a 10-fold cross-validation scheme. The training dataset was divided into 10 subsets (Figure 6). One sub-set was used for testing and nine were used for training. This was repeated 10 times, using every time a different training and test dataset. The results are the averages calculated for the 10 training and test sets.

RFE was used to evaluate the effect of the number of input features on the model performance. The RFE algorithm is based on feature importance, where the relative importance of each metric is obtained by quantifying the increase in the mean square error of the model after the removal of the variable [34]. The optimal feature subset size was the one that minimized the average RMSE of the cross-validation scheme. The values of the hyper-parameters that were used in the feature selection step were taken from previous studies [8,11,17]: LMR, λ = 0.01; SVR, r = 0.0011, C = 64; RF, ntree = 400, mtry = m/3 (m is the number of LiDAR-derived metrics used in RF); KNN, k = 5; ANN, size = 1; ƞ = 0.6.

After selecting the set of features, a grid-search was used to determine the best values for the hyper-parameters. This was also done by using 10-fold cross-validation; the optimal parameters were those that resulted in the lowest mean RMSE. The hyper-parameter values tested for different algorithms are listed in Table 4.

2.9. Model Validation

The performance of the DBH prediction methods (LM, LMR, SVR, RF, ANN, and KNN) was assessed in terms of coefficient of determination (R²), square root of mean squared error (RMSE, in cm), relative RMSE (rRMSE, %), and mean error (BIAS, in cm) [39]:

R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}}

(11)

RMSE = \sqrt{\frac{\sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}{n}}

(12)

rRMSE = \frac{RMSE}{\bar{y}}

(13)

RIAS = \frac{\sum_{i = 1}^{n} (y_{i} - {\hat{y}}_{i})}{n}

(14)

where

y_{i}

represents the observed DBH value of the tree i,

{\hat{y}}_{i}

is predicted value of tree i,

\bar{y}

is the observed mean value, and n is the number of observations.

3. Results

3.1. The Best LiDAR Metrics for Predicting DBH

The RMSEs of most of the DBH prediction methods were not much affected by the number of input features after about 8 variables (Figure 7). The average RMSE stabilized after reaching the best accuracy, except LM where a high number of predictors most likely resulted in overfitting, decreasing model performance in independent test data. The LMR method mitigated the overfitting problem of LM by using regularization. The ANN method was also negatively affected by a high number of input variables, becoming unstable beyond the best number of variables. Since ANN was sensitive to hyper-parameters, the overfitting pattern shown in Figure 7 may be explained by the fact that the hyper-parameters were not tuned simultaneously with feature selection. Figure 7 indicates that four to ten predictor variables are needed to predict DBH.

The most important LiDAR-derived metrics for estimating DBH were partly the same for the different methods (Figure 8). H_max^T (maximum point cloud elevation within the crown area of an individual tree) or its transformation (H_max^T)² usually had the greatest relative importance.

The LiDAR metrics used in the LM regression methods differed most from the metrics used in the other methods. The Pearson’s correlation coefficients between the predictor variables that were used in LM were greater than 0.75. The variance inflation factor of LM was greater than 10, which indicates high multicollinearity among the independent variables selected by RFE for the LM.

3.2. Performance of the Prediction Methods

The DBH was generally predicted well in the 10-fold cross-validation (Table 5). The prediction accuracy of LMR was close to the nonparametric machine learning techniques suggesting that the regularization of LMR improved the robustness of the model.

The average rRMSE of the nonparametric machine learning methods, SVR, RF, KNN, and ANN was 14.11%, 14.43%, 12.75%, and 11.75%, respectively, which are smaller than obtained for the LM method. The ANN method had the lowest average of RMSE calculated by cross-validation.

3.3. Performance of the Prediction Methods in Independent Validation Data

To compare the prediction and generalization ability, we calculated the predicted DBHs for the independent validation dataset (25% of the data, see Figure 2). This validation dataset was not used for feature selection, hyper-parameter tuning, and model fitting. In LM and LMR, the minimum predictions were negative, i.e., these algorithms were not able to logically predict the diameters of very small trees (Table 6).

Table 6 indicates that the predicted DBH values of the nonparametric machine learning algorithms were close to the field-measured diameters. However, in RF and ANN, the maximum predicted values were smaller than the largest measured values, which means that the RF and ANN algorithms were not able to properly predict the DBH of the largest trees. When looking at all the statistics in Table 6, it can be concluded that the predictions of SVR best reflect the range and the distribution of the measured breast height diameters.

The LM and RF methods had a wider scatter of the predicted values compared to the other algorithms (Figure 9). In addition, the RF systematically predicted the diameters of the largest trees too small. To some extent, this was also true for the ANN method. Otherwise, the scatterplots of the DBH prediction methods were reasonably close to the 1:1 trend line (red line in Figure 9).

The prediction results of the six different algorithms were assessed for three DBH ranges (Table 7). The rRMSE values were the worst for the DBH range less than 12 cm. Although the ANN provided the best overall estimation results (Table 6), SVR provided the best estimation (smallest RMSE and bias) for the DBH range greater than 30 cm. When looking at all the three diameter ranges, the SVR method had the best prediction accuracy. ANN performed slightly better than LMR and KNN.

4. Discussion

This study compared the performance of six different methods to predict individual tree diameter from UAVLS data. The average point density of the data was approximately 140 pts·m⁻², which was sufficient for delineating the crowns of individual trees. Metrics calculated on two scales (tree and stand) were used to predict individual tree diameter.

Previous studies have shown that the use of random stand effects improves the models for predicting tree height from DBH, or DBH from tree height [2,12]. Since the effect of stand variables on the relationship between DBH and tree height is often systematic, it can also be modeled through fixed model parameters. Previous studies show, for example, that the height of a tree with a given DBH increases with increasing stand age or mean height [19]. For example, Hao et al. (2021) used both tree and plot level LiDAR metrics as predictor variables in a nonlinear mixed-effects model, which also included random site effects [11]. This significantly improved the model compared to the use of fixed tree effects and random site effects. In this study, the set of the potential predictors of individual tree diameter included both tree and stand-level LiDAR metrics, which allowed the relationship between tree height and DBH to change along changing stand conditions.

In the present study, the conventional linear regression model was the poorest of the six tested prediction algorithms. This was partly because the method was unable to address nonlinear relationships, and partly because of the low robustness of the method in the presence of multicollinearity. It might also be that the method used to select the predictors was not ideal for LM because the selected predictors were highly correlated. The lacking ability to model nonlinear relationships was mitigated by using two transformed variables, which linearized the relationship between tree height and DBH, and between crown projection area and DBH.

The transformed variables were selected as predictors also in the other approaches (Figure 8), which suggests that several of the methods analyzed in this study may not be ideal to properly model nonlinear relationships. Common to all methods was that the diameter of medium-sized trees was overestimated and the diameter of the largest trees was underestimated. When the six algorithms analyzed in this study were applied without the transformed variables, the statistics calculated for the independent validation dataset (not shown) were always worse than obtained with transformed variables. The tendency of overestimating the DBH of medium-sized trees and underestimating the DBH of large trees was stronger when transformed variables were not used. On the other hand, the cross-validation results calculated for the training dataset did not always improve when transformed variables were used as additional predictors.

The linear model with regularization (LMR) resulted in a robust model, mitigating the overfitting problem of the LM (Figure 7). The performance statistics calculated from the independent validation dataset were also better for LMR than LM. They were also better than obtained for the random forest method and competitive with the ANN and KNN methods.

The nonparametric approaches do not require a priori specification of the functional form of the relationship between the predictor variables and the response variable [40]. Of the nonparametric machine learning algorithms, ANN, KNN, and SVR had the smallest average RMSE in the 10-fold cross-validation based on the training dataset. However, KNN and ANN underestimated the diameters of the largest trees, which is a serious problem in the calculation of stand volume or biomass, for example. The ANN and SVR showed good performance in the 10-fold cross-validation, which is in line with previous studies [17]. However, ANN had also a serious shortcoming problem as it could be weak in predicting the DBHs of the very largest trees. The time needed to tune the hyper-parameters was much longer for ANN than the other methods.

KNN differs from the other methods in such a way that the predicted diameter is obtained as the mean of the field-measured diameters of a few trees that are similar to it in terms of LiDAR metrics. Therefore, the method needs a database of field-measured diameters when it is used in prediction, which may restrict its usability. In addition, the method cannot predict diameters larger than the largest field-measured diameters, making it difficult to apply the model in a new area, or a few years after field-data collection. The advantage of the method is that several tree-level variables can be imputed to the LiDAR-detected trees without the need to develop separate models for all variables.

The RF algorithm had the poorest performance of the six tested algorithms according to the statistics calculated for the independent validation dataset. A serious shortcoming of the method was its inability to predict small and large tree diameters. The problem may be related to the number of decision nodes used in the decision trees of the RF method. Adding more nodes to classify the large trees in more fine-grained DBH classes would most probably mitigate the problem.

The use of the RFE method for variable selection reduced the number of predictors for reducing the complexity of the model. The RFE method seldom selected intensity-related LiDAR metrics as predictor variables. Only ANN used a few intensity-related metrics to predict DBH, which however had lower relative importance than the height-related metrics.

All of the nonparametric machine learning methods identified the maximum value of the individual tree point heights (H_max^T) or its transformation as the most important predictor variable for modeling DBH, which is logical as the H_max^T represents the total height of trees. The result is in line with previous studies [17].

Our study indicated that the relative prediction accuracy was the lowest for small trees. The trees with small DBH values usually represent young stands. Young stands are often dense and the tree crowns form a rather continuous surface, which may cause problems in individual tree detection. The difficulty in predicting the diameters of small trees agrees with the conclusions of previous studies [11].

Failures to detect trees or interpreting tree groups as one tree are most probably common when the forest stand is dense [41]. Previous studies have analyzed the influence of individual tree segmentation methods on the estimation of vegetation characteristics with LiDAR data. It has been found that dense young forests are the most prone to segmentation errors, affecting the overall accuracy of DBH estimation [11,42]. In young stands, there may be many overlapping crowns and the crowns often form a rather uniform canopy surface. This makes it difficult for the watershed segmentation algorithm to detect individual tree crowns. Using a higher value for the crown base height threshold might mitigate the problem to some extent, but it would not eliminate it. Therefore, better individual tree delineation approaches are called for in dense young forests.

The results of the present study were hardly affected by the individual tree segmentation errors because only matched trees were used in the analyses. However, when individual trees detected from LiDAR data are used in forest planning, which seems to be an increasing trend, non-matched trees cannot be omitted. There are already several studies on optimizing forest management at the tree level based on individual tree detection and airborne laser scanning data [15,43,44].

Due to the difficulties to detect all trees of all stands, there is a need to develop new types of methods for tree-level planning, such as the hybrid method discussed in Sun et al. (2022). The data for such methods can be produced by the combined use individual tree delineation (ITD) and area-based approaches (ABA), where the latter produces a complete diameter distribution of trees, of which a part can be detected individually [45]. The idea of the hybrid method for management optimization is to use a different decision rule for those trees that are detected individually and trees that are known only via the diameter distribution.

Using individual trees in forest planning can use also the information on tree locations. Knowing the tree locations would make it possible to use distance-dependent growth models, analyze spatial problems, or optimize the locations of harvest roads, for example.

Although LiDAR has been recognized as a reliable technology to provide detailed data from forests in three dimensions, it has a restricted spectral resolution, generally covering only a single spectral range in the near-infrared region [4]. Therefore, future studies should focus on integrating multi-source remote sensing data for improved prediction of tree diameters and other characteristics of trees and stands. Integrated use of multi-source information can lead to improved prediction accuracy, especially for mixed forests, where the tree species of each detected tree needs to be predicted.

5. Conclusions

The study showed that several nonparametric methods can be used to develop models for predicting individual tree diameter from LiDAR data. In addition, regularization methods can be used to increase the robustness of the ordinary linear regression model (LM) and decrease the overfitting problem of this model in the case of a high number of mutually correlated potential predictors. All the methods tested as alternatives for the ordinary linear regression model performed better than the LM. However, the improvements were not drastic, most probably because of the strong correlation between LiDAR-estimated canopy height and tree diameter. In addition, all the six prediction methods had the drawback that the diameters of the largest trees were underestimated, which might be because of the methods’ inability to properly model nonlinear relationships. This shortcoming could be at least partly overcome by using different transformations of the potential predictors. The underestimation of large diameters was the most severe with the random forest method. The support vector regression was evaluated to be the best method for predicting individual tree diameters from LiDAR data. The performances of the linear model with ridge regularization, the k-nearest neighbors imputation, and the artificial neural network method were close to each other. The benefits of the methods analyzed in this study might be larger for predicted variables other than tree diameter, for instance, crown length, crown volume, or tree biomass. The benefits of the methods tested in this study can be expected to be the highest in the case of little prior knowledge on the relationships between the variables, a high number of potential predictors, and strong correlations between the potential predictors.

Author Contributions

Conceptualization, T.P.; methodology, T.P. and Y.S.; formal analysis, T.P. and Y.S.; data curation, X.J. and F.L.; writing—original draft preparation, T.P. and Y.S.; writing—review and editing, X.J. and F.L.; supervision, X.J. and F.L.; project administration, X.J.; funding acquisition, F.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by Natural Science Foundation of China (U21A20244) and (32071758) and the Fundamental Research Funds for the Central Universities of China (No. 2572020BA01).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to thank the faculty and students of the Department of Forest Management, Northeast Forestry University (NEFU), China, who collected and provided the data for this study.

Conflicts of Interest

The authors declare no conflict of interest.

References

Pan, Y.; Birdsey, R.A.; Fang, J.; Houghton, R.; Kauppi, P.E.; Kurz, W.A.; Phillips, O.L.; Shvidenko, A.; Lewis, S.L.; Canadell, J.G.; et al. A Large and Persistent Carbon Sink in the World’s Forests. Science 2011, 333, 988–993. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Leite, R.V.; Silva, C.A.; Mohan, M.; Cardil, A.; de Almeida, D.R.A.; Carvalho, S.; Jaafar, W.S.W.M.; Guerra-Hernández, J.; Weiskittel, A.; Hudak, A.T.; et al. Individual Tree Attribute Estimation and Uniformity Assessment in Fast-Growing Eucalyptus spp. Forest Plantations Using Lidar and Linear Mixed-Effects Models. Remote Sens. 2020, 12, 3599. [Google Scholar] [CrossRef]
Kotivuori, E.; Kukkonen, M.; Mehtätalo, L.; Maltamo, M.; Korhonen, L.; Packalen, P. Forest inventories for small areas using drone imagery without in-situ field measurements. Remote Sens. Environ. 2020, 237, 111404. [Google Scholar] [CrossRef]
Lu, D.; Chen, Q.; Wang, G.; Liu, L.; Li, G.; Moran, E. A survey of remote sensing-based aboveground biomass estimation methods in forest ecosystems. Int. J. Digit. Earth 2016, 9, 63–105. [Google Scholar] [CrossRef]
Næsset, E. Predicting forest stand characteristics with airborne scanning laser using a practical two-stage procedure and field data. Remote Sens. Environ. 2002, 80, 88–99. [Google Scholar] [CrossRef]
Yu, X.; Hyyppä, J.; Holopainen, M.; Vastaranta, M. Comparison of Area-Based and Individual Tree-Based Methods for Predicting Plot-Level Forest Attributes. Remote Sens. 2010, 2, 1481–1495. [Google Scholar] [CrossRef] [Green Version]
Guo, Q.; Su, Y.; Hu, T.; Zhao, X.; Wu, F.; Li, Y.; Liu, J.; Chen, L.; Xu, G.; Lin, G.; et al. An integrated UAV-borne lidar system for 3D habitat mapping in three forest ecosystems across China. Int. J. Remote Sens. 2017, 38, 2954–2972. [Google Scholar] [CrossRef]
Liu, K.; Shen, X.; Cao, L.; Wang, G.; Cao, F. Estimating forest structural attributes using UAV-LiDAR data in Ginkgo plantations. ISPRS J. Photogramm. Remote Sens. 2018, 146, 465–482. [Google Scholar] [CrossRef]
Hao, Y.; Widagdo, F.R.A.; Liu, X.; Quan, Y.; Liu, Z.; Dong, L.; Li, F. Estimation and calibration of stem diameter distribution using UAV laser scanning data: A case study for larch (Larix olgensis) forests in Northeast China. Remote Sens. Environ. 2022, 268, 112769. [Google Scholar] [CrossRef]
Navarro, A.; Young, M.; Allan, B.; Carnell, P.; Macreadie, P.; Ierodiaconou, D. The application of Unmanned Aerial Vehicles (UAVs) to estimate above-ground biomass of mangrove ecosystems. Remote Sens. Environ. 2020, 242, 111747. [Google Scholar] [CrossRef]
Hao, Y.; Widagdo, F.R.A.; Liu, X.; Quan, Y.; Dong, L.; Li, F. Individual Tree Diameter Estimation in Small-Scale Forest Inventory Using UAV Laser Scanning. Remote Sens. 2021, 13, 24. [Google Scholar] [CrossRef]
Fu, L.; Duan, G.; Ye, Q.; Meng, X.; Luo, P.; Sharma, R.P.; Sun, H.; Wang, G.; Liu, Q. Prediction of Individual Tree Diameter Using a Nonlinear Mixed-Effects Modeling Approach and Airborne LiDAR Data. Remote Sens. 2020, 12, 1066. [Google Scholar] [CrossRef] [Green Version]
Xu, Q.; Li, B.; Maltamo, M.; Tokola, T.; Hou, Z. Predicting tree diameter using allometry described by non-parametric locally-estimated copulas from tree dimensions derived from airborne laser scanning. For. Ecol. Manag. 2019, 434, 205–212. [Google Scholar] [CrossRef]
Pascual, A. Multi-objective forest planning at tree-level combining mixed integer programming and airborne laser scanning. For. Ecol. Manag. 2021, 483, 118714. [Google Scholar] [CrossRef]
Packalen, P.; Pukkala, T.; Pascual, A. Combining spatial and economic criteria in tree-level harvest planning. For. Ecosyst. 2020, 7. [Google Scholar] [CrossRef] [Green Version]
Sun, Y.; Wang, W.; Pukkala, T.; Jin, X. Stand delineation based on laser scanning data and simulated annealing. Eur. J. For. Res. 2021, 140, 1065–1080. [Google Scholar] [CrossRef]
Corte, A.P.D.; Souza, D.V.; Rex, F.E.; Sanquetta, C.R.; Mohan, M.; Silva, C.A.; Zambrano, A.M.A.; Prata, G.; Alves de Almeida, D.R.; Trautenmüller, J.W.; et al. Forest inventory with high-density UAV-Lidar: Machine learning approaches for predicting individual tree attributes. Comput. Electron. Agric. 2020, 179, 105815. [Google Scholar] [CrossRef]
Dalla Corte, A.P.; Rex, F.E.; Almeida, D.R.A.d.; Sanquetta, C.R.; Silva, C.A.; Moura, M.M.; Wilkinson, B.; Zambrano, A.M.A.; da Cunha Neto, E.M.; Veras, H.F.P.; et al. Measuring Individual Tree Diameter and Height Using GatorEye High-Density UAV-Lidar in an Integrated Crop-Livestock-Forest System. Remote Sens. 2020, 12, 863. [Google Scholar] [CrossRef] [Green Version]
Dong, L.; Pukkala, T.; Li, F.; Jin, X. Developing distance-dependent growth models from irregularly measured sample plot data—A case for Larix olgensis in Northeast China. For. Ecol. Manag. 2021, 486, 118965. [Google Scholar] [CrossRef]
Osborne, J.; Waters, E. Four Assumptions of Multiple Regression That Researchers Should Always Test. Pract. Assess. Res. Eval. 2002, 8, 1. [Google Scholar] [CrossRef]
Gao, Y.; Lu, D.; Li, G.; Wang, G.; Chen, Q.; Liu, L.; Li, D. Comparative Analysis of Modeling Algorithms for Forest Aboveground Biomass Estimation in a Subtropical Region. Remote Sens. 2018, 10, 627. [Google Scholar] [CrossRef] [Green Version]
Duzan, H.; Shariff, N.S.B.M. Ridge Regression for Solving the Multicollinearity Problem: Review of Methods and Models. J. Appl. Sci. 2015, 15, 392–404. [Google Scholar] [CrossRef] [Green Version]
Pascual, A.; Bravo, F.; Ordoñez, C. Assessing the robustness of variable selection methods when accounting for co-registration errors in the estimation of forest biophysical and ecological attributes. Ecol. Modell. 2019, 403, 11–19. [Google Scholar] [CrossRef]
Brede, B.; Calders, K.; Lau, A.; Raumonen, P.; Bartholomeus, H.M.; Herold, M.; Kooistra, L. Non-destructive tree volume estimation through quantitative structure modelling: Comparing UAV laser scanning with terrestrial LIDAR. Remote Sens. Environ. 2019, 233, 111355. [Google Scholar] [CrossRef]
Yu, X.; Hyyppä, J.; Vastaranta, M.; Holopainen, M.; Viitala, R. Predicting individual tree attributes from airborne laser point clouds based on the random forests technique. ISPRS J. Photogramm. Remote Sens. 2011, 66, 28–37. [Google Scholar] [CrossRef]
Zhang, W.; Qi, J.; Peng, W.; Wang, H.; Xie, D.; Wang, X.; Yan, G. An Easy-to-Use Airborne LiDAR Data Filtering Method Based on Cloth Simulation. Remote Sens. 2016, 8, 501. [Google Scholar] [CrossRef]
Guo, Q.; Li, W.; Yu, H.; Alvarez, O. Effects of Topographic Variability and Lidar Sampling Density on Several DEM Interpolation Methods. Photogramm. Eng. Remote Sens. 2010, 76, 701–712. [Google Scholar] [CrossRef] [Green Version]
Chen, Q.; Baldocchi, D.; Gong, P.; Kelly, M. Isolating Individual Trees in a Savanna Woodland using Small Footprint LIDAR data. Photogramm. Eng. Remote Sens. 2006, 72, 923–932. [Google Scholar] [CrossRef] [Green Version]
Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
Jiang, H.; Huang, K.; Zhang, R. Field Support Vector Regression. In Proceedings of the Neural Information Processing; Liu, D., Xie, S., Li, Y., Zhao, D., El-Alfy, E.-S.M., Eds.; Springer International Publishing: Cham, Switzerland, 2017; pp. 699–708. [Google Scholar]
de Almeida, C.T.; Galvão, L.S.; Aragão, L.E.D.O.C.E.; Ometto, J.P.H.B.; Jacon, A.D.; de Souza Pereira, F.R.; Sato, L.Y.; Lopes, A.P.; de Alencastro Graça, P.M.L.; de Jesus Silva, C.V.; et al. Combining LiDAR and hyperspectral data for aboveground biomass modeling in the Brazilian Amazon using different regression algorithms. Remote Sens. Environ. 2019, 232, 111323. [Google Scholar] [CrossRef]
Monnet, J.-M.; Chanussot, J.; Berger, F. Support Vector Regression for the Estimation of Forest Stand Parameters Using Airborne Laser Scanning. IEEE Geosci. Remote Sens. Lett. 2011, 8, 580–584. [Google Scholar] [CrossRef] [Green Version]
Cherkassky, V.; Ma, Y. Practical selection of SVM parameters and noise estimation for SVM regression. Neural Netw. 2004, 17, 113–126. [Google Scholar] [CrossRef] [Green Version]
Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
Belgiu, M.; Drăguţ, L. Random forest in remote sensing: A review of applications and future directions. ISPRS J. Photogramm. Remote Sens. 2016, 114, 24–31. [Google Scholar] [CrossRef]
McRoberts, R.E.; Næsset, E.; Gobakken, T. Optimizing the k-Nearest Neighbors technique for estimating forest aboveground biomass using airborne laser scanning data. Remote Sens. Environ. 2015, 163, 13–22. [Google Scholar] [CrossRef]
Morera, A.; Martínez de Aragón, J.; Bonet, J.A.; Liang, J.; de-Miguel, S. Performance of statistical and machine learning-based methods for predicting biogeographical patterns of fungal productivity in forest ecosystems. For. Ecosyst. 2021, 8, 21. [Google Scholar] [CrossRef]
Guyon, I.; Weston, J.; Barnhill, S.; Vapnik, V. Gene Selection for Cancer Classification using Support Vector Machines. Mach. Learn. 2002, 46, 389–422. [Google Scholar] [CrossRef]
Zhang, Z.; Cao, L.; She, G. Estimating Forest Structural Parameters Using Canopy Metrics Derived from Airborne LiDAR Data in Subtropical Forests. Remote Sens. 2017, 9, 940. [Google Scholar] [CrossRef] [Green Version]
Fehrmann, L.; Lehtonen, A.; Kleinn, C.; Tomppo, E. Comparison of linear and mixed-effect regression models and a k-nearest neighbour approach for estimation of single-tree biomass. Can. J. For. Res. 2008, 38, 1–9. [Google Scholar] [CrossRef]
Yang, Z.; Liu, Q.; Luo, P.; Ye, Q.; Duan, G.; Sharma, R.P.; Zhang, H.; Wang, G.; Fu, L. Prediction of Individual Tree Diameter and Height to Crown Base Using Nonlinear Simultaneous Regression and Airborne LiDAR Data. Remote Sens. 2020, 12, 2238. [Google Scholar] [CrossRef]
Wang, Y.; Hyyppä, J.; Liang, X.; Kaartinen, H.; Yu, X.; Lindberg, E.; Holmgren, J.; Qin, Y.; Mallet, C.; Ferraz, A.; et al. International Benchmarking of the Individual Tree Detection Methods for Modeling 3-D Canopy Structure for Silviculture and Forest Ecology Using Airborne Laser Scanning. IEEE Trans. Geosci. Remote Sens. 2016, 54, 5011–5027. [Google Scholar] [CrossRef] [Green Version]
Pascual, A. Building Pareto Frontiers under tree-level forest planning using airborne laser scanning, growth models and spatial optimization. For. Policy Econ. 2021, 128, 102475. [Google Scholar] [CrossRef]
Fransson, P.; Franklin, O.; Lindroos, O.; Nilsson, U.; Brännström, Å. A simulation-based approach to a near-optimal thinning strategy: Allowing harvesting times to be determined for individual trees. Can. J. For. Res. 2020, 50, 320–331. [Google Scholar] [CrossRef]
Vastaranta, M.; Kankare, V.; Holopainen, M.; Yu, X.; Hyyppä, J.; Hyyppä, H. Combination of individual tree detection and area-based approach in imputation of forest variables using airborne laser data. ISPRS J. Photogramm. Remote Sens. 2012, 67, 73–79. [Google Scholar] [CrossRef]

Figure 1. Map of the Mengjiagang forest farm in Heilongjiang province, northeast China, showing the location of the study area.

Figure 2. An overview of the workflow for DBH estimation by different methods using UAVLS data.

Figure 3. Examples of individual tree delineation. Polygon boundaries represent individual tree crowns. White cells represent no data cells.

Figure 4. Stand delineation overlaid with the 95% percentile of the height distribution of the echoes (1 km × 1.5 km). A lighter tone indicates a higher value of the attribute. Red points represent sample plots and yellow lines are stand boundaries.

Figure 5. An overview of the artificial neural network (ANN) method. Symbol x represents an input feature, W_ij⁽¹⁾ are the weights of predictors (normalized LiDAR metrics) to calculate intermediate values, a_j is the intermediate value (hidden layer value) of neuron j, obtained from the activation function, W_j⁽²⁾ are the weights of the neurons in the calculation of the prediction, and y is the predicted DBH value. In this ANN, there is one hidden layer.

Figure 6. The principle of the 10-fold cross-validation. The dataset is divided into 10 parts, of which nine are used for training and one for validation. The subset that is used for validation is different in each of the 10 repeated model fittings and validations.

Figure 7. Impact of the number of input features on the average of cross-validated RMSE for different prediction algorithms.

Figure 8. The relative importance of the 10 most important metrics for each DBH prediction method. LM used only used seven metrics to predict the DBH of individual trees. A description of the LiDAR-derived metrics is given in Table 3.

Figure 9. The predicted vs. field-measured diameter at breast height (DBH) in the independent validation data for six different prediction algorithms. The red line is the 1:1 trend line.

Table 1. Descriptive statistics of forest measurements.

Variable	Mean	Std Dev	Minimum	Maximum
DBH (cm)	17.39	7.80	5.00	38.80
Total height (m)	15.61	6.69	5.20	32.70

Note: Std Dev = Standard deviation.

Table 2. Details of the operational parameters for the UAVLS data collection.

Variable	Value
Laser pulse repetition rate	380 kHz
Accuracy	5 mm
Maximum echo number	5
Maximum scan speed	200 scans/second
Echo signal intensity	16 bit
Laser wavelength	1550 nm
Beam divergence	0.5 mrad

Table 3. Summary of the tree- and stand-level metrics derived from UAVLS data that were used for DBH estimation.

Abbreviation	Description
Tree level metrics
H₅^T, H₁₀^T, H₂₀^T, H₂₅^T, H₃₀^T, H₄₀^T, H₅₀^T, H₆₀^T, H₇₅^T, H₈₀^T, H₉₀^T, H₉₅^T, H₉₉^T	Percentiles of the height distributions of the echoes within the perimeters of individual trees (5th, 10th, …, 95th, and 99th).
H_max^T, H_min^T, H_mean^T, H_med^T	Maximum, minimum, mean, and medium of the echo heights within the perimeters of individual trees.
H_var^T, H_std^T, H_cv^T, H_ske^T, H_kur^T	Variance, standard deviation, coefficient of variation, skewness, and kurtosis of the echo heights within the perimeters of individual trees.
H_IQ^T	H₇₅^T–H₂₅^T
AIH₅^T, AIH₁₀^T, AIH₂₀^T, AIH₂₅^T, AIH₃₀^T, AIH₄₀^T, AIH₅₀^T, AIH₆₀^T, AIH₇₅^T, AIH₈₀^T, AIH₉₀^T, AIH₉₅^T, AIH₉₉^T	Percentiles of accumulated echo heights within individual tree perimeters (5th, 10th, …, 95th, and 99th).
RCH_max, RCH_mean	The ratios of a target tree’s height to the maximum and mean tree height of the stand.
D₁₀^T, D₂₀^T, D₃₀^T, D₄₀^T, D₅₀^T,D₆₀^T, D₇₀^T, D₈₀^T, D₉₀^T	The proportion of points above the quantiles (10th, 20th, …, 80th, and 90th) of the total number of points within the perimeters of a tree.
CPA	The projected area of the tree crown.
Stand level metrics
H₅^S, H₁₀^S, H₂₀^S, H₂₅^S, H₃₀^S, H₄₀^S, H₅₀^S, H₆₀^S, H₇₅^S, H₈₀^S, H₉₀^S, H₉₅^S, H₉₉^S	Percentiles of the echo height distributions (5th, 10th, …, 95th, and 99th) within a stand.
H_max^S, H_min^S, H_mean^S, H_med^S	Maximum, minimum, mean and medium values of the echo heights within a stand.
H_var^S, H_std^S, H_cv^S, H_ske^S, H_kur^S	Variance, standard deviation, coefficient of variation, skewness, and kurtosis of the echo heights within a stand.
H_IQ^S	H₇₅^S–H₂₅^S
AIH₅^S, AIH₁₀^S, AIH₂₀^S, AIH₂₅^S, AIH₃₀^S, AIH₄₀^S, AIH₅₀^S, AIH₆₀^S, AIH₇₅^S, AIH₈₀^S, AIH₉₀^S, AIH₉₅^S, AIH₉₉^S	Percentiles of accumulated echo heights (5th, 10th, …, 95th, and 99th) within a stand.
D₁₀^S, D₂₀^S, D₃₀^S, D₄₀^S, D₅₀^S, D₆₀^S, D₇₀^S, D₈₀^S, D₉₀^S	The proportion of points above the quantiles (10th, 20th, …, 80th, and 90th) of the total number of points within a stand.
SD	Stem density measured from individual tree segmentation results within a stand.
Slope	The average slope of the stand (degrees).
Aspect	The aspect of the stand, calculated as the azimuth of an inclined plane (degrees).

Table 4. Tested hyper-parameters for each machine learning algorithm.

Model	Hyper-Parameter Values
Linear Model (LM)	-
Linear Model with Ridge Regularization (LMR)	λ = 0.0001, 0.003, 0.001, 0.003, 0.01, 0.03, 0.1
Random Forest (RF)	ntree = 50, 100, 150,…, 300 mtry = n × m (n = 0.1, 0.2, 0.3,…, 0.9)
Support Vector Regression (SVR)	C = 1, 3, 6, 10, 30, 60, 100 r = 0.0001, 0.003, 0.001, 0.003, 0.01, 0.03, 0.1
K-Nearest Neighbors (KNN)	k = 1, 2, 3,…, 10
Artificial Neural Networks (ANN)	activation function: {logistic, tanh, relu} size = 1, 2, 3,…, 15 η = 0.1, 0.2, 0.3, …, 1

Note: m is the number of LiDAR-derived metrics used in RF, logistic is the logistic sigmoid function, tanh is the hyperbolic tangent function, and relu is the rectified linear unit function.

Table 5. Summary of hyper-parameter tuning and accuracy assessment results in cross-validation for different regression algorithms.

Model	Hyper-Parameter Values	Statistics	10-Fold Cross-Validation
Model	Hyper-Parameter Values	Statistics	R²	RMSE (cm)	rRMSE (%)
LM	-	Mean	0.82	2.56	15.59
LM	-	Std Dev	0.01	0.15	0.93
LMR	λ = 0.003	Mean	0.83	2.09	14.15
LMR	λ = 0.003	Std Dev	0.01	0.16	1.00
SVR	C = 30 r = 0.001	Mean	0.84	1.98	14.11
SVR	C = 30 r = 0.001	Std Dev	0.01	0.17	1.02
RF	ntree = 100 mtry = 6	Mean	0.81	2.36	14.43
RF	ntree = 100 mtry = 6	Std Dev	0.01	0.12	0.09
KNN	k = 9	Mean	0.83	2.09	12.75
KNN	k = 9	Std Dev	0.02	0.16	0.99
ANN	size = 10 ƞ = 0.8 activation function = logistic	Mean	0.84	1.92	11.75
ANN	size = 10 ƞ = 0.8 activation function = logistic	Std Dev	0.02	0.16	1.00

Note: Mean is the average value in the 10-fold cross-validation scheme and Std Dev is the standard deviation of the 10 cross-validation values.

Table 6. Summary of statistical results of the predicted DBHs for the independent validation dataset calculated using six different methods.

Models	Std Dev	Minimum	Maximum	Range	RMSE	rRMSE	BIAS
LM	7.66	−0.84	38.91	39.76	2.68	15.76	−0.01
LMR	7.70	−1.38	38.90	39.26	2.25	13.23	−0.04
SVR	7.67	3.82	36.37	32.54	2.22	13.06	0.02
RF	7.36	6.91	28.68	21.77	2.53	14.88	0.02
KNN	7.54	5.53	34.51	28.97	2.19	12.80	−0.06
ANN	7.71	5.20	32.57	27.37	2.11	12.41	−0.01
Field-measured DBH	7.98	4.50	37.60	33.10	-	-	-

Note: Std Dev = Standard deviation.

Table 7. Summary of accuracy assessment results for the validation dataset for different DBH ranges. Positive bias indicates underestimation.

Model	Statistics	DBH Ranges (cm)
Model	Statistics	<12	12–23	>23
LM	BIAS	−0.44	−0.50	1.28
	RMSE	2.28	2.28	2.75
	rRMSE	27.42	12.62	10.22
LMR	BIAS	−0.37	−0.55	1.18
	RMSE	2.22	2.24	2.54
	rRMSE	26.32	12.39	9.46
SVR	BIAS	−0.37	−0.53	1.05
	RMSE	2.12	2.18	2.48
	rRMSE	25.59	12.01	9.18
RF	BIAS	−0.63	−0.48	1.59
	RMSE	2.41	2.37	2.94
	rRMSE	28.88	13.09	10.94
KNN	BIAS	−0.39	−0.74	1.19
	RMSE	1.88	2.28	2.62
	rRMSE	22.59	12.57	9.74
ANN	BIAS	−0.01	−0.48	1.39
	RMSE	1.92	2.19	2.64
	rRMSE	23.18	12.11	9.75

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Sun, Y.; Jin, X.; Pukkala, T.; Li, F. Predicting Individual Tree Diameter of Larch (Larix olgensis) from UAV-LiDAR Data Using Six Different Algorithms. Remote Sens. 2022, 14, 1125. https://doi.org/10.3390/rs14051125

AMA Style

Sun Y, Jin X, Pukkala T, Li F. Predicting Individual Tree Diameter of Larch (Larix olgensis) from UAV-LiDAR Data Using Six Different Algorithms. Remote Sensing. 2022; 14(5):1125. https://doi.org/10.3390/rs14051125

Chicago/Turabian Style

Sun, Yusen, Xingji Jin, Timo Pukkala, and Fengri Li. 2022. "Predicting Individual Tree Diameter of Larch (Larix olgensis) from UAV-LiDAR Data Using Six Different Algorithms" Remote Sensing 14, no. 5: 1125. https://doi.org/10.3390/rs14051125

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Predicting Individual Tree Diameter of Larch (Larix olgensis) from UAV-LiDAR Data Using Six Different Algorithms

Abstract

1. Introduction

2. Materials and Methods

2.1. Study Sites and Field Data

2.2. UAVLS Data

2.3. UAVLS Data Pre-Processing

2.4. Individual Tree Delineation

2.5. Stand Delineation

2.6. Calculation of the LiDAR Metrics

2.7. Modeling Methods

2.8. Feature Selection and Hyper-Parameter Tuning

2.9. Model Validation

3. Results

3.1. The Best LiDAR Metrics for Predicting DBH

3.2. Performance of the Prediction Methods

3.3. Performance of the Prediction Methods in Independent Validation Data

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI