A Comparison of Four Methods for Automatic Delineation of Tree Stands from Grids of LiDAR Metrics

Sun, Yusen; Jin, Xingji; Pukkala, Timo; Li, Fengri

doi:10.3390/rs14246192

Open AccessArticle

A Comparison of Four Methods for Automatic Delineation of Tree Stands from Grids of LiDAR Metrics

by

Yusen Sun

¹,

Xingji Jin

^1,*,†,

Timo Pukkala

^1,2,†

and

Fengri Li

¹

Key Laboratory of Sustainable Forest Ecosystem Management, Ministry of Education, School of Forestry, Northeast Forestry University, Harbin 150040, China

²

School of Forest Sciences, University of Eastern Finland, P.O. Box 111, 80101 Joensuu, Finland

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Remote Sens. 2022, 14(24), 6192; https://doi.org/10.3390/rs14246192

Submission received: 19 October 2022 / Revised: 26 November 2022 / Accepted: 4 December 2022 / Published: 7 December 2022

(This article belongs to the Topic Challenges, Development and Frontiers of Smart Agriculture and Forestry)

Download

Browse Figures

Review Reports Versions Notes

Abstract

Increased use of laser scanning in forest inventories is leading to the adoption and development of automated stand delineation methods. The most common categories of these methods are region merging and region growing. However, recent literature proposes alternative methods that are based on the ideas of cellular automata, self-organizing maps, and combinatorial optimization. The studies where these methods have been described suggest that the new methods are potential options for the automated segmentation of a forest into homogeneous stands. However, no studies are available that compare the new methods to each other and to the traditional region-merging and region-growing algorithms. This study provided a detailed comparison of four methods using LiDAR metrics calculated for grids of 5 m by 5 m raster cells as the data. The tested segmentation methods were region growing (RG), cellular automaton (CA), self-organizing map (SOM), and simulated annealing (SA), which is a heuristic algorithm developed for combinatorial optimization. The case study area was located in the Heilongjiang province of northeast China. The LiDAR data were collected from an unmanned aerial vehicle for three 1500-ha test areas. The proportion of variation in the LiDAR metrics that was explained by the segmentation was mostly the best for the SA method. The RG method produced more heterogeneous segments than the other methods. The CA method resulted in the smallest number of segments and the largest average segment area. The proportion of small segments (smaller than 0.3 ha) was the highest in the RG method while the SA method always produced the fewest small stands. The shapes of the segments were the best (most circular) for the CA and SA methods, but the shape metrics were good for all methods. The results of the study suggest that CA, SOM, and SA may all outperform RG in automated stand delineation.

Keywords:

region growing; cellular automaton; self-organizing map; simulated annealing; LiDAR data; unmanned aerial vehicle; laser scanning; forest segmentation

1. Introduction

Forest stands are homogeneous subareas of the forest, which often serve as the basic unit for field surveys, yield prediction, management planning, and implementation of management prescriptions. The use of automatic segmentation methods in the delineation of forest stands is increasing [1,2,3,4]. For example, in Finland, almost all stand delineations are currently based on fully automated or semi-automatic methods that employ LiDAR scanning data either alone or in combination with digital aerial photographs [2].

The main reason for this trend might be the increased availability of laser scanning data. There are already several methods available that can use these data to automatically delineate forest stands [1,2,3,4]. The traditional manual stand delineation by forest managers is time-consuming and subjective. Delineations by different forest managers may differ greatly.

Increased use of LiDAR data in forest inventories has accelerated the shift from manual stand lineation to automated methods. Besides segmenting the forest into stands, LiDAR data may also be used for stratification and in the process of imputing or predicting stand variables for the segments [5,6]. In Finland, the field data are imputed, besides segments, also for 16 m by 16 m raster cells. Therefore, the site and growing stock variables of the raster cells may also be used in automated stand delineation [7,8,9,10].

Many methods for automated stand delineation have been suggested in forestry literature [4,10,11,12]. The methods that have been used most commonly belong to the categories of region merging and region growing. The idea of region merging is to join adjacent spatial units if they are similar in terms of LiDAR metrics or other variables that are available for assessing similarity. The initial spatial units may be raster cells, so-called nano-segments [13], small micro-segments [6], or Voronoi polygons that correspond to the growing spaces or crown areas of individual trees [14].

In region-growing, the first step is usually to identify homogeneous areas from the forest, which are subsequently used as seeds in the region-growing process. Adjacent raster cells are joined to the seeds in such a way that the within-segment variation increases as little as possible [15].

Region merging and region growing are one-directional methods in the sense that the segments can only enlarge. Recent literature presents a few alternative approaches in which the segmentation process is more flexible, and the segments can shrink or enlarge, or even disappear. One of these methods is the cellular automaton where each cell of a grid is joined to one of its adjacent segments for many iterations [3,8,16]. The initial segments may be, for instance, rectangular areas. When the cellular automaton algorithm proceeds, the borders of the segments will move toward existing boundaries in the forest landscape.

Another approach suggested in recent literature is to use existing algorithms for combinatorial optimization for the delineation of forest stands [4]. The rationale behind this suggestion is the realization that finding the best stand number for a grid of raster cells is a combinatorial optimization problem. Therefore, simulated annealing, genetic algorithm, and other methods for combinatorial problems can also be used in automatic stand delineation [17,18].

The third new method that was recently adapted to stand delineation is the self-organizing map developed by Kohonen (1982) [19]. This method consists of separate training and classification steps. The training step creates a set of different combinations of values of those variables that are used in segmentation. These combinations are called neurons. In the training phase of the method, the neurons “learn” from the data. At the end of the training step, the set of neurons provides a summary of the data. In the classification step, each cell of the raster is connected to the neuron that is closest to it in terms of variables available for the raster cells. The result can be understood as stratification. When the coordinates of the raster cells are used as additional variables in training and classification, the strata are spatially continuous, corresponding to forest stands [9].

Recently, some studies have suggested that these segmentation methods could be used for automatic stand delineation [3,4,9]. All of the new methods are promising and able to delineate stands with small within-stand variations in LiDAR metrics or other variables that were used in segmentation. However, there are no comparisons of these algorithms with the prevailing segmentation methods, namely region merging and region growing. The new methods have not been compared to each other either, except that Pukkala (2021) provided a succinct comparison of the cellular automaton and self-organizing map [9].

The objective of this study was to compare the performance of the region growing, cellular automaton, self-organizing map, and simulating annealing in automated stand delineation. The purpose was to increase the knowledge and understanding of the potential usability of the new methods. The variables used in the delineation were three metrics calculated from LiDAR scanning data.

2. Materials and Methods

2.1. Study Sites and Field Data

The study area is located in the Mengjiagang forest farm of 15,503 ha, owned by the Huanan country in Heilongjiang Province of China (45°30′16″–46°20′20″N, 130°32′0″–130°52′6″E) (Figure 1). Most forests are plantations dominated by coniferous tree species, predominantly Pinus koraiensis, Pinus sylvestris var. mongolica, Larix olgensis, and Picea asperata. The case study forest has an average elevation of approximately 250 m a.s.l. with a range of 180–450 m. The area is characterized by relatively flat slopes [20].

2.2. Data Acquisition and LiDAR Metrics

Laser scanning from an unmanned aerial vehicle (UAVLS) was conducted on 12 August 2019, using a RIEGL VUX-1UAV LiDAR scanner (www.riegl.com/products/unmanned-scanning/rieglminivux-1uav, accessed on 19 October 2022) carried by a DJI M600 Pro unmanned aerial vehicle (eight-rotor UAV platform). The services provided by the RIEGL (RIEGL Laser Measurement Systems GmbH, Horn, Austria) also included basic data processing. The study area consisted of three rectangular 1 km × 1.5 km sub-areas, which were scanned from an altitude of 180 m above ground. The flight speed was 10 m/s. A total of 6 flights were carried out. The laser scanner had a 330° field of view (FOV). The scanning of the three sub-areas lasted for one hour. The maximum point density was 136 pulses/m², with up to 5 echoes. The main characteristics of the scanning are summarized in Table 1.

Before calculating the structural metrics from the UAVLS point cloud data, some data pre-processing was conducted. First, the noise points were removed from the LiDAR point clouds by using a Gaussian-smoothing filter [21]. Second, a cloth simulation filter was employed to separate non-ground and ground point clouds using parameters 0.5 as the value of the grid resolution, 0.6 as the time step, and 3 as the rigidness [22]. The average density of ground points was 17 pulses/m². Then, the digital elevation model (DEM) was constructed using the ground points and the Kriging interpolation method with a 1 m spatial resolution [23]. Finally, The UAVLS point clouds were height-normalized by the DEM. The normalized point clouds were clipped with a 1 km × 1.5 km rectangular boundary. Three subareas of this size were used in the study.

A set of UAVLS metrics were calculated for 1 m² raster cells using the normalized point cloud data and the LiDAR360 software (www.lidar360.com). Three categories of metrics were calculated: height-related metrics, intensity-related metrics, and topography-related metrics. Height-related and intensity-related metrics were calculated from the normalized point clouds, and they included the percentiles (1%, 5%, 10%, 20%, 25%h, 30%, 40%, …, 90%, 95%, 99%) of echo heights, cumulative heights, and intensities. Other standard metrics such as the variance, standard deviation, coefficient of variation, skewness, kurtosis, average absolute deviation, mean, maximum and median of the heights, and intensities of the echoes were also calculated [4]. Topography-related metrics were calculated from the DEM and included the elevation, aspect, and slope of the terrain.

Based on the study of Sun et al. (2021), the following three LiDAR metrics were used in stand lineation (Figure 2): 95th percentile of the height distribution of normalized echo height (referred to as HP95), 5th percentile of the accumulated echo heights (AH5), and variance of the intensity of the echoes (IV) [4]. Sun et al. (2021) concluded that while HP95 is the most important variable for the segmentation, the result would improve if other LiDAR metrics were used as well. Based on Sun et al. (2021), the weights of the three LiDAR metrics were as follows: HP95: 0.7, AH5: 0.2, and IV: 0.1. These weights were used in all four delineation methods tested in this study.

The three LiDAR variables (HP95, AH5, IV) were first calculated for 1 m² raster cells. Based on the recommendation of Jia et al. (2020), the 1 m² rasters were resampled to a 5 m by 5 m cell size because the use of smaller cells may delineate crowns of individual large trees and produce very rugged segment boundaries [3]. The metrics of the 5 m by 5 m cells were calculated as the means of the 1 m² cells that belonged to the larger 5 × 5 m² cell.

2.3. Methods

2.3.1. Region-Growing

Region-growing was performed using the segmentation algorithm introduced by Balasubramanian et al. (2008) and implemented in MATLAB (Mathworks, Inc., Natick, MA, USA) [24]. Region growing (RG) is the process of aggregating the cells of a grid into larger regions. It first determines initial seeds, which are subsequently enlarged into neighbouring cells following the parameters set for area growth and the stopping criteria of area growth [25,26].

The use of region growing started with the generation of initial seeds and the calculation of the mean and standard deviation of the variables used in the process. A recent study by Lee and Cok (1991) uses a vector-based gradient map to guide the growth of the regions [27]. In this study, a weighted mean of the normalized values of three LiDAR metrics within 5 m² raster cells was used in the generation of the gradient map. The weighted mean of the LiDAR metrics was calculated from:

C = 0.7HP95 + 0.2AH5 + 0.1IV

(1)

Low gradient values correspond to homogenous regions. To form initial growth seeds, we joined adjacent cells for which the gradient was below the sum of the mean and standard deviation within the seed by using 4-neighborhood connectivity (cells to the east, west, north, and south were considered). Seeds larger than 0.025 ha were retained for subsequent region growth.

In the region-growing step, the neighbouring cells of a region included the adjacent “side cells” (cells to the east, west, north, and south) and the “corner cells” (cells to the northeast, southeast, southwest, and northwest) of those cells that constituted the region, i.e., 8-neighbourhood was used (Figure 3). The initial growth seeds expanded toward neighbouring cells based on a pre-defined homogeneity criterion. The homogeneity criterion employed the vector-based gradient map [2]. The mean (m) and standard deviation (σ) computed for each initial growth seed were used to calculate the confidence interval for C (Equation (1)). If the value of variable C in the neighboring cell was between m − σ and m + σ, the neighboring cell was added to the growing seed and the neighboring cell was given the same segment number as the seed.

The expanding process was stopped when no cells were found that met the joining criterion. To obtain a segment number for all cells, the process was repeated for several iterations. At each iteration, the mean and standard deviation were calculated for the expanded regions, which were subsequently used as the initial growth seeds for the next iteration. The above steps were repeated until each cell belonged to a region (i.e., had a segment number).

2.3.2. Cellular Automaton

The cellular automaton used in this study is the same as that described by Jia et al. (2020) [3]. The idea of the method is to join each cell of a raster to one of its adjoining segments using the following formula for selecting the most suitable segment for the cell:

P_ij = w₁p₁(B_ij) + w₂p₂(A_j) + w₃p₃(D_ij) + w₄p₄(S_ij)

(2)

where P_ij is the priority, or score, if cell i is joined to segment j, D_ij is the Euclidean distance of the LiDAR metrics between cell i and segment j, A_j is the area of segment j, B_ij is the proportion of the common border between cell i and segment j (of the total border length of cell i), S_ij is the effect of joining cell i to segment j on the shape of segment j, p_k is the sub-priority function for criterion k, and w_k is the weight of criterion k. The sum of the weights was equal to one. The score was calculated for each segment adjacent to cell i, and the number of the segment with the highest score was given to cell i. When calculating the border length, it was assumed that the side cells to the east, west, south, and north have a length equal to 1 and the corner cells (to the northeast, southeast, southwest and northwest) have a “length” equal to 0.3.

The initial segments were obtained by dividing the area into square-shaped areas. Then, the segment number of each cell of the grid was determined by using Equation (2). This process was repeated for several iterations.

Compared to the first version of the cellular automaton [8], Jia et al. (2020) introduced a shape metric that affected the assignment of segment numbers for the cells [3]. The shape metric was based on the distance of the cells of a segment to the segment’s center. Minimizing this distance produces roundish segments without long and narrow extensions.

All four criteria used in Equation (2) had an associated sub-priority function, which determined the effect of segment area, common border, segment shape, and difference in LiDAR metrics between the cell and the segment on the sub-priority obtained from the criterion. We used the same sub-priority functions as Jia et al. (2020) [3]. As shown by Equation (2), the sub-priorities were multiplied by the criteria weights and summed.

The difference between LiDAR metrics in cell i and segment j was calculated from the normalized values of the metrics as follows:

D_{i j} = \sqrt{0.7 {({H P 95}_{i} - {H P 95}_{j})}^{2} + 0.2 {({A H 5}_{i} - {A H 5}_{j})}^{2} + 0.1 {({I V}_{i} - {I V}_{j})}^{2}}, j = 1, \dots, J

(3)

where J is the number of segments adjacent to cell i. The sub-priority was inversely proportional to the difference, i.e., a smaller difference in the LiDAR metrics between cell i and segment j increased the probability that cell i was joined to segment j. Since all LiDAR metrics were scaled to the range 0–1, the sub-priority also ranged from 0 to 1.

The sub-priority functions for the segment area and the common border between the cell and a segment were logistic curves, which resulted in sub-priorities ranging from zero to one. For example, the sub-priority function for the segment area was:

p_{2} = \frac{1}{1 + \exp (- 5 (A_{j} - 0 . 5))}

(4)

where A_j is the area of segment j in hectares. The sub-priority function is graphically depicted in Figure 4. It indicates that increasing the segment area increases the likelihood that a cell is joined to the segment, but the effect is almost over at about 1.5 hectares. As a result of the shape of the sub-priority function (Figure 4), the cellular automaton avoids joining cells to segments whose area is clearly less than one hectare. However, other criteria of the priority function (Equation (2)) may favour a small segment, which means that small segments are not ruled out completely.

2.3.3. Self-Organizing Map

The variant of the self-organizing map adapted to stand delineation by Pukkala (2021) was used in this study [9,19]. The algorithm begins with the creation of initial neurons. In Pukkala’s (2021) study as well as in ours, the initial neurons were obtained by dividing the area into squares (for instance, 1 ha squares) and calculating the mean values of the normalized LiDAR metrics and the x and y coordinates for each square [9]. Each neuron was associated with the values of HP95, AH5, IV, x coordinate, and y coordinate.

Then, a training process was initiated that consisted of selecting a random cell from the grid and finding the neuron most similar to it in terms of the five variables listed above (HP95, AH5, IV, x, y). The similarity was assessed by using the weighted Euclidean distance

D_{i j} = \sqrt{0.7 {({H P 95}_{i} - {H P 95}_{j})}^{2} + 0.2 {({A H 5}_{i} - {A H 5}_{j})}^{2} + 0.1 {({I V}_{i} - {I V}_{j})}^{2} {+ v (x}_{i} - x_{j})^{2} {+ v (y}_{i} - y_{j})^{2}}

(5)

where D_ij is the distance between cell i and neuron j, and v is the weight of coordinates. Before calculating the distance, all variables were normalized to a mean of zero and a standard deviation of one.

The neuron that is most similar to cell i is called the best matching unit (BMU). The attribute values (HP95, AH5, IV, x, y) of the BMU were updated using

z_{j k} (t + 1) {= z}_{j k} (t) + α (t) [z_{j k} (t) - z_{i k} (t)]

(6)

where z_jk is the value of attribute k in neuron j, z_ik is the value of the same attribute in cell i, t is the number of the current iteration and α(t) is the learning rate. The initial value of the learning rate parameter was 1, and it was updated after every iteration as follows

α (t + 1) = α (0) [1 - {t / t}_{M a x}]

(7)

The number of training iterations (t_Max) was 10000, which means that the process of selecting a random cell, finding the BMU for it, and updating the attribute values of the BMU was repeated 10,000 times. Then, classification was performed where each cell of the raster was linked to the neuron most similar to it in terms of Euclidean distance (Equation (5)). The number of the most similar neuron was given to the cell. Due to the use of coordinates, cells that constituted a class were usually adjacent, and the classes could therefore be interpreted to be segments.

As suggested by Pukkala (2021), the process that consisted of the production of initial neurons, training, and classification was repeated a few more times (10 times in this study) [9]. In the second and all later repetitions, the attribute values of initial neurons were obtained from the means of the classes of the previous classification. However, classes that had only a few cells were discarded, which means that the number of initial neurons may decrease when the self-organization process is repeated. In this study, all classes (segments) smaller than 0.1 ha were discarded at the beginning of a new self-organization round.

2.3.4. Simulated Annealing

In the same way, as in cellular automaton, simulated annealing (SA) was started by dividing the area into square-shaped initial segments [4]. In the SA method, these segments are called the initial solution. Then, a random cell was selected from the grid. If this cell was located at the segment border (at least one of its adjacent cells belonged to a different segment), the possibility to change the segment number of the cell was considered. The choice depended on the effect of the change on the properties of the segment to which it would be joined (recipient segment) and the current segment of the cell (donor segment). In SA, changing the segment number of a raster cell was called a move.

The formula that was used to evaluate the effect of the move was as follows:

Q = w₁p₁(A) + w₂p₂(V) + w₃p₃(S)

(8)

The formula expresses the quality of the segment as a function of three criteria: segment area (A), variance of the LiDAR metrics (HP95, AH5, IV) within the segment (V), and shape of the segment (S). The quality measure was calculated for both the donor and the recipient segment before and after implementing the move. If the move improved the average quality score of the two segments, it was accepted. Otherwise, the move was accepted with the following probability (p):

p = \exp (\frac{Q_{A f t e r} - Q_{B f o r e}}{T})

(9)

where Q_Before is the average quality score of the two segments before the move and Q_After is the average score if the move is implemented. T is a “temperature” parameter that affects the probability of implementing inferior moves. The temperature parameter had a starting value and an ending value (“freezing temperature”). At each temperature, a certain number of candidate moves were produced and evaluated, after which the temperature was multiplied by a constant smaller than one. The process was terminated when the temperature reached freezing temperature.

In this study, the number of candidate moves (random cells) evaluated at each temperature was 50,000. This number also includes those randomly selected cells that were not located at the segment border and were therefore not eligible for a move. The initial temperature was 0.1, the freezing temperature was 0.0001, and the multiplier to obtain a new temperature was 0.95. These parameters are based on the analyses of Sun et al. (2021) [4].

In Equation (8), which was used to evaluate the quality of a segment, the sub-priority function for segment area was the same as used in cellular automaton (Figure 4). The sub-priority function for within-segment variance (V) was

p_{2} = \frac{1}{1 + e x p (3 (V - 0.3))}

(10)

where V is the weighted mean of the relative variances of HP95, AH5, and IV within the segment. The formula implies that the quality score from variance decreases with increasing within-segment variation in HP95, AH5, and IV. The relative variance was calculated by dividing the variance of the cell values by the mean value of the attribute within the segment. The weighted mean of the relative variances (RV) was calculated from

V = 0.7RV_HP95 + 0.2RV_AH5 + 0.1RV_IV

(11)

The priority function for segment shape was the one suggested by Sun et al. (2021) [4]. It is different from the one used in the cellular automaton, although in both cases the idea of the shape metric is to measure deviation from a circular shape. The quality points from the segment shape were calculated as follows:

p_{3} = \frac{1}{n} \sum_{i = 1}^{n} \frac{1}{1 + e x p (8 (R D_{i} - 1))}

(12)

where n is the number of cells that belong to the segment and RD_i is the relative distance of cell i from the segment centroid. RD was calculated by dividing the distance by the radius of a circle that has the same area as the segment. The shape score of the segment is calculated as the mean of cell-level distance scores.

2.4. Post-Processing

All three methods tested as alternatives for region growing (CA, SOM, SA) may split segments into disconnected parts. The methods may also produce rugged boundaries and result in a gradual change in stand number (Figure 5 top left). These outcomes are more common in SOM and SA as compared to CA [3,4,9]. Although the delineations obtained from the algorithms correspond to the spatial variation of the attribute values within the input raster, divided segments, gradual changes, and rugged segment boundaries may not be appreciated in forestry practice. In addition, very small segments are seldom desired as forest management units since they make it difficult to implement treatments.

To mitigate these problems, the segmentations obtained from CA, SOM, and SA were post-processed in three steps. First, a mode filter with a 3 × 3 cell window was applied to the segmentation: the segment number of every cell was replaced by the most common segment number of the 9-cell window. This process smoothed segment boundaries and reduced gradual transitions (Figure 5, top left and top right).

Second, non-connected parts of segments were given different segment numbers, i.e., they were interpreted to be different segments (Figure 5, bottom left). Third, another model filter was applied to cells that belonged to segments smaller than 0.1 ha. The segment number of these cells was replaced by the most common segment number within a 9 × 9 cell window. This post-processing step was called cleaning (Figure 5, bottom right).

Of these post-processing steps, mode filtering increases the within-segment variation in LiDAR metrics and renumbering decreases it. However, the effects are usually small [4]. Both mode-filtering and renumbering tend to improve the shapes of the segments [21]. Renumbering decreases the average area points of the stands as it creates new small segments, and cleaning has the opposite effect.

2.5. Fine-Tuning of CA, SOM, and SA

Most of the parameters of CA, SOM, and SA were taken from previous studies [3,4,9], which have analyzed the effects of parameters on the segmentation results and recommended certain parameter values. However, additional sensitivity analyses were conducted to find the most suitable size of initial stands and suitable weights for the quality criteria of the segments.

Three different sizes of the square-shaped initial stands were compared within each method (CA, SOM, SA): 1 ha, 1.5 ha, and 2 ha. In addition, three different sets of criteria weights were tested. In CA, the weight of Euclidean distance between a cell and a segment (criterion D in Equation (2)) was 0.1, 0.4, or 0.7, and the weights of the three other criteria (area, shape, common border) were equal (0.3, 0.2, or 0.1). This resulted in nine combinations of initial segment area and criteria weights.

In SOM, each initial segment area was tested with three different weights of coordinates in Equation (5). Equation (5) was used to measure the similarity between a cell and a neuron. The weight of the coordinates was 3, 6, or 9 while the weights of HP95, AH5, and IV were always 0.7, 0.2, and 0.1, respectively. An increasing weight of coordinates results in more compact, roundish, and even-sized segments.

In SA, the criteria weights of Equation (8) were modified so that the weight of within-segment variance was 0.5, 0.7, or 0.9. The weights of the other criteria (area and shape) were equal in such a way that the sum of the three weights was always 1.

2.6. Statistics Calculated for the Segmentations

The delineations produced by the four methods were evaluated by calculating the degree of the variance in HP95, AH5, and IV which was explained by the delineation (R²). The R² statistic was calculated from

R^{2} = 1 - SSE / SST

(13)

where SSE is the variation not explained by the delineation and SST is the total variation of the attribute within the grid. SST and SSE were calculated as follows:

SST = \sum_{j = 1}^{N} \sum_{i = 1}^{n_{j}} {(z_{i j} - \bar{z})}^{2}

(14)

SSE = \sum_{j = 1}^{N} \sum_{i = 1}^{n_{j}} {(z_{i j} - {\bar{z}}_{j})}^{2}

(15)

where N is the number of segments, n_j is the number of cells in segment j, z_ij is the value of the attribute (HP95, AH5 or IV) in cell i of segment j,

\bar{z}

is the overall mean of the attribute, and

{\bar{z}}_{j}

is the mean value of the attribute among the cells that belong to segment j.

In addition, the average segment area and the proportion of segments smaller than 0.3 ha were calculated for each segmentation. The shape of the segments was assessed using metrics that measure deviation from a circular shape. First, a radius of a circle having the same area as the segment was calculated for each segment. Then, the average distance of the cells from the segment center was calculated, using the length of the radius as the unit. For example, if the segment area is 1.2 ha, it corresponds to a 61.8-m radius. If the average distance of the cells from the segment’s center is 45 m, it corresponds to a relative distance of 0.727 (45/61.8 = 0.728, i.e., 0.728 radii from the center). The centre of the segment was defined by the mean x and y coordinates of the cells that constituted the segment.

In addition, the proportion of cells that were within one radius of the segment center was calculated for each segment. Then, the average distance and average proportion of cells within one radius were calculated over all segments. These averages were calculated with and without using segment area as the weight variable.

3. Results

3.1. Parameter Fine-Tuning Results

Different combinations of initial segment area and criteria weights were tested in the second case study area. The results were visualized by plotting the mean R² of the three LiDAR metrics (HP95, AH5, IV) against the shape metric (Figure 6) and the mean area of the segments (Figure 7). The results indicated that increasing the R² was achieved at the cost of a decreased average segment area and increased deviation from the circular shape of the segments.

Considering that small within-segment variation in the LiDAR metrics is the most important measure of a good stand delineation, the segmentations shown with large markers in Figure 6 and Figure 7 were selected for further analyses. These segmentations were judged to be the most efficient as they maximized the degree of variance explained by the segmentation while also being good in stand area and shape. In all cases, the area of the initial segments was 1 ha. In CA, the weights of the four criteria of Equation (2) were: D (difference in LiDAR metrics between a cell and segment) 0.4, A (segment area) 0.2, S (segment shape) 0.2 and B (the common boundary between cell and segment) 0.2. In SOM, the weight of the coordinates (v in Equation (5)) was 3 while the weights of HP95, AH5, and IV were the same as in Equation (5). In SA, the weights of the within-segment variance, segment area, and segment shape were 0.7, 0.15, and 0.15, respectively (Equation (8)). The results presented later for the three case study areas were obtained with these parameters.

The parameter fine-tuning results already give some information about the performance of alternative segmentation methods. For example, SA resulted in a higher mean R² than the region growing (RG) with all parameter combinations tested in fine-tuning. SOM always resulted in a better (more circular) stand shape than RG. Figure 7 suggests that SA might be the best method in terms of R² and segment shape, and all new methods (CA, SOM, and SA) may outperform RG in R² and segment shape.

Figure 7 shows that CA and SOM always resulted in a larger average segment area than RG. The overall conclusion is that decreasing the segment area increases the R² of the LiDAR metrics. Therefore, the parameters used in CA, SOM, and SA were selected in such a way that the mean segment area of CA, SOM, and SA was not smaller than that produced by RG. Based on Figure 7, CA seems to dominate all other methods since, with the selected parameter values, it resulted in almost the same mean R² as the best method (SA) but had a clearly larger mean stand area.

Plotting the percentage of small stands (less than 0.3 ha) against mean R² indicated better performance of CA, SOM, and SA, as compared to RG (Figure 8). The proportion of small stands was higher in the RG segmentation than in the other methods, irrespective of the parameter values, except for one SA case that produced as many small stands as obtained in RG.

3.2. Delineation Maps

The three case study areas posed different challenges to the delineation methods. Case study area 1 had a large continuous area of the natural forest without clear stand boundaries (Figure 9). In addition, there were narrow strips in the east–west direction that consisted of trees taller than the surrounding areas (lower-left corners of the maps of Figure 9). Usually, SA and SOM delineated these strip-like areas into distinct segments better than RG and CA. In general, all methods created delineations that look usable according to the maps in Figure 9. Careful inspection of the RG map shows, however, that a few stands consist of sub-areas with quite different tree heights (HP95 correlates closely with canopy height). In the SOM map, some boundaries are very rugged, and they most probably need to be smoothed for practical use.

The challenge in case study area 2 was the narrow strips of young trees between clear blocks of planted tall trees. The SOM and SA methods most often delineated them into separate segments. In the RG map, the continuous areas of a rather similar forest in the upper part of the maps of Figure 10 were divided into several small segments, which is a different outcome compared to the other methods. There seem to be some unnecessary segment borders in all maps of Figure 10, which is most probably due to the aim at the circular shape and the fact that increasing the stand area was considered to be an improvement only until about 1.5 hectares (see Figure 4).

The third case study area had large continuous areas where the tree height was almost constant. In addition, there was a long narrow shape caused by a road and another narrow shape near the western edge of the area. The maps in Figure 11 indicate that RG had difficulties in dealing with the narrow shapes. They were often joined to the adjacent very different segments or were demarcated too widely. In CA, the road area was demarcated narrow enough, but in some places, it was connected to the adjacent forest. SA and SOM were the best methods to deal with the long, narrow shapes. In practical use of the segmentation methods, the road would most probably be masked off (not included in automated segmentation), but we kept it to see if it could reveal systematic differences between the four segmentation methods.

3.3. Numerical Statistics

The proportion of variation in the LiDAR metrics that was explained by the delineation was mostly the best for the SA method, but in some cases, SOM was the best (Table 2). The RG method produced lower R² values than the three other methods, which means that the segments were the most heterogeneous in the RG method. The results of Table 2 suggest that the RG method is not competitive with the other methods if the aim is to create segments that are homogeneous in terms of the LiDAR metrics.

The CA method resulted in the smallest number of segments and the largest average segment area (Table 3). The results were similar in all three case study areas. The three other methods (RG, SOM, SA) were close to each other in terms of average stand area.

The proportion of segments smaller than 0.3 ha was the highest in the RG method, followed by the CA method. The SA method always produced the smallest percentage of small segments, and the SOM method was always the second best.

The statistics that describe the shape of the segments were the best for the CA and SA methods. A short mean distance and a high percentage of cells that are within a circular area indicate that the segments have roundish shapes. In this criterion, the SOM and RG methods performed worse than CA and SA and were close to each other. However, the differences between the methods in the shape metrics were small and the values of the shape metrics were good for all methods.

4. Discussion

To increase the knowledge and understanding of the potential usability of alternative methods proposed for automated stand delineation, this study analyzed the performance of four different segmentation algorithms, analyzed using LiDAR metrics calculated for 5 m by 5 m raster cells as the data source. The study showed that all the segmentation methods can be used to segment forests into stands when LiDAR scanning data are available. Laser scanning can capture such characteristics of forest canopies that are essential for delineating stands in large areas [28].

All the automated segmentation methods tested in this study provide feasible options for stand delineation in the context of forest management, especially in plantations. Our study corroborated the results of several recent studies [3,4,9]. In the analyses of this study, however, CA, SOM, and SA outperformed the region-growing method since they produced more homogeneous segments than region-growing. For wider generalization, this result needs to be confirmed in other types of forests and datasets.

The algorithms used in this study are easy to implement in computer programs. In this respect, the SOM might be the easiest one (see the meta code provided in Pukkala, 2021) [9]. It is also the fastest method to use. The simulated annealing algorithm needs much longer times to run, the cellular automaton being between SOM and SA. On the other hand, Table 2, Table 3 and Table 4 suggest that the delineations produced by the SA method may be evaluated to be slightly better than those obtained from CA and SOM.

We used 25 m² raster cells of three UAVLS variables to segment the forest, which together represent most of the information contained in the nearly 90 LiDAR metrics calculated for the raster cells [4]. Sun et al. (2022) also suggested that the use of a fourth metric, namely, a texture variable describing the variation of HP95 within the 5 m × 5 m raster, might be useful [21]. This possibility was also checked in the current study, and it was found that texture varied significantly only in one of the three sub-areas. The contribution of the texture variable to the delineation result would have remained small in the other sub-areas.

Increasing the initial stand area had a clear effect on the delineation result. Because SOM, SA, and CA methods cannot produce new segments (new segment numbers) but segments may disappear during the segmentation process, small initial segments usually lead to better results than the use of large initial segments. However, the conclusion might be different if renumbering is applied during the segmentation run [3]. Renumbering means that disconnected parts of the segments are given different stand numbers. We did not implement this possibility during the CA, SOM, and SA runs (but only afterwards) because we wanted to compare the basic versions of these methods to the region-growing algorithm. Renumbering during the segmentation process can be expected to improve the performance of the methods, and it makes the algorithms less dependent on the initial segment area.

In the present study, the region-growing method was the poorest of the four tested segmentation methods, especially in terms of R². Region growing was more competitive in the stand area and shape statistics, although these variables were not used as criteria in the region-growing algorithm. Perhaps the fact that the segment area was not used as a criterion was the reason why the number of small segments (< 0.3 ha) was higher in RG than in the other methods. On the other hand, the region-growing method can prevent the creation of small stands simply by setting a large enough minimum size for the initial growth seeds.

The use of very small initial segments may also be harmful since several adjacent initial segments may be demarcated within the same large homogeneous stand. Some of the methods, especially CA, do not move the initial segment boundaries easily in a homogeneous region, resulting in segmentations where large stands are unnecessarily divided into several small segments. This shortcoming could be mitigated by increasing the weight of the area criterion and modifying its priority function. Based on our study, it may be concluded that the optimal initial stand size is around 1 ha when UAVLS data are used for stand delineation in the forests of Heilongjiang province.

CA differs from the other methods in such a way that the common border between a cell and the adjacent segments is taken into account. Therefore, the method often leads to smoother stand boundaries than SA and SOM [4,7]. On the other hand, rugged stand borders can easily be smoothed afterward by using mode filtering.

The main objective of stand delineation is to find a balance between a low within-stand variation and suitable stand size and regular stand shape. In most cases, the forest managers require that the stands are homogeneous, the stand shape is regular, and the stand boundaries are smooth.

If the delineation criteria emphasize too much low within-stand variation, the result might be unsatisfactory in terms of stand size and shape. There are no official standards for stand delineation as the requirements depend on the preferences of the forest manager and the purpose of the delineation. For example, some forest managers focus on creating larger size segments that correspond to traditional stand compartments, while another possibility would be to create smaller segments for a more accurate prediction of the development of the forest [13,21]. Modern forest planning methods make it possible to aggregate small stands used in calculations into larger continuous treatment units [8,29]. Regarding the capability of modern forest planning methods, a large number of small homogeneous stands is a better option than a low number of large and less homogeneous stands [8].

Our study indicated that all segmentation methods were capable of producing good stand delineations in homogeneous plantation forests. Failures are more likely to happen in natural forests that are often spatially heterogeneous with small trees occurring in the gaps of a higher canopy. As discussed in Pukkala (2021), SOM has the advantage that it can divide a heterogeneous forest into two or more overlapping sub-stands, one for the small trees or canopy gaps and the other for areas of taller trees [9]. The overlapping stands may consist of several disconnected parts. This feature of the SOM may also be useful in retention forestry where groups of retention trees are left in regeneration areas. Delineating all groups of retention trees as different stands may not be an ideal solution as the groups can be very small.

Additional possibilities to improve the delineation of heterogeneous forests into stands would be to use K-means clustering or other similar methods to create the initial segments for the CA, SOM, and SA methods [30]. This would mitigate the problem that some methods may unnecessarily split large stands into several small segments when a systematic layout of small squares is used as initial stands.

In Chinese forest management, it is not permitted to change the boundaries of forest compartments. This increases the need for delineating compartments that are useful for forest management for long periods. Currently, the Chinese compartment demarcation uses topographic and land type data and can therefore distinguish the forest and non-forest. The methods analyzed in this study could also make use of additional layers, one of which could mask off cells that represent non-forest land uses [8,11,31]. When compartments are large and permanent, it might be worthwhile to repeat the segmentation separately for each compartment. This would make it possible to deal with within-compartment heterogeneity in the simulation of forest development.

LiDAR has a restricted spectral resolution, generally covering a single spectral range in the near-infrared region, which may not be optimal for the interpretation of species composition [32]. Therefore, new studies should be conducted on integrating multi-source remote sensing data for improved stand delineation. Hyperspectral data may be particularly useful in mixed forests to characterize the tree species composition [33], biochemical features [34], and some biophysical properties such as the leaf area index (LAI) and biomass [35].

5. Conclusions

The study showed that all four segmentation methods compared in this study (RG, CA, SOM, and SA) can be used for automatic stand delineation based on laser scanning data collected by an unmanned aerial vehicle. The performances of the methods were close to each other in terms of the shape and area of the segments but CA, SOM, and SA produced more homogeneous segments than RG. Overall, SA was evaluated to be the best method for automatic stand delineation. The results suggest that methods based on cellular automata, self-organizing maps, and combinatorial optimization should be used more in the automated delineation of forest stands.

Author Contributions

Conceptualization, T.P.; methodology, T.P. and Y.S.; formal analysis, T.P. and Y.S.; data curation, X.J. and F.L.; writing—original draft preparation, T.P. and Y.S.; writing—review and editing, X.J. and F.L.; supervision, X.J. and F.L.; project administration, X.J.; funding acquisition, F.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Natural Science Foundation of China (U21A20244) and (32071758) and the Fundamental Research Funds for the Central Universities of China (No. 2572020BA01).

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to thank the faculty and students of the Department of Forest Management, Northeast Forestry University (NEFU), China, who collected and provided the data for this study.

Conflicts of Interest

The authors declare no conflict of interest.

References

Wulder, M.; White, J.; Hay, G.; Castilla, G. Towards automated segmentation of forest inventory polygons on high spatial resolution satellite imagery. For. Chron. 2008, 84, 221–230. [Google Scholar] [CrossRef]
Mustonen, J.; Packalen, P.; Kangas, A. Automatic segmentation of forest stands using a canopy height model and aerial photography. Scand. J. For. Res. 2008, 23, 534–545. [Google Scholar] [CrossRef]
Jia, W.; Sun, Y.; Pukkala, T.; Jin, X. Improved Cellular Automaton for Stand Delineation. Forests 2020, 11, 37. [Google Scholar] [CrossRef]
Sun, Y.; Wang, W.; Pukkala, T.; Jin, X. Stand delineation based on laser scanning data and simulated annealing. Eur. J. For. Res. 2021, 140, 1065–1080. [Google Scholar] [CrossRef]
Maltamo, M.; Packalen, P. Species-Specific Management Inventory in Finland BT-Forestry Applications of Airborne Laser Scanning: Concepts and Case Studies; Maltamo, M., Næsset, E., Vauhkonen, J., Eds.; Springer: Dordrecht, The Netherlands, 2014; pp. 241–252. ISBN 978-94-017-8663-8. [Google Scholar]
Packalen, P.; Pukkala, T.; Pascual, A. Combining spatial and economic criteria in tree-level harvest planning. For. Ecosyst. 2020, 7, 1–13. [Google Scholar] [CrossRef]
Pukkala, T. Using ALS raster data in forest planning. J. For. Res. 2019, 30, 1581–1593. [Google Scholar] [CrossRef]
Pukkala, T. Optimized cellular automaton for stand delineation. J. For. Res. 2019, 30, 107–119. [Google Scholar] [CrossRef]
Pukkala, T. Can Kohonen networks delineate forest stands? Scand. J. For. Res. 2021, 36, 198–209. [Google Scholar] [CrossRef]
Pukkala, T. Delineating forest stands from grid data. For. Ecosyst. 2020, 7, 13. [Google Scholar] [CrossRef]
Wang, Z.; Boesch, R.; Ginzler, C. Integration of high resolution aerial images and airborne LIDAR data for forest delineation. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2008, 37, 1203–1207. [Google Scholar]
Koch, B.; Straub, C.; Dees, M.; Wang, Y.; Weinacker, H. Airborne laser data for stand delineation and information extraction. Int. J. Remote Sens. 2009, 30, 935–963. [Google Scholar] [CrossRef]
Pascual, A.; Pukkala, T.; de Miguel, S.; Pesonen, A.; Packalen, P. Influence of size and shape of forest inventory units on the layout of harvest blocks in numerical forest planning. Eur. J. For. Res. 2019, 138, 111–123. [Google Scholar] [CrossRef]
Olofsson, K.; Holmgren, J. Forest stand delineation from lidar point-clouds using local maxima of the crown height model and region merging of the corresponding Voronoi cells. Remote Sens. Lett. 2014, 5, 268–276. [Google Scholar] [CrossRef]
Baatz, M.; Schäpe, A. An Optimization Approach for High Quality Multi-Scale Image Segmentation. In Angewandte Geographische Informationsverarbeitung XII; Strobl, J., Blaschke, T., Griesebner, G., Eds.; Wichmann: Heidelberg, Germany, 2000; pp. 12–23. [Google Scholar]
Strange, N.; Meilby, H.; Bogetoft, P. Land use optimization using self-organizing algorithms. Nat. Resour. Model. 2001, 14, 541–574. [Google Scholar] [CrossRef]
Bettinger, P.; Graetz, D.; Boston, K.; Sessions, J.; Chung, W. Eight Heuristic Planning Techniques Applied to Three Increasingly Difficult Wildlife Planning Problems. Silva Fenn. 2002, 36, 561–584. [Google Scholar] [CrossRef]
Heinonen, T.; Pukkala, T. The use of cellular automaton approach in forest planning. Can. J. For. Res. 2007, 37, 2188–2200. [Google Scholar] [CrossRef]
Kohonen, T. Self-organized formation of topologically correct feature maps. Biol. Cybern. 1982, 43, 59–69. [Google Scholar] [CrossRef]
Hao, Y.; Widagdo, F.R.A.; Liu, X.; Quan, Y.; Liu, Z.; Dong, L.; Li, F. Estimation and calibration of stem diameter distribution using UAV laser scanning data: A case study for larch (Larix olgensis) forests in Northeast China. Remote Sens. Environ. 2022, 268, 112769. [Google Scholar] [CrossRef]
Sun, Y.; Jin, X.; Pukkala, T.; Li, F. Predicting Individual Tree Diameter of Larch (Larix olgensis) from UAV-LiDAR Data Using Six Different Algorithms. Remote Sens. 2022, 14, 1125. [Google Scholar] [CrossRef]
Zhang, W.; Qi, J.; Peng, W.; Wang, H.; Xie, D.; Wang, X.; Yan, G. An Easy-to-Use Airborne LiDAR Data Filtering Method Based on Cloth Simulation. Remote Sens. 2016, 8, 501. [Google Scholar] [CrossRef]
Hao, Y.; Widagdo, F.R.A.; Liu, X.; Quan, Y.; Dong, L.; Li, F. Individual Tree Diameter Estimation in Small-Scale Forest Inventory Using UAV Laser Scanning. Remote Sens. 2021, 13, 24. [Google Scholar] [CrossRef]
Balasubramanian, G.P.; Saber, E.; Misic, V.; Peskin, E.; Shaw, M. Unsupervised color image segmentation using a dynamic color gradient thresholding algorithm. In Proceedings of the Human Vision and Electronic Imaging XIII, San Jose, CA, USA, 18 February 2008; Volume 6806, p. 68061H. [Google Scholar]
Hao, Y.; Widagdo, F.R.A.; Liu, X.; Liu, Y.; Dong, L.; Li, F. A Hierarchical Region-Merging Algorithm for 3-D Segmentation of Individual Trees Using UAV-LiDAR Point Clouds. IEEE Trans. Geosci. Remote Sens. 2022, 60, 1–16. [Google Scholar] [CrossRef]
Wen, Y.; Su, D.; Lin, Q. Region-Growing Algorithm on CT Angiography Images for Detection of Gynecological Malignant Tumor. Sci. Program. 2021, 2021, 9875886. [Google Scholar] [CrossRef]
Lee, H.-C.; Cok, D.R. Detecting Boundaries in a Vector Field. Trans. Sig. Proc. 1991, 39, 1181–1194. [Google Scholar] [CrossRef]
Quan, Y.; Li, M.; Hao, Y.; Wang, B. Comparison and Evaluation of Different Pit-Filling Methods for Generating High Resolution Canopy Height Model Using UAV Laser Scanning Data. Remote Sens. 2021, 13, 2239. [Google Scholar] [CrossRef]
Heinonen, T.; Kurttila, M.; Pukkala, T. Possibilities to Aggregate Raster Cells through Spatial Optimization in Forest Planning. Silva Fenn. 2007, 41, 89–103. [Google Scholar] [CrossRef]
Hu, H.; Guo, Z. A U-net and KMeans based method for brain tumor segmentation and measurement. In Proceedings of the 2nd International Conference on Computer Vision, Image, and Deep Learning, Liuzhou, China, 5 October 2021; p. 57. [Google Scholar]
Eysn, L.; Hollaus, M.; Schadauer, K.; Pfeifer, N. Forest delineation based on airborne LIDAR data. Remote Sens. 2012, 4, 762–783. [Google Scholar] [CrossRef]
Lu, D.; Chen, Q.; Wang, G.; Liu, L.; Li, G.; Moran, E. A survey of remote sensing-based aboveground biomass estimation methods in forest ecosystems. Int. J. Digit. Earth 2016, 9, 63–105. [Google Scholar] [CrossRef]
Roth, K.L.; Roberts, D.A.; Dennison, P.E.; Peterson, S.H.; Alonzo, M. The impact of spatial resolution on the classification of plant species and functional types within imaging spectrometer data. Remote Sens. Environ. 2015, 171, 45–57. [Google Scholar] [CrossRef]
Asner, G.P.; Martin, R.E.; Anderson, C.B.; Knapp, D.E. Quantifying forest canopy traits: Imaging spectroscopy versus field survey. Remote Sens. Environ. 2015, 158, 15–27. [Google Scholar] [CrossRef]
de Almeida, C.T.; Galvão, L.S.; de Oliveira Cruz e Aragão, L.E.; Ometto, J.P.H.B.; Jacon, A.D.; de Souza Pereira, F.R.; Sato, L.Y.; Lopes, A.P.; de Alencastro Graça, P.M.L.; de Jesus Silva, C.V.; et al. Combining LiDAR and hyperspectral data for aboveground biomass modeling in the Brazilian Amazon using different regression algorithms. Remote Sens. Environ. 2019, 232, 111323. [Google Scholar] [CrossRef]

Figure 1. Map of the Mengjiagang forest farm in Heilongjiang Province, northeast China, showing the study area location.

Figure 2. Variation of the LiDAR metrics in 1 m² raster cells in a part of sub-area 2. Light tone indicates the high value of the metric.

Figure 3. Schematic diagram illustrating the expanding process of growth seeds in the region growing method. Blue cells represent the initial growth seed, red cells represent the neighbouring cells, and yellow cells represent neighbouring cells that meet the joining criteria.

Figure 4. A sub-priority function for segment area.

Figure 5. The effect of the post-processing steps (mode filtering, renumbering, and cleaning) on the segmentation produced by simulating annealing.

Figure 6. Relationship between the mean R² of the LiDAR metrics (HP95, AH5, IV) and a shape metric when CA, SOM, and SA were used with different parameter values. Larger markers show the results for those segmentations that were obtained with parameters selected for further analyses.

Figure 7. Relationship between the mean R² of the LiDAR metrics (HP95, AH5, IV) and the mean area of the segments when CA, SOM, and SA were used with different parameter values.

Figure 8. Relationship between the mean R² of the LiDAR metrics (HP95, AH5, IV) and the percentage of segments smaller than 0.3 ha when CA, SOM, and SA were used with different parameter values.

Figure 9. Segmentations produced by the four methods in case study area 1 (1.5 km × 1 km) overlaid with the 95th percentile of the height distribution of the echoes. Light tone indicates a high percentile value.

Figure 10. Segmentations produced by the four methods in case study area 2 (1.5 km × 1 km) overlaid with the 95th percentile of the height distribution of the echoes. Light tone indicates a high percentile value.

Figure 11. Segmentations produced by the four methods in case study area 3 (1.5 km × 1 km) overlaid with the 95th percentile of the height distribution of the echoes. Light tone indicates a high percentile value.

Table 1. Descriptive statistics of the operational parameters for the UAVLS data collection.

Characteristic	Value
Laser pulse repetition rate (kHz)	380
Accuracy/Precision (mm)	10/5
Maximum echo number	5
Maximum Range (m)	250
Beam divergence (mrad)	0.5
Laser wavelength (nm)	1550
Weight (kg)	3.75

Note: Accuracy is the degree of conformity of a measured quantity to its actual value and precision is the degree to which further measurements show the same result.

Table 2. Proportion of variance explained by the segmentation. The method with the highest (best) R² is in boldface.

	Region Growing	Cellular Automaton	Self-Organizing Map	Simulated Annealing
		Region 1
R² of H95	0.707	0.791	0.805	0.802
R² of AH5	0.641	0.729	0.752	0.748
R² on IV	0.503	0.544	0.556	0.561
Mean R²	0.672	0.754	0.770	0.767
		Region 2
R² of H95	0.755	0.862	0.851	0.861
R² of AH5	0.742	0.829	0.827	0.832
R² on IV	0.491	0.517	0.510	0.547
Mean R²	0.726	0.821	0.812	0.824
		Region 3
R² of H95	0.692	0.787	0.797	0.806
R² of AH5	0.632	0.717	0.733	0.740
R² on IV	0.576	0.604	0.612	0.637
Mean R²	0.668	0.755	0.766	0.776

Table 3. Mean segment area and other statistics related to the size of segments. The method with the best value of the statistic is in boldface.

	Region Growing	Cellular Automaton	Self-Organizing Map	Simulated Annealing
		Region 1
Mean area, ha	1.000	1.304	0.909	1.030
% small segments	14.7	13.4	4.8	3.8
		Region 2
Mean area, ha	0.872	1.229	0.903	0.955
% small segments	14.0	11.4	4.5	2.5
		Region 3
Mean area, ha	1.006	1.515	0.887	1.071
% small segments	10.7	8.8	7.2	5.6

Table 4. Statistics related to the shape of segments. The method with the best value of the statistic is in boldface. AW stands for “area-weighted” and means that the average value of the statistic among the segments was calculated using segment area as the weight.

	Region Growing	Cellular Automaton	Self-Organizing Map	Simulated Annealing
		Region 1
Mean distance	0.772	0.737	0.793	0.762
AW mean distance	0.793	0.732	0.771	0.744
% in circle	75.2	79.5	73.3	78.6
AW % in circle	72.6	80.6	75.6	80.7
		Region 2
Mean distance	0.772	0.758	0.788	0.771
AW mean distance	0.767	0.743	0.763	0.759
% in circle	75.6	77.3	75.0	80.6
AW % in circle	75.6	78.5	77.3	79.0
		Region 3
Mean distance	0.767	0.768	0.822	0.764
AW mean distance	0.764	0.736	0.761	0.741
% in circle	76.7	77.7	72.2	77.4
AW % in circle	76.0	81.1	77.4	83.4

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Sun, Y.; Jin, X.; Pukkala, T.; Li, F. A Comparison of Four Methods for Automatic Delineation of Tree Stands from Grids of LiDAR Metrics. Remote Sens. 2022, 14, 6192. https://doi.org/10.3390/rs14246192

AMA Style

Sun Y, Jin X, Pukkala T, Li F. A Comparison of Four Methods for Automatic Delineation of Tree Stands from Grids of LiDAR Metrics. Remote Sensing. 2022; 14(24):6192. https://doi.org/10.3390/rs14246192

Chicago/Turabian Style

Sun, Yusen, Xingji Jin, Timo Pukkala, and Fengri Li. 2022. "A Comparison of Four Methods for Automatic Delineation of Tree Stands from Grids of LiDAR Metrics" Remote Sensing 14, no. 24: 6192. https://doi.org/10.3390/rs14246192

APA Style

Sun, Y., Jin, X., Pukkala, T., & Li, F. (2022). A Comparison of Four Methods for Automatic Delineation of Tree Stands from Grids of LiDAR Metrics. Remote Sensing, 14(24), 6192. https://doi.org/10.3390/rs14246192

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Comparison of Four Methods for Automatic Delineation of Tree Stands from Grids of LiDAR Metrics

Abstract

1. Introduction

2. Materials and Methods

2.1. Study Sites and Field Data

2.2. Data Acquisition and LiDAR Metrics

2.3. Methods

2.3.1. Region-Growing

2.3.2. Cellular Automaton

2.3.3. Self-Organizing Map

2.3.4. Simulated Annealing

2.4. Post-Processing

2.5. Fine-Tuning of CA, SOM, and SA

2.6. Statistics Calculated for the Segmentations

3. Results

3.1. Parameter Fine-Tuning Results

3.2. Delineation Maps

3.3. Numerical Statistics

4. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI