Inﬂuence of Scanner Position and Plot Size on the Accuracy of Tree Detection and Diameter Estimation Using Terrestrial Laser Scanning on Forest Inventory Plots

: This research tested how di ﬀ erent scanner positions and sample plot sizes a ﬀ ect the tree detection and diameter measurement in forest inventories. For this, a multistage density-based clustering approach was further developed for the automatic mapping of tree positions and simultaneously applied with automatic measurements of tree diameters. This further development of the algorithm reduced the proportion of falsely detected tree locations by about 64%. The algorithms were tested in di ﬀ erent settings with respect to the number and spatial alignment of scanner positions and under manifold forest conditions, covering di ﬀ erent age classes and a mixture of scenarios, and representing a broad gradient of structural complexity. For circular sample plots with a maximum radius of 20 m, the tree mapping algorithm showed a detection rate of 82.4% with seven scanner positions at the vertices of a hexagon plus the center coordinates, and 68.3% with four scanner positions aligned in a triangle plus the center. Detection rates were signiﬁcantly increased with smaller maximum radii. Thus, with a maximum radius of 10 m, the hexagon setting yielded a detection rate of 90.5% and the triangle 92%. Other alignments of scanner positions were also tested, but proved to be either unfavorable or too labor-intensive. The commission rates were on average less than 3%. The root mean square error (RMSE) of the dbh (diameter at breast height) measurement was between 2.66 cm and 4.18 cm for the hexagon and between 3.0 cm and 4.7 cm for the triangle design. The robustness of the algorithm was also demonstrated via tests by means of an international benchmark dataset. It has been shown that the number of stems per hectare had a signiﬁcant impact on the detection rate.


Introduction
The major purpose of forest inventories is to provide relevant information on the status and changes of forest landscapes. Forest inventory data not only are required for decision making in forest management, but also are used as an empirical platform for manifold research activities. In order to provide reliable and precise information on traditional attributes, such as average growing stock timber volume or tree count, measurement errors besides the design-based sampling variance should be kept as small as possible.
It turns out that, owing to the many possible combinations of parameter settings, such as hardware properties, scanner position variations, underlying forest structure diversity, and nonhomogeneous quality criteria, a comparative assessment of the results from the different TLS studies is difficult [6]. For this reason, an international benchmark project [6] was initiated in order to evaluate the performance of different algorithms under predefined reference conditions. With regard to the forest structure, the benchmark data were divided into "easy", "medium", and "difficult" plots. Furthermore, as consistent quality criteria, "completeness" (detection rate) and "correctness", and "mean accuracy" as a combination of the two, were defined. Within the concerted activities of the benchmark project, the mean accuracy achieved with the different approaches using a multiscan setting ranged between 74% and 53% under medium and difficult conditions, respectively [6].
Despite the promising benefits of TLS, in terms of providing high-resolution 3D information, a gold standard approach for automatic tree detection and stem diameter measurement is still lacking. Consequently, TLS is so far not exclusively used as a standard measurement technique in forest inventory practice. This is especially because even well-performing approaches generally show high variance with respect to the quality criteria and when tested with different scanner settings and under various forest structure scenarios.
The goal of the present study was to investigate the influence of scanner position and plot size on the accuracy of tree detection and diameter estimation in forest inventory. In the course of this, an algorithm for the automatic detection of stems, which has been successfully used for the mapping of complete forest stands in a forest monitoring context [7], was further developed. The aim of this further algorithm development is to reduce the proportion of falsely detected tree locations. The enhanced multistage clustering algorithm was tested on smaller forest inventory sample plots with varying scanner positions and numbers of scans. It was also examined how the accuracy of automatic stem detection was influenced by the structural diversity of the forest. Finally, the algorithm was applied to reference data provided by the international benchmark project in the work of [6].

Study Area and Sample Plots
The BOKU Institute of Forest Growth maintains a permanently repeated forest inventory in the forest district of Ofenbach located in the federal state of Lower Austria near the village of Forchtenstein. The forest inventory was initiated by Schodterer [60] in 1989 and comprises a total of 554 sample plot locations. Sample plots are systematically aligned in a regular grid with a mesh width of 141.4 m × 141.4 m. At each sample point, Bitterlich relascope sampling [61][62][63] was conducted using a constant basal area factor of 4 m 2 per ha. Trees were recorded only if their dbh (measured from one direction with caliper in 1.3 m height) was greater than or equal to a lower threshold of 5 cm. For the present study, a subsample of 23 plots was selected in such a way that possible variations in forest type (broadleaved, coniferous, and mixed), forest structure, and terrain property (flat to steep) were represented. The Bitterlich method produces an incomplete tree pattern that results from a size-related and distance-dependent thinning. In order to obtain reference information on the existing trees that were thinned by the Bitterlich process, their positions and dbh (measured from one direction in 1.3 m height) were manually measured in the 3D point cloud. By doing so, only trees with a dbh of 5 cm or greater and a distance of 20 m or less from the sample plot center were measured. In addition, covariates were derived for each sample plot, among them mixture type, main species, Shannon index [64], existence of regeneration, and terrain slope. Using the sample tree data, further attributes were derived: basal area per hectare (BA/ha), quadratic mean diameter (dm), stand density index (SDI) [65], stem density in terms of number of trees per hectare (N/ha), coefficient of variation of diameter at breast height (CVdbh), dbh differentiation according to Füldner [66] (Di f f _Fuel), Clark and Evans aggregation index [67] (CE), and proportion of deciduous trees (p_dec). Table 1 shows summary statistics (number of trees, mean, standard deviation, min, and max) of the metric sample plot parameters. A complete description of the 23 sample plots can be found in Appendix A, Table A1.

Instrumentation and Data Collection
Twenty-three sample plots were scanned in February-March 2018 using a FARO Focus3D X330 terrestrial laser scanner (Faro Technologies Inc., Lake Mary, FL, USA). This scanner uses phase-shift light detection and ranging (LIDAR) technology to measure the XYZ positions of objects. The maximum range of the scanner is 330 m and it can measure 9.76 × 10 5 points per second. The distances are measured with a continuous wavelength of 1550 nm and an accuracy of ± 2 mm at 10 m distance. The field of view is 360 • × 300 • . With a battery capacity of approximately 4.5 h, a weight of 5.2 kg, and a compact design (240 mm × 200 mm × 100 mm), the scanner is easy to handle in forestry measurement campaigns. The scan quality parameter was set to 4×, resulting in moderate noise reduction. The quality parameters 1× through 4× are different expressions for the observation time: 1× has the smallest observation time of 1 µs per scan point, and 4× has 8 µs per scan point. The resolution was set to r = 6.136 mm/10 m, which is 1/4 of the maximum possible resolution (r max = 1.534 mm/10 m). Furthermore, the hardware settings "Clear Sky" and "Clear Contour" were used to filter incorrect scan points. These settings resulted in a pure scanning time plus photographs of about 11 minutes per scan, and it took approximately 3 minutes more to set up the scanner at a different position, as is necessary in multiscan mode.
A set of nine scan variants was applied, differing in the position of the scanner and the number of scans per plot (see Table 2). The most intensive scan, variant 1, comprised seven scanner installations, among them six positions arranged in a regular hexagon with a constant edge length of 15 m and centered at the sample point location, plus an additional location positioned at the sample point. Scan variant 9 had the lowest intensity and was the single scan mode, in which the scanner was placed only at the sample plot center. The other seven scan variants were constituted as subsets of the full scan, variant 1, comprising a hexagon without a central scan (variant 2) and other geometrical arrangements in the form of a triangle, a rectangle, and a diagonal, each of which was applied in two subvariants, either including or excluding an additional central scan position.  with center  7  77  21  5  103  2  Hexagon without center  6  66  18  5  89  3  Triangle with center  4  44  12  5  61  4  Triangle without center  3  33  9  5  47  5  Rectangle with center  5  55  15  5  75  6  Rectangle without center  4  44  12  5  61  7  Diagonal with center  3  33  9  5  47  8 Diagonal without center 2 22 6 5 33 9 Single scan (only center) 1 11 3 0 14 Figure 1 shows the spatial arrangement of the scanner positions and reference spheres on a sample plot with a 20 m radius. For the most intensive scan, variant 1, the distance between each neighboring scanner position was constantly 15 m. However, if a scanner position was closer than 0.5 m to the nearest tree, it was allowed to be relocated by a maximum distance of 0.5 m in an appropriate direction. To enable easy co-registration of the individual scans, eight Styrofoam balls with a diameter of 150 mm and an additional ball with a diameter of 300 mm mounted on monopods were installed as reference targets on each sample plot. Two sets of the reference targets, each containing four targets, were arranged in regular squares with side lengths of 14.1 m and 7.1 m. That is, the target points of the inner rectangle were 5 m from the sample point, and those of the outer rectangle were 10 m from the sample point (see Figure 1). Once the center scan was completed, the plot center was marked with the reference target ball with a diameter of 300 mm. Figure 1. Scan settings. Numbers below scanner positions (marked as triangles) denote scan variants in which each respective scanner position was used. For the most intensive scan variant, variant 1, the distance between each neighboring scanner position (triangle) was constantly 15 m. The Styrofoam balls as reference targets (filled circles) were located in two concentric squares. The outer square had a side length of 14.1 m and the distance of each reference target to the center was 10 m. The inner square had a side length of 7.1 m and the distance of each reference target to the center was 5 m. In addition, the two squares were rotated by 45 • . In scan variants without a central scan (2,4,6,8), an extra reference target was placed at the center location.

Point Cloud Processing
Co-registration of the raw scan data was performed separately for each scan variant and by means of the Styrofoam reference targets using the FARO SCENE 6.2 program [68]. A constant cutoff distance of 30 m was chosen for each of the seven scanner positions. The aligned scans were then further processed to obtain a complete 3D point cloud for each plot and scan variant containing X, Y, and Z coordinates. These point clouds were then clipped with an upright-oriented cylinder with a radius of 21 m and centered at the sample point. For processing of the point clouds in the subsequent steps, the LAStools software package (rapidlasso GmbH, Gilching, Germany) [69] was used. The point clouds were split into square tiles with an edge length of 10 m plus an additional buffer of 2 m width on each side. Each tile was then processed with the LAStools "lasnoise" noise filter. Subsequently, all points were classified into ground and non-ground points, using a modified version of the algorithm by Axelsson [70], which is implemented in LAStools as "lasground". The non-ground points were normalized relative to the ground-derived digital elevation model (DEM), and the ground points were discarded from the data. After noise reduction and ground classification, the tiles were reassembled to form a single point cloud for each sample plot. The resulting point cloud data were exported in a tab-delimited text file format. Further data analysis was performed using R statistical software (R Foundation for Statistical Computing, Vienna, Austria) [71]. A step-by-step overview of the entire workflow is given in Appendix A, Table A2.

Clustering, Detection of Tree Positions, and dbh Measurement
Automatic detection of stem positions and automatic measurement of dbh were carried out using a methodology demonstrated by Ritter et al. [7] that was further refined during the present study. The workflow as well as the changes and innovations compared with Ritter et al. [7] are shown in Table A2. The fundamental principle of the automatic tree detection approach is a density-based clustering algorithm proposed by Rodriguez and Laio [72] that is implemented in the R-package densityClust [73]. The clustering algorithm is solely based on the Euclidean distance between data points. Cluster centroids are defined by local density maxima. The clustering algorithm is based on the rationale that cluster centers (i) are surrounded by areas of lower local density and (ii) are at a relatively large distance from other local density maxima [72]. For each data point i, the local density ρ i and distance δ i from points of higher density are calculated, depending on the distances between data points and a constant cutoff distance d c .
Our stem detection algorithm uses such a density-based clustering in two subsequent phases. In the first clustering phase, the 3D point cloud of normalized Z values was stratified into vertical layers, each of which had a height of 21 cm and a vertical overlap of 20 cm with the layer lying above. The lower border of the lowest layer was placed 1 m above the normalized zero height and the upper border of the topmost layer at a height of 2.88 m. This procedure resulted in 168 vertical layers. From each of these layers, 20,000 points were randomly sampled, and if a layer contained fewer than 20,000 points, the whole point set was selected. Layers with fewer than 500 points were excluded from cluster analysis. The density-based clustering was separately applied to the points selected from each layer.
In the second clustering phase, the cluster centroids from the first phase were additionally clustered. For that purpose, the cluster centroids from the first phase were projected onto a plane by simply dropping their Z-coordinates. Subsequently, the phase-1 cluster centroids were filtered so that the remaining set contained only points with at least 15 neighbors at a 5 cm distance. This filtering caused noise reduction and discarded first-phase centroids that represented branches or regeneration. The filtered phase-1 centroids were then subjected to the phase-2 clustering, which resulted in preliminary estimates of the existing tree positions.
The XY-coordinate range of all points contributing to the phase-2 cluster was used to construct a clipping window around every preliminary tree position estimate. The original 3D point cloud was then clipped accordingly and stratified into 13 vertical layers, each of which had a vertical extent of 15 cm. The lower boundary of the lowest layer was placed at a 1 m height above ground, and the lower boundary of the topmost layer was at a 2.5 m height. The distance between two neighboring layers was 12.5 cm, resulting in a 2.5 cm overlap of neighboring layers. The 3D points from each layer were projected onto the horizontal plane, creating 2-dimensional XY points. Subsequently, circles were separately fitted to the points of each layer using the circular cluster method of Muller and Garlipp [74]. The method was implemented in the R package edci [75]. A regular grid of 25 × 25 starting points with a mesh width of 3.2 cm was chosen as initial circle centers for the optimization. In addition, a sequence of five starting radii beginning at 1 cm, ending at 50 cm, and with a step width of 9.8 cm was chosen at each grid point. This procedure resulted in 3125 starting values for each of the 13 layers. After accomplishing the circle fit in each vertical layer, annulus-shaped buffer zones were constructed around the circle arcs. Each buffer annulus had a width of 20% of the respective circle radius and an outer radius of 110% of the circle radius. The proportion of points falling into this buffer zone was calculated in relation to the total number of points associated with each layer. The decision as to whether a tree position was finally proposed at a phase-2 cluster centroid location was determined by the results from the circle fits. Trials revealed that the decision could be safely made depending on the following three criteria: for 6 out of the total 13 layers, (i) the circles' buffer zones should include at least a 55% proportion of the total amount of clipped points, (ii) the standard deviation of the XY position coordinates must be less than or equal to 0.02 m, and (iii) the standard deviation of the diameter measurements must be less than or equal to 1.5 cm. The preliminary proposal of a tree location was rejected if at least one of the three criteria was not met for any of the 1716 total possible combinations of 6 samples from the 13 layers. If several combinations met the three criteria, the one with the highest average number of points in the buffer zone across the six layers was further analyzed to derive the final tree position and dbh measurement. For measurement of the final dbh, it proved to be appropriate to choose the layer closest to the 1.3 m height. However, if a circle fit for a layer around the 1.3 m height was not included by the collective of the six selected layers, the layer closest to the 1.3 m height was successively chosen and the diameter measurement for this layer was corrected by a positive/negative offset of ±1.34 cm m -1 per deviation from the 1.3 m height. The center of the respective fitted circle was then chosen as the final tree position.

Reference Data
Reference data for the actual dbh values and tree positions were derived from a combination of field data and manual measurements in the co-registered 3D point clouds. For trees that were included by the Bitterlich relascope sampling, the tree positions and dbh values from the field measurement data served as reference data. For trees that were located within a range of 20 m from the sample point, but were excluded by the Bitterlich sampling, the reference locations and dbh were measured manually within the point cloud using FARO SCENE 6.2 [68]. Because measurement errors could occur with both methods, either the automatic routines or the manual measurements in the field and within the point cloud, it was not guaranteed that the estimated reference position coordinates would be equal to their counterparts in the reference data for any given tree. Thus, a reliable assignment of estimated tree positions to corresponding reference coordinates was required. This assignment was automatically performed using the function pppdist() from the R-package spatstat [76]. This function performs a mapping of two similar point patterns by minimizing the average Euclidean distance between matching points. Automatically detected tree positions that could not be assigned to a tree measured in the reference data were manually double-checked in the TLS point cloud. Depending on the results of the check, either the automatically detected tree position was marked as a false positive, or if the tree was visible in the point cloud, but simply missed during the reference measurements in the field, the coordinates were added post hoc to the reference data.

Test by Means of Independent Reference Data from an International Benchmark Project
To test the performance of our algorithms by means of other independent reference data, our methodology was also applied to the publicly available TLS dataset, comprising six sample plots that were collected for an international TLS benchmarking project [6] in Evo, Finland. Eighteen partners from Asia, Europe, and North America participated in the project and submitted their retrieved tree attributes for evaluation. Out of a total of 18 participants, 15 partners provided results achieved with their own algorithms for tree detection and dbh measurement.
The TLS data of the benchmarking project was collected on 24 sample plots with different characteristics (categories), and 6 of them including reference field data have been made publicly available for nonprofit research purposes. The "easy" category represents sample plots that had clear stem visibility with minimal understory vegetation and low stem density, the "medium" category represents sample plots with moderate stem density and sparse understory vegetation, and the "difficult" category represents sample plots that had high stem density and dense understory vegetation. All scans were carried out in both single and multiple scan modes. The reader is referred to the original work by Liang et al. [6] for further details.

Accuracy of Tree Detection and dbh Measurement
The accuracy of tree detection was evaluated in terms of three measures: detection rate d r (%), commission error c(%), and overall accuracy acc(%). These measures were calculated as follows: where n match is the number of correctly found reference trees; n ref is the total number of reference trees; n falsepos is the number of tree positions, which could not be assigned to an existing tree in the reference data; n extr is the number of automatically detected tree positions (n match + n falsepos ); and o(%) is the omission error, defined as 100% − d r (%). The detection rate d r (%) measures the proportion of correctly detected tree locations. The commission error c(%) measures the proportion of falsely detected tree locations. The overall accuracy acc(%) is a combination of the latter two metrics and represents a global quality criterion. The precision of the automatic dbh measurements was assessed by means of root mean square error (RMSE) calculated as the square root of the average quadratic distance between the automatic measurementd bh i and the corresponding reference measurement dbh i .
The accuracy of the automatic dbh measurements was assessed in terms of bias: The latter two criteria were also calculated as relative measures, as percentage RMSE% and percentage bias%: with dbh = 1 n match n match i=1 dbh i (8) being the average dbh of the reference data.
In order to examine whether and to what extent the accuracy of the automatic tree detection and the accuracy and precision of the dbh measurements were influenced by the distance of a tree to the sample plot center, the performance measures described above were separately assessed for four subplot sizes defined by disks with radii of 20 m, 15 m, 10 m, and 5 m. In addition, whether the performance of the automatic routines showed trends depending on the size of the trees was also analyzed. For this purpose, the performance measures were evaluated for subsets defined by three lower dbh thresholds of 5 cm, 10 cm, and 15 cm. Finally, the nine scan variants were ranked with respect to (i) the associated labor costs, (ii) the overall accuracy on average achieved with the different sample plot radii and dbh thresholds, and (iii) the average RMSE.
As it turned out that scan variant 3 would provide a good compromise between performance and labor costs, its performance was analyzed in depth with respect to the influence of an applied maximum radius (plot size) and the effects of stand characteristics and topographic measures. To evaluate the significant effects of the above-mentioned covariates, we tested the relationship between detection rate/commission error and the stand and terrain characteristics given in Table 1. A logistic regression model was formulated using the glm() function in R. As the detection and commission rate are proportional data, a quasibinomial logit model has been fitted. Before that, the predictor variables were plotted against the dependent variable and in the course of modeling, and any plausible interactions were also tested. As interpreting the model parameters is difficult because of multicollinearity problems, the number of predictor variables was reduced with the support of correlation coefficients. Finally, the saturated model for the logistic regression was formulated as follows: with where y is either the detection rate or the commission error between 0 and 1, η is a linear predictor, and β 0 , . . . , β 7 are linear coefficients. A backward selection of predictor variables was applied to these full models, resulting in models from which Efron's pseudo coefficient of determination [77] was calculated.

Detection of Tree Positions
The analysis of the 23 plots showed that the detection rate d r (%) over all scan variants strongly depended on the lower dbh threshold and the sample plot radius applied as a maximum radius ( Figure 2). In general, the detection rate increased with an increasing lower threshold for dbh and a decreasing maximum radius. Furthermore, it can be seen that scan variants with fewer scans generally have lower detection rates. Scan variants without a scan position at the sample plot center (2, hexagon; 4, triangle; 6, rectangle; 8, diagonal) showed lower detection rates than their counterparts that included a central scan position (1, hexagon; 3, triangle; 5, rectangle; 7, diagonal). This difference between scan variants with and without a central scan position was greater with a smaller number of scans. This means that the difference is smallest between scan variants 1 and 2 (7 and 6 scanner positions, respectively) and largest between variants 7 and 8 (3 and 2 scanner positions, respectively). Under plot radii of 15 m, 10 m, and 5 m, single scan variant 9 showed higher detection rates than variant 8 (diagonal without center). The average detection rates over all 23 sample plots and across all scan variants, dbh thresholds, and plot radii ranged from 34.4% to 92% (see further details in Appendix A, Table A3). In comparison with the omission error (100% − d r (%)), the commission error (c(%) ) was less relevant (see Figure 3). However, the commission error was likewise dependent on the settings and increased with increasing lower dbh threshold and maximum radius. The commission error differed only slightly among scan variants, especially for lower plot sizes. However, single scan variant 9 mostly resulted in lower commission errors. The average commission error over all 23 sample plots and depending on scan variant, dbh threshold, and plot radius ranged from 0% to 3.8% (see Appendix A, Table A4 for further details).
Overall accuracy rates (acc(%)), as a combination of detection and commission rates, are presented in Figure 4. Overall accuracy showed similar behavior to the detection rate depending on dbh threshold, plot radius, and scan variant. Average overall accuracies across all sample plots were between 33.9% and 90.2% (details are provided in Appendix A, Table A5).

Estimation of dbh
The performance of the automatic dbh estimation is outlined in Table 3 (RMSE) and Table 4 (bias) for the different scan variants, lower dbh thresholds, and plot radii. Boxplots of the data can be found in Appendix A. It turns out that scan variants with fewer scans generally showed a higher RMSE and bias. Single scan variant 9 had the largest RMSE and bias of all scan variants. With an increasing lower dbh threshold, bias increased, whereas RMSE remained nearly constant. With decreasing plot radii, RMSE decreased, while bias remained nearly constant. Scan variants without a center scan (2, hexagon; 4, triangle; 6, rectangle; 8, diagonal) mostly had higher RMSE compared with their complete counterparts that included a central scan position (1, hexagon; 3, triangle; 5, rectangle; 7, diagonal); the differences were especially pronounced between scan variants 7 and 8. With regard to bias, a clear difference existed only between scan variants 7 (diagonal with center) and 8 (diagonal without center) when comparing variants with and without the central scan. Average RMSE over all 23 sample plots and across scan variants, dbh thresholds, and plot radii ranged from 2.7 cm to 7 cm. Average relative RMSE was between 8.9% and 23.4%. Absolute bias ranged from 0.5 cm to 4.5 cm and relative bias from 1.8% to 14.5%.     Table 5 shows the ranking of individual scan variants with respect to standardized labor costs (in terms of working time), standardized overall accuracy, and standardized RMSE of the dbh measurements. The overall accuracy and RMSE for each scan variant were calculated as average values for the different settings (plot radii and dbh thresholds) based on the figures presented in Tables 4  and A5. Scan variant 9 (single scan) was associated with the least working time (standardized value of 0), and scan variant 1 (hexagon with center) had the most working time (standardized value of 1) with seven scan positions. The best overall accuracy (78.15%, standardized value of 0) was achieved with variant 1 (hexagon with center) and the worst (51.47%, standardized value of 1) was with variant 8 (diagonal without center). In terms of RMSE, scan variant 1 (hexagon with center) showed the best performance (RMSE of 3.59 cm, standardized value of 0), and scan variant 9 (single scan) had the poorest performance (RMSE of 5.93 cm, standardized value of 1). According to an overall ranking, scan variant 3 could be regarded as superior (lowest standardized value of 0.257), and scan variant 9 showed the worst overall performance (highest standardized value of 0.257). Variant 3 (triangle with center), with a standardized working time of 0.528, a standardized overall accuracy of 0.046, and a standardized precision of automatic dbh measurement of 0.197, represented a good compromise between labor cost and accuracy of automatic stem detection and dbh measurement.

Relationship to Plot Radius and Environmental Variables
The relationship between overall detection accuracy and maximum radius, that is, sample plot radius, was analyzed in more detail for scan variant 3 (triangle plus central scan). The results are shown in Figure 5. The curves of the accuracy rates had a similar shape for all three lower dbh thresholds. However, the level of the curve, that is, the overall accuracy, was lowest for the 5 cm threshold and highest for the 15 cm threshold. For each of the three applied dbh thresholds, overall accuracy remained almost constant across a range of maximum radii between 5 and 10 m, and overall accuracy continuously declined for maximum radii greater than 10 m. Corresponding graphs for the other scan variants are shown in Appendix A (Figures A5 and A6). Analysis revealed that the curves of scan variants without a central scan position (variants 2, 4, 6, 8) had a flatter trend than the curves of scan variants with a central scan (variants 1, 3, 5, 7). Among all of them, the curves of scan variant 9 (single scan mode) had the steepest trend.  The results of the logistic regression analysis of the influence of environmental and stand variables on detection rate and commission error are presented in Table 6. For the detection rate, only the number of stems per hectare (N/ha) proved to be a significant covariate. It turns out that the odds for detection decrease as the number of trees increases. The pseudo-coefficient of determination was 0.627 for the full model and 0.469 for the final model. Figure A7 in Appendix A shows the response curve of the final model for the detection rate. Significant covariate effects could not be found for the commission error.

Results from Tests by Means of International Benchmark Data
The results of our algorithms applied to the independent international benchmark dataset are provided in Tables 7 and 8. In order to enable an exact comparison with the results presented in the benchmark study, additional quality metrics were computed according to suggestions by the authors of that study. The completeness criterion is synonymous with the detection rate used in our study; correctness is defined as correctness (%) = 100 − c(%); and mean accuracy as mean accuracy = 2×n match (n ref +n match ) × 100, where c(%) is the commission error, n match is the number of correctly detected tree positions, n ref is the total number of reference trees, and n extr is the number of automatically detected tree locations. Bias and RMSE were defined the same as in our study and as described in the Methods section. In the paper of the benchmark study, the results from 14 approaches were evaluated in terms of the aforementioned criteria. The performance of our algorithms was assessed in relation to the performance ranking of the 14 approaches. That means the performance of our routines could be ranked between 1 and 15. In multiple scan mode, the following results were achieved. The completeness ranged from 34.2%-42.6% for difficult plots to 85.7%-92.2% for easy plots. In contrast, the correctness ranged from 97.6%-100% for difficult plots to 84.7%-92.2% for easy plots. This resulted in a mean accuracy between 50.6% and 92.2%. Compared with the 14 benchmark algorithms, an average rank in completeness of 9.7, in correctness of 5.3, and in mean accuracy of 7.3 was achieved. The dbh could be measured on average with 2.4 cm RMSE (13.3% relative RMSE) and −0.9 cm bias (−5.1% relative bias). This corresponds with rank 5.7 in RMSE (rank 6.3 in relative RMSE) and rank 7.7 in bias (rank 9 in relative bias). For single scan data, we achieved an average completeness of 33.3%, correctness of 90.5%, mean accuracy of 45.2%, RMSE of 4.7 cm (23.1%), and bias of −2.3 cm (−11.0%). The average rank achieved with our routines and applied to single scan data was between 7.3 and 13.

Comparison with Other Studies
The algorithms demonstrated in the present study work fully automatically. For each sample plot and scan variant, constant parameter settings were applied for the underlying clustering algorithms and circle fitting routines. As a result, the proposed methodology proved to be robust and provide high flexibility under different forest structure scenarios and in different scanning setups. The further development of the algorithm successfully reduced the proportion of falsely detected trees (commission rate) in scan variant 1 from 4.51% (sd ± 1.52%), achieved by the method of Ritter et al. [7], to 1.67% (sd ± 1.16%). For scan variant 3, it was feasible to reduce the commission rate from 3.91% (sd ± 1.70%) to 1.36% (sd ± 0.87%). This resulted in a reduction of falsely detected trees of around 64%. Commission rates of 4%-5% are not a major problem for intensive forest monitoring purposes and on experimental stands, as the study sites are usually visited multiple times for additional measurements-thus, these falsely detected trees can be corrected relatively easy. In contrast, forest inventory sample plots are visited only once per inventory period, which makes it difficult to deal with falsely detected trees.
The algorithm used in this study was developed and trained based on 23 sample plots of the BOKU forest inventory (see Section 2.1). Using an independent data set [6], the performance of the algorithm was evaluated untrained and with the same parameter settings. In comparison with the 14 other approaches that were tested in the benchmark study [6], it turned out that our algorithms had medium performance. However, especially on difficult plots of the benchmark data set, high correctness rates of 100% and 97.6% could be achieved.
To our knowledge, so far only Zhang et al. [78] have used the benchmark dataset [6] to test the performance of novel algorithms. In multiscan mode, they achieved mean values for completeness of 59.94%, for correctness of 97.27%, and for accuracy of 71.98% [78]. The average RMSE of the diameter fit was 2.67 cm and the corresponding average bias was −1.42 cm [78]. In comparison, in multiscan mode, our tree detection algorithms performed slightly better in terms of completeness and a little worse in terms of correctness, thus the mean accuracy was at a similar level. Our diameter fit algorithms performed slightly better in terms of RMSE and bias.
In single scan mode, Zhang et al. [78] achieved a mean completeness of 40.16%, mean correctness of 96.65%, and mean accuracy of 52.21%. The average RMSE of the diameter fit was 4.10 cm and the average corresponding bias was −2.82 cm. In comparison to our study, in single scan mode, the algorithms of Zhang et al. [78] performed better in all terms.

Comparison of Scan Variants, Tree Detection, and dbh Measurement
The nine scan variants applied in the present study differed with respect to the number of scans and the geometric alignment of the scanner locations. Key to a successful application of automatic routines in TLS-based forest monitoring is the quality of the point cloud. Even on a sample plot with only a 20 m radius, the 3D point cloud could contain up to millions of points. This makes the data processing highly computational. Despite the vast number of pointwise measurements in 3D, the problems introduced by occlusion and noise are still relevant in the context of TLS-based surveys. Our results showed that regardless of scan variant, the application of a central scan can be regarded as mandatory, especially for the task of automatic stem detection. This especially came into effect when scan variants 8 and 9 were compared. In our practical tests, the single scan mode (variant 9) with only one scan position at the sample plot center achieved higher detection rates than double scan variant 8 with two peripheral scanner positions. The importance of the central scan could also be explained by the fact that the central scan basically offers the opposite direction to all the additional scans. Further, on average, the central scan is at closer distance to the plot trees compared with any other location. Hence, our results conformed with the findings in Abegg et al. [46], in that occlusion could be most efficiently avoided through the application of a central scanner location.
In general, the detection rate decreases with fewer scans. This is because the density of the 3D point cloud data becomes thinner and occlusion effects are more likely occur with fewer scanner locations. Taking this fact together with plot size, it could be shown that the difference is more pronounced for large plots, whereas for small plots, the difference decreases or even disappears. Another problematic challenge is that with an increasing number of scans, noise can be more easily produced because of the more challenging co-registration of multiple scans. This phenomenon was likewise confirmed by other existing studies (e.g., [5,6,42,79]).
It was also interesting to see that scan variant 5 with five scanner positions achieved similar and sometimes even lower detection rates than scan variant 3 with only four scanner positons. This was probably caused by two effects: first, the extra scan position in variant 5 could have introduced co-registration errors; and second, the rectangular alignment of the scanner positions in variant 5 was associated with unequal distances between the outer scanner positions, whereas the scanner positions in the triangle alignment of variant 3 had constant outer distances and constant distances between the central and outer scanner positions. That is, if the outer scanner positions were more evenly distributed around the plot in terms of angles, they had a better potential to observe different sides of the trees. In this regard, our experience was also in line with the findings in Abegg et al. [46], who found that the scanner locations should be nearly uniformly aligned in space, such that the inter-scanner distances become constant.
Furthermore, it could clearly be demonstrated that the accuracy of tree detection generally increases with decreasing plot size and increasing dbh threshold. Other studies (e.g., [5,6,46]) also showed that dbh and other tree-size metrics have a strong influence on detection rates. Abegg et al. [46] found a close relationship between the number of small laser beams hitting an object and the angular scanner resolution, object size, and distance to the scanner.
The accuracy of dbh measurements was almost stationary among scan variants. However, scan variant 8 and, in particular, scan variant 9 were exceptions and were associated with significantly worse accuracy and precision with regard to dbh measurement. Liang et al. [6] likewise found that the precision and accuracy of dbh estimation were almost constant among stand complexity categories. This would imply that the scan variant has only a minor effect on the precision and accuracy of dbh measurements.
It was demonstrated that plot size (maximum radius) has major impact on tree detection rates. Several studies (e.g., [1,5,38,80]), especially in the context of single scan mode, likewise confirm our results. It was also interesting that the influence of the plot size varies depending on the scan variant. Those scan variants without a center scan (2,4,6,8) have a much lower detection rate up to a plot radius of 10 m than the comparable full variants. As was found in Kankare et al. [81], the main problem of TLS data is that the point density decreases with the distance to the scanner. As a possible solution, the methodology of distance sampling [82,83] could be applied to correct for the nondetection of trees standing at larger distances [38,80] from the scanner location. The basic idea of this methodology is to use measured distances between sample objects and sample points to draw inference about distance dependent detectability, which is used to correct for non-detection [84]. Another approach was proposed by Olofsson and Olsson [85], who chose the visible region around a scanner position as a sampling window in which stem density and diameter distribution were estimated. In the context of single scans, an alternative approach was suggested by Seidel and Ammer [86], who showed that forest parameter estimates can be improved by compensating for the shaded areas behind existing trees. In our study, the outer scanner positions had a distance of 15 m from the sample point. On the basis of our results that the detection rate remained almost constant for plot radii up to 10 m, it might be worth testing a farther relocation of scanner positions toward the sample plot boundary.
In principle, the performance of the algorithms used for the point cloud processing, position finding, and dbh measurement should not be influenced by seasonal effects. However, in the leaf-on stage during the vegetation period, the quality (completeness) of the point cloud is probably lower thanks to shadowing effects. In particular, in deciduous forests, a high stem density and a multi-layered structure could more easily produce co-registration problems during the vegetation period. In order to provide clear results, further research is needed in the future on how the methods are affected by such seasonal effects.

Ranking of Scan Variants and Logistic Regression
With regard to the ranking of individual scan variants, it is difficult to assess which criteria should be used. Of course, different criteria could be weighted in different ways. In the present work, the working time, overall accuracy, and RMSE of dbh estimation were calculated as standardized values, between 0 and 1, for the best and worst performance. This resulted in scan variant 3 as the best choice (Table 5), followed by scan variants 7, 5, and 6. If a weighting were applied in favor of working time, implying a higher penalization of labor cost, the variants with less working time would receive lower (better) ranks. Scan variant 3 consumed approximately 60 min of working time, which surely reflects the upper limit of tolerable field working cost per forest inventory sample plot. Of course, it could also be argued that TLS provides a great deal of extra information that so far has not been collected in traditional forest inventories. However, the efficiency of TLS-based forest inventories could be enhanced by adapting the intensity, that is, the number of scanner positions, to the local complexity of the forest structure on each sample plot. The TLS-based inventory could likewise be modified with respect to plot size and dbh threshold. Thus, the idea of multiple nested sample plots with varying radii around the same sample point should be tested in future research. This would imply that plot size is individually determined by tree size. This principle of concentric sample plots also has a long tradition in forest inventory practice [1,2,87,88].
The logistic regression for the performance measures of scan variant 3 showed that only the number of stems was included in the final model for the detection rate. Olofsson and Olsson [85] also found a similar relationship between the stem number and the visible area on TLS sample plots. Other variables that showed no significant effect were slope, dm (diameter of mean basal area tree), CVdbh (coefficient of variation of diameter at breast height), CE (Clark and Evans aggregation index), and p_dec (proportion of deciduous). However, graphic diagnostics via scatterplots showed at least some signs of correlation between the performance criteria and further structural attributes (especially for CE and CVdbh). For the commission rate, no significant stand or environmental variable was found at all. Currently, the pseudo coefficient of determination was only approximately 0.47, and only a small proportion of variance of the detection rate could be explained. In order to develop models with better goodness of fit, trials should be made in future research to derive additional explanatory variables. However, the sample size of only 23 plots available for the present study, and thus the limited number of degrees of freedom, make it impossible to add additional variables without introducing overfitting problems. Thus, more data should be collected in the future. If it were possible to describe a larger proportion of the detection rate with simple stand and environmental variables, then it would be conceivable to formulate a correction model. Bauwens et al. [89] used a linear mixed model with a random effect on the plot ID to test significant effects on the detection rate. This approach seems promising for repeated TLS inventories, because the plot-level random effects derived from past inventories could be subsequently used to improve such correction models in future applications.

Conclusions
The major goal of the present study was to investigate the influence of scanner position and plot size on the accuracy of tree detection and diameter estimation in forest inventory by means of TLS data. For tree detection and dbh measurement, a further developed multiphase point clustering approach with simultaneous circle fitting was used. This further development of the algorithm reduced the proportion of falsely detected tree locations by around 64%, resulting in less than 4% false detections. Our method proved to be applicable to manifold forest structure conditions comprising different species combinations, varying stem densities, and dense understory vegetation. The parameter setup was kept fixed for the different scan variants. It could be shown that the number of trees per hectare has the greatest influence on the detection rate. Across all scan variants, the detection rate showed an increasing trend with increasing lower dbh threshold and decreasing plot size. It was clearly demonstrated that the application of a central scanner position coincided with a high benefit. Depending on plot size and dbh threshold, between 65.3% and 90.5% of trees could be detected using a hexagon-shaped alignment of scanner positions together with an extra central scan. The latter design is associated with a field working time of approximately 103 min. By using a triangle-shaped alignment of scanner positions, including an additional central scan (four scanner positions), it was possible to detect between 59.1% and 92% of trees, given a working time of approximately 61 min per plot. The RMSE of the dbh measurement was between 2.66 cm and 4.18 cm for the hexagon and between 3.0 cm and 4.7 cm for the triangle design. It turned out that the triangle design obviously provides a good compromise between labor work and overall accuracy, and can be thus considered as the best option for application in forest inventory. With regard to the partly low detection rates, which will occur in some TLS inventories, there is a need for further research. For this purpose, first approaches have been described to correct the detection rate based on stand characteristics and environmental variables. The algorithms were successfully tested by means of an international benchmark dataset.
Decision making on forest resources relies on the precise information that is collected using sample-based inventories. With traditional instruments, such as calipers and tapes, the entire workflow from data collection to data analysis can be regarded as inefficient. This is because measurement data are still often recorded with pencils on paper and hereafter manually transcribed into electronic data bases. By doing so, entry errors can easily occur. In contrast, information is kept electronically throughout the entire workflow of TLS-based surveys. Thus, the high-resolution TLS data remains accessible with novel software routines in the future. It is also expected that TLS will be more often used in forest inventories as soon as the currently developed programs will be available in terms of free software routines, which can be applied with high flexibility under various hardware settings and different forest land cover scenarios.

Conflicts of Interest:
The authors declare no conflict of interest.      Figure A1. dbh RMSE in cm for different scan variants, lower dbh thresholds, and plot radii. Black squares represent mean dbh RMSE. Figure A2. dbh RMSE in % for different scan variants, lower dbh thresholds, and plot radii. Black squares represent mean dbh RMSE. Figure A3. dbh bias in cm for different scan variants, lower dbh thresholds, and plot radii. Black squares represent mean dbh bias. Figure A4. dbh bias in % for different scan variants, lower dbh thresholds, and plot radii. Black squares represent mean dbh bias.