An Examination of Diameter Density Prediction with 2 k-NN and Lidar 3

5USDA Forest Service Pacific Northwest Research Station, University of Washington, P.O. Box 352100, Seattle, WA 98195-2100, USA, handersen@fs.fed.us 6 Dept. of Forest Engineering, Resources and Management, Oregon State University, Corvallis, Peavy 204, Oregon 973315706, USA, hailemariam.temesgen@oregonstated.edu Preprints (www.preprints.org) | NOT PEER-REVIEWED | Posted: 30 September 2017 doi:10.20944/preprints201709.0168.v1


Introduction
Airborne scanning lidar technology (henceforth referred to as simply lidar) provides vegetation measurements which are highly related to forest attributes needed for forest inventory and monitoring.
Examples of forest attributes which are highly related to lidar measurements include bole volume, basal area, quadratic mean diameter, and above-ground biomass [1][2][3].The lidar vegetation measurements can be obtained with high precision over large areas enabling wall-to-wall measurements and predictions of vegetation attributes, as well as precise estimates of population parameters.Despite demonstrated advantages to using lidar for inventory and monitoring, there are also omissions from most analyses which inhibit common usage.One of the limitations of typical lidar research studies, is that they do not evaluate prediction of tree diameter distributions.
The distribution of diameters at breast height (dbh) is an important component of most forest inventory, management, and monitoring strategies.Dbhs are needed to describe stand properties because variables such as growth, volume, value, conversion-cost, product specifications, and future forest prescriptions are dependent on trees' dbhs.The use of single-tree level growth and yield models almost always requires dbh distributions.Dbhs are also used to assess forest sustainability based on whether the quantity and sizes of growing stock are suited to replace the current population of harvestable trees [4].Information about dbhs also informs the type and timing of management strategies and economic value of the stand [5].Ecological analyses also use dbh density information including, for example, assessments of vegetative diversity [6], insect disturbance mechanics [7], habitat suitability [8], and suitability and distribution of parent stock for coarse woody debris [9].
Dbh distributions are often simplified using a mathematical function with parameters that can be estimated or recovered using lidar measurements or other ancillary data.Dbh distribution functions can be predicted with both parametric and non-parametric strategies.Parametric strategies are based on the assumption that the dbh density can be characterized by a theoretical probability density function.A variety of theoretical functions have been tested including the beta [10,10], Weibull [11,12] and Johnson's SB [13,14].Two methods have been used to predict parameters of theoretical functions, the parameter prediction method and the parameter recovery method [15].As the name indicates, in the parameter prediction method stand attributes (or remote sensing data) are used to predict parameters of the probability density function.In the parameter recovery method moments or percentiles of dbh distribution are predicted or measured using stand variables.The parameters of a theoretical distribution are then recovered by leveraging the known relationships between the predicted attributes and the distributional parameters.
Non-parametric strategies attempt to predict percentiles of empirical distributions [16,17] or directly predict dbh bins or classes.While parametric strategies are advantageous in being able to represent a complete distribution with a few parameters when the empirical distribution is unimodal, they may have limited ability to represent complex and mixed species stands that may not have unimodal densities [18].Empirical strategies which retain the original data or relative densities by dbh bins provide more flexibility to accommodate many types of stand tables [17], however, some prediction strategies for multiple dbh percentiles can result in illogical behavior, benefitting from the introduction of constraints [19].
Various strategies for predicting dbh distributions with lidar have been evaluated.Gobakken and Naesset (2004) compared parameter prediction and parameter recovery methods in the prediction of stem number and basal area distributions.The underlying probability theoretical distribution was a two-parameter Weibull distribution.The precision was slightly better for the parameter recovery method than for the parameter prediction method.Mehtatalo et al. ( 2007) also used the parameter recovery approach, however, they proposed a method where the parameters of the assumed dbh distribution and height-dbh curve are determined in such a manner that they are compatible with the predictions of stand attributes.Maltamo et al. (2007) proposed another approach to obtain a compatible stand description.First the parameters of a Weibull distribution and stand volume are predicted with lidar.Then, the estimated stem number distribution is modified to correspond to the volume obtained in the previous step by using the calibration estimation approach proposed by Deville and Sarndal (1992).
Percentiles of dbh distributions have also been predicted with lidar.to estimate parameters of theoretical distributions with lidar.A benefit of GLM is that they can be a onestep procedure, without the need to first fit a distribution and then predict its parameters.Thomas et al.
(2008) studied the prediction of both unimodal and bimodal dbh distributions using a finite mixture model approach.They successfully predicted the parameters of separate Weibull functions, but because there was no lidar-based method for separation of distribution type (unimodal or bimodal), the applicability of the method is somewhat limited for lidar inventory.
An alternative strategy to predict dbh distribution with lidar is using k-nearest-neighbor (k-NN) imputation.An advantage of k-NN is that the k-NN model can be used to simultaneously predict a suite of response variables, including a tree-list.k-NN also provides compatible stand if stand attributes are predicted simultaneously with tree-lists.This was the motivation for the work by Packalén and Maltamo (2008).They predicted stand attributes and tree-lists simultaneously for Scots pine, Norway spruce, and deciduous trees and compared the performance of a tree-list approach to the use of a Weibull distribution approach.The k-NN tree-list strategy was able to mimic bimodal dbh distributions of Norway Spruce (the only shade tolerant tree species in the study area) and in general provided clearly lower error index values.Individual tree detection (ITD; alternatively referred to as single tree detection in some parts of the world) is an entirely different approach to predict dbh distributions.Unlike the previously described methods, it does not depend upon areal sampling and prediction.ITD can directly produce a tree-list, or dbh distribution [29].The quality of the dbh distribution is determined by the rate of detected trees and the precision of dbh modelling.It is well-known that ITD results in tree-lists that include many of the largest trees, but disproportionally omits trees below the dominant tree layer, which are not easily detected in the lidar.Peuhkurinen, Mehtätalo, and Maltamo (2011) reported that the saw log size proportion of the dbh distribution was more accurately predicted by ITD than with an area-based approach, but for the entire dbh range, the area based approach was more accurate.The degree of concern with under-representing small trees clearly depends upon the application.
When a diameter prediction strategy is evaluated, inference is typically made on differences between the observed and predicted distribution using hypothesis or goodness of fit tests, such as the Kolmogorov-Smirnov test.However, for reasons described extensively in Reynolds et al. (1988)  predictor variables.Based on our findings using the proposed indices, we conclude with recommendations on effective diameter distribution predictions strategies with lidar and k-NN.

Study site
The study was conducted at the U. S. Department of Energy's Savannah River Site, an 80,267 ha National Environmental Research Park in Aiken and Barnwell counties, South Carolina USA (Figure 1).The Savannah River Site is characterized by sandy soils and gently sloping hills dominated by pines, with hardwoods occurring in riparian areas.Prior to acquisition by the Department of Energy in 1951, the majority of Savannah River Site uplands were agricultural fields or had recently been harvested for timber.
The U.S. Department of Agriculture Forest Service has managed the natural resources of the Savannah River Site since 1952 and reforested the majority of the uplands with loblolly (P.taeda), longleaf (P. palustris), and slash (P.elliottii) pines.These pine stands are actively managed for timber and wildlife habitat.

Ground data collection
Plot measurements were performed on a grid of fixed radius circular plots designed for modeling forest attributes with auxiliary lidar data.The plot design consisted of two concentric nested fixed area circular measurement plots.The innermost 0.004 ha plot was used to measure trees between 2.5 and 7.4 cm in dbh.
Larger trees were measured on a 0.04 ha plot if there were at least 8 dominant or co-dominant trees, otherwise trees larger than 7.4 cm dbh were measured on a .081ha plot.The heights, dbhs, heights to crown base, and species were recorded for trees on the two concentric plots, and additionally trees between 2.5 and 7.4 cm were tallied on a 0.04 ha plot.
Plot locations were selected purposively to cover the range of tree sizes and stand compositions that occur on the Savannah River Site.Plot locations were taken from a set of approximately 629 inventory plot locations measured in a 2001 inventory and supplemented with locations in desired vegetation types.Field measurements were taken on 194 field plot locations selected purposively to sample across multiple vegetation classes and sizes.Of the 194 plots, 4 were dropped because it was determined that they were measured in locations outside of our target population.A summary of the tree and plot variables used for this study is provided in Table 1.Additionally, a visual representation of the empirical dbh density functions for the 8 most common species occurring on plots is shown in Figure 2. Additional forest attributes (besides dbh) used in our analyses included trees per hectare (TPH), basal area per hectare (m 2 / ha, BA), Lorey's height (m, Lor.), and total cubic bole volume (m 3 / ha, Vol.).
Plot locations were surveyed using L1/L2 GLONASS enabled survey-grade GPS receivers.The receiver was placed at each plot's center on a 3 m pole and a minimum of 600 Plots were assigned to post-strata using the dominant species group for the stand in which the plot was measured (the most common dominant types include: Hardwood -29 plots; Loblolly P. -76 plots; Longleaf P. -54 plots).All of the hardwood species were combined into a single stratum.Forestry staff for the site developed a tract-wide map of species groups by visually classifying stands in the field.Plots were assigned to strata by intersecting plot locations with the strata map.

Lidar data
Lidar data were collected from February 21 to March 2, 2009 with two Leica ALS50-II lidar sensors in leafoff conditions.One hundred and eighty-five (185) flight lines of data were acquired in 10 sessions across the project site.Table 2 provides acquisition parameters.
Lidar heights were processed to create predictor variables for this study using the cloudmetrics executable included with FUSION software [34].This executable computes a large number of statistics from lidar including, but not limited to, height percentiles (e.g. 90 th , 50 th , and 30 th percentile heights in Table 3) and lidar cover (the percent of returns above a threshold, in our case 1.5 meters).We also examined fraction without foliage (fwof), a modeled variable which used normalized intensity to suggest the proportion of the leaf-off lidar which did not intersect live foliage.

k-NN tree list imputation
In k-NN imputation, response variables from measured sites are shared or imputed with sites without measurements based on the degree of similarity in their auxiliary variables.The "similarity" in auxiliary variables is evaluated using a distance metric, e.g.Euclidean distance, where a large number of distance metrics have been demonstrated in the k-NN literature.The distance metrics are functions which determine how one or more auxiliary variables should be weighted and combined.The coefficients of the weight function can also depend on the observed association between response and predictor variables, theoretically weighting predictors which can better predict the response variable(s).If more than one nearest neighbor (k greater than one) is used, then a rule must be formulated to average (continuous) and select (categorical) donor response values.This procedure can be used to simultaneously impute a large number of response variables in a single step.
Procedurally, the process is as follows: 1) a distance metrics is computed between measured and unmeasured (response) observations, then 2) the k observations with the smallest distances (donors) are transferred (imputed) to the observation without a measured response (target).
For this study we relied upon the yaImpute package [35] implemented in R [36] for k-NN imputation.The identities of the donor plots (nearest neighbors) were used to impute tree lists, as yaImpute is not currently set up to directly impute tree lists.Based on the donors' identities, all of the tree records from the imputed donor plots were copied to the target observation.Each copied tree was then distance weighted to generate a tree list for the target observation.The distance weighted tree lists from the k neighbors then became the basis for prediction of the empirical dbh density for the target observation.The choice of a weighting function for the K imputed neighbors has been shown to have limited effect on performance [27].

k-NN imputation strategy components
We examined the effects of 4 components of a k-NN tree list imputation strategy including 1) the choice of distance metric, 2) the selected predictors, 3) the set of response variables, and 4) the effect of poststratification.The distance metrics evaluated include Euclidean distance (EUC.),Mahalanobis distance (MAH), most similar neighbors (MSN), and random forest (RF).MSN and RF distances both use response variables for measured observations in computing distances.Additional details for how these distances are defined can be found in the yaImpute package documentation [35].The effect of post-stratification on k-NN performance was evaluated by stratifying the data, and predicting separately within strata.Strata with fewer than 10 response measurements were imputed from the pool of all observation.

Dbh densities
The proportions of trees falling in diameter bins (the empirical dbh density, or just "dbh density") were computed by first binning lists of trees into 2.54 cm (1 inch) dbh bins and computing the proportions of all trees in the dbh classes.In the case of imputed tree lists, weighted dbh densities were computed using the distance weights from the imputed plots.The bin proportions were then smoothed with a 3-bin moving average centered on the target bins.The smoothing function was applied to emphasize major trends, and de-emphasize fine-scale fluctuations.Individual plot densities can have spikes, pits, and other characteristics that we did not wish to examine.

Measures of performance
Evaluation of k-NN predictions strategies were performed using (LOO) validation in combination with indices.In LOO, models are iteratively fit to the data while omitting one plot at a time.After fitting a given model, the data are then tested against the omitted plot.The errors in prediction for the omitted plots then serve as the basis for indices of performance.Our first suggested measure of performance is index H, Index H is equivalent to the coefficient of determination (or, commonly, R 2 ) which has a straight-forward interpretation -the proportion of variability explained.The index has an advantage over alternative measures of performance that we examined in that it provides a relative measure of performance.The baseline level of variability comes from plot variation in dbh densities around the mean dbh density within a dbh class for all of the plots.As with R 2 , smaller prediction error relative to baseline variability will yield index H values closer to one.Larger prediction residuals will in general cause index H to approach zero, and negative values of H are possible if the prediction strategy is inferior to prediction with the means model.Our inferences for this index are similar to those that would be made from R 2 .We rely heavily on index H for inferences about different k-NN dbh prediction strategies.While the H values are suggestive of general trends in performance, the index is not suited for identifying a single best prediction strategy.
A limitation of index H is that it is only meaningful in comparisons if the baseline variability is similar between compared strategies.In many cases this property will not hold.We expect, for example, to have greater variability amongst variable radius plots than amongst fixed radius plots for the same area; comparing their index H values then would be meaningless.To compare prediction strategies from two different inventory designs or two study areas with different levels of baseline variability, it is important to have an absolute measure of performance.A second limitation to index H is that the index is unitless, and it is often desirable to have an index in the same units as the attribute of interest.To support inferences from multiple designs in the units of the response, we propose a second index, I, which is an absolute measure of performance: Peer-reviewed version available at Forests 2017, 8, 444; doi:10.3390/f8110444better prediction performance.Index I can be used to compare performance across sites, designs, and project areas.
While we do not use p-values in this analysis, we recognize that some users will wish to use p-values.Pvalue can be fairly easily generated for the suggest indices with simulations.One simulation-based approach to obtain p-value is to randomly assign tree-lists to plots several thousand times, and compute index values for each randomization.This will yield a distribution of index values for the null model where lidar and k-NN provide no explanatory power.The distribution can then be compared to the observed index value for a particular lidar and k-NN configuration.The proportion of values which are as extreme as the observed index value will serve as the simulation-based p-value.

Results
Results are divided into two sections.In the first section -Index properties -we demonstrate the behavior of H under a variety of prediction scenarios.Our demonstration of index H gives a sense of the behavior of H under various conditions, and the degree of sensitivity of the index to disagreement between predicted and observed dbhs.Simulation results are shown only for index H (not I) for the sake of brevity, since the behavior of the two indices is nearly identical (inversely), although the interpretations are quite different: index H provides a measure of relative improvement over the mean model (higher is better), and index I provides a measure of absolute error in the units of the response that is portable between designs and studies (lower is better).
In the second section -K-NN strategies -we use index H to suggest superior dbh prediction strategies, then conclude with a table of H and I values for the best prediction strategies.We investigate number k of nearest neighbors, which distance metric is used, which sets of predictors and response variables are used for k-NN imputation, and how are predictions for individual species.As with R 2 , higher index H values suggest better prediction performance, but are not necessarily suited for model selection.Instead, they are meant to help interpret general trends in performance for different prediction configurations.

Index properties
To provide a sense of the behavior of our indices as a function of prediction performance, we first provide a visual calibration image which shows H values for various levels of departure from agreement between a sample and a prediction (Figure 3).Our quantitative examination of the properties of H relative to prediction properties used simulated dbhs.Our simulated population is a mixture distribution composed of two normal distributions (Figure 4).For our examination, we took 100 clustered sample plots of 50 trees from the simulated population, and compared these with "predictions" for the samples (Figure 5).The predictions were obtained by taking the original sample data and introducing Gaussian noise with parameters ( , ).In our simulations, we look at various dbh bin widths and four types of errors in our In Figure 6 we can see that the effect of bin width on H was observed similarly for each type of prediction error.In each instance both small and large dbh bin widths have reduced values of H, with the highest (best) values of H typically occurring around 4 to 5 cm for our simulations.We can expect lower values of H for small bins because there are few observations in narrow bins, resulting in higher sampling variability in both the plot sample and the plot predictions for each bin.We can also expect lower values of H for larger dbh bins because the shape of the dbh distribution approaches the average density for the plot.In this case (large dbh bins), plot mean densities have greater likelihood of falling near the population mean density, which means there is little variation to be explained by predictions -causing the H values to decline.
Figure 6 also shows that adding errors of increased size to our predictions causes H values to decline.For example, when we introduce Gaussian errors with no bias and a standard deviation of 1.0 ( =0.0, =1.0), H values hover around 0.9.When we increase the error by adding a 1 cm bias to predictions, as in the case of the second line in Figure 5 ( =1.0, =1.0) it causes all of the H values decline.Interestingly, the magnitude of decline in H values for Gaussian errors ( =1, =1.0) is similar to the H values when errors have parameters ( =0, =2.0).When we add errors with 2.0 cm bias and 2.0 cm standard deviation ( =0.0, =2.0) we see a more severe downturn in performance: the H values at best explain 60% of variability, and at worst do a poorer job of prediction than simply using the mean density from all of the plots combined.

K-NN strategies
Of the four distance metrics examined, the distance metric which had the greatest sensitivity to configuration was MSN distance.Excluding MSN distance, there was little difference in performance amongst the distance metrics used to impute tree lists (Table 5).For a given number of neighbors, H only varied by a few percent.The range of values for any number of neighbors, k, is sufficiently small to suggest that there is no practical difference in performances amongst distances (excluding MSN).The effect of number of neighbors, k, was larger, e.g.ranging from 0.50 to 0.76 for Euc., and the decline from using a sub-optimal k was greatest for MSN.Performances were generally best for 3 neighbors relative to fewer or more neighbors, with little differences observed in the vicinity of 3 neighbors.
To test the effect of auxiliary variables on performance, a suite of auxiliary variables was initially selected which would reflect different types of information (Table 6).The variables were then added or removed to isolate the influences of individual predictors -essentially a manual variable selection approach.Table 6 shows the sorted performances of the various predictor sets.There were only marginal differences among performances for predictor sets 1 through 9, when excluding MSN distance.Prediction performances were clearly sensitive to the predictor sets, although, excluding MSN distance, declines in performance from using inferior predictors sets were fairly modest (from H = 0.65 to H = 0.80).
We also examined the sensitivity of the k-NN density imputation strategies to differences in the response sets for MSN and RF.Euc. and Mah., in contrast, do not use response variables when computing distances.
We evaluated two sets of predictor variables with five sets of response variables.The results in Table 7 indicate that MSN was sensitive to the choice of response variables, while RF was fairly insensitive to the choice of response variables.Index H values for MSN in Table 7 declined from 0.81 to 0.58, a 28.4% reduction in performance.In contrast, index H values for RF distances varied by less than 4% for the combinations shown.
Our final evaluation of k-NN components was on the effect of post-stratification.As can be seen in Table 8, post-stratification on forest type resulted in slightly poorer prediction performance in most cases.Most notably, stratification on the dominant species in a stand did not consistently improve either species group predictions (hardwood or conifer) or individual species predictions.

Comparative performance
In Table 9 we provide H and I values for simple cases of prediction and estimation with each of the distance metrics.Although most of our inferences were based on index H, Table 9 demonstrates the relationship between the two indices -larger values of H, and smaller values of I suggest better performance, much as with the coefficient of determination and RMSD.The values of H can also provide a baseline for others to use in comparisons and in inventory planning.

Indices H and I
The indices demonstrated in this study facilitate inferences about dbh distribution predictions.The indices were essential to our analysis, and enabled us to demonstrate the behavior of diameter predictions with k-NN and lidar in an easily interpreted fashion.The results can also be compared with other regions, and prediction strategies through the use of index I.The portability of index I, and to a lesser extent H, should help to clarify the ability of lidar-based methods to provide diameter predictions for forest inventory.While we do not compare the performance of lidar-based methods with a traditional inventory system in this study, such comparisons are a natural extension of this research.
In their current implementation, the indices we proposed are based on tree counts by diameter bin, however they are not limited to this formulation.uses maximum bin deviations.These could, respectively, be used in applications where errors in larger trees are more problematic, or in applications where the maximum bin error is of primary concern.

k-NN imputation strategies
We observed a number of useful trends with respect to the performance of dbh distribution predictions using nearest neighbor imputation methods and lidar.Our first observation agreed with that of other studies [37,38] in that lidar and nearest neighbor methods were able to provide meaningful predictive power for plot level dbh distributions.We were also able to identify patterns in the behavior of prediction performance with respect to the number of neighbors, k, the nearest neighbor distance type, use of strata in prediction, and the selection of variables used for imputing dbh distributions at the plot level.These results may prove indicative of performances in other areas with similar datasets.Our results are also in rough agreement with those of other k-NN studies, although there are few with which direct comparisons are feasible as studies describing dbh distribution prediction with lidar and k-NN are not common.
Our results with respect to the number of neighbors are fairly similar to those observed elsewhere, although not necessarily in the context of dbh distribution prediction.Most studies have observed that prediction performance improves for k greater than 1, with maximal performance usually falling somewhere in the range of 2-7 neighbors (a more detailed discussion of selection of k is provided by Eskelson et al., 2009).
Our results also agree with other studies in that prediction performance is not sensitive to a specific number of neighbors in the indicated range.
With respect to distance metric, while MSN distance achieved equal performance with other metrics in the best case, it was very sensitive to the configuration used for prediction, at time faring much poorer than alternate distance metrics.Euc., Mah., and RF were all fairly robust to configuration, and as a result are preferable to MSN.These results are in contrast to another study which found generally good performance with MSN distance [37] for dbh predictions.Our findings with respect to MSN were surprising given that we hypothesized that there would be an advantage to leveraging the empirical relationship between predictor and response variables.MSN distance did not bear out this hypothesis for dbh prediction, although RF distance, which also relies on response metrics, performed the best according to the indices.
Even though it performance the best in terms of the indices, a limitation of RF was that it took a much loinger time to calculate distances.While RF had the best results in terms of top performance and stability, the required additional computational time may not merit the effort.Mah.distance had nearly the same performance as RF (for the configurations tested), was faster, and eliminated the need to select a response set for k-NN -simplifying the analysis process.
The choice of a set of predictor variables also influenced performances, but the results were fairly stable with respect to changes so long as a reasonable set of predictors was provided.Height metrics such as P30 and P90 appeared to be more important than the canopy cover metric.This is a fairly intuitive result as the vertical height metrics are likely to better reflect the vertical forest structure, and thereby, indirectly, the all of the plot.The choice of a response set was only important for MSN distance, which was shown to be fairly sensitive to all aspects of the k-NN configuration.
The results from stratification with k-NN suggest that more prevalent species were predicted better without stratification than with stratification.For less common species there was no evidence that one strategy worked better than the other.Previous studies have differed in their conclusions with respect to the effects of stratification on k-NN predictions (e.g.Eskelson, Temesgen, and Barrett 2008; Wilson, Lister, and Riemann 2012), although the studies did not look at dbh predictions.Differing sample sizes between studies likely played a role in the different findings between studies.The number of suitable donors likely also played a role in the observed trend that performances were better for more common species.For dominant subgroups, it is likely dbh densities were simply very similar in form to the combined density from all species, and therefore also effectively predicted without strata.

Conclusions
Tree dbhs are a common requirement of forest inventory systems, but few studies document dbh prediction performance.This study proposes two interpretable indices and uses them to evaluate various lidar and k-NN dbh prediction strategies.K-NN with lidar was shown to effectively predict a tree dbh distribution for an 80,267 ha pine dominated study area in South Carolina.While the results were fairly insensitive to changes, we identified that Mahalanobis distance, k=3 neighbors, and no stratification was preferable to other strategies.The proposed indices will facilitate others to make comparisons between prediction strategies, and our findings will enable evaluations of lidar and k-NN as inventory tools.We should note that this was an intensively managed pine forest plantation, and the results may vary greatly from results for other forest types.
equivalent to the Root Mean Squared Deviation (RMSD) by plot (rather than by bin).The units for this index are the same as for the attribute of interest.As with RMSD, a smaller value of index I indicates Preprints (www.preprints.org)| NOT PEER-REVIEWED | Posted: 30 September 2017 doi:10.20944/preprints201709.0168.v1

30 September 2017 doi:10.20944/preprints201709.0168.v1
Andersen et al., 2009)available at Forests 2017, 8, 444; doi:10.3390/f8110444belessthan 1 meters in the pine forest types at the Savannah River Site, based upon our previous experience with positional accuracy using these receivers in a variety of forest types (e.g.Andersen et al., 2009).
1-second-epoch satellite fixes were collected and differentially corrected.We expect the horizontal RMSE for surveyed plot center positions to Preprints (www.preprints.org)| NOT PEER-REVIEWED | Posted:

Preprints (www.preprints.org) | NOT PEER-REVIEWED | Posted: 30 September 2017 doi:10.20944/preprints201709.0168.v1
Peer-reviewed version available at Forests 2017, 8, 444; doi:10.3390/f8110444predictions, where the four types of prediction errors are represented as four separate lines and labeled with the parameters of their error distributions.Larger values of index H suggest better prediction performance, where H is bounded by one on its upper end.A value less than zero indicates that the mean by bins (as obtained from all sample plots) is a better predictor of the sample distribution than the prediction strategy under examination.

Preprints (www.preprints.org) | NOT PEER-REVIEWED | Posted: 30 September 2017 doi:10.20944/preprints201709.0168.v1
Peer-reviewed version available at Forests 2017, 8, 444; doi:10.3390/f8110444 The proposed indices can be easily tweaked to suit various applications.For example, one could weight bins by basal area, or use a completely different strategy which

Preprints (www.preprints.org) | NOT PEER-REVIEWED | Posted: 30 September 2017 doi:10.20944/preprints201709.0168.v1
Peer-reviewed version available at Forests 2017, 8, 444; doi:10.3390/f8110444dbh density.Unlike other potential response variables such as Vol., dbh densities do not measure the quantity of vegetation, they measure the distribution of sizes.It doesn't matter if they cover a portion, or