LiDAR-Based Estimates of Canopy Base Height for a Dense Uneven-Aged Structured Forest

: Accurate canopy base height (CBH) information is essential for forest and ﬁre managers since it constitutes a key indicator of seedling growth, wood quality and forest health as well as a necessary input in ﬁre behavior prediction systems such as FARSITE, FlamMap and BEHAVE. The present study focused on the potential of airborne LiDAR data analysis to estimate plot-level CBH in a dense uneven-aged structured forest on complex terrain. A comparative study of two widely employed methods was performed, namely the voxel-based approach and regression analysis, which revealed a clear outperformance of the latter. More speciﬁcally, the voxel-based CBH estimates were found to lack correlation with the reference data ( R 2 = 0.15, rRMSE = 42.36%) while most CBH values were overestimated resulting in an rbias of − 17.52%. On the contrary, cross-validation of the developed regression model showcased an R 2 , rRMSE and rbias of 0.61, 18.19% and − 0.09% respectively. Overall analysis of the results proved the voxel-based approach incapable of accurately estimating plot-level CBH due to vegetation and topographic heterogeneity of the forest environment, which however didn’t affect the regression analysis performance.


Introduction
Canopy base height (CBH)-although not universally defined-commonly refers to the vertical distance between ground surface and the base of the continuous live crown (i.e., lowest branch of the tree crown with live foliage) [1][2][3][4][5]. Either on stand or individual tree level, CBH constitutes one of the most important forest structure parameters in several forestry applications. It can be used as an indicator of tree foliage amount and seedling growth [6][7][8], for wood quality estimation [9,10], for crown conditions and forest health evaluation [6,11] as well as in fire behavior prediction systems such as FARSITE, FlamMap and BEHAVE [12][13][14][15][16].
Despite being essential for forest and fire managers, the challenge of CBH information acquisition still remains unresolved. Field surveys, while the most reliable and accurate method for obtaining CBH information, are particularly labor-intensive, costly, time-consuming and sometimes even restricted when the study area is inaccessible [17,18]. Remote Sensing technology has provided an alternative solution for monitoring forest parameters with higher time and cost-efficiency. In particular, Light Detection and Ranging (LiDAR) sensors can penetrate forest canopies and has been proven to successfully retrieve vertical structure information including measures of individual and plot level CBH [18][19][20][21][22][23].
(i.e., presence of points from the lower part of the canopy) which strongly depend on the sensor characteristics and several vegetation, topography and weather conditions such as canopy cover, presence of steep slopes and relative air moisture [18,31]. Predictive models constructed by regression analysis are not as sensitive to pulse penetration as the former approach but essentially rely on field data. In fact, the quantity and quality of sample plots may significantly affect the estimation accuracy. Furthermore, such models tend to be species and site specific which means that their applicability to different areas may be limited [16].
To the best of our knowledge, very few researches have focused on the comparative study of these two approaches in a specific study area. Sumnall et al., (2016) and Maltamo et al., (2018) performed regression and vertical profile analysis for CBH estimation at the plot and individual tree level in pine-dominated forests of USA and Finland respectively [18,32]. Both studies concluded that regression models performed better compared to the vertical distribution analysis.
The aim of the present study was to examine the potential of airborne LiDAR data to accurately estimate CBH at the plot-level in a dense hybrid fir (Abies borisii regis) forest characterized by uneven-aged structure on complex terrain. The study investigates and compares the afore-described widely used approaches through the implementation of high-density LiDAR point clouds and area-based metrics. The specific objectives were to: • apply a voxel-based approach to estimate CBH directly from LiDAR data at the plot-level, • perform a plot-level regression analysis to estimate CBH through the implementation of LiDAR-derived metrics and • evaluate the accuracy of the CBH estimates and compare the applied methods.
Taking all the surrounding literature into account, we acknowledge that both methods can theoretically achieve high estimation accuracies in our study area and that the regression-based method is supposed to outperform the voxel-based approach. Nevertheless, previous studies have also suggested the exclusive use of the voxel-based approach for CBH estimation [1,3,28,29]. In addition, the reported studies focus mainly on pine species over different topographic conditions which can reflect on the acquired LiDAR point cloud structure and eventually facilitate or hinder accurate CBH predictions. Considering that each method has advantages and limitations, it is of utmost importance to quantify the level of their accuracy and the difference between the results, if any, even if they prove to be consistent with the already reported ones. For instance, the regression model may have had a better performance but the RMSE derived from the voxel-based method could be still sufficiently low for practical applications. Under these circumstances, a cost benefit analysis could lead to the application of the voxel-based approach in case the area under examination was inaccessible or economic constraints did not allow field measurements. It should be noted that such challenges are common in Greece since it constitutes one of the most mountainous countries in Europe.
Consequently, despite the highly accurate results reported in the literature, there is still a profound need to reach the aim of this study, the results of which can form the basis for future work and development of more sophisticated and accurate CBH prediction methods.

Study Area
The present study was conducted at the Pertouli University Forest located in the Pindos Mountain Elevation ranges from 1100 m to 2073 m above mean sea-level and the climate is characterized as transitional (i.e., Mediterranean-Mid-European) with rainy and cold winters and relatively warm and dry summers. The forest predominantly consists of natural hybrid Abies borisii regis (Abies alba Mill. X Abies cephalonica Loud.) pure stands. The understorey vegetation (i.e., natural regeneration) comprises mainly Abies borisii regis species as well. The forest presents an uneven-aged structure due to the shade-tolerance of this species, which enables the survival of young regeneration under the shade of adjacent trees for a prolonged period of time and penetration into the midstory once light and growing space conditions become favorable.

Airborne LiDAR data
LiDAR data were acquired in October 2018 over the entire area of the Pertouli University Forest using a RIEGL VQ-1560i-DW laser scanner sensor mounted on an airplane (detailed sensor specifications can be found at http://www.riegl.com/nc/products/airborne-scanning/produktdetail/ product/scanner/55/) . An average flying altitude of 2243 m above the mountainous terrain and a maximum scan angle rank of ±32 • off nadir were used. The scanner includes two laser channels of different wavelengths, namely the Green (532 nm) and Near Infrared (1064 nm).

Field Measurements
Field measurements were conducted during the fall of 2019. The one-year temporal gap between the LiDAR data acquisition and field work is considered negligible as the difference of plot-level CBH is insignificant. CBH was measured in 32 plots (Figure 1) of 1000 m 2 each (rectangle 40 × 25 m) covered by pure dense Abies borisii regis stands. The selection of the specific plot dimensions was based on the sampling method which is applied in the study area for the forest management plan development and constitutes one of the two methods (fixed-area plots of size 40 × 25 m or Bitterlich plots) employed in the framework of all forest management plans in Greece. The distribution of plots was performed in a way to represent a variety of elevations, aspects, slopes and tree densities over the study area (Table 1), although the accessibility of the area and vegetation cover (partly wooded stands were avoided) was also taken into account. Plots subjected to any natural or human interventions during the time period between the LiDAR flight and field work were not included in the sample dataset.
The plots' location was recorded using a handheld GPS with an average horizontal accuracy of 3 m. Within each plot, the diameter at breast height (DBH) of all trees (i.e., 54 trees per plot on average) was recorded and the CBH of ten trees from all DBH classes (i.e., I: 4-20 cm, II: 21-34 cm and III/IV: 35-82 cm) was measured. The height of lowest branches with live foliage which were isolated from the main tree crown was not considered as crown base [2,3,5]. The plot-level CBH was subsequently calculated as the weighted average considering the proportion of trees of each DBH class in the plot. Figure 2 presents an illustration example of the field-measured average CBH of plot 29 (Table 1) on the respective height-normalized LiDAR point cloud.
It should be noted that prior to adopting the above-described sampling method, we measured all trees in eight of the plots and the plot-level CBH was computed as the arithmetic mean of all CBH values. The comparison of the plot-level CBH derived from both sampling methods revealed an average difference of 0.33 m which can be actually attributed to the precision (i.e., 0.5 m) of the instrument used for the measurements (i.e., Blume-Leiss altimeter) and/or measurement errors of random nature, such as leaning trees or branches obscureness due to dense canopy [33]. To this end, we concluded that the measurement of a sample of trees per plot is sufficient for the derivation of reliable reference data. Although we acknowledge the additional uncertainty that the specific sampling method may introduce in some plots, the measured data are as reliable as they could reasonably be.    Table 1). The horizontal dashed line represents the plot-level CBH measured on the field.

Lidar Data Preprocessing
Prior to delivery, the LiDAR data were preprocessed by the contractor, namely GEOSYSTEMS HELLAS S.A. (https://www.geosystems-hellas.gr/en/home/). The wavelengths were initially transformed to discrete returns (maximum of seven returns per transmitted pulse) resulting in point density of approximately 12 points m −2 for each channel (Green and NIR). The point clouds were subsequently georeferenced, atmospherically corrected and the noise was reduced (e.g., multiple-time-around echoes elimination). The final point cloud was produced by merging the data of the two channels (without maintaining the information about the channel each point belongs to), removing any duplicated points and classifying the remaining ones into six classes, namely ground, low, medium and high vegetation, buildings and low points-noise.
The overlap between adjacent strips, which ensured a spatially continuous coverage of the surveyed area, along with the afore-described preprocessing resulted in an average point density of 22.23 points m −2 with a scan angle rank from −29 • to 32 • and a point spacing of 0.4 m.

Voxel-Based CBH Estimation
The voxel-based method applied in this study is based exclusively on the vertical point height distribution ( Figure 3). The concept was to analyze the points frequency in the vertical profile and identify the height where it rises considerably over the average frequency value of the entire profile.
First, the rectangle vector file of each plot buffered by 5 m was used for the extraction of the respective point clouds. The 5 m area around the plot was used to avoid the vertical cut of tree canopies located on the plot edges which create considerable gaps and eventually cause CBH overestimation (also known as edge effect) ( Figure 4). The point cloud of each plot was height-normalized using k-nearest neighbor approach with an inverse-distance weighting (kNN-IDW) and ground points were eliminated.
The removal of understorey vegetation points has been adopted in other voxel-based methods (e.g., [1,20]) in order to eliminate the impact of shrubs and small trees on CBH estimation accuracy. Nevertheless, considering the forest's structural characteristics, this step was considered inappropriate for two reasons. First, as mentioned in Section 2.1, the forest is characterized by an uneven-aged structure leading to an overlap of the top of high understorey vegetation-where present-with the CBH. Additionally, a lack of points from very low vegetation was observed in the majority of the sample plots. Based on the above, additional analysis for the removal of understorey vegetation would either have no impact on the CBH prediction accuracy or cause overestimation since points from the actual CBH would be removed.  The preprocessing was followed by the voxelization of the point clouds at a resolution of 10 × 10 × 0.2 m (x × y × z). Each voxel (3D pixel) contained a single value of the number of included LiDAR returns. The selection of the optimal voxel size was influenced by the point cloud spacing and spatial distribution and was achieved through the examination of a variety of voxel sizes and their effect on the resulting CBH prediction accuracies [22]. It was observed that horizontal voxel dimensions lower than 10m resulted in a very low number of points in multiple voxels leading to the identification of false abrupt increases in the point frequencies of the vertical point cloud profile. Furthermore, the presence of considerable within canopy gaps (clearly visible in Figure 2 of the revised manuscript) led to the generation of a series of vertically adjacent zero-frequency voxels resulting in a considerable CBH overestimation. On the contrary, when a larger horizontal voxel size was applied, small gaps and significant increases in the vertical point frequency distribution were omitted. The vertical voxel size of 0.2 m was selected based on the desired precision of the estimated CBH values. In particular, a vertical size lower than 0.2 m created multiple voxels of zero frequency, similarly to the use of small horizontal voxel size, but heights greater than 0.2 m resulted in coarse precision of the estimated values.
After point cloud voxelization, voxels that fell outside the actual plot edges were removed. In case a vertical voxel stack over a 10 m cell (hereafter referred to as bin) lacked point frequency information (i.e., only one or none of the bin voxels contained points), they were retained from further analysis. Point frequencies of each bin were smoothed using the Nadaraya-Watson kernel regression method [34] ( Figure 5: grey points connected by line) and local minima and maxima were subsequently detected ( Figure 5). CBH for each bin was defined at the height of the local minimum followed by the first significant local maximum (i.e., above average point frequency of the bin) ( Figure 5: vertical green line). Finally, the plot-level CBH was calculated as the average of all bin CBH values.

Plot-Level Regression Analysis
Similarly to the voxel-based method, LiDAR data were preprocessed prior to regression analysis ( Figure 6). First, the point cloud covering each plot area was extracted and normalized by kNN-IDW method. Points covered by multiple flightlines were subsequently eliminated and a suite of LiDAR structural descriptive statistics (hereafter referred to as metrics) was calculated using the LAStools software and rLiDAR package implemented in R. In particular, height-normalized last-of-many and single (ls), last-of-many (l), first-of-many and single (fs), first-of-many (f) and total returns were extracted from the region of each plot. 47 plot-level metrics were computed from each extracted point cloud (i.e., ls, l, fs, f, all) including percentiles and bincentiles at relative heights as well as height value statistics ( Table 2).  Table 2. The LiDAR metrics used in this study for the construction of the plot-level CBH prediction regression model. The metrics refer to the pooled first-of-many and single echoes, pooled last-of-many and single echoes, last-of-many, first-of-many and total echoes. Linear regression was subsequently used for the construction of a plot-level CBH prediction model. The LiDAR metrics were initially examined for their correlation with field CBH individually and the most highly correlated ones were considered as candidate predictors in the model. Following the general rule of thumb of ten samples per one predictor [35], we tried not to exceed three predictors in the model and simultaneously minimize RMSE and avoid overfitting. The incorporation of additional predictors would increase model's complexity and the risk of overfitting, due to the limited number of sample plots. The selection of the optimal predictor variables and appropriate transformations was manually performed trying to achieve maximum fit between the observed and estimated CBH values.

LiDAR
Once the predictive model was constructed, it was applied to the entire forest area in order to generate the respective CBH map.
where y i is the observed CBH for plot i, x i is the predicted CBH for plot i, y is the mean observed CBH and n the number of observations (i.e., number of sample plots). The performance of the developed plot-level regression model was evaluated through Leave-One-Out Cross Validation (LOOCV) analysis. The predicted CBH values were assessed for their accuracy using the afore-described statistical measures (Equations (1)-(4)). Due to the log-transformation of the response variable in the model, the residual variance (σ 2 ) (Equation (5)) was calculated in order to perform bias correction (Equation (6)) and calculate the actual estimated CBH values in the original scale.
where σ 2 is the residual variance, y i is the observed CBH for plot i, x i is the predicted CBH for plot i, X i is the actual predicted CBH value in the original scale for plot i and n the number of observations (i.e., number of sample plots). Table 3 and Figure 7 present a statistical summary and comparison of the field measured and voxel-based estimated CBH respectively. The results showcase a significant overestimation (i.e., error > 30%) in 12 of the sample plots (red points of Figure 7) and underestimation in two of them (blue points in Figure 7). Overall, no correlation was found between the field measured and predicted CBH values. In particular, R 2 , RMSE, rRMSE and rbias were 0.15, 1.71 m, 42.36% and −17.52% respectively.

Plot-Level Linear Regression Model
The examination of the individual LiDAR metrics correlation with the field measured CBH revealed that the skewness of height for the last-of-many echoes was the most important predictor with R 2 = 0.42 ( Figure 8). The final model (Equation (7)) developed via regression analysis includes three independent variables and five of the computed LiDAR metrics (Table 2), namely skewness of height (Hske l ), 85th and 99th height percentiles (p85 l and p99 l respectively), 50th bincentile (b50 l ) and 90th bincentile (b90). Metrics with l suffices refer to the last-of many echoes while no presence of suffix means that the metric was calculated including the entire point cloud.
log(CBH) = 54.24 − 1.298 × Hske l + 1.057e − 06 × (p85 l × b50 l ) 2 − 11.27 × log(p99 l + b90) (7) The fitted model has an R 2 , RMSE, rRMSE and rbias of 0.75, 0.66 m, 16.38% and −0.18% respectively. The statistical measures derived by the estimated CBH values after the LOOCV process are R 2 = 0.61, RMSE = 0.73 m, rRMSE = 18.19% and rbias = −0.09%. A scatterplot of the observed vs. the predicted CBH values and the residuals plot are presented in Figures 9 and 10 respectively. Figure 11 illustrates the CBH map produced at a spatial resolution of 32 m for the study area by employing the developed model to the entire LiDAR point cloud.

Discussion
In this study, plot-level CBH was estimated for a dense uneven-aged structured forest over complex terrain using high-density LiDAR data. The implemented estimation methods, namely the voxel-based and regression analysis approach, led to highly variate results. The former method failed to provide accurate CBH predictions for a significant number of sample plots. On the contrary, regression analysis resulted in the development of a prediction model with 3 independent variables which led to considerably more accurate predictions. The results of both methods are discussed in detail in Sections 5.1-5.3.

Voxel-Based Estimation
As reported in Section 4.1, a lack of correlation between measured and estimated CBH values was found (i.e., R 2 = 0.15). 18 of the 32 plots were accurately estimated in contrast to the remaining 14 plots which presented estimation errors higher than 30%. The latter plots were thoroughly examined in terms of vegetation structure and topography-information obtained from the field work-and visual inspection of the LiDAR point cloud was additionally performed.
Major underestimation errors occurred in two of the plots (i.e., plots 10 and 11 of Table 1). Field records of the specific areas indicated the presence of understorey vegetation which is close or overlapping with the dominant layers. These understorey conditions were confirmed by visual examination of the respective LiDAR point clouds. Figure 12 illustrates the vertical profile of plot 10 where CBH was severely underestimated (error 49.26%). In this case, the high understorey vegetation creates a continuous vertical point distribution extending from nearly the ground up to the top of the canopy. As a result, no vertical gaps are created which would facilitate the discrimination of the CBH. Although we agree with other authors stating that understorey vegetation renders the vertical LiDAR profile too noisy [3], the discrimination and elimination of understorey vegetation from the point cloud would not be able to benefit the results accuracy (related details in Section 3.2), at least within the scope of the current context.
Considerable overestimation errors were encountered in 12 of the 14 miscalculated plot-level CBH values. The majority of these plots are characterized by a low CBH (<3 m) compared to those of higher CBH values. In fact, eight out of the 12 plots have a reference plot-level CBH ranging between 1.72 m and 2.99 m respectively. A more detailed analysis of the total number of even slightly overestimated plots revealed a positive correlation between the estimation error and the distance of the reference CBH from the average canopy height. This type of correlation implies that the point cloud is not sufficiently dense in the lower parts of the canopy. The low number of returns from these layers is a strong indicator of the insufficient penetrating characteristics of the laser pulse at the specific areas which impacts the ability to accurately estimate the CBH or any other forest structural parameter related to the understorey vegetation.
A variety of potential factors may influence laser pulse penetration through the canopy including sensor settings, flight conditions, season of data acquisition and canopy characteristics. The pulse repetition frequency (PRF) (i.e., number of emitted pulses per second) is an important factor affecting pulse penetration and has been examined-among others-by Chasmer et al., (2006) and Massaro et al., (2012) [36,37]. Both studies reported that pulses of lower frequencies have greater penetration capabilities while Chasmer et al., (2012) stated that PRF has the opposite effect in stands with significant understorey presence. Scan angle also constitutes a substantial influence factor. According to Qin et al., (2017), foliage vertical profiles aren't necessarily represented best with shallow scan angles but with 20 • instead since increased angle leads to enhanced return energy from the lower canopy [38]. Weather considerably affects penetration rates as well, as reported by Hsu et al., (2015). The study indicates that pulse penetration decreases when wet and/or humid conditions are present because infrared light does not penetrate water vapor. Other studies have reported that a low pulse penetration rate can further be attributed to increased canopy height, density and cover levels [39,40]. In case of CBH estimation by means of vertical profile analysis, a uniform return density is essential as it prevents the acquisition of decreased number of returns in some parts of the obtained LiDAR data. It can be achieved by maintaining a constant aboveground flying height which however becomes challenging over steep mountainous terrain especially during windy days [41] (Figure 13). Additionally, steep slopes may impact the laser scanning swath leading to a similar point density variation. Plots located on hilly terrain may partially lack points from one side due to the resulting change in scanning swath between adjacent flightlines ( Figure 13). The number of echoes from the low part of the canopy is also substantially affected by crown length which variates among different tree species. For example, mature spruce trees are generally characterized by longer crown lengths compared to pines which may result in more accurate CBH estimation of the latter [42].
Although each of the above-mentioned factors can explain CBH miscalculations, it is very challenging to define particular reasons. The results' examination revealed that the lack of correlation between field measured and estimated CBH values may be simultaneously attributed to multiple causes. While we could not explore the effect of certain components on the estimation accuracy of our study (e.g., effects of difference in scanning angle or PRF), we examined the factors that present variation among the sample plot areas. A prominent example is the effect of canopy height, cover and slope on estimation accuracy. Although the individual correlation of these forest parameters with CBH accuracy was rather low (i.e., R 2 of 1.8e −5 , 6.6e −3 and 0.16 for canopy cover, slope and mean canopy height respectively), their interactive impact was observed. More specifically, the negative effect of high canopy (i.e., increased crown length) on the quantity of low points and the subsequent estimation accuracy was reduced in cases where canopy cover and slope were relatively low compared to other areas with high cover and steep slopes. The same applies to areas of intense relief but lower canopy height and cover.
Taking all under consideration, the voxel-based analysis approach applied in this study was not able to deliver accurate CBH estimations over a forested land which is highly diverse and complex in terms of forest structure and terrain.

Regression Analysis
Three statistically significant predictors were used in the developed predictive model (Equation (7)) including five LiDAR-retrieved metrics, namely the Hske l , p85 l , p99 l , b50 l and b90 ( Table 2). All were calculated using the last-of-many returns of the point cloud except for the b90 which was derived by the total number of echoes. The fact that metrics of last-of-many returns were mostly chosen as predictors indicates the direct interaction of LiDAR with the lower part of the canopy while the use of the entire point cloud shows an indirect CBH estimation through its association with the canopy conditions. Skewness and percentiles of height have been successfully used in the literature for describing CBH in various vegetation and topography conditions such as pine plantations in SW Spain, forested watershed and mountain pine stands at very high elevations in the US [4,15,21,43]. In our study, Hske l was the variable mostly correlated (i.e., negative relationship) with the reference data ( Figure 8) as it is a measure of the height distribution shape from which CBH is derived. The use of the percentiles of last-of-many returns is a logical selection for describing CBH since these metrics describe the bottom of the canopy better than percentiles of other echo types. In fact, visual examination of the LiDAR data revealed that ground returns were seldom obtained from beneath the canopy but mostly from canopy openings. Thus, last-of-many echoes were dominated by reflection from within the canopy, including both bottom and top layers.
Due to the variance in pulse penetration depth along the study area (explained in Section 5.1), the exclusive use of height percentiles was considered insufficient for CBH prediction in our research. Height bincentile metrics were found to complement this insufficiency since they depict the percentage of points below an indicated height (e.g., 85th bincentile = % points below the 85% of the point cloud's maximum height). In other words, height bincentiles (i.e., b50 l and b90) were used as an indirect proxy of the pulse penetration depth through the canopy. A higher bincentile value means that more returns are registered below a certain elevation, thus the pulse has penetrated deeper in the canopy compared to an area characterized by a lower bincentile value. In fact, canopy cover and mean canopy height, which influence the pulse penetration depth (described in Section 5.1), were calculated from the LiDAR data and individually examined for their correlation with the field-measured CBH. The results showcased an R 2 of 0.11 and 0.17 for the canopy cover and mean height respectively. To this end, the individual use of height and density related LiDAR metrics in the prediction model would lead to a rather lower predictive performance.
While it is difficult to interpret the use of sums and interactions between the metrics employed in the produced regression model, the combination of a percentile and a bincentile metric in one predictive variable seems logical since, as mentioned above, percentile metrics could not encompass the high variability of the forest in terms of vegetation density and topography. The logarithmic transformation of CBH was used to deal with variance instability, mitigating heteroscedasticity of the residual plot.
The employment of the afore-described LiDAR metrics led to the construction of a regression model (Equation (7)), which-according to the cross validation results reported in Section 4.2-can reliably predict plot-level CBH. In particular, the model can explain 61% of the variability in the linearized space derived from the variables transformations. The models fit to compare the predicted with the reference data is similar to the hypothetical 1:1 relationship (Figure 9). Plot-level CBH was predicted by the model with 18.19% of rRMSE and a negative rbias of −0.09%.
Significant caution is required when directly comparing regression results derived from different studies due to differences in forest types, LiDAR sensors and flight specifications, topography, response variables and employed transformations in the regression model. Nevertheless, it is worth noting that RMSE in our study may be higher compared to other reported earlier. The reason may possibly encompass many of the above mentioned factors but also may just be related to the generally lower reference CBH values of our study area.
The predictive model was applied to the LiDAR point cloud covering the entire study area generating a CBH map (Figure 11), which provides spatial plot-level CBH information at a resolution of 32m. Visual examination of the map CBH values along with aerial photo-interpretation revealed evident estimation errors in limited areas of the forest. CBH values approximating zero were calculated either over forest openings covered by few trees or very sparse stands while extremely high values-eliminated from the final CBH map-were located at the forest boundaries due to edge effects reported in other relevant studies as well [18,19,21,23,38]. This is an expected outcome since the model was constructed with the use of sample plots located exclusively over dense forested land with no sparsely wooded areas or forest openings involved.

Methods Comparison and Limitations
An overall analysis of the results (Sections 4.1 and 4.2) revealed that regression-based estimation provides more reliable CBH predictions in structurally complex forests over variant topographic conditions. In fact, the advantages of the voxel-based approach over the regression method (described in Section 1) proved unable to compensate for the heterogeneity in structure and terrain characterizing the study area.
Despite certain limitations presented by the regression analysis method (described in Section 1), the results showcased that it is less sensitive to understorey conditions compared to the voxel-based method. This is of utmost importance in our study area where the understorey (where present) may overlap with the canopy leading to absence of vertical gaps and a subsequent difficulty in eliminating the understorey points, which could not be overcome in the context of our work. On the contrary, structural point cloud analysis based on voxels resulted in substantial underestimation errors in plots with intense presence of understorey growth.
Being sensitive to any structural characteristics of the LiDAR point cloud, the inconsistent accuracy results derived by the voxel-based estimation method reflect the respective deviation in terms of forest and terrain contexts which enhanced point density variation in the lower parts of the canopy. Nevertheless, the method gave accurate estimates in the majority of sample plots where CBH was not low (i.e., >3 m), indicating that accurate estimation could be achieved for forest species of generally higher CBH (e.g., mature pines).
Although the present study has reached its aims, there were some limitations affecting both methods. First, field measurement accuracy is essential for acquiring reliable predictions especially in regression analysis approach. In addition, uncertainties in GPS measurements were present. Although the GPS horizontal accuracy was just 3 m, the possible geographic misplacement of the rectangular plots may greatly influence the estimation outcomes. This problem has been probably alleviated by the selection of large sample plots (1000 m 2 ) which may have mitigated a possible inherent variability and the subsequent variation in predictions derived especially by the regression-based method. Moreover, the results of the study indicate that both employed methods require very high-density LiDAR in order to better represent vegetation structure from the lower part of the canopy, at least within the existing forest contexts. The small number of sample plots constitutes an additional limitation of this work as most related studies recommend the use of more than 40 samples in regression analysis implementations [44].
Finally, the predictive model developed in this study should not be considered as transferable to other forest ecosystems with differences in species composition, phenology and topography characteristics. Although performing well in our study area, a new model should be developed for a different area using new reference CBH values since variation in LiDAR data acquisition specifications, vegetation characteristics and/or topography negatively influences the model's transferability to other forested areas.

Conclusions
The present work focused on the CBH estimation of an uneven-aged structured forest located over complex terrain. Two widely used methods, namely the voxel-based approach and regression analysis, were applied and accordingly evaluated. The results showcased a substantial outperformance of the regression-based estimation method. More specifically: • The voxel-based method implementation led to CBH prediction with no adequate correlation with the reference data (R 2 = 0.15). In most of the plots CBH was overestimated (rbias = −17.52%) with the rRMSE reaching 42.36%. • Regression analysis resulted in a predictive model of high performance, given the heterogeneity of the study area. Cross-validation revealed an R 2 and rRMSE of 0.61 and 18.19% respectively and a limited overestimation trend (rbias = −0.09%).

•
The voxel-based approach could not successfully respond to variations characterizing the structural and topographic conditions within the forest. Its' estimation accuracy is determined by the LiDAR pulses penetration properties, thus highly susceptible to the quantity of points from low canopy parts, which are used to define CBH. • The use of LiDAR-retrieved metrics in regression analysis compensated for the variance in vegetation structure and terrain across the study area. • The developed predictive model can be used to generate a CBH map over the entire study area ( Figure 11) for direct employment by forest and fire managers.
Given the above mentioned conclusions, further research requires the investigation of the potential of full-waveform or discrete-return LiDAR data of higher point density to estimate plot-level CBH over the same study area, which could not be achieved in the framework of this research due to legal restrictions. Such LiDAR data would probably provide more structural information from the low parts of the canopy resulting in more reliable CBH predictions. Moreover, the probable acquisition of spaceborne LiDAR data would provide a cost-effective alternative and the examination of their potential capability to accurately predict CBH in complex forests would be of high interest and importance.

Conflicts of Interest:
The authors declare no conflict of interest.