1. Introduction
Insect disturbances are natural processes in forest ecosystems to maintain healthy and heterogeneous forests [
1]. However, forest ecosystems can experience increased magnitude and frequency insect disturbances due to interaction from increasing temperatures and drought [
2,
3]. One of the most significant threats to the temperate forests of Central Europe are climate-change driven bark beetle Ips typographus L. (hereafter bark beetle) outbreaks [
4]. Prolonged periods of warmth, longer vegetation seasons, and drought combined with an increased frequency of windstorms allow for the establishment of additional generations of the Ips typographus and a population overgrowth [
4,
5]. Moreover, urban forests may be affected more seriously because an urban environment produces additional stressors [
6]. The stressors include changes in forest soil properties, tree species diversity, higher temperatures, and carbon dioxide content [
7,
8]. The stressed trees are then optimal material for a bark beetle feeding. Therefore, pest insect infestation is a serious threat to urban forests that can affect forest health and economic losses [
9]. On the other hand, urban forests are under active forest management, which allows rapid individual tree sanitation. Therefore, it is necessary to use an appropriate method for the detection of individual infested trees.
A terrestrial survey of symptoms on the trunk and foliage discoloration or defoliation is traditionally used for bark beetle disturbance detection and assessment on a single tree level [
10,
11]. The infested tree goes through three-needle discoloration [
5,
12,
13]. These stages are commonly known as green, red, and grey attacks. In the green attack stage, the infested trees have not shown distinct symptoms visible to the human eye, and therefore, infestation is not easy to detect at this stage at leaf and canopy levels [
5,
14]. However, the green attack can be better seen by remote sensing in the shortwave and thermal infrared spectra because of the changes in water content [
15]. A field survey is also relatively time-consuming and costly, making this approach suitable only for a plot scale. Therefore, forest management has widely used conventional multi- or hyperspectral remote sensing to detect bark beetle infestation’s spectral symptoms.
Medium resolution satellites, especially Landsat data employing subpixel analysis, are more frequently used in the of study bark beetle disturbances compared to coarser resolution sensors, i.e., MODIS or AVHRR, indicating that spatial resolution is crucial feature for mapping such infestations [
16]. However, the spatial resolution (>10 m) of these satellites is not suitable for the detection of bark beetle infestation at the individual tree level because the pixel size is more extensive than the mapping unit (a single tree crown). This decametric spectral resolution is more appropriate for mapping larger infestations at an epidemic population scale using an area-based (pixel-based) approach only, because of the pixel size [
17]. High-resolution imagery (spaceborne or airborne) with a metric resolution (1–9 m) is very suitable for mapping spatially dispersed infestations of small groups of trees [
17]. However, the area-based approach is still applied in recent studies using metric imagery to detect the bark beetle damage [
11,
18,
19]. Nevertheless, for targeted individual tree sanitation of infested trees, it is essential to use imagery with a submeter (<1 m) spatial resolution and a tree-based approach based on automatic individual tree crown delineation (ITCD) from a submetric imagery to substitute the traditional terrestrial survey [
6,
17,
20,
21]. These prerequisites have been met only by airborne remote sensing.
Three main methods (Valley following, Region growing, and Watershed segmentation) have been developed for an automatic ITCD from aerial imagery [
22]. These methods worked only with image brightness values. The algorithms searched for regions surrounded by local minima (shaded gaps between tree crowns), spectral similarity to the seed pixels, or local minima within inverted grayscale imagery [
23,
24,
25]. However, automatic ITCD is difficult to perform with no vertical canopy structure [
16,
22].
Modern remote sensing technologies are now available for providing information about vertical canopy structure, enabling the tree-based approach. Airborne laser scanning (ALS) or stereophotogrammetry is now a standard operational tool for forest inventory and producing 3D canopy models [
26,
27]. The most used strategy for ITCD from airborne laser scanning is currently searching for local maxima (LM) for treetops detection followed by more common watershed segmentation or a more advanced region growing algorithm for crown delineation [
28]. These methods are applicable either to photogrammetrically derived point clouds (PPCs) or from light detection and ranging (LiDAR) [
22]. Despite this, the area-based approach is still more frequently used for bark beetle disturbance detection [
6,
29]. Only two recent studies that are focused on the pest insect disturbance detection, Nasi et al. [
6] and Windrim et al. [
30], have applied an essential individual treetop detection using image brightness or a canopy height model (CHM) from RGB data.
A disadvantage of airborne sensing is the high operational costs, weather conditions, and extensive mission planning. This is problematic for bark beetle disturbance management and, in general, for all pest insect infestations in the small-scale urban forest because these applications require an elevated level of spatial detail, frequent, intra-annual monitoring, and rapid processing. Nevertheless, recent developments in remote sensing and robotics have enabled the production of unmanned aircraft systems (UAS) featuring miniature multi- or hyperspectral sensors. In addition to ultra-high spatial resolution, the main advantages of employing UAS for insect disturbance detection are low operational costs (≤2000 Euro), the ability to undertake repeated image acquisition, when pest insects start to attack the trees, or when the damage is most likely visible and high operability under varying weather conditions, e.g., drizzling and moderate wind. Moreover, thanks to modern photogrammetric techniques based on Structure from Motion (SfM), it is possible to produce more cost-effective 3D photogrammetric canopy PPC and orthomosaics from UAS data as an alternative to airborne LiDAR to perform ITCD [
31,
32]. This enables processing precision, which cannot be achieved using LiDAR. Widespread practice involves applying an automatic ITCD from CHM derived from RGB or UAS LiDAR data [
33]. Local maxima are used for raster-based treetop detection, while for tree crown delineation, watershed segmentation is commonly used [
34].
The current state of the art in the UAS literature is to apply a tree-based approach for deriving individual tree parameters, such as tree height [
35,
36] or crown diameter [
37], using an automatic ITCD from CHM. Studies focused on bark beetle driven disturbance detection have used the pixel-based approach [
38]. Only two studies, Näsi et al. [
39] and Klouček et al. [
40], have been applied to a basic ITCD based on an automatic treetop identification combined with a buffer zone around the treetops for a bark beetle damage classification in the mature even-aged spruce forest. Both studies used the CHM derived photometrically. All of these studies implemented only spectral based features (original spectral bands and vegetation indices) to classification, which is sufficient to detect infested trees within a monoculture. Nonetheless, this is problematic in a highly diverse mixed forest when spectral response of different species is similar. For example, Sothe et al. [
41] implemented UAS PPC-derived elevation tree metrics to tree species classification of tropical forests resulting in an overall accuracy increase of 15% compared to classification-based only on spectral features.
Despite this, studies combining spectral and elevation features for bark beetle damage detection are rare [
42]. Little research has examined the segmentation of tree crowns directly from a PPC, especially in terms of multispectral analysis, because the accuracy of the LM method is linked intricately to point density, and multispectral cameras have been characterized by lower resolution [
43]. Moreover, minimal investigations have performed ITCD within an uneven-aged, mixed forest [
44]. Canopies of a distinct size may cause problems to set the correct spatial resolution when using CHM for ITCD, because smaller crowns may disappear in a coarser raster [
22]. Therefore, it would be beneficial to use an original dense PPC to perform ITCD in a mixed forest.
This study focuses on the detection of individual spruce trees infested with bark beetles. We investigated whether it is possible to substitute laborious and commonly used manual ITCD with automatic ITCD without increasing extracted feature uncertainty of disturbance classes due to inaccurate ITCD. This aspect has not been thoroughly investigated in the scientific literature. The objectives of this study are (1) to examine and compare current methods for an individual tree crown spectral and elevation feature extractions from UAS multispectral data to clearly distinguish between the forest disturbance classes, (2) to statistically compare extracted features among reference crowns and automatically delineated tree crowns and (3) to discuss the uncertainties of the automatic crown delineations based on a used method and the extracted feature statistics for the bark beetle disturbance at individual tree level.
The methodology of this study was designed to address the abovementioned limitations. We use a multispectral PPC with a high point density to ensure accurate treetop identifications and ITCD in an uneven-aged mixed forest. The CHMs were derived from the high point density PPCs to guarantee a sufficient spatial resolution for ITCD. Moreover, we hypothesized that individual tree-based extraction of spectral and elevation features from a multispectral PPC and orthomosaic would improve the detection of individual bark beetle infested trees in a heterogeneous urban mixed forest. Hence, the suggested methodology for detecting bark beetle infestation in the urban forest aims to introduce a more efficient approach to forest health monitoring and damage reduction.
3. Results
3.1. Vegetation Masks
The effect of vegetation mask application before the LI segmentation technique to the PPC is illustrated in
Figure 3. The mask’s application resulted in a perfect reduction in deciduous tree crown segments and the better delineation of the disturbed trees.
The effect of vegetation mask applications before the MCWS and DAL treetop’s detection to the PPC is illustrated in
Figure 3b,c. When no mask was applied, it resulted in many false-positive treetop identifications because all trees of distinct species (larches and deciduous trees) were included in the
lmf searching algorithm. Moreover, the larger the decimation grid that was applied, the more false-positive treetops were detected regardless of the variable size window function. The reason for this was that deciduous trees and larches had been shed due to the time of imaging. This resulted in multiple false-positive treetop detections in the crowns because the larger decimation grid formed the false local maxima from their main branches. These local maxima were more likely to be detected within the less dense PPC by the
lmf algorithm.
Nevertheless, these false-positive treetops were strongly reduced by applied masks to the PPC. The ExG threshold mask separated the conifers from the ground, grass, deciduous trees, and partial shadows. The main proportion of the points was removed using the ExG mask. The additional NIR mask completely reduced the shadowed pixels. All the reduced points represented false positive multiple treetops of deciduous trees and larches.
3.2. Automatic Treetops Detection Accuracy
The resulting accuracies of treetop detections are presented in
Table 8. The lowest accuracy was obtained using the original PPC with the extreme point density of 701 points/m
2 regardless of the applied variable window size function. The accuracy was decreased due to many FN treetop identifications formed by multiple seed points. Therefore, the PPC with original point density was not use for further analysis. However, the detection accuracy was enormously improved using the decimated PPCs.
These results have shown that the density of the PPC affects the accuracy of treetops detection. The larger decimation grid was applied, the more FP and the fewer FN treetops were detected regardless of the variable size window function. These trends were caused by keeping the highest point in each decimation cell and the principle of the implemented lmf algorithm. The larger crown’s FP treetops were formed because the more extensive decimation grid selected more than one highest point within the crown. These more isolated points were more likely to have been selected as the false local maxima because there were no higher points in the search radius of the lmf algorithm after the combi mask application.
The moving window size function also affected the accuracy of treetops detection. The application of larger moving windows (function f2) resulted in fewer FP but more FN detections than function f1, because the reference crown area data are positively skewed. Therefore, the smaller trees were not detected by the variable moving window fitted to median crown size. These two features had a common effect on the treetop detection accuracy. The larger decimation grid increased FP detections, while the larger moving window increased FN detections.
The three most accurate datasets had a dense cloud with a point density from 10 to 85 points/m2 and a search window size function corresponding to the minimum diameter of the reference tree crowns. However, the accuracy between these three dense PPCs showed only minor variance. The more dense or sparse PPC affects the accuracy of treetop detections negatively. Therefore, the density from 10 points/m2 to 85 points/m2 can be considered sufficient for treetop detections using the lmf algorithm.
3.3. Automatic Individual Tree Crown Delineation Accuracy
The resulting accuracies of automatic ITCD methods are presented in
Table 9. The accuracy of all methods showed a minor variance between methods within one PPC, confirmed by subsequent statistical testing. Statistical tests revealed insignificant differences among delineation methods in the densest PPCs las01 and las02. In the case of more decimated PPCs, the accuracies varied significantly. The results of statistical testing were expressed by accuracy ranking within each dataset in
Table 9. Tied ranks expressed insignificant differences. Higher overall accuracy was obtained by applying traditional MCWS, followed by the LI method regardless of the PPC density. The accuracy of the DAL method was slightly lower compared to MCWS and, sometimes, LI.
Moreover, the statistical testing revealed insignificant differences in each method’s accuracy among decimated datasets, except for the las1 datasets with the point density 1 point/m2, because the low point density reduced detailed crown shapes in a dense forest. It resulted in significantly less accurate oversegmentation (DAL and LI) or undersegmentation (MCWS) of the automatically delineated crowns.
The oversegmentation or undersegmentation is caused by the low point density and the type of the algorithm. In the case of DAL and LI oversegmentation, the region-growing algorithm selected only a few topmost points around the treetop that met the height conditions. The MCWS algorithm also included points of the neighboring crowns because the height gradient magnitudes between the crowns were reduced due to the low point density. That resulted in undersegmentation. The same effect of starting oversegmentation was also detectable in another sparse las05 PPC (4 points/m2) using DAL and LI methods, but the accuracy decrease was not significant within this dataset.
It appears that the main effect of automatic ITCD accuracy is the density of the PPC because the significant decrease in accuracy was obtained using the most decimated PPC (1 point/m2). A point density greater than 10 points/m2 can be considered sufficient with the margin of our results. If the point density is sufficient, the selection of the segmentation method is a minor issue. Every recent ITCD method based on point cloud or CHM segmentation can be used. However, the MCWS was the most accurate method, even in the case of too little point density.
Because statistical tests revealed significantly lower accuracy of all delineation methods using the most decimated las1 dataset compared with other datasets, we decided to compare extracted feature statistics of the reference dataset only with extracted statistics of las02 and las1.
3.4. Feature Extraction
The statistical testing revealed significant differences among disturbance classes within the reference dataset (
Table 10). Therefore, the distinguishing of disturbance classes and bark beetle disturbance detection is possible. Multiple comparisons using spectral features exposed problematic distinguishing between pines and seriously infested trees (sbbd) using NDVI and ENDVI indices, consistent with the preliminary results. However, these two classes were well separated, employing the NDRE index. Another problematic distinction was revealed between noninfested trees (sh) and the trees under the green attack using NDRE, but significant differences between classes were detected applying NDVI and ENDVI indices. This makes the NDRE index less suitable for detection of infested spruce trees.
The excellent separability of pines was reached using elevation features and the Mann–Whitney U test. That is consistent with the study’s hypotheses documenting the supporting role of elevation features for tree species separation. However, the crown area’s application could be limiting for tree species separation if the crown area of these species were equal. This makes the feature less applicable. On the other hand, the quartile features are universally applicable because the different crown shapes of these tree species are more likely to remain stable regardless of crown size.
The Kruskal–Wallis test revealed some significant differences among extracted feature statistics of RTC and automatically delineated crowns of las02 and las1 within disturbance classes (
Figure 4). The statistical testing results were similar, although the delineated crowns of las1 showed a higher variance of extracted statistics within the group and also between groups. These variations are visible mainly for spectral features that are key to separate disturbance classes of spruce trees (
Figure 4b,d,f). They are caused by less accurate ITCD of las1 and they increase the uncertainty of disturbance classes separation.
Figure 4 also illustrates the result presented in
Table 10. If the RTC classes’ boxplots highly overlap (NDVI and ENDVI boxplots of pine-sbbd NDRE boxplots of sbbg and sh), there is no significant difference. Moreover, the NDRE and crown area features are less resistant to inaccurate ITCD than NDVI, ENDVI and quartile elevation features resulting in higher variance. This lack of robustness may potentially limit these features for bark beetle disturbance detection. These findings are consistent with the results of statistical testing presented in
Table 10. All disturbance classes were separable using at least one of the selected spectral or elevation features. However, no VI was able to distinguish between all disturbance classes by itself. The NDVI and ENDVI indices were able to separate all stages of bark beetle attack which makes them more appropriate for bark beetle disturbance detection. However, these indices are not able to separate pines from spruces. Nevertheless, the application of elevation features alone is barely relevant for bark beetle infestation detection. Therefore, we recommend the mixture of spectral and elevation features. This set of features can serve as a starting point for similar studies.
The significant differences among delineation methods were revealed only within sbbd and sbbg class of spectral features. The significant variances among delineation methods were obtained within each tree species class of all elevation features. These significant differences were further investigated using post-hoc treatment (BUFFER, DAL, LI, MCWS) versus control (RTC) multiple comparisons tests of each dataset for spectral (
Table 11) and elevation (
Table 12) features.
The results showed that extracted features from AITC polygons of the BUFFER method significantly differed from RTC in both PPCs because the polygon buffers covered only the crowns’ parts. BUFFER’s extracted spectral feature statistics differed significantly from RTC within sbbd and sbbg classes (
Figure 4 and
Table 11).
The limited extent of buffer polygons was more significant for elevation features than spectral features, especially in more massive crowns. It resulted in a much lower crown area compared to RTC and other delineation methods (
Figure 4g–h). Moreover, the BUFFER method’s application resulted in higher quartile values of the tree heights because the buffer covered the topmost part of the trees. This, in fact, was more evident in the case of zq25 and median quartiles of spruce trees because no lower parts of tree crowns were present (
Figure 4i–n,
Table 12), increasing the quartile values.
In contrast, the automatic delineation methods based on PPC segmentation (DAL, MCWS, and LI) were more accurate than the BUFFER method. This resulted in almost no significant differences between the extracted spectral features of RTC and AITC polygons within disturbance classes, regardless of PPC density. The results revealed only the significant differences among RTC and LI NDRE values of more decimated las1 inside the sbbg class.
Due to the lower ITCD accuracy of the las1 dataset, there were significant differences among crown areas causally related to delineation’s geometrical accuracy. The MCWS method tended to slightly undersegment the crowns. The undersegmentation was more evident in las1 (
Table 9), but it was also significant using las02. The undersegmentation resulted in significantly different (larger) crown areas compared to RTC (
Table 12). In contrast, DAL and especially LI tended to oversegment the crowns in the case of strongly decimated las1 PPCs (see
Table 9). This led to significant differences in these methods’ elevation features in the case of las1 because the AITC was oversegmented (smaller) compared to RTC (
Figure 4h and
Table 12). However, there were almost no significant differences in extracted elevation statistics among RTC and AITC of las02 regardless of the applied delineation method.
To fully understand the effect of PPC decimation on feature extraction and consequent differences between las02 and las1 within treatment versus control tests (
Table 11 and
Table 12, highlighted in yellow), we additionally performed a Kruskal–Wallis test between treatments (RTC, las02, and las02) of each method within each class for all features. The results were entirely in line with the changes presented in
Table 11 and
Table 12. The changes in treatments versus control test results of las02 and las1 within the same disturbance class and feature were caused by significant differences between the tested statistics of las02 and las1. Therefore, the lower PPC density negatively and significantly affected the feature extraction in these cases. Moreover, it could be concluded that the method most sensitive to the decimation of the PPC is the LI method, because the ITCD was performed directly using a decimated PPC instead of CHM.
Based on the experimental results of this subsection, several conclusions were made. The selected spectral and elevation features are suitable for bark beetle disturbance detection documented by excellent disturbance class separability of RTC. The BUFFER method is less suitable for detecting bark beetle disturbance in the mixed forest using the individual tree-based approach because of the simplicity of crown delineation around treetop. The buffers’ application led to significant differences in extracted feature statistics compared to RTC using the Kruskal–Wallis test and post hoc treatment versus control multiple comparisons tests. Therefore, these significant dissimilarities reduce the separability of disturbance classes, especially tree species, using the BUFFER method. The other methods (DAL, MCWS, and LI), which delineated the full crown based on PPC or subsequently derived CHM, led to insignificant differences of extracted feature statistics compared to RTC. However, these methods are affected by PPC density. The more decimated PPC las1 revealed significant differences in extracted feature statistics (especially elevation features) compared to RTC due to significantly lower crown delineation accuracy caused by low point density. If the PPC is sparse (less than 4 points/m2), it results in undersegmentation (MCWS) or oversegmentation (DAL and LI) of delineated crows, which negatively affect the feature extraction. The PPC-based delineation method (LI) is more sensitive to the cloud’s point density than CHM-based methods. Therefore, we assume that the most accurate and the most robust to point density is the watershed segmentation for detecting bark beetle infestation using the individual tree-based approach. Moreover, the spectral features are more robust to the ITCD variability of applied methods and the lower PPC density because the significant differences between RTC and delineated crowns were less frequent than for elevation features. It is an advantage for bark beetle infestation detection.