1. Introduction
Forest responses to environmental change in the coming decades could have major impacts on the global terrestrial carbon cycle. Tree mortality rates have increased steadily in many regions as a result of drought, heat stress, and insect outbreaks [
1], and there is growing concern that many forest regions could soon become net carbon sources [
2]. Future projections are limited by our understanding of tree mortality though, which is a highly complex process [
3] that has a large influence on forest dynamics. Data and models that can help in quantifying annual rates of tree mortality and in predicting how mortality responds to environmental conditions are therefore needed. To our knowledge, [
4] is the only study until now that used vegetation indices to predict tree mortality. In this case, the indices were derived from satellite images.
Many efforts have used ground-based data, such as a tree’s diameter at breast height (DBH) or stand basal area, for predicting tree mortality [
3,
5], but this approach is not suitable for predicting mortality across larger areas. As an alternative approach, remote sensing data can predict tree mortality from visible (RGB), red edge, and near-infrared (NIR) reflectance [
6]. Spectral data provide information on leaf traits such as photosynthetic capacity, water, soluble carbon, or nitrogen content [
6,
7] and have been used as indicators of tree health and overall vitality [
6,
8]. A number of studies have successfully used remotely sensed spectral data to detect insect outbreaks [
9,
10,
11,
12,
13,
14,
15,
16], fire [
17,
18,
19], and tree mortality across large forest landscapes [
1,
20,
21].
Unmanned aerial vehicles (UAVs) can provide high-resolution aerial imagery that allows for the identification and analysis of individual tree crowns. Recent studies have used crown-level UAV data for species identification [
22] and to measure size [
23], competition [
24], forest health [
10,
11,
25,
26], insect attacks [
9,
15,
16,
27], and fire damage [
18] at the tree level. Additionally, UAV imagery has been used in combination with satellite images to combine the benefits of a high-resolution UAV image and a large-scale satellite image [
28,
29,
30]. UAV imagery also offers considerable potential for predicting tree death based on the spectral features of individual crowns. For example, [
31] estimated canopy cover from different species by UAV imagery to investigate its impact on tree mortality.
In the present study, we predict tree mortality based on spectral indices derived from RGB, red edge, and NIR reflectance. We extracted band values for individual tree crowns of three tree species using high-resolution UAV imagery and developed predictive models for the probability of individual trees dying within the next year. Our specific research objectives were to determine whether: (1) spectral indices for tree crowns can effectively predict tree mortality; (2) multispectral indices such as the normalized difference vegetation index (NDVI) or the normalized difference red edge index (NDRE) contribute additional predictive value beyond RGB data; and (3) the accuracy of mortality predictions varies among different tree species.
2. Materials and Methods
2.1. Study Area
Field data were collected in Cypress Hills Interprovincial Park, a 35,000 ha protected area located on the southern Alberta–Saskatchewan border in western Canada (49°40′N, 110°15′W). The climatic zone of the park is sub-humid with a mean annual temperature of approximately 2 °C and annual precipitation of 550 mm [
24]. The Cypress Hill landscape comprises a mix of fescue prairie and forests dominated by three tree species:
Pinus contorta (lodgepole pine),
Picea glauca (white spruce), and
Populus tremoluides (trembling aspen). Most of the present forest originated between 1880 and 1890 following large fires [
32]. Average tree density at our study sites is approximately 500 trees/ha, with a mean DBH of 25 cm and a mean canopy height of 15 m.
2.2. Data Acquisition
In the summer of 2019 and 2020, we flew a DJI Matrice 200 v2 (Beijing, China) equipped with a Sentera AGX-710 sensor, to obtain multispectral aerial imagery of 38 forest stands (
Figure 1). We stratified the 38 stands by both dominant species and elevation, such that they contained approximately equal representation of the three tree species and spanned the elevation range for each. The sensor provided reflectance values for five separate bands: blue (446 nm), green (548 nm), red (650 nm), red edge (720 nm), and NIR (840 nm). We also collected RGB imagery of these stands during leaf-off conditions in earlier flights (spring 2018 and 2019). Image acquisition flights were planned according to the recommendations by [
33]. During each flight, the UAV flew to a predetermined altitude 60 m above the canopy. The UAV then repeatedly traversed a 150 m by 200 m rectangular area in a series of parallel flight lines that maintained > 90% side and forward overlap between adjacent images. As it flew over the site, the UAV captured downward-facing photos approximately every 2 s with a single flight lasting 24 min on average. Most flights were conducted midday in bright light conditions.
2.3. Data Processing
We used a photogrammetric software (Agisoft Metashape (v. 1.6.4, St. Petersburg, Russia)) to generate 3D point clouds (55 points ) and orthophotos (5.1 ± 0.9 cm per pixel) for each stand. We used orthophotos from the spring of 2018 and 2019 to manually identify the center of individual tree crowns (n = 56,231) and identify species for a subset of the trees (n = 24,303), during leaf-off conditions. We then used orthophotos from the summers of 2019 and 2020 to mark all trees as alive or dead in those years, based on their leaf cover and vertical position. Trees that were leafless in summer or had fallen over were labeled as dead.
To determine the accuracy of the labeling process, we compared a subset of the tree tops (n = 355) to ground-truthed data collected in 2019. We excluded small trees with either DBH < 35 cm or height < 15 m because it was not always possible to identify them in orthophotos. Among the remaining ground-truthed trees, 326 trees were true positives (live trees marked as alive) and 20 trees were true negatives (dead trees marked as dead). Only one tree was a false negative (live trees marked as dead), and eight trees were classified as false positives (dead trees marked as alive). In total, 97% of trees were correctly labeled with a Kappa value of 0.80.
Next, we extracted the mean values of red, green, blue, red edge, and NIR bands in a 1 m-diameter circle centered on each tree’s crown in the 2019 and 2020 orthophotos. Average crown widths for lodgepole pine, trembling aspen, and white spruce varied between 1 m and almost 6 m, depending on the trees’ diameter [
24]. We could not be sure that each tree was marked in the center of its crown, and so, we adopted a conservative radius size to minimize the chance of including pixels from outside a given tree’s crown. We used these band values to calculate eleven spectral variables for each tree’s crown: percent greenness (PG), excessive red (ER), normalized difference index (NDI), excessive green index (EGI), excessive green minus red index (EGMRI), visible atmospherically resistant index (VARI), green leaf index (GLI), normalized color intensities (NCI), crown brightness (Bright), NDVI, and NDRE (
Table 1). These indices were chosen because they capture the relative reflectance of green, NIR, and red edge bands, which are related to factors such as the number of bare branches in a crown, leaf chlorophyll content, and active photosynthesis [
34,
35,
36,
37,
38,
39]. Tree mortality has been linked to all of the above variables through their relationships with leaf discoloration (i.e., chlorosis) and defoliation ([
40,
41,
42]).
2.4. Classification
We used the spectral indices from 2019 and 2020 to predict the species of individual trees with a random forest model [
44] that included the 11 spectral indices listed above. We selected all trees whose species had been manually labeled (
n = 24,303; 23% lodgepole pine, 26% trembling aspen, 51% white spruce), trained the model using 80% of those crowns, then tested the goodness-of-fit on the remaining 20%. The classification model had an accuracy of 80% and a Kappa value of 0.66. Because of the moderate Kappa score, we proceeded with three different approaches. In the first approach, we used only the trees whose species had been manually labeled. In the second approach, we used the random forest model to predict the species of all remaining unlabeled trees in the data set, which resulted in 52,845 labeled trees (20% lodgepole pine, 23% trembling aspen, 57% white spruce). In the third approach, we applied a cutoff value of 0.6 to the estimated probability of correct species identification and excluded cases where the maximum class probability was below this value. With this setting, 16,969 trees were assigned a species label from the model, producing a total of 41,272 trees classified to species in our data set (21% lodgepole pine, 23% trembling aspen, 56% white spruce). Using this higher cutoff value resulted in an accuracy of 87% and a Kappa value of 0.79 for the independent testing data.
2.5. Statistical Analysis
We sought to predict the death of individual trees based on the values of their crown’s spectral indices in 2019. Our data comprised 52,845 trees that were alive in 2019, 977 (1.8%) of which were dead in 2020. We modeled tree mortality using two different algorithms: logistic regression and random forest using aliveness as the response variable and spectral indices as predictor variables. We split our data into training (80%) and testing (20%) partitions and evaluated model performance using different goodness-of-fit statistics on the testing set. We tested for collinearity among predictor variables using their pairwise Pearson correlation coefficients. For the logistic regression model, we excluded variables that were highly correlated (
p > 0.9) with others, then used forward selection to build a parsimonious model with relevant predictor variables. We verified the importance of variables in the final model with the Boruta algorithm (package Boruta in R), which is a variable selection algorithm that classifies predictor variables as “important” and “not important” [
45]. Random forest models are more robust regarding correlation between predictor variables, as there is a random selection of predictors at each node’s creation. Therefore, we included all of the spectral indices in the random forest model as suggested by the Boruta algorithm. In order to test for overfitting, we trained different random forest models using first only one predictor and then adding predictors to the model one at a time, while we tested each model on our independent testing data set. During this process, there were no signs of overfitting. Parameter tuning for random forest was performed using the caret package [
46] to perform a grid-based search for the optimal number of variables randomly sampled as candidates at each split and the optimal number of trees to grow.
Because the number of class observations (alive or dead in 2020) was highly unbalanced, we tested different sampling methods with both algorithms, including undersampling, oversampling, and a combination of the two. For undersampling, we set the sample size to 1500, which is approximately twice the number of dead trees in our default training set. For oversampling, we set the sample size to 81,000, which is approximately twice the number of alive trees in our default training set. The sample size for a combination of oversampling and undersampling was set to 41,500. For the oversampling and undersampling approaches, the probability of sampling from the rare class was chosen automatically to achieve the target number of samples from each class. For the combination of both methods, it was set to 0.5. Furthermore, the synthetic random oversampling examples (ROSE) technique, and synthetic minority oversampling technique (SMOTE) were used (in R packages ROSE [
47] and DMwR [
48]). These sampling algorithms produced six different data sets (one default data set and five data sets using different sampling methods), each of which was used to fit a logistic regression and a random forest model. In addition, we created one random forest model by using stratified sampling (randomForest package [
49]), with a sample size of 300 for both dead and living trees. To obtain the best classification results, we used Youden’s index [
50] to find the optimal classification threshold probability for each model (pROC package [
51]).
We compared the different models using the area under the curve (AUC) of the receiver operating characteristic curve, as well as the values for sensitivity and specificity. The best models for the random forest and logistic regression algorithms were retained. To determine which algorithm provides the most accurate and stable model, we randomly assigned observations to the training (80%) and testing (20%) data sets. The models were fit to all the training data, and predictions were tested on the testing set. This process was then repeated 200 times, with a different random assignment to the training or testing sets each time. Afterwards, we compared the results and chose the model with the better performance according to the AUC, sensitivity, specificity, and balanced accuracy metrics. After selecting the final model, we evaluated it for the three different tree species using: (1) the data set comprising trees that were manually labeled as a specific species; (2) the data set comprising all trees that could be labeled with a minimum classification probability of 0.6; and (3) the data set in which all trees were labeled as a specific species. We compared the AUC values, sensitivity, specificity, and balanced accuracy to determine if the final model performed well on all species.
To test whether multispectral indices provide additional information for predicting tree mortality, we repeated the process of model selection without using multispectral indices (RGB models). We chose the best models for both algorithms and then, again, randomly assigned observations to the training (80%) and testing (20%) data sets, repeating this process 200 times, with a different random assignment to the training or testing sets each time. We calculated variable importance using the mean decrease accuracy (MDA) and mean decrease AUC (MDAUC) (randomForest [
49] and party packages [
43]), following the recommendations of [
43]. We also considered whether the forward stepwise model and Boruta algorithm included multispectral indices in the final model and therefore labeled those predictor variables as important. As additional model comparison tools, we compared the logistic regression models using the Akaike information criterion (AIC) and random forest models using out-of-bag (OOB) errors.
After fitting the mortality models, we predicted the mortality rates for each of the 38 stands and used Spearman rank correlations to evaluate their relationships with observed mortality, with tree density, and with the percentage of trembling aspen (defined as the number of trees labeled as trembling aspen at a specific stand divided by the total number of trees at this stand). We used these stand-level correlations to assess the model’s ability to identify stands with high and low mortality rates, as well as how mortality varied with simple measures of structure and composition.
A summary of the workflow to process remotely sensed data can be seen in
Figure 2.
3. Results
There was a high correlation between the spectral indices NDI and VARI, between EGI and EGMRI, and between GLI and PG in our data set. We therefore excluded the variables VARI, EGI, and PG from the logistic regression models, as their variable importance values were lower compared to the other variable in each highly correlated pair (
Table 1). The forward selection and Boruta algorithms both included all remaining predictor variables in the model. The logistic regression models that used multispectral indices for predicting aliveness therefore included NDI, ER, EGMRI, GLI, Bright, NCI, NDVI, and NDRE as predictor variables. For the logistic regression, the default data set produced a similar AUC score (88.7%) to the data sets that were filtered using different sampling methods (range 88.1–88.7%). The specificity (76–80%) and sensitivity (86–90%) values were similar for all data sets. As we focused on predicting dead trees correctly, we compared the two models with the highest specificity values, which were the combination of undersampling and oversampling, as well as the default data set with a specificity of 80% and a sensitivity of 86%. As the combination of undersampling and oversampling overestimated tree mortality slightly more than the model using the default data set (false negatives: 1458 vs. 1449 cases), we chose the default data set as our best logistic regression model. Youden’s index indicated that the optimal classification probability threshold was 0.98 for the default data set.
For the random forest algorithm, the stratified data sample including all variables produced a higher AUC score (89.8%) than models using other or no sampling methods (range 86.8–89.0%). In contrast to the logistic regression models, the specificity (11–80%) and sensitivity (83–100%) values were variable. The highest specificity values were obtained with the stratified sample (specificity: 78%, sensitivity: 89%) and undersampling (specificity: 80%, sensitivity: 83%). As stratified sampling had higher AUC and sensitivity values and resulted in a less-severe overprediction of tree mortality (false negatives: 1173 vs. 1810 cases) compared to undersampling, we chose it as our most promising random forest model. Comparing the best logistic regression and random forest models using the 200 randomly assigned training and testing data sets, the stratified random forest slightly outperformed the default logistic regression model with a mean AUC of 89.8% compared to 88.7% (
Figure 3). Furthermore, the issue of the overprediction of tree mortality was less severe for random forest (false negatives: 1173 cases) in contrast to logistic regression (false negatives: 1449 cases), while the sensitivity and specificity values were similar. The balanced accuracy for random forest was equal to logistic regression (0.83). We therefore adopted the stratified random forest model as our final model (
Table 2). This model had an AUC of 0.92, an accuracy of 0.88, a sensitivity of 0.89, and a specificity of 0.78. However, all models overestimated the annual mortality rate due to false negatives (trees predicted to die that survived). The predicted mortality rate was 13%, whereas the actual mortality rate was only 1.8%. This issue could not be solved by using the mean probabilities for deaths instead of a binary classification. For falsely classified trees, it was often the case that the lighting conditions were sub-optimal (
Figure 4). False negative classifications (trees predicted to die that survived) sometimes occurred when a particular tree crown was in the shadow of another tree, and false positive cases (trees predicted to survive that died) often occurred when a tree crown was overexposed. Despite this large bias, across the 38 stands, there was a clear positive relationship between the reference mortality rate and predicted mortality rate (Spearman rank correlation r = 0.77) (
Figure 5). There was a moderate negative correlation between tree density and predicted mortality rates (Spearman rank correlation r = −0.51) and a weak positive correlation between the percentage of trembling aspen at each stand and predicted mortality rates (Spearman rank correlation r = 0.19;
Figure 5).
Variable importance calculations produced similar results using MDA and MDAUC. GLI and PG were rated as the most important variables, with a large drop in the performance of the next three most important variables (EGMRI, EGI, ER) (
Table 1). NDVI and NDRE were the least important predictor variables, but they still contributed to the model’s predictive ability. With multispectral indices excluded, our default logistic regression model used NDI, ER, EGMRI, GLI, Bright, and NCI as the predictor variables. Excluding the multispectral indices decreased model performance slightly, with mean AUC values dropping from 88.7% to 87.2% for logistic regression and from 89.8% to 87.9% for stratified random forest (
Figure 3). For logistic regression, the sensitivity and specificity values dropped by 1%, and for random forest, the specificity values decreased from 78% to 76%, while the sensitivity values did not change. The balanced accuracy dropped by 1% when excluding multispectral indices for both models. AIC worsened from 5368 to 5608, and the OOB error rate increased from 12.18% to 12.92% when NDVI and NDRE were not used.
The mean red, green, blue, red edge, and NIR reflectance values in 2019 and 2020 were extracted for the trees that were alive in 2020 and for those that died between 2019 and 2020 (
Table 3). Trees that died had a higher absorbance of green, NIR, and red edge bands and a higher reflectance of red and blue bands, in both years. In 2019, trees that later died had lower GLI and NDVI values than those that survived. Both indices decreased in 2020 among the trees that died, but were unchanged for trees that survived (
Figure 6).
Our final model, the stratified random forest model, was used to evaluate the performance at the species level. As there were only minor differences in model performance among data sets that used different species classification approaches (trees with manually labeled species only; all trees using predicted species; trees where the predicted species probability exceeds 0.6), we evaluated model performance for individual species using all tree crowns classified as a certain species in our data set. Lodgepole pine and trembling aspen both had the highest AUC scores (92%), whereas that of white spruce was somewhat lower (87%). However, for lodgepole pine model, the sensitivity was 0.97, but the specificity was only 0.50, which indicates that half of the trees that died were falsely classified as alive. Although the sensitivity values for trembling aspen and white spruce were somewhat lower than that of lodgepole pine (0.85 and 0.87, respectively), they each had higher specificity values (0.85 and 0.68, respectively) (
Table 2). Balanced accuracy was 0.85 for trembling aspen, 0.78 for white spruce, and 0.73 for lodgepole pine.
4. Discussion
Between 2019 and 2020, our study area experienced a tree mortality rate of 1.8%. Although it was not possible to attribute specific causes of death, both biotic and abiotic disturbances occur in forests of the Cypress Hills. Mountain pine beetle
(Dendroctonus ponderosae) can kill lodgepole pine trees on a regular basis [
52], and both forest tent caterpillar
(Malacosoma disstria) and fungal diseases can attack aspen trees [
53]. Moisture limitation is also present, and a 2017 summer drought may have had a long-lasting negative effect on tree health. Although forests in the Cypress Hills were regularly disturbed by fire in the past, no major fires have occurred since the late 19th Century. The 1.8% mortality rate we observed is not unusual for this area.
Spectral indices for individual tree crowns correctly predicted 78% of tree deaths and 89% of trees that survived over the following year. As our data set contained relatively low numbers of dead trees (977), we used different sampling techniques to compensate for imbalanced classes. The resulting random forest and logistic regression models performed similarly well (AUC ranging from 86.8–89.8%) and were successfully validated with independent test data. However, all models overestimated tree mortality by an order of magnitude.
A random forest model based on stratified sampling of the data had greater predictive power than a logistic regression model, but scores for AUC, sensitivity, specificity, and balanced accuracy showed that both types of models performed well. Selecting a high-threshold classification probability helped to minimize predictive errors that arose from unbalanced classes (more than 98% of all trees survived to the following year). Although the stratified sampling approach and classification threshold that we used resulted in mortality being overpredicted, our models showed that aerial imagery from UAVs has strong potential for identifying trees and stands with high mortality risk.
In our models, sensitivity was always higher than specificity, which meant that the model was able to predict trees that survived more accurately than trees that died. One reason could be that the model has much more data available to learn patterns in spectral indices for live trees. The model tends to classify more trees as dead than there are in reality, but in doing so, it correctly identifies more than three-quarters of the trees that die. This leads to a significant overestimation of tree deaths. Despite this bias, the model can successfully predict which stands are expected to have high or low mortality over the following year (
Figure 5). We also explored simple associations between predicted mortality and stand structure and composition. High-density stands tended to have greater predicted mortality rates, but predicted mortality rates were only weakly related to the percentage of trembling aspen (
Figure 5).
There are potential sources of prediction error in our models. For example, our model could not easily differentiate trees with signs of decline that survived from those that died. White spruce and lodgepole pine are resilient to a certain degree of defoliation, but may eventually die after several years [
54,
55]. Long-term monitoring would help in predicting the eventual fate of trees that did not die within one year. Another major error source is that false reference labeling results in trees that are labeled as dead, but are actually alive or vice versa. As we saw during the process of ground-truthing, such errors occurred and could lead to a major prediction bias. Some errors appeared to be related to poor illumination, as numerous trees were incorrectly predicted to die when their crown was in the shadow of neighboring trees (as shown for white spruce in
Figure 4). Aspen trees sometimes had exposed branches without leaves (
Figure 4), and the model tended to predict these trees would die even if they were otherwise healthy. Conversely, trees with overexposed crowns were often incorrectly predicted to survive. The light saturation of these crowns likely reduced the predictive value of their spectral indices. Furthermore, there is also the possibility that the extraction method for band values is not optimal for some individual trees. By extracting data from a 1 m-diameter circle at a manually interpreted crown center, there is a chance we did not capture the spectral characteristics of the entire crown. It could be that the manually interpreted crown center does not align perfectly with the actual crown center, which leads, depending on the crown width, to the case that non-crown spectral information or just a small part of the crowns’ spectral information is included in band values’ extraction.
4.1. Evaluating Different Spectral Indices
Reflectance differed between trees that died and those that survived, especially for the red, blue, and NIR bands. The greater reflectance in the red and blue bands of trees that later died could be because of reduced chlorophyll and carotenoids, which is often referred to as “blue shift” [
56]. The difference in the red band can be explained by reduced chlorophyll relative to anthocyanin concentrations [
56,
57]. Lower reflectance values in the red edge band likewise indicate lower levels of chlorophyll in trees that later died [
58]. These patterns are signs of reduced tree health. The very low NIR reflectance of trees that later died compared to trees that survived could be a sign of water stress [
59], which is consistent with the decrease of the NIR band values for dying trees between 2019 and 2020. Those findings are consistent with [
42], who found increased spectral reflectance of red and a decrease in green wavelength reflectance.
GLI (which is sensitive to leaf chlorophyll content [
38]) and PG were the top predictor variables in our models. Chlorophyll loss may therefore be the dominant signal of tree mortality, as GLI values were low in trees that later died (
Figure 6). GLI and PG were highly correlated, and therefore rated as almost equally important. Surprisingly, the multispectral indices NDVI and NDRE did not show high predictive power, as they were rated as the least important predictor variables. Studies have found NDVI to be useful for predicting vegetation growth, as it correlates well with defoliation [
38]. It appears that most of the information in NDVI that indicates a tree is near death is better expressed through GLI. Nevertheless, our results suggest that NDVI contains useful information for predicting tree mortality, which is consistent with [
1], who found clear early warning signals in NDVI regarding tree mortality. NDRE was also used successfully to detect early warning signals for specific tree species, in particular for earlier stages of stress [
39].
The forward stepwise model, as well as the Boruta algorithm considered both NDVI and NDRE to be important predictor variables and consequently included them in the model. The slight improvements in AIC and OOB using multispectral indices also justified their inclusion in the final classification model. However, when evaluating variable importance, NDVI and NDRE were found to have less predictive power than any of the RGB-derived indices. In many situations, the marginal increase in model performance may not be sufficient to justify the expense of obtaining multispectral imagery. Other spectral indices, such as those that capture moisture availability, may have greater value in cases where hyperspectral data are available [
56]. Despite that, we should be cautious to interpret variable importance for models that overpredict the target class, using error-prone reference data. We suggest that further research is necessary, for example by using different learning algorithms such as cost-sensitive algorithms, to evaluate the benefit of multispectral predictor variables for tree mortality.
4.2. Comparing Results for Different Tree Species
We found some differences in model performance across three tree species, but the model still had good predictive ability for all three species (AUC = 87.2–91.9%;
Table 2). The model performed best for trembling aspen, if correctly predicting trees that die and survive is considered equally important. This is shown by the balanced values for sensitivity and specificity and by the higher balanced accuracy. Lodgepole pine and white spruce both showed lower specificity rates and balanced accuracy values, which means that the model is somewhat less effective in predicting individual tree mortality for these species. See
Table 2.
The differences in model performance among species could have three possible reasons. Firstly, white spruce and lodgepole pine are both coniferous species, for which it may be more difficult to predict tree mortality based on spectral indices. Trembling aspen is more likely to exhibit signs of decline (defoliation and dead branches) prior to death, which are easily detected in aerial imagery. Secondly, the data set contained few instances of lodgepole pine trees that died. A greater number of observations for lodgepole pine could lead to better performance for this species. Thirdly, the spectral values of tree crowns prior to the onset of mortality may vary among species [
60]. Spectral index values that accurately predict the mortality of one species might not be as relevant to others.
5. Conclusions
Our study demonstrated the promise of using aerial imagery collected with a UAV to predict tree mortality. Both the random forest and logistic regression approaches were effective for predicting mortality over the following year using spectral indices for individual tree crowns, producing AUC scores of 89.8% and 88.7%, respectively. Including multispectral indices produced a small increase in model performance over RGB indices alone.
Further improvements are needed in order to avoid overprediction and evaluate the use of specific spectral indices. Such improvements could include the use of hyperspectral data, training the model with a larger and more balanced data set, using cost-sensitive learning algorithms, and testing the performance in new regions. Long-term monitoring could also yield additional information regarding variable importance and overprediction of tree deaths. Accurate tree mortality models based on spectral information would provide valuable information for understanding how tree mortality varies with environmental conditions and in modeling stand dynamics within forest ecosystems.