Next Article in Journal
Spatiotemporal Dynamics and Driving Mechanism of Aboveground Biomass Across Three Alpine Grasslands in Central Asia over the Past 20 Years Using Three Algorithms
Next Article in Special Issue
Large-Scale Monitoring of Potatoes Late Blight Using Multi-Source Time-Series Data and Google Earth Engine
Previous Article in Journal
Spatiotemporal Changes of Vegetation Growth and Its Influencing Factors in the Huojitu Mining Area from 1999 to 2023 Based on kNDVI
Previous Article in Special Issue
GVC-YOLO: A Lightweight Real-Time Detection Method for Cotton Aphid-Damaged Leaves Based on Edge Computing
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Assessing Sirex noctilio Fabricius (Hymenoptera: Siricidae) Damage in Pine Plantations Using Remote Sensing and Predictive Machine Learning Models

by
Andrés Hirigoyen
1,* and
José Villacide
2
1
Sistema Forestal, Instituto Nacional de Investigación Agropecuaria, INIA Las Brujas, Ruta 48 km 10, Rincón del Colorado, Canelones 90100, Uruguay
2
Grupo de Ecología de Poblaciones de Insectos, IFAB-INTA Bariloche, Modesta Victoria 4450, San Carlos de Bariloche 8400, Rio Negro, Argentina
*
Author to whom correspondence should be addressed.
Remote Sens. 2025, 17(3), 537; https://doi.org/10.3390/rs17030537
Submission received: 18 November 2024 / Revised: 10 January 2025 / Accepted: 3 February 2025 / Published: 5 February 2025
(This article belongs to the Special Issue Plant Disease Detection and Recognition Using Remotely Sensed Data)

Abstract

:
Early detection and monitoring of invasive forest pests are crucial for effective pest management, particularly in preventing large-scale damage, reducing eradication costs, and improving overall control effectiveness. This study investigates the potential of machine learning models and remote sensing at various spatiotemporal scales to assess forest damage caused by the woodwasp Sirex noctilio in pine plantations. A Random Forest (RF) model was applied to analyze Planetscope satellite images of Sirex-affected areas in Neuquén, Argentina. The model’s results were validated through accuracy analysis and the Kappa method to ensure robustness. Our findings demonstrate that the RF model accurately classified Sirex damage levels, with classification accuracy improving progressively over time (overall accuracy of 87% for five severity categories and 98% for two severity categories). This allowed for a clearer distinction between healthy and Sirex-infested trees, as well as a more refined categorization of damage severity. This study highlights the potential of machine learning models to accurately assess tree health and quantify pest damage in plantation forests, offering valuable tools for large-scale pest monitoring.

1. Introduction

Invasive forest pests are a major problem for plantation forests, drastically affecting the provision of goods and services worldwide [1]. The increasing arrival pressure of non-native species, coupled with changes in the dynamics of established pests, demands considerable management efforts to minimize their impacts on both private owners and government agencies. Among these efforts, early detection and monitoring of pest infestations are essential for effective pest management, particularly in preventing large-scale damage, reducing the costs associated with eradication and containment, and improving overall pest control effectiveness [2].
To assess forest health, a combination of in situ field methods and remote sensing techniques is typically used [3,4]. Field data provide local and specific biological information but can be costly and struggle to capture broad spatial patterns. In contrast, remote sensing (RS) offers cost-effective and efficient options for assessing forest vegetation health based on physical–chemical parameters, covering larger regions and overcoming the challenges of monitoring in remote locations [5]. Remote sensing techniques have proven to be valuable for identifying forest disturbances such as windstorm damage and pest infestations at different spatial scales [3,6,7]. These methods are indispensable for acquiring accurate data on pest distribution, spread, and intensity, facilitating the development of improved management strategies adapted to vulnerable forest areas.
Recent advances in remote sensing technologies, including unmanned aerial vehicles (UAVs) and satellite-based platforms, offer valuable tools for forest pest monitoring. Several studies have explored the potential of UAV-based hyperspectral and multispectral imagery for detecting pest-damaged trees. UAVs are mainly implemented at the stand scale due to the costs of image acquisition and processing, as well as the potential to create a long time series (TS) of images for the same areas of interest, but when monitoring programs need to cover larger areas, satellite-based RS approaches become essential [8]. Processing images by means of Machine Learning (ML) algorithms, including K-nearest neighbors, Random Forest (RF), Decision Trees, Artificial Neural Networks (ANN), and Support Vector machines (SVM), have been successfully employed in forest damage assessments, achieving accuracies ranging from 55% to 90% [6,9,10]. The use of these RS technologies in conjunction with these advanced techniques provides broader spatial and temporal resolutions, which are critical for mapping changes in forest pests over time.
One of the most promising satellite platforms is Planet, which operates a constellation of over 175 standardized CubeSat 3U nanosatellites, capturing high-resolution multispectral images daily. Planet’s satellites offer a spatial resolution of 3–4 m and capture data in red, green, blue, and near-infrared (NIR) spectral bands. These high-resolution images have been used successfully to detect forest disturbances, such as bark beetle infestations, by differentiating healthy trees from those in early stages of infestation [8,11]. The ability to create time series of such data makes satellite-based RS a powerful tool for assessing forest health and facilitating timely decision making in pest management programs. Remote sensing studies conducted by Villaverde (2020) [12] using Landsat and Sentinel platforms showed an upward trend in the values of MSI and NDVI spectral indices over a five-year period. These changes indicate a gradual deterioration of plant health and a decrease in plant vigor. The correlation between these results and the increase in water stress levels, coupled with the absence of adequate silvicultural practices, suggests an increase in the susceptibility of forests to attack by Sirex, a pest that finds optimal development conditions in weakened trees [12].
Time series analysis allows for the early detection of stress or damage to forests. By tracking the temporal evolution of spectral indices, researchers can identify early signs of decline before visible symptoms appear. TS analysis involves processing images acquired monthly to detect changes in the land surface caused by the infestation damage and distinguish them from changes due to other environmental factors, such as the natural behavior of light reflectance during the year. Several studies [7,13,14,15] use TS analysis to assess the impact of specific disturbances, such as insect outbreaks or climate change, on forest ecosystems. By comparing the temporal patterns of spectral indices before and after a disturbance, researchers can quantify the extent of damage and the rate of recovery. To capture the fine-scale spatial details necessary for symptom identification, high-resolution remote sensing data are required. Consequently, researchers have utilized sensors such as IKONOS, QuickBird, RapidEye, WorldView-2, ADS FieldSpec ProFR, and LIDAR to map insect-related forest disturbances [14]. By analyzing TS of vegetation indices or biophysical variables derived from satellite data, we can investigate changes in vegetation dynamics [7].
The invasive woodwasp Sirex noctilio Fabricius (Hymenoptera: Siricidae) is one of the most damaging insect pests affecting planted pine forests worldwide [16,17,18]. Severe outbreaks have been recorded in several invaded regions, with tree mortality rates reaching up to 75% in some areas of southern Argentina [10,18]. The population dynamics of S. noctilio are characterized by eruptive, pulse-like patterns, driven by both density-independent factors—such as the availability of weakened trees due to prolonged droughts—and density-dependent mechanisms [10,17,18]. While populations can remain at low, endemic levels for extended periods, attacking mostly stressed trees, outbreak events can lead to widespread mortality of healthy trees, resulting in significant economic consequences.
In non-native regions, management strategies for S. noctilio aim to reduce tree mortality through a combination of legal, biological, and silvicultural measures. Early detection of infestation outbreaks is critical for facilitating timely decision-making and improving the success of these management strategies [18,19]. While local-scale monitoring methods, such as visual surveys, trap trees, or kairomone intercept traps, can effectively detect the presence of Sirex [19], large-scale monitoring poses significant challenges. For government agencies and large forestry operations, having robust data on the pest’s distribution and damage over large areas is essential for quantifying the problem’s spatio-temporal extent and developing an appropriate intervention protocol.
In this study, we aimed to develop a remote sensing framework to identify tree mortality patterns caused by S. noctilio in pine plantations. Our underlying hypothesis was that spectral profile comparisons across different Sirex damage categories could serve as the basis for developing AI-based predictive machine learning models capable of differentiating between levels of infestation severity from remote sensing images. Specifically, our objectives were to (a) characterize Sirex damage categories based on spectral signatures, (b) apply machine learning algorithms to classify field plots into damage categories, (c) create time series of damage categories using selected indices, and (d) evaluate and validate the classification models across the study area. Given the increasing impact of forest pests worldwide, we hope this study will contribute to the development of new monitoring tools that facilitate the acquisition and comparison of data on pest damage across large regions, ultimately supporting the design and implementation of effective management strategies.

2. Materials and Methods

2.1. Study Areas

We used the regional S. noctilio aerial classification damage database maps, generated in February 2019 and 2023 as part of the official S. noctilio Monitoring and Control Programme led by the General Directorate of Plant Health and Agricultural Emergency, Ministry of Production and Industry, Neuquén Province, Argentina (see open data in: https://produccioneindustria.neuquen.gov.ar/programa-de-control-de-avispa-sirex/, accessed on 24 April 2024). This monitoring was extended over the entire planted area of Neuquén in northern Patagonia, Argentina, covering nearly 75 thousand hectares of both public and private pine plantations forest. Damage classification was carried out through visual assessments of Sirex-affected trees by qualified personnel during aerial helicopter surveys. Each pine plantation was assigned an attack level based on tree mortality ranges (i.e., proportion of Sirex-affected trees) representative of the different stages of Sirex population dynamics [17]. The categories of Sirex damage were classified as follows: no attack (no visible signs of Sirex-affected trees); incipient (affecting less than 5% of trees); mild (affecting 5–10% of trees); moderate (affecting 10–50% of trees); severe (affecting more than 50–70% of trees); and extreme (affecting more than 70% of trees) (see aerial survey categories in Appendix A).
To our study, we selected two Sirex-infested pine plantation areas for field data collection, characterized by similar tree ages and forest management approaches, including only partial early pruning and thinning. Firstly, to train our machine learning algorithm we chose a forest area located in Las Ovejas, a rural town in the northern region of Neuquén Province (hereafter referred to as the Las Ovejas plantation; 36.87°S–70.88°W). This area is planted with 6053 hectares of pure stands of Pinus contorta and Pinus ponderosa, ranging from 28 to 32 years old, with all levels of Sirex damage severity (from no damage to extreme), as identified in the 2019 aerial survey.
Additionally, to generate field validation to apply the ML algorithm selected in the previous step, we chose a second pine plantation area affected by Sirex, located in Junín de Los Andes city, in the southern region of Neuquén Province (hereafter referred to as the Junín plantation; 39.92°S–71.09°W—Figure 1). This area covers nearly 220 hectares, predominantly planted with 28-year-old lodgepole pines (Pinus contorta), with some patches of ponderosa pine (Pinus ponderosa). We generated an orthomosaic of the entire Junín planted area from images captured by a drone equipped with multispectral camera. Using QGIS [20], we generated circular sampling plots of 400 square meters regularly spaced by 100 m, in which we identified and counted in the field all healthy and Sirex-infested trees.
In both Las Ovejas and Junín areas, according to the proportion of Sirex-infested trees, we labeled each sampling plot according to the damage levels to compare with the result obtained from ML algorithm classification. We assume that all pixels within a plot are homogeneous and representative of the assigned label. Within each plot spectral variability is minimal, ensuring accurate classification. For instance, in a plot labeled as no damage, all pixels exhibit spectral values in RGB+NIR bands that are distinctly different from those labeled as severe, incipient, or moderate.
To address potential inaccuracies in visually assignment of intermediate damage categories, we expanded the labeling scheme beyond the original aerial monitoring approach (with five original categories—5LA) by implementing a Combined Labels Approach (CLA). Firstly, to produce classification maps of Sirex damage categories, we applied the Four-Labels Approach (4LA), which included the following labels: no damage, incipient, moderate (a combination of mild and moderate), and severe. Given the absence of areas with severe damage in the 2023 aerial survey data, we adopted a three-group categorization (3LA) that included the labels: no damage, incipient, and moderate (a combination of mild and moderate). Additionally, to generate presence/absence Sirex maps, we employed a Two-Labels Approach (2LA) to differentiate between no damage and affected, encompassing all categories of Sirex damage levels (see all labeling approaches in Appendix A).

2.2. Remote Sensing Data Acquisition and Processing

Remote sensing data were obtained from PlanetScope Ortho Tile products [21] with less than 20% cloud coverage for the Las Ovejas plantation. Images acquired between January 2019 and December 2023 (excluding the months from May to September due to the presence of snow) were used to create an orthomosaic by merging multiple scenes, which were orthorectified into a single strip and divided into a 25 × 25 km grid. All Images were subjected to radiometric, geometric, and sensor corrections. The analytical format consisted of a 4-band, 16-bit multispectral image (RGB-NIR) with an orthorectified spatial resolution of 3.125 m. Daily review was conducted, and 32 spectral indices (SI’s) were calculated for each image to assess vegetation greenness and stress levels (Appendix B). A similar approach was implemented for the Junín plantation, using images acquired in April 2023. Image pre-processing and processing were performed using ENVI (version 5.7) [22], and post-processing was performed with QGIS (version 3.16.3) [20] and R software (version 3.3.1) [23].

2.3. Approach to Land Cover and Forest Density Classification

For land cover and forest density classification, we utilized the approach proposed by Roy et al. (1996) [24], which isolates the vegetation feature space using the Canopy Shadow Index (SI), Bare Soil Index (BI), and Advanced Vegetation Index (AVI). These indices are the most useful for detecting and estimating canopy density, as they reduce the effects of atmospheric disturbances and vegetative influences [25]. The separation of canopy density and bare soil is significant in the color composite of the AVI + SI + BI combination for the orthomosaic. Pixels showing values of 0 or null for some of the components of this combination are eliminated, resulting in pure pixels with vegetation values.

2.3.1. Bare Soil Index

Bare soil index (Equation (1)) enhances the detection of bare soil areas, fallow lands, and vegetation with distinct background responses. Like the AVI, BI is a normalized index that differentiates between reflective (IR and B) and absorption (MIR and R) bands. This index aids in distinguishing vegetation with varying background conditions, including completely bare, sparse canopy, and dense canopy [24].
BI = {(MIR + R) − (IR + B)}/{(MIR + R) + (IR + B)}

2.3.2. Canopy Shadow Index

Crown arrangement in forest stands influences shadow patterns, affecting spectral responses. Young, even-aged stands have lower canopy shadow indices (Equation (2)) compared to mature natural forest stands. Mature forest stands exhibit a flatter and lower spectral axis compared to open areas. SI is directly proportional to forest canopy density, resulting in higher SI values for denser forests. The vegetation feature space is further stratified based on SI, considering the textural variations introduced by canopy shading in forest stands [25].
SI = [(255 − B) ∗ (255 − G) ∗ (255 − R)]1/3
where B is the blue band, G is the green band, and R is the red band

2.3.3. Advanced Vegetation Index

The NDVI has been shown to be sensitive to canopy foliage activity. Anon et al. (1993) suggested that subtle differences can be enhanced by using a power transformation of the infrared response. The resulting index, termed the advanced vegetation index (Equation (3)), has proven to be more sensitive to forest density and physiognomic vegetation classes [13].
AVI = [IR ∗ (255 − R) ∗ (IR − R)]1/3
where IR is the infrared band response and R is red band response.
We conducted a pixel-by-pixel classification of the image based on damage labeling generated during the 2019 aerial survey, which was further refined using a computer workstation. In this process, we created 776 circular plots with a radius of 20 m (1.256,64 m2), positioned on the orthomosaic using QGIS to capture representative regions within the study area. To enhance the accuracy of our analysis, we eliminated pixels from each plot that lacked values in any of the combined indices (AVI + SI + BI) (Figure 2). This step was crucial for reducing noise caused by ground refraction in areas exhibiting some degree of damage, ensuring that only pixels representing vegetation cover were retained. Furthermore, to maintain the integrity of our labels, we excluded plots where more than 10% of the pixels contained missing data and reassigned the remaining data as necessary. Since the final pixels were labeled according to plot damage category, a pixel-level model was fitted, providing pixel-specific results.
Mean values of spectral reflectance bands (4) and SI’s (32) for each plot class were extracted using the “extrac” function from the terra package in R.

2.4. Data Analysis

2.4.1. Feature Selection: Variable Importance Selection

The number of variables useful as predict input was reduced using permutation importance (PIMP) proposed by Altmann et al. (2010) [26] for reducing bias, based on a permutation test, returning significant p-values for each feature (Appendix B). PIMP is available in the vita package of R software [23]. The PIMP-derived spectral indices used as input for ML models was reduced using the Jeffries–Matusita (J–M) separability test [7]. Based on IS’s derived from PIMP, the J-M value was calculated for each severity class using the training set. J-M ranges from 0 to 2; values between 0 and 1 corresponds to very poor separability, and values between 1 and 1.5 suggest a poor separability, and high separability is associated with values in the range of 1.5 to 2. [27] allowed for precise selection of spectral bands that best discriminated between different damage classes. Only SI’s with J-M values above a predefined threshold (1.5) were included in the ML models.

2.4.2. Machine Learning Classification Methods

Traditional machine learning algorithms, such as Random Forest, Artificial Neural Networks, and Support Vector Machines, have consistently demonstrated high accuracy in modeling tree-level pest and disease infestations [28,29]. Their ability to automate detection and accurately discriminate subtle spectral changes associated with potential outbreaks makes them invaluable tools for forest insect pest and disease monitoring [28]. These algorithms excel at distinguishing healthy trees from affected ones, even in complex environments [13,30].
Machine learning algorithms were trained with 5LA (2019) in the Las Ovejas plantation, using 80% (n = 620) of a randomly selected holdout sample; final accuracy assessments were determined using the remaining 20% of the data (n = 156). To evaluate ML algorithms performance in estimating damage categories, a comparative analysis was undertaken.

Random Forest

Random Forest is a machine learning algorithm widely employed in forest research due to its robustness in addressing modeling challenges, particularly by mitigating overfitting and reducing model variance [7]. Because of its high accuracy and reliability, the RF classifier has become a key tool in remote sensing applications [29]. Random Forest models were trained using the randomForest package [23] in R software setting the number of trees, ntree, to 500. Internal evaluation through leave-one-out cross-validation assessed the model’s performance across different number of variables to randomly sample as candidates at each split (mtry) values. For each model, the mtry value that minimized the out-of-bag error was selected.

Support Vector Machines

Support Vector Machines are powerful machine learning algorithms adept at both classification and regression. For classification, SVM finds the optimal hyperplane that maximizes the margin between classes, ensuring robust decision boundaries. In regression, SVMs predict continuous values by fitting a function within a specified tolerance zone [13]. Built on the principle of structural risk minimization, SVMs balance model complexity and generalization. By maximizing the margin, they excel in handling high-dimensional data and mitigating overfitting. SVM assume a unique relationship between input features and the target variable, enabling the construction of accurate decision boundaries [13]. They have proven robust to outliers, dimensionality issues, and overfitting [3]. A polynomial function was used as the kernel function; it showed better behavior than lineal and sigmoidal kernel functions. The optimal cost and gamma parameters were tuned using svm and tune.svm functions from the e1071 R package.

Artificial Neural Networks

Artificial Neural Networks offer an approach to model complex, nonlinear phenomena in forest science. Typically composed of input, output, and one or more hidden layers, ANNs mimic human learning by establishing and strengthening connections between input and output data [3,31]. These connections enable data linkage without explicit programming. While various ANN architectures exist, including Radial Basis Function, Elman Recurrent, and Hopfield Neural Networks, Multilayer Perceptron Neural Networks (MLPNNs) with backpropagation have gained widespread popularity [31]. MLPNNs consist of interconnected layers of neurons, transforming input data into meaningful outputs. In this study, the MLPNN architecture comprised 35 input neurons, corresponding to the predictor variables by ‘monmlp” “neuralnet” R packages. The optimal number of hidden neurons was determined through training and validation, with a three-neuron configuration yielding the lowest error.

2.4.3. External Classification Accuracy Analysis

For external validation, the selected ML model was applied on a pixel-by-pixel basis within the Junín plantation. Model predictions were compared with a field assessment conducted by experts. We use the Kappa, overall accuracy, and user accuracy to measure the agreement between ML model predictions and field data. Kappa is a statistical measure used to assess the agreement between two raters or classifications, quantifying the extent to which classifications agree beyond what would be expected by chance [23]. A Kappa value of 1 indicates perfect agreement, while a value of 0 suggests that any agreement is due to chance alone; a negative Kappa value indicates worse-than-chance agreement. When evaluating classification models such as RF, SVM, and ANN, a comprehensive assessment requires going beyond overall accuracy [23]. While overall accuracy is a widely used metric that quantifies the proportion of correct predictions made by the model compared to the total number of observations, it provides only a basic understanding of model performance. User accuracy measures the reliability of the model’s class assignments, where a high user accuracy indicates that when the model assigns an observation to a class, it is typically correct. In contrast, producer accuracy assesses the model’s ability to identify all instances of a given class, with a high producer accuracy suggesting effective detection of most instances. By considering both user and producer accuracy, a more complete idea of the model’s performance can be obtained [23].

2.5. Temporal Variation in Spectral Indices

Changes in vegetation phenology are a key aspect of environmental change due to their sensitivity to climate change and significant impacts on ecosystem functions. Satellite-derived time series of vegetation indices can capture the seasonal dynamics of leaf development and are essential for spatially continuous observations of vegetation phenology, such as in forest plantations [15,32]. A time series analysis of spectral indices was conducted to identify and quantify changes in vegetation health over time induced by Sirex damage. Monthly image acquisitions were used to derive these indices, capturing variations in spectral reflectance across different months. To account for natural seasonal fluctuations in reflectance, seasonal adjustments and normalization techniques were applied, ensuring that the observed changes in spectral indices were primarily attributed to Sirex damage rather than seasonal effects [11]. Radiometric calibration is often essential for long-term image time series to account for sensor degradation. In this study, the short temporal extent minimized the impact of potential radiometric drift, rendering calibration unnecessary [11,15]. Despite variable image availability due to weather and satellite coverage, a fixed number of images was used for each smoothing. Snowy months with missing data were excluded.

3. Results

We generated and analyzed a comprehensive dataset of 35 composite orthomosaics collected annually from 2019 to 2023 in the Las Ovejas plantation (seven per year). Using a ML algorithm, we classified tree sample areas based on Sirex damage levels (i.e., proportion of infected trees) as no damage (without detectable Sirex damage) or Sirex-infested (with different damage levels), based on 32 spectral indices (Appendix B), and we developed a time-series of damage categories using selected indices. Additionally, we evaluated and validated the selected classification models in a new study area (Junín plantation) to assess their generalization and predictive accuracy.

3.1. Variable Importance Selection

According to importance inference and cross-validated permutation importance, the spectral indices revealed distinct patterns in their importance for classification tasks across the labeling approaches of Five-Labels, Four-Labels, Three-Labels, and Two-Labels Sirex-damage categories between the years 2019 and 2023. To confirm and remove non-informative feature selection, we employed the J-M separability index. For each image, we calculated the J-M value for all possible combinations of damage classes. By averaging these values, we obtained a single overall J-M value for each SI. This metric reflects the SI’s ability to differentiate among the three damage classes [7]. SI’s with J-M values exceeding 1.5 were selected as input features for the ML models.
In February 2019, the most influential indices for classifying Sirex damage severity classes were VARI, TGI, NDWI, GNDVI, and the green and NIR bands. Conversely, in February 2023, the indices shifted to VARI, TGI, NDWI, GNDVI, HUE, and the green and red bands.

3.2. Seleted Machine Learning Algorithms

Our results from the confusion matrices for the RF, SVM, and ANN indicate that classification accuracy varies across labeling approaches. A comparison of the overall accuracy for 5LA of the three artificial intelligence algorithms, RF, SVM, and ANN, yielded accuracy scores for the 2023 dataset of 88%, 70%, and 78%, and for the 2019 dataset, the scores were 57%, 47%, and 48%, respectively. In the 5LA (for RF, SVM, and ANN classification), we found that in 2023, the mild damage category exhibits the lowest accuracy compared to other categories, primarily due to the misclassification of moderate and mild cases in 2019. Notably, the no damage category achieves both user and producer accuracies exceeding 92%, 90%, and 89% for RF, SVM, and ANN, respectively. Given its superior performance, the Random Forest algorithm was selected to classify S. noctilio damage levels in this study. The confusion matrix detailing the classification results for 5LA is presented in Table 1.
In the Four- and Three-Labels classification, the incipient category maintains consistent accuracy when comparing the 5LA across both years (Table 1). Additionally, there is a slight improvement in the no damage category’s accuracy, along with a substantial increase in overall model accuracy, indicating that the RF model has effectively enhanced its classification capabilities over time (Table 2). Lastly, the Two-Labels (2LA) classification shows a marked improvement in accuracy across all categories compared to the previous classification approach (Table 3), with user accuracy rising from 87.8% to 98% in 2023 and from 57% to 99% in 2019, underscoring the improved performance of the Random Forest model in distinguishing between Sirex-affected and non-affected trees.

3.3. Random Forest Interpolation

The results obtained from the interpolation process using the selected Random Forest models indicate effective estimations of category values for every pixel within the study zone. The spatial pattern maps generated for the 2019 data (Figure 3) reveal clear delineations in the Five-Labels (5LA), Two-Labels, and Four-Labels classification approaches. In the assessment of interannual predictions, applying the Random Forest model trained on the 2019 labels to the 2023 plot raster dataset demonstrated reliable classification, particularly with the selected 4LA and 2LA classifiers (Figure 4). This approach enabled the classification into a severe category, which was absent in the 2023 ground truth, resulting in a high percentage of agreement between the predicted and observed field values. Figure 4 illustrates the spatial pattern maps generated by the selected Random Forest models for 2023 data, utilizing 5LA, 3LA, and 4LA (without severe class) classifications. Table 4 presents the calculated percentage of pixels covered by each category within each label approach. Additionally, the RF models’ predictions were validated in the Junín plantation (n = 140 plots) against expert-assigned labels, further supporting the robustness of the model’s predictions.

3.4. RF Interannual Extrapolation

To assess the quality of interannual predictions, a Random Forest model trained on 2019 labels was applied to the 2023 plot raster dataset (Table 5 and Table 6). Due to its superior model fit, a Two-Labels approach and a Four-Labels classifier were chosen (4LA enables classification into a severe class, which was absent from the 2023 ground truth). The evaluation was conducted by calculating the percentage of agreement between the predicted and observed field values.

3.5. Extrapolation of RF to Another Field of Study

The validation of the Random Forest 2023 model predictions in the Junín plantation (n = 140 plots) shows a strong correlation with the labels assigned by a field expert. The results indicated a high level of accuracy in the RF models for identifying tree attack caused by S. noctilio in nearby areas (Table 7).

3.6. Damage Categories Based on Spectral Signatures

The select spectral indices reveal distinct patterns in density distributions across Sirex-damage categories within the study area for February 2019 and February 2023 (Figure 5 and Figure 6). These density curves represent variations in index values across different levels of damage, from no damage to severe. The separation observed between the density curves for each damage level suggests that these indices can effectively differentiate between varying intensities of Sirex damage. However, some overlap between curves indicates potential challenges in discriminating certain damage categories. Additionally, the shape of these density curves reveals patterns in the distribution of index values; for instance, more symmetrical curves with lower variance may indicate a more consistent vegetation response within a specific damage category.

3.7. Temporal Variations in the Selected IS’s

Figure 7 illustrates the temporal trends of 2019 and 2023 spectral indices (VARI, TGI, NDWI, GNDVI, and HUE) and bands (green, red, and NIR) within the study area, categorized differently based on the Four-Labels Approach (incipient, moderate, no damage, and severe). The central tendency (or mean) of an index over time for a specific damage category is presented, and the shaded areas indicate the confidence interval of this trend, that is, the range of values within which the true value is expected to fall with a certain level of probability. Trend lines allow us to visualize how the indices evolve over time for each damage category [11]. This is useful for identifying seasonal patterns, long-term trends, or abrupt changes associated with specific events. The confidence intervals provide a measure of the uncertainty associated with the trend estimates. Narrower confidence intervals indicate a higher precision in the estimate, while wider intervals suggest greater variability in the data. By comparing trends and confidence intervals across different damage categories, we can assess the discriminatory power of the indices used to differentiate between various levels of pest infestation. Although some indices exhibit superior performance, the selection carried out by the RF algorithm ensures their complementarity. Changes in the slope or amplitude of the trend lines may indicate changes in tree health or environmental conditions. In this study, climatic variables that could affect these trends were not considered.

4. Discussion

Early detection and damage monitoring of forest pests are crucial for forest health programs [33,34]. The development of tools that improve the spatial and temporal quantification capabilities of trees affected by pests, with sensitive methods that identify their physiological changes, allows for the obtaining of data to allocate resources and implement targeted control measures more effectively. Furthermore, the applicability of these approaches to the management of plantations over large areas, integrated with other data generated through methods such as aerial monitoring, can provide valuable data in regional surveys for both companies and public forest health agencies. Accurate and timely forest health monitoring is indispensable for mitigating forest degradation and enhancing ecosystem resilience [7]. By detecting and quantifying the extent and severity of pest and pathogen infestations, forest managers can implement targeted interventions to minimize damage and reduce the risk of future outbreaks. Additionally, such monitoring enables the assessment of forest vulnerability and the evaluation of the efficacy of management strategies [7]. Our study demonstrates the effectiveness of integrating remote sensing and machine learning to detect and assess damage caused by S. noctilio in forest plantations, both at stand and large scales and over time. Similarly to other research [11], we identified the most effective spectral bands and spectral indices from Planetscope data to differentiate between various stages of S. noctilio infestation.
SI’s have proven to be valuable tools for monitoring plant health and stress. By mitigating the influence of environmental factors, such as geometry, background, and lighting conditions, SI’s enable accurate and reliable large-scale assessments of vegetation health [3]. This study compared the performance of RF, ANN, and SVM classifiers on equivalent training and test datasets of Sirex damage levels. In their work with Sirex and ML algorithms, Abdel-Rahman et al. (2014) [13] determined that RF and SVM were particularly effective due to their inherent ability to handle multidimensional and noisy hyperspectral data [13,35,36]. Their robustness to outliers and overfitting further enhances their suitability for this classification task [36].
To characterize spectral signatures associated with varying levels of S. noctilio damage in this study, we selected Random Forest classification models. RF demonstrated a higher accuracy in differentiating between no damage, incipient, moderate, and severe damaged categories compared to SVM and ANN. This finding is consistent with the results reported by Abdel-Rahman et al. (2014) [13] and Long et al. (2023) [35] regarding the superior classification performance of RF.The high accuracy of these models, validated across a new plantation area, provides a powerful tool for early detection and monitoring of spatio-temporal patterns of S. noctilio damage at both stand and large-area scales. The results indicate that the 2LA, focusing on no damage/affected tree detection, outperformed 5LA, 3LA, and 4LA, which aimed to classify specific Sirex damage categories. Our 2LA findings, consistent with the reserch of Gao et al. (2022) [3] on the red turpentine beetle, Dendroctonus valens, in Pinus trees, demonstrate the potential of SI’s to distinguish between early-infested and healthy trees (overal accuracy 89–98% and 84%, respectively).The RF model results obtained by [13,14] in their research of Sirex in South African pine plantations with three infection scales were comparable to our finding (2023 overall accuracy 79–81%). The high level of disagreement in 2019 between the 3LA, 4LA, and 5LA, as shown in the confusion matrices, highlights the challenges of accurately classifying pixels that belong to multiple classes, especially when considering spatial patterns in the observed and predicted data. Additionally, the model’s accuracy improved for 2023 data compared to 2019, suggesting better performance in detecting established infestations.
However, our study highlights the need for further refinement of the model to improve the classification of different damage severities. Time series analysis was found to be crucial in differentiating damage-related changes from natural variations [35], and prediction at a regional scale can be realized by using long time series remote sensing data [35]. Host water loss, although not always visually apparent, is a critical indicator of stress; remote sensing studies have revealed significant differences in spectral responses between canopy discoloration, characterized by chlorosis or reddening and insect-induced defoliation [12].
Our analysis revealed that specific spectral indices, such as VARI, GNDVI, TGI, and NDWI, were crucial for discriminating among the different S. noctilio damage categories. These indices serve as vital indicators of vegetation health, allowing us to detect subtle changes that may precede visible symptoms on trees. Spectral indices demonstrated sensitivity to changes in tree mortality caused by S. noctilio, capturing various aspects of plant stress and enabling a more comprehensive damage assessment. Despite the overall trends, the significant variability observed over time in index values can be attributed to fluctuations in S. noctilio population dynamics, climatic factors, and other uncontrolled variables. Recently, Villaverde (2020) [12] identified a significant relationship between Sirex infestation levels and various spectral indices within the same geographic region. By leveraging high-resolution satellite imagery from Landsat 8 and Sentinel-2, the study analyzed correlations between the Normalized Difference Vegetation Index (NDVI), Normalized Difference Water Index (NDWI), Modified Soil Index (MSI), and Soil-Adjusted Vegetation Index (SAVI) and the extent of insect-caused damage. According to Villaverde (2020) [12], the combination of different spectral indices provides a more comprehensive and robust understanding of tree stand status, offering deeper insight into the mechanisms underlying Sirex-induced damage. In our study, indices such as GNDVI and HUE are useful for detecting early signs of Sirex damage before symptoms become visible in aerial imagery or field surveys. Additionally, indices like VARI, GNDVI, NIR, and NDWI have proven to be effective indicators of overall tree health, helping to identify areas at higher risk. The decline in VARI values in Sirex-affected areas highlights its utility for monitoring damage progression.
GNDVI, with its strong correlation to Sirex damage, is particularly sensitive to changes in chlorophyll content, making it a valuable tool for assessing vegetation health. While NDWI shows a less direct relationship with Sirex damage, it provides valuable insights into plant water content, helping to identify water stress. The TGI, initially developed to correlate with chlorophyll content at the canopy level, exhibits its highest sensitivity to changes in Leaf Area Index [36]. This index’s strong correlation with chlorophyll content makes it a useful tool for detecting subtle variations in plant health directly related to green foliage losses caused by Sirex. According to Magalhães et al. (2024) [36], as the canopy becomes denser, indices strongly associated with green reflectance, such as VARI and TGI, accurately separate categories with less damage from those with more damage. Finally, indices such as TGI, NIR, green, red, and HUE offer supplementary information on various aspects of Sirex-induced damage, including reduced biomass, changes in pigment composition, and alterations in leaf structure (additional details can be found in Table 8).
In addition to identifying early signs of S. noctilio damage, our analysis also highlighted how these spectral indices fluctuated over time, reflecting the evolving dynamics of the infestation in pine plantations. Interestingly, we observed that the significance of these indices shifted between 2019 and 2023, likely reflecting changes in the dynamics of the Sirex population and its impact on tree mortality. As the infestation progressed, early signs of damage became increasingly pronounced, underscoring the growing impact of this invasive pest on plantation forest health. Furthermore, while the spectral indices of all damage categories exhibited similar temporal trends, indicating a generalized spectral response of vegetation to the infestation, inter-category variability was observed. Some damage categories showed higher or lower values in specific spectral bands, suggesting differential physiological responses to the Sirex infestation. By incorporating these temporal trends into our classification models, we achieved accurate mapping of damage levels across the study area, enabling visualization of the extent and severity of Sirex damage over time. This enhanced mapping capability is essential for guiding targeted pest management efforts, ensuring that interventions can be implemented promptly and effectively to mitigate further damage to forest resources.
Given that the accuracy and consistency of our approach largely depend on the field data used for training, future research should explore the integration of additional remote sensing data sources, such as hyperspectral imagery or LiDAR, which could provide greater spectral and structural detail to enhance the discrimination of subtle stress symptoms. In our study, we relied on training data obtained from aerial visual surveys of the damage categories generated by S. noctilio over large areas, which may introduce inherent biases. These biases can arise from subjective interpretations of Sirex damage levels, variations in visibility due to environmental conditions, and the limitations of aerial surveys in capturing fine-scale damage details. Additionally, classification results may be influenced by atmospheric conditions, sensor calibration, and variability in vegetation responses to stress factors. However, when evaluating the model’s performance in a new plantation affected by Sirex, where damage levels were quantified in smaller sampling units, its accuracy remained consistent. This suggests that our model can adapt to different assessment contexts and continues to be valuable for detecting Sirex tree mortality.

5. Conclusions

Our study highlights the potential of machine learning models in accurately classifying tree health status, distinguishing between healthy and infested trees, and assessing varying degrees of damage in forests plantation plots. Time series analysis of spectral indices revealed distinct temporal patterns associated with different infestation levels, facilitating the development of early warning systems. The generated maps provide valuable spatial information on infestation distribution and severity. These insights can guide targeted management strategies, such as selective treatments or the removal of infested trees, optimizing resource allocation and minimizing environmental impacts. The spectral indices employed in this study have proven useful. However, the development of more sensitive indices, based on a deep understanding of the spectral and biochemical properties of the plant and pathogen, represents a promising avenue for advancing early detection of S. noctilo infestations through remote sensing.
Although this study underscores the advantages of remote sensing and predictive machine learning models for forest health assessment, certain limitations must be addressed. Factors such as atmospheric variability, vegetation heterogeneity, and data quality remain challenges. Future research should focus on refining the methodology and incorporating additional data sources, including hyperspectral imagery and LiDAR. Such advancements could significantly enhance the accuracy and reliability of this approach, broadening its applicability for large-scale forest management.

Author Contributions

Conceptualization: A.H. and J.V.; Methodology: A.H. and J.V.; Formal Analysis: A.H.; Writing—Original Draft Preparation: A.H. and J.V.; Writing—Review and Editing: A.H. and J.V.; Visualization: J.V. and A.H.; Funding Acquisition: A.H. and J.V. Both authors read and approved the final manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This study was funded by the National Research Institute of Agricultural Research (INIAsupported by grants: INIA Sistema Forestal and INTA PDI074-2023 and PEI033-2023 (Argentina).

Data Availability Statement

The data are available and can be obtained by contacting the author.

Acknowledgments

We would like to thank Axel Von Muller and Santiago Berh for their assistance in acquiring and processing drone-based images of the Junín plantation. We also would like to thank Gonzalo Martinez Crossa for his support and Demian Gomez for his feedback on draft versions of the manuscript.

Conflicts of Interest

The authors declare no competing interests.

Appendix A

Table A1. Baseline Aerial Classification Labels and Combined Labeling Approaches.
Table A1. Baseline Aerial Classification Labels and Combined Labeling Approaches.
Original LabelsCombined Labels Approach
Original Labels (OLA)Five-Labels Approach
(5LA)
Four-Labels Approach (4LA)Three-Labels Approach (3LA)Two-Labels Approach (2LA)
No damage (0%)No damageNo damageNo damageNo damage
Incipient (<5%)IncipientIncipientIncipientAffected
Mild (5–10%)MildModerateModerate
Moderate (10–50%)Moderate
Severe (50–70%)SevereSevere
Extreme (>70%)

Appendix B

Table A2. Spectral indices used as input variables in machine learning algorithms.
Table A2. Spectral indices used as input variables in machine learning algorithms.
IndexDescription
AVIAdvanced vegetation index
BIBare soil index
CIColoration Index
CLGGreen-band Chlorophyll Index
CTVICorrected Transformed Vegetation Index
DVI *Difference Vegetation Index
EVIEnhanced Vegetation Index
EVI2Two-band Enhanced Vegetation Index
GEMI *Global Environmental Monitoring Index
GLIGreen leaf index Vis
GNDVI *Green Normalized Difference Vegetation Index
GRVIGreen-Red Vegetation Index
HI *Hue Index
HUE *Overall Hue Index
MSAVI *Modified Soil Adjusted Vegetation Index
MSAVI2Modified Soil Adjusted Vegetation Index 2
NDVINormalized Difference Vegetation Index
NDWI *Normalized Difference Water Index
NGRDINormalized green red difference index
NRVINormalized Ratio Vegetation Index
RIRedness Index
RVIRatio Vegetation Index
SAT *Saturation Index
SAVI *Soil Adjusted Vegetation Index
SCISoil Color Index
SHP *Shape index
SIShadow index
SRSimple Ratio Vegetation Index
TGI *The triangular greenness index
TTVIThiam’s Transformed Vegetation Index
TVITransformed Vegetation Index
VARI *Visible Atmospherically Resistant Index
* SI’s selected after PIMP.

References

  1. Lewis, S.L.; Edwards, D.P.; Galbraith, D. Increasing Human Dominance of Tropical Forests. Science 2015, 349, 827–832. [Google Scholar] [CrossRef] [PubMed]
  2. Buzatu, A.; Nețoiu, C.; Apostol, B.; Badea, O. The Use of Remote Sensing Indices Derived from Sentinel 2 Satellite Images for the Defoliation Damage Assessment of Lymantria Dispar. Ann. For. Res. 2023, 66, 123–138. [Google Scholar] [CrossRef]
  3. Gao, B.; Yu, L.; Ren, L.; Zhan, Z.; Luo, Y. Early Detection of Dendroctonus valens Infestation with Machine Learning Algorithms Based on Hyperspectral Reflectance. Remote Sens. 2022, 14, 1373. [Google Scholar] [CrossRef]
  4. Calamita, F.; Imran, H.A.; Vescovo, L.; Mekhalfi, M.L.; La Porta, N. Early Identification of Root Rot Disease by Using Hyperspectral Reflectance: The Case of Pathosystem Grapevine/Armillaria. Remote Sens. 2021, 13, 2436. [Google Scholar] [CrossRef]
  5. Sumiya, N.N.; Hasan, M.U.; Nusrat, M.; Hossain, M.S. A Review on Remote Sensing Based Forest Vegetation Health Assessment: Bangladesh Perspective. Dhaka Univ. J. Earth Environ. Sci. 2024, 12, 107–121. [Google Scholar] [CrossRef]
  6. Bozzini, A.; Francini, S.; Chirici, G.; Battisti, A.; Faccoli, M. Spruce Bark Beetle Outbreak Prediction through Automatic Classification of Sentinel-2 Imagery. Forests 2023, 14, 1116. [Google Scholar] [CrossRef]
  7. Sebastiani, A.; Bertozzi, M.; Vannini, A.; Morales-Rodriguez, C.; Calfapietra, C.; Vaglio Laurin, G. Monitoring Ink Disease Epidemics in Chestnut and Cork Oak Forests in Central Italy with Remote Sensing. Remote Sens. Appl. 2024, 36, 101329. [Google Scholar] [CrossRef]
  8. Trubin, A.; Kozhoridze, G.; Zabihi, K.; Modlinger, R.; Singh, V.V.; Surový, P.; Jakuš, R. Detection of Green Attack and Bark Beetle Susceptibility in Norway Spruce: Utilizing PlanetScope Multispectral Imagery for Tri-Stage Spectral Separability Analysis. For. Ecol. Manag. 2024, 560, 121838. [Google Scholar] [CrossRef]
  9. Lausch, A.; Erasmi, S.; King, D.J.; Magdon, P.; Heurich, M. Understanding Forest Health with Remote Sensing-Part I—A Review of Spectral Traits, Processes and Remote-Sensing Characteristics. Remote Sens. 2016, 8, 1029. [Google Scholar] [CrossRef]
  10. Lantschner, M.V.; Aukema, B.H.; Corley, J.C. Droughts Drive Outbreak Dynamics of an Invasive Forest Insect on an Exotic Host. For. Ecol. Manag. 2019, 433, 762–770. [Google Scholar] [CrossRef]
  11. Dalponte, M.; Cetto, R.; Marinelli, D.; Andreatta, D.; Salvadori, C.; Pirotti, F.; Frizzera, L.; Gianelle, D. Spectral Separability of Bark Beetle Infestation Stages: A Single-Tree Time-Series Analysis Using Planet Imagery. Ecol. Indic. 2023, 153, 110349. [Google Scholar] [CrossRef]
  12. Villaverde Fontana, R.A. Análisis de Daños de Sirex Noctilio en Plantaciones de Pinus spp. en la Patagonia Argentina Mediante Sensores Multiespectrales; Universidad de Córdoba: Córdoba, Spain, 2020. [Google Scholar]
  13. Abdel-Rahman, E.M.; Mutanga, O.; Adam, E.; Ismail, R. Detecting Sirex noctilio Grey-Attacked and Lightning-Struck Pine Trees Using Airborne Hyperspectral Data, Random Forest and Support Vector Machines Classifiers. ISPRS J. Photogramm. Remote Sens. 2014, 88, 48–59. [Google Scholar] [CrossRef]
  14. Xulu, S.; Mbatha, N.; Peerbhay, K.; Gebreslasie, M.; Agjee, N. Comparison of Different Spectral Indices to Differentiate the Impact of Insect Attack on Planted Forest Stands. Remote Sens. Appl. 2024, 33, 101087. [Google Scholar] [CrossRef]
  15. Zhou, X.; Zhao, C.; Li, N.; Cui, Y.; Yi, X.; Liu, M. Spatiotemporal Variation of Snow and Its Response to Climate Change in Northeast China. Plateau Meteorol. 2021, 40, 875–886. [Google Scholar] [CrossRef]
  16. Carnegie, A.J.; Matsuki, M.; Haugen, D.A.; Hurley, B.P.; Ahumada, R.; Klasmer, P.; Sun, J.; Iede, E.T. Predicting the Potential Distribution of Sirex noctilio (Hymenoptera: Siricidae), a Significant Exotic Pest of Pinus Plantations. Ann. For. Sci. 2006, 63, 119–128. [Google Scholar] [CrossRef]
  17. Villacide, J.M.; Corley, J.C. Ecology of the Woodwasp Sirex noctilio: Tackling the Challenge of Successful Pest Management. Int. J. Pest. Manag. 2012, 58, 249–256. [Google Scholar] [CrossRef]
  18. Corley, J.C.; Lantschner, M.V.; Martínez, A.S.; Fischbein, D.; Villacide, J.M. Management of Sirex noctilio Populations in Exotic Pine Plantations: Critical Issues Explaining Invasion Success and Damage Levels in South America. J. Pest. Sci. 2019, 92, 131–142. [Google Scholar] [CrossRef]
  19. Martínez, A.S.; Villacide, J.; Fernández Ajó, A.A.; Martinson, S.J.; Corley, J.C. Sirex noctilio Flight Behavior: Toward Improving Current Monitoring Techniques. Entomol. Exp. Appl. 2014, 152, 135–140. [Google Scholar] [CrossRef]
  20. QGIS Development Team. Geographic Information System. 2019. Available online: https://www.scirp.org/reference/referencespapers?referenceid=2631129 (accessed on 24 April 2024).
  21. Planet Labs Precision Ag Insights from Frequent Imaging Smarter Farming Throughout the Season. 2018. Available online: https://learn.planet.com/rs/997-CHH-265/images/planet-precision-ag-ebook-2018-smarter-farming.pdf (accessed on 24 April 2024).
  22. Exelis VIS. ENVI 5.3; Exelis VIS: Boulder, CO, USA, 2015. [Google Scholar]
  23. R Core Development Team. A Language and Environment for Statistical Computing; R Core Development Team: Vienna, Austria, 2013; Volume 1. [Google Scholar]
  24. Roy, P.S.; Sharma, K.P.; Jain, A. Stratification of Density in Dry Deciduous Forest Using Satellite Remote Sensing Digital Data—An Approach Based on Spectral Indices. J. Biosci. 1996, 21, 723–734. [Google Scholar] [CrossRef]
  25. Mondal, I.; Thakur, S.; Juliev, M.; Kumar De, T. Comparative Analysis of Forest Canopy Mapping Methods for the Sundarban Biosphere Reserve, West Bengal, India. Environ. Dev. Sustain. 2021, 23, 15157–15182. [Google Scholar] [CrossRef]
  26. Altmann, A.; Toloşi, L.; Sander, O.; Lengauer, T. Permutation Importance: A Corrected Feature Importance Measure. Bioinformatics 2010, 26, 1340–1347. [Google Scholar] [CrossRef] [PubMed]
  27. Romero Ramirez, F.J.; Navarro-Cerrillo, R.M.; Varo-Martínez, M.Á.; Quero, J.L.; Doerr, S.; Hernández-Clemente, R. Determination of Forest Fuels Characteristics in Mortality-Affected Pinus Forests Using Integrated Hyperspectral and ALS Data. Int. J. Appl. Earth Obs. Geoinf. 2018, 68, 157–167. [Google Scholar] [CrossRef]
  28. Mngadi, M.; Germishuizen, I.; Mutanga, O.; Naicker, R.; Maes, W.H.; Odebiri, O.; Schroder, M. A Systematic Review of the Application of Remote Sensing Technologies in Mapping Forest Insect Pests and Diseases at a Tree-Level. Remote Sens. Appl. 2024, 36, 101341. [Google Scholar] [CrossRef]
  29. Miraki, M.; Sohrabi, H.; Fatehi, P.; Kneubuehler, M. Coupling UAV and Satellite Data for Tree Species Identification to Map the Distribution of Caspian Poplar. Landsc. Ecol. 2024, 39, 30. [Google Scholar] [CrossRef]
  30. Zhang, J.; Cong, S.; Zhang, G.; Ma, Y.; Zhang, Y.; Huang, J. Detecting Pest-Infested Forest Damage through Multispectral Satellite Imagery and Improved UNet++. Sensors 2022, 22, 7440. [Google Scholar] [CrossRef]
  31. Matiza, C.; Mutanga, O.; Odindi, J.; Mngadi, M. The Utility of Planetscope Spectral Data in Quantifying Above-Ground Carbon Stock in an Urban Reforested Landscape. Ecol. Inform. 2024, 80, 102472. [Google Scholar] [CrossRef]
  32. Ding, C.; Huang, W.; Meng, Y.; Zhang, B. Satellite-Observed Spatio-Temporal Variation in Spring Leaf Phenology of Subtropical Forests across the Nanling Mountains in Southern China over 1999–2019. Forests 2022, 13, 1486. [Google Scholar] [CrossRef]
  33. Allison, J.D.; Paine, T.D.; Slippers, B.; Wingfield, M.J. (Eds.) Forest Entomology and Pathology: Volume 1: Entomology; Springer Nature: Cham, Switzerland, 2023; 810p. [Google Scholar] [CrossRef]
  34. Villacide, J.M.; Gomez, D.F.; Perez, C.A.; Corley, J.C.; Ahumada, R.; Rodrigues Barbosa, L.; Furtado, E.L.; González, A.; Ramirez, N.; Balmelli, G.; et al. Forest Health in the Southern Cone of America: State of the Art and Perspectives on Regional Efforts. Forests 2023, 14, 756. [Google Scholar] [CrossRef]
  35. Long, L.; Chen, Y.; Song, S.; Zhang, X.; Jia, X.; Lu, Y.; Liu, G. Remote Sensing Monitoring of Pine Wilt Disease Based on Time-Series Remote Sensing Index. Remote Sens. 2023, 15, 360. [Google Scholar] [CrossRef]
  36. de Magalhães, L.P.; Rossi, F. Use of Indices in RGB and Random Forest Regression to Measure the Leaf Area Index in Maize. Agronomy 2024, 14, 750. [Google Scholar] [CrossRef]
Figure 1. Location of the Las Ovejas study sites and the Junín validation zones in the Neuquén, Argentina. Pine plantations are in green.
Figure 1. Location of the Las Ovejas study sites and the Junín validation zones in the Neuquén, Argentina. Pine plantations are in green.
Remotesensing 17 00537 g001
Figure 2. Mosaic images showing a plot from study area: (right) an RGB orthomosaic; (left) AVI + SI + BI combination symbology; pixels lacking color have values of zero or NaN.
Figure 2. Mosaic images showing a plot from study area: (right) an RGB orthomosaic; (left) AVI + SI + BI combination symbology; pixels lacking color have values of zero or NaN.
Remotesensing 17 00537 g002
Figure 3. Model Predictions of the Five-Labels, Four-Labels, and Two-Labels approaches (5LA, 4LA, and 2LA, respectively) in the entire area for February 2019.
Figure 3. Model Predictions of the Five-Labels, Four-Labels, and Two-Labels approaches (5LA, 4LA, and 2LA, respectively) in the entire area for February 2019.
Remotesensing 17 00537 g003
Figure 4. Model Predictions of the Five-Labels, Three-Labels, and Two-Labels approaches (5LA, 3LA, and 2LA, respectively) in the entire area for February 2023.
Figure 4. Model Predictions of the Five-Labels, Three-Labels, and Two-Labels approaches (5LA, 3LA, and 2LA, respectively) in the entire area for February 2023.
Remotesensing 17 00537 g004
Figure 5. Density distribution for each band and spectral index according to Sirex noctilio damage category for 2023.
Figure 5. Density distribution for each band and spectral index according to Sirex noctilio damage category for 2023.
Remotesensing 17 00537 g005
Figure 6. Density distribution for each band and spectral index according to Sirex noctilio damage category for 2019.
Figure 6. Density distribution for each band and spectral index according to Sirex noctilio damage category for 2019.
Remotesensing 17 00537 g006
Figure 7. Temporal trend analysis and characterization of spectral indices over the 2019–2023 period.
Figure 7. Temporal trend analysis and characterization of spectral indices over the 2019–2023 period.
Remotesensing 17 00537 g007
Table 1. Five-Labels approach RF-based classification results of 2023 and 2019.
Table 1. Five-Labels approach RF-based classification results of 2023 and 2019.
2023 IncipientMildModerateSevereNo DamageUser’s AccuracyOverall AccuracyKappa
Incipient11,23932478703400.8987.9%0.79
Mild185680104060.70
Moderate6559648670230.86
Severe000000
No damage2837017780.98
Producer’s accuracy0.930.60.8400.83
2019 IncipientMildModerateSevereNo DamageUser’s AccuracyOverall AccuracyKappa
Incipient1803100435549160.53
Mild186961104652517480.56
Moderate920132810420.5157%0.42
Severe39315734182706150.53
No damage03009311.00
Producer’s accuracy0.440.690.260.460.92
Table 2. Three- and Four-Labels approach RF-based classification results of 2023 and 2019.
Table 2. Three- and Four-Labels approach RF-based classification results of 2023 and 2019.
2023 ModerateIncipientNo DamageUser’s AccuracyOverall AccuracyKappa
Moderate5946898280.8789%0.81
Incipient913111813580.90
No damage92817610.98
Producer’s accuracy0.870.920.82
2019 ModerateIncipientNo DamageSevereUser’s AccuracyOverall AccuracyKappa
Moderate738119923129760.6059%0.45
Incipient1139155626080.47
No damage0097201.00
Severe1727326723820.54
Producer’s accuracy0.720.400.960.40
Table 3. Two-Labels approach RF-based classification results of 2023 and 2019.
Table 3. Two-Labels approach RF-based classification results of 2023 and 2019.
2023 AffectedNo DamageUser’s AccuracyOverall AccuracyKappa
Affected18,9504040.9898%0.89
No damage2617430.99
Producer’s accuracy0.990.81
2019 AffectedNo DamageUser’s AccuracyOverall AccuracyKappa
Affected20,08891.0099%0.99
No damage010031.00
Producer’s accuracy1.000.99
Table 5. Four-Labels approach RF-based classification results for 2023 based on 2019 RF model.
Table 5. Four-Labels approach RF-based classification results for 2023 based on 2019 RF model.
IncipientMildModerateNo Damage User’s Accuracy
Incipient25075185951163.82
Mild103192136251.89
Moderate2162963132752.45
No damage 100314542262.99
Producer’s accuracy85.769.838.331.9
Table 6. Two-Labels approach RF-based classification results for 2023 based on 2019 RF model.
Table 6. Two-Labels approach RF-based classification results for 2023 based on 2019 RF model.
AffectedNo DamageUser’s Accuracy
Affected1714105262.0
No damage25034157.7
Producer’s accuracy87.324.5
Table 7. Three-Labels approach RF-based (2023 RF model) classification results of the Junín plantation.
Table 7. Three-Labels approach RF-based (2023 RF model) classification results of the Junín plantation.
MildModerateNo DamageUser’s Accuracy
Mild254183.33
Moderate1044081.48
No damage 8123563.64
Producer’s accuracy58.173.397.2
Table 4. Distribution of pixel classification percentages by category, approach (Five-Labels, Four-Labels, Three-Labels and Two-Labels approaches: 5LA, 4LA, 3LA, and 2LA, respectively) and year.
Table 4. Distribution of pixel classification percentages by category, approach (Five-Labels, Four-Labels, Three-Labels and Two-Labels approaches: 5LA, 4LA, 3LA, and 2LA, respectively) and year.
YearApproachLabelsPercentage
20192LAAffected99.71
No damage0.29
4LAIncipient17.2
Moderate62.2
Severe20.1
No damage0.5
5LAIncipient17.7
Mild55.2
Moderate3.1
Severe23.5
No damage0.5
20232LAAffected99.68
No damage0.32
3LAIncipient58.13
Moderate41.35
No damage0.52
5LAIncipient60.11
Mild7.84
Moderate31.52
No damage0.52
Table 8. Selected Remote Sensing indices for assessing vegetation damage.
Table 8. Selected Remote Sensing indices for assessing vegetation damage.
Indices/BandConcise Index DescriptionObserved ResultsInterpretation
VARI = (Green − Red)/(Green + Red − Blue) Visible Atmospherically Resistant Index highlights vegetation in the visible spectrum, mitigating illumination differences and atmospheric effects. It is ideal for detecting water stress, pests, and damage.Evidence a marked seasonal pattern in all categories, with annual peaks and valleys. A clear differentiation is observed between the categories, with “No damage” being the one with the highest value and the least variability, while “Severe” presents the lowest value and the greatest dispersion. Presents an annual cyclical behavior, which suggests a strong influence of climatic factors. Higher values indicate healthy vegetation; lower values suggest stress or damage. Useful for identifying areas at risk of damage
GNDVI = (Green − Red)/(Green + Red)The Green Normalized Difference Vegetation Index is a measure of vegetation greenness and vigor. Higher values indicate a higher concentration of chlorophyll, suggesting healthy and vigorous vegetationAlthough there is a seasonal pattern, a general downward trend in GNDVI values is evident for all categories throughout the study period. This may suggest an overall decline in tree health or alterations in environmental conditionsAlthough exhibiting seasonal patterns, the GNDVI values for all categories demonstrate a general declining trend throughout the study period. This may suggest an overall deterioration of tree health or alterations in environmental factors
NDWI = (Green − NIR)/(Green + NIR)The Normalized Difference Water Index is a measure of water content in vegetation. Lower values indicate lower water content, suggesting water stress.Provides a comprehensive view of temporal variations in tree water content, facilitating the identification of seasonal patterns and categorical differences. No clear trend of decreasing values is observed as damage increases. It should be combined with other indices to obtain a more comprehensive view of vegetation health. Sensitive to changes in water content
TGI = (Green − Red)/(Green + Red + Blue)The Triangular Vegetation Index is a measure of vegetation greenness, using red, green, and blue bands. Higher values indicate better plant health and higher chlorophyll contentA clear and consistent trend related to increasing damage severity is not evident in the data. There is significant interannual variability in the observed values.The TGI serves as a proxy for tree health. Elevated TGI values are indicative of healthy, robust trees, whereas reduced values signal stress and potential decline. Trees experiencing mild to moderate stress exhibit intermediate TGI values.
NIRThe near-infrared region is sensitive to water and biomass content.A decrease in reflectance is expected as damage severity increases NIR serves as a proxy for tree health. Elevated NIR values are indicative of healthy, robust trees, whereas reduced values signal stress and potential decline. Trees experiencing mild to moderate stress exhibit intermediate NIR values.
Green bandGreen spectral band. Utilized for quantifying and assessing vegetation health.Decrease in values as damage increases.It shows lower chlorophyll content in damaged trees; with considerable variability over time
Red bandRed spectral band. Crucial for vegetation analysis due to chlorophyll’s strong absorption in this region.These bands allow differentiation between elements with diverse levels of affliction or state. Elevated metrics are indicative of a favorable condition, while reduced metrics suggest progressive degradationExhibits variability across different categories. “No damage” category displays the highest values, indicative of an optimal condition. In ‘Incipient’, ‘Moderate’, and ‘Severe’ categories, values decrease, suggesting progressive degradation. Notwithstanding these disparities, overall trends within each category remain relatively stable over time
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Hirigoyen, A.; Villacide, J. Assessing Sirex noctilio Fabricius (Hymenoptera: Siricidae) Damage in Pine Plantations Using Remote Sensing and Predictive Machine Learning Models. Remote Sens. 2025, 17, 537. https://doi.org/10.3390/rs17030537

AMA Style

Hirigoyen A, Villacide J. Assessing Sirex noctilio Fabricius (Hymenoptera: Siricidae) Damage in Pine Plantations Using Remote Sensing and Predictive Machine Learning Models. Remote Sensing. 2025; 17(3):537. https://doi.org/10.3390/rs17030537

Chicago/Turabian Style

Hirigoyen, Andrés, and José Villacide. 2025. "Assessing Sirex noctilio Fabricius (Hymenoptera: Siricidae) Damage in Pine Plantations Using Remote Sensing and Predictive Machine Learning Models" Remote Sensing 17, no. 3: 537. https://doi.org/10.3390/rs17030537

APA Style

Hirigoyen, A., & Villacide, J. (2025). Assessing Sirex noctilio Fabricius (Hymenoptera: Siricidae) Damage in Pine Plantations Using Remote Sensing and Predictive Machine Learning Models. Remote Sensing, 17(3), 537. https://doi.org/10.3390/rs17030537

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop