1. Introduction
Remote sensing (RS) is an essential tool in precision agriculture, with RGB and multispectral images widely used to estimate agricultural productivity and adjust management strategies, especially in intercropping systems [
1,
2]. The evolution of RS techniques has brought increasing relevance to Unmanned Aerial Vehicles (UAVs) equipped with multispectral cameras, which provide greater spatial resolution and data acquisition frequency compared with satellite images [
3,
4,
5,
6]. Silva et al. [
7] highlighted the potential of biophysical indices derived from RGB images captured by UAVs for monitoring the vegetative state of forage cactus, as well as for detecting bare soils and salinized areas, factors that contribute to more sustainable agricultural practices [
8,
9].
Biophysical indices derived from UAV images offer robust support for agricultural monitoring and environmental diagnostics. Among the vegetation indices, the Normalized Difference Vegetation Index (NDVI) is one of the most widely used due to its sensitivity to photosynthetically active biomass, exploiting the contrast between red and near-infrared reflectance [
10]. The Soil-Adjusted Vegetation Index (SAVI) and the Enhanced Vegetation Index (EVI) are also commonly applied to minimize the influence of atmospheric and soil background conditions, respectively, and these adjustments are particularly relevant in canopies with sparse or heterogeneous vegetation, such as those found in forage cactus systems [
11,
12,
13]. In addition, the Modified Green–Red Vegetation Index (MGRVI) and the Visible Atmospheric Resistance Index (VARI), which use only the visible spectral bands, are effective under variable illumination and in conditions where near-infrared (NIR) data are limited [
14].
To assess salinity, the Normalized Difference Salinity Index (NDSI) was adopted for its ability to highlight spectral differences caused by salt crusts or saline moisture on the soil surface, using the red and NIR bands [
15]. Studies such as [
16] have demonstrated strong correlations between NDSI and soil salinity in irrigated areas. The Generalized Difference Vegetation Index (GDVI) was also considered due to its flexibility in different spectral configurations and its accuracy in estimating vegetation density [
17]. Collectively, these indices provide a comprehensive spectral framework that enhances spatial modeling capabilities when integrated with geostatistical techniques.
Recent studies reinforce the value of RGB-based indices in estimating parameters such as leaf area index (LAI) and chlorophyll content via machine learning [
18,
19,
20]. Santos et al. [
18] and [
12,
21] demonstrated the relevance of these indices for productive and environmental monitoring, while [
2] showed that multispectral images, when including bands beyond the visible spectrum, outperform RGB images in the classification of forage cactus using artificial intelligence algorithms. According to [
12], the integration of biophysical indices from visible and non-visible spectral bands improves the mapping of field conditions. Feng et al. [
22] also reported a higher model accuracy with multispectral images compared with RGB-only data.
These indices allow the indirect estimation of soil attributes through regression models with high coefficients of determination [
8] and assist in spatialization by interpolation methods, such as kriging. Among these methods, cokriging stands out for incorporating secondary variables (e.g., spectral indices) correlated with the primary variable, thus reducing the uncertainty of the estimate. Studies have shown that cokriging produces smaller prediction errors than conventional techniques, such as ordinary kriging and inverse distance weighting (IDW) [
23,
24,
25].
At the same time, the Brazilian semiarid region presents unique environmental challenges due to its unpredictable precipitation patterns, shallow and compactable soils, and frequent salinization [
26]. Marengo et al. [
27] describes it as the wettest semiarid region in the world, with annual precipitation of up to 800 mm, but with high evapotranspiration (~2000 mm/year), which restricts groundwater recharge and compromises water security. The limited sanitation infrastructure, present in only 243 of the 1135 municipalities [
28], contributes to the risks of contamination and reduced water quality [
29]. Initiatives that use treated domestic wastewater for irrigation have emerged as a sustainable alternative. However, their reach is still limited, especially in rural areas [
30]. Although positive results have been observed in cactus pear irrigated with treated wastewater [
31], there are still concerns about environmental risks, such as microbial contamination and, especially, soil salinization [
32]. On the other hand, the controlled reuse of wastewater can increase soil organic matter content [
33].
Forage cactus is a strategic plant for semiarid regions, characterized by intense radiation, water scarcity, and high temperatures, affecting the availability of forage food [
34]. The productivity of this crop is strongly correlated with the physical–chemical quality of the soil, with organic carbon content (TOC) being one of the main indicators of fertility in semiarid environments [
35]. TOC contributes to greater water retention, aggregate stability, and nutrient cycling, both of which are essential for the development of crops under a water deficit. These benefits are enhanced by the use of forage species adapted to semiarid ecosystems, which generally accumulate more carbon when compared with agricultural crops due to the high density of their fasciculate root system and their ability to use small amounts of water available in the soil [
36].
Soil electrical conductivity (EC), which is often high in areas irrigated with reclaimed water, acts as a salinization agent, limiting water absorption by agricultural crops, affecting their productivity. In this context, the spatial mapping of these attributes, especially with the support of remote sensing techniques, allows more effective interventions in irrigation management and the use of conservation practices, providing increased productive resilience of forage cactus. Cheng et al. [
37] found that remote sensing can be used to map crop water productivity, leading to significant water savings while maintaining or increasing crop yields. Pizarro et al. [
38] demonstrated that multispectral indices derived from UAVs can improve the prediction of soil properties, contributing to the more efficient and effective management of agricultural crops.
Despite the advances in RS and geotechnologies, there is still a scientific gap regarding the integration of RGB and multispectral indices derived from UAVs as auxiliary variables in cokriging models aimed at estimating soil attributes in areas irrigated with reused water. This approach remains little explored in forage cactus cultivation, especially in the Brazilian semiarid region. Therefore, this study aimed to evaluate the effectiveness of biophysical indices in reducing the uncertainties associated with the geostatistical mapping of organic matter and electrical conductivity in a forage cactus production area irrigated with reused water, in the semiarid region of Pernambuco, Brazil. In addition, we sought to investigate the sensitivity of cokriging to the spatial resolution of secondary variables, comparing different interpolation scales. This research offers a practical and scientific contribution to precision agriculture by proposing an innovative, relatively low-cost approach based on the integration of spectral data from UAVs to improve soil monitoring and increase the efficiency of wastewater irrigation in regions of high environmental vulnerability.
3. Results and Discussion
3.1. Descriptive Analysis of Soil Attributes and Spectral Indices
Table 3 presents the descriptive statistics for soil electrical conductivity (EC), organic matter (OM), and the indices derived from UAV imagery, using three different sampling grid resolutions. The mean organic matter content was 33.831 g kg
−1, while the mean electrical conductivity was 1.83 dS m
−1. Based on the coefficients of variation (CVs), EC was classified as highly variable (85.4%), whereas organic matter showed moderate variability (27.6%). The high variability observed in EC may be related to localized factors such as variations in irrigation management with reclaimed water and topographic differences across the field [
62].
Regarding the biophysical indices, the Enhanced Vegetation Index (EVI) and the Generalized Difference Vegetation Index (GDVI) showed the highest mean values (0.426 and 0.398, respectively). In contrast, the Normalized Difference Salinity Index (NDSI) and the Visible Atmospherically Resistant Index (VARI) exhibited negative values, which is a common pattern in areas with low vegetation cover or exposed soils [
12,
63]. The variability in the indices ranged from low to moderate, with coefficients of variation (CV) between 17.5% (NDVI) and 22% (EVI), indicating good consistency in the data, even when using a lower-resolution sampling grid.
Considering grid 2, a slight rise in the mean values of EVI and GDVI was observed. The data range remained stable, with minimum and maximum values similar to those of the previous grid, and the CVs remained comparable. This suggests that increasing the sampling density did not significantly affect the variability in the extracted indices. The lower (LL) and upper (UL) limits confirmed that most data were concentrated within a statistically acceptable confidence interval, with few outliers. In the highest-resolution grid (grid 3), a further increase in the mean values of EVI (0.440) and GDVI (0.419) was recorded, suggesting improved spatial detail in detecting variability in soil spectral parameters. However, the standard deviation values remained similar to those of the previous grids—for example, EVI showed standard deviations of 0.094 (grid 1), 0.084 (grid 2), and 0.092 (grid 3); for GDVI, the values were 0.072 (grid 1), 0.073 (grid 2), and 0.079 (grid 3)—indicating that increasing the sampling density did not significantly reduce data dispersion or uncertainty.
3.2. Correlations Between Soil Attributes and Spectral Indices
Table 4 presents the Spearman correlations between soil attributes and spectral indices across the three sampling resolutions (grids 1, 2, and 3), allowing the consistency of biophysical and physicochemical variables to be assessed under varying sampling densities. Overall, strong positive correlations were observed among the spectral indices EVI, NDVI, GDVI, and SAVI, particularly in grids 2 and 3. GDVI and SAVI exhibited near-perfect correlations across all grids (≈0.999), suggesting that both capture similar spectral patterns associated with green biomass. NDVI and GDVI also showed high correlations (>0.95), reinforcing their robustness in vegetation detection in irrigated areas. Conversely, NDSI showed a strong negative correlation with vegetation indices such as GDVI and NDVI (<–0.95), indicating that increased salinity tends to be associated with reduced vegetation cover [
64].
Soil electrical conductivity (EC) showed moderate correlations with the spectral indices of NDVI, GDVI, EVI, and SAVI, with values of 0.209, 0.172, 0.17, and 0.162, respectively. The NDSI exhibited a negative correlation (−0.16), suggesting an indirect relationship between soil salinity and the presence of forage cactus in the study area. In contrast, organic matter (OM) showed weaker correlations. Overall, OM displayed weak or nearly null correlations with the spectral indices, corroborating the findings of Belenok et al. [
65], who reported a low linear correlation between the OM and NDVI values derived from Landsat 5 and 8 imageries, based on Pearson’s test. The correlation patterns were consistent across all three sampling grids, indicating that changes in sampling density did not significantly affect the strength of the relationships among variables.
Principal Component Analysis (PCA) was applied to the dataset, excluding outliers, to identify correlation patterns between soil attributes and spectral indices. The resulting biplots for the 2 × 1 m, 1 × 1 m, and 0.5 × 0.5 m grids are presented in
Figure 3. Dimension 1 (Dim1) explained between 40.6% and 42.2% of the variance in all three grids, while Dimension 2 (Dim2) accounted for 27.5% to 30.6%, together representing over 70% of the total variability explained by the first two components.
The biplots reveal that EC and OM contributed minimally to the variance explained by the first two principal components. However, in the 2 × 1 m grid, soil electrical conductivity showed a stronger correlation with the spectral indices NDVI, SAVI, GDVI, and EVI, suggesting that these indices have the potential for estimating soil salinity patterns in areas irrigated with reclaimed water. A similar behavior was observed in the higher resolution grids (1 × 1 m and 0.5 × 0.5 m), where EC maintained a certain degree of proximity to the spectral vectors, albeit with reduced intensity. The consistency of this pattern across different grids reinforces the robustness of PCA in capturing the structural relationships between spectral data and soil chemical attributes. Nevertheless, this result contrasts with the findings of Silva et al. [
8], who reported inverse correlations between NDVI, SAVI, GDVI, and EVI and soil EC in irrigated areas of the semiarid region. This divergence can be attributed to the fact that the present study focused solely on the spectral response of the soil, whereas Silva et al. [
8] evaluated the spectral response of the plant. Variations in soil type, the degree of salinization, irrigation management, soil cover, and image acquisition period also directly affect spectral responses and their relationship with soil properties.
3.3. Geostatistical Modeling of Soil Attributes and Spectral Indices
Table 5 and
Figure 4 present the parameters of the fitted semivariogram models for the physicochemical soil variables (OM and EC) and the spectral indices (MGRVI, NDSI, NDVI, VARI, EVI, GDVI, and SAVI). All variables exhibited spatial dependence, with fitted exponential or Gaussian models and high coefficients of determination (R
2), indicating good model performance.
Organic matter (OM) exhibited a strong spatial dependence (DSD = 11.15%), while electrical conductivity (EC) showed a moderate dependence (47.98%). The range values varied from 1.08 to 3.36 m, with EC presenting the highest range, likely due to its greater spatial heterogeneity, often influenced by irrigation practices and salt accumulation on the soil surface [
33]. The spatial dependence observed for organic matter is similar to that reported by Fu et al. [
66], who evaluated soil organic matter at depths of 0–20 cm and 20–40 cm and found models with DSD values of 10% and 13%, respectively.
Among the spectral indices, EVI presented the highest coefficient of determination (R
2 = 0.993), followed by NDSI and NDVI (0.987), GDVI (0.937), and SAVI (0.930), indicating an excellent fit of the theoretical model to the experimental semivariances. In contrast, VARI showed the lowest R
2 (0.671), likely due to its higher sensitivity to atmospheric interference and illumination variability, which is common in indices based solely on visible bands [
23]. In terms of spatial dependence, VARI exhibited a strong spatial dependence (DSD < 25%), while the other indices—MGRVI, NDSI, NDVI, EVI, GDVI, and SAVI—demonstrated a moderate spatial dependence (DSD between 38.38% and 48.03%). The observed range values varied between 3.95 m (VARI) and 56.25 m (MGRVI), reflecting different spatial extents.
Table 6 and
Figure 4 present the parameters of the semivariograms adjusted for the spectral indices in both grids 2 (1 × 1 m) and 3 (0.5 × 0.5). The adjustments of the semivariograms of the spectral indices showed different patterns of spatial structure, with a predominance of the Gaussian and exponential models.
The spectral indices MGRVI, NDSI, NDVI, EVI, and GDVI were best represented by the Gaussian model, with high coefficients of determination (R2) ranging from 0.986 (GDVI) to 0.850 (NDVI and NDSI), indicating an excellent fit of the data to the modeled spatial structure. On the other hand, the SAVI and VARI indices showed a better fit to the exponential model. The VARI stood out for having the highest range (R = 23.04 m) along with a good R2 (0.886), while EVI (R = 16.47 m) and MGRVI (R = 12.30 m) also exhibited ranges above 10 m, indicating significant spatial continuity. Regarding the degree of spatial dependence (DSD), SAVI showed a strong dependence (13.67%), while the other indices exhibited a moderate dependence, with MGRVI (49.93%), NDVI (49.76%), and NDSI (49.76%) standing out.
In grid 3 (0.5 × 0.5 m), the model fitting patterns remained consistent, particularly for indices based on near-infrared (NIR) bands, such as NDVI, GDVI, SAVI, and EVI, reinforcing the robustness of these indices in representing the soil’s spectral response. GDVI showed the highest R2 (0.928), followed by SAVI (0.927), and both NDVI and NDSI (0.910), indicating their strong capacity to represent the spatial structure of the data even under a higher sampling density. Although VARI (0.515) and MGRVI (0.556) presented lower R2 values, all indices passed the leave-one-out cross-validation test, reinforcing the accuracy of the spatial models. Furthermore, the DSD values in grid 3 followed patterns similar to the previous grids: VARI (11.59%) and MGRVI (13.90%) maintained a strong spatial dependence, while EVI, GDVI, SAVI, NDVI, and NDSI showed a moderate dependence, ranging from 49.78% to 57.52%. The spatial range varied from 2.64 m (VARI) to 10.62 m (EVI), indicating that the spatial dependence structure was preserved even at higher spatial resolution.
The parameters of the cross-semivariogram models between organic matter (OM) and the spectral indices are presented in
Table 7 and
Figure 5. The Gaussian models were predominant, with EVI showing the highest spatial dependence (DSD = 26.36%), followed by SAVI with a very low DSD (0.17%) and a range of 16.09 m, indicating a well-defined spatial structure. NDVI and NDSI presented similar low DSD values (0.18%), suggesting a strong spatial correlation with OM. GDVI had a range of 12.66 m and DSD of 8.33%, indicating a moderate spatial dependence. MGRVI and VARI did not yield valid semivariogram models at this scale, indicating limited applicability in coarser sampling grids.
The cross-semivariogram adjustments between electrical conductivity (EC) and the spectral indices revealed different levels of spatial dependence, as shown in
Table 8 and
Figure 6. For this soil attribute, the SAVI exhibited the highest degree of spatial dependence (DSD = 41.91%) with a range of 11.58 m, followed by NDVI and NDSI (DSD = 34.41%), which both fitted with exponential models and showing a broader spatial range (21.09 m). EVI also showed a significant spatial relationship with EC (DSD = 29.85%) and a range of 12.97 m, indicating its potential for mapping soil salinity variations over larger scales. GDVI exhibited the lowest range (7.39 m) and a DSD of 23.24%, suggesting a less continuous spatial structure. MGRVI and VARI did not yield valid model fits at this scale.
3.4. Uncertainty Analysis and Mapping Performance
The frequency distributions of the standard deviations associated with ordinary kriging and cokriging between organic matter and biophysical indices are presented in
Figure 7. In grid 1, with the lowest resolution (2 × 1 m), the ordinary kriging of organic matter (OM) yielded a mean standard deviation of 15.55. Cokriging with EVI reduced this value to 9.317, while GDVI, NDVI, and NDSI each reduced it to 13.19. In grid 2 (1 × 1 m), cokriging with EVI, NDSI, and NDVI produced the lowest deviations (9.409, 8.981, and 8.981, respectively), indicating improvements in prediction accuracy. In grid 3 (0.5 × 0.5 m), the lowest deviations were observed with NDSI and NDVI (both at 8.783), followed by EVI (9.317) and MGRVI (15.073).
The estimation of soil organic matter (OM) based on the EVI, GDVI, NDSI, NDVI, and SAVI indices resulted in a reduction in the mean standard deviation in all cases evaluated, except for the GDVI of the 0.5 × 0.5 m grid, where an increase in uncertainty was observed. This improvement in estimates is supported by Gou et al. [
67], where the authors highlighted the effectiveness of indices such as NDVI in predicting organic carbon in small properties using UAV-derived images. Overall, it was observed that some indices, such as NDVI and NDSI, performed better as the extraction grid was refined, with successive reductions in mean standard deviations: 13.19, 8.918, and 8.783 for the 2 × 1, 1 × 1, and 0.5 × 0.5 grids, respectively. These results are consistent with those found by Lundgren et al. [
68] who evaluated the sensitivity of grid resolution and the inverse relationship with standard deviations. On the other hand, indices like GDVI showed an increase in standard deviation as the grid was refined, suggesting a greater sensitivity to the interpolation scale. Still, all indices evaluated contributed to reducing uncertainty in OM estimates, except for GDVI at the densest grid, indicating the importance of selecting appropriate indices based on the spatial scale of the analysis.
For electrical conductivity, the frequency distributions of the standard deviations associated with ordinary kriging and cokriging are presented in
Figure 8. Kriging showed a mean standard deviation of 1.39. In the first grid, cokriging with EVI provided the best result, reducing the standard deviation to 0.666, followed by GDVI (0.686). In grid 2, GDVI showed the lowest deviation (0.396), followed by NDSI and NDVI (both 0.674). In grid 3, EVI again exhibited the lowest deviation among the indices, with a standard deviation of 0.67, followed by SAVI (0.773) and NDVI and NDSI (0.785 and 0.785, respectively).
The frequency distribution of the standard deviations obtained from the mapping shows improvements in the estimates when biophysical indices are used as secondary variables. The best results for estimating EC and OM are shown in
Figure 9A,B. OM values were achieved with the 0.5 × 0.5 grid, using the NDVI and NDSI indices to reduce uncertainty in organic matter estimation, with an approximate reduction of 43.52% in the mean standard deviation compared with kriging performed using only the primary variable. However, the frequency distribution of the standard deviations observed in the estimates indicates a significant improvement for the evaluated indices.
Regarding electrical conductivity, the EVI and GDVI indices stood out for having the lowest mean standard deviations, indicating a higher precision in the estimates. These results corroborate previous studies, such as those by Taghadosi and Hasanlou [
64], who demonstrated the effectiveness of both indices as indirect indicators of salinity in vegetated areas. Similarly, Silva et al. [
8] highlighted GDVI as a promising index for predicting soil electrical conductivity. These findings reinforce the potential of using spectral indices in salinity modeling, especially when applied to environments with vegetative cover.
3.5. Spatial Distribution Maps of Soil Attributes
To complement the uncertainty analysis, spatial distribution maps were generated to visualize the improvement in soil attribute predictions achieved by incorporating spectral indices as secondary variables in the cokriging models (
Figure 10).
The spatial distribution maps of OM and EC illustrate the performance of kriging and cokriging approaches. While ordinary kriging (
Figure 10A,B) produced more generalized spatial patterns, the integration of spectral indices in the cokriging models (
Figure 10C–H) enhanced the spatial detail, particularly for EC, where zones of higher salinity were more sharply delineated. These improvements reflect the ability of multispectral indices to capture the spatial variability associated with vegetation vigor and salinity effects, contributing to more accurate and spatially consistent predictions of soil attributes in wastewater-irrigated systems.
4. Conclusions
This study highlighted the potential of using UAV-derived multispectral images integrated with geostatistical techniques for mapping the spatial variability in soil salinity and total organic carbon in areas irrigated with wastewater in the Brazilian semiarid region. The use of cokriging models incorporating spectral indices as secondary variables enhanced the accuracy of soil attribute predictions and reduced associated uncertainties.
Regarding the indices analyzed, NDVI, EVI, and NDSI stood out as the most promising, particularly NDVI and NDSI, which contributed significantly to reduce uncertainties in spatial interpolation, especially in grids with a higher resolution. These findings emphasize the importance of integrating remote sensing and geostatistics as a strategic tool for monitoring soil quality and supporting more sustainable agricultural practices, thereby enhancing water use efficiency in regions vulnerable to water scarcity. Moreover, the sensitivity analysis across different sampling grids provided valuable insights into the balance between sampling density and model performance, which can guide future operational decisions.
Additionally, the incorporation of NDVI and NDSI into the cokriging models led to reductions of up to 43% in prediction uncertainty for total organic carbon under higher sampling densities. The consistency of the correlations across different grid resolutions further supports the robustness of the proposed approach for practical field applications. By improving soil attribute mapping and supporting more efficient wastewater irrigation management, the proposed methodology contributes to more sustainable agricultural practices and enhances the resilience of forage cactus production systems in vulnerable semiarid environments.