1. Introduction
Agriculture is currently facing significant challenges due to the need to ensure food security in the context of a growing global population and climate change. The sustainability in production systems is a priority, especially in key crops such as wheat, which accounts for 21% of global food requirement [
1,
2]. In this context, direct seeding (DS) has emerged as a promising practice for improving agricultural productivity, reducing energy costs, and mitigating environmental impacts compared to conventional seeding (CS) [
3].
Wheat (Triticum
aestivum L.) is one of the world’s major crops and plays a crucial role in food security. In Spain, this crop covers an area of approximately 1,967,485 hectares, producing 7,116,771 tons in 2024, with the Castilla and León being the main producing region [
4]. Agricultural management practices, such as DS and CS, significantly influence crop yield, grain quality, and sustainability. Researchers have shown that DS on previous crop residues significantly increases yields compared to seeding on tilled soil where an equivalent amount of residue has been incorporated. This effect results from the benefits of minimal soil disturbance. The structure generated by the root canals of the previous crop, combined with the biological activity of earthworms and other forms of soil fauna, promotes deep rooting while improving rainwater infiltration and percolation. These conditions contribute to creating a more favorable environment for crop development [
5].
Previous studies have shown that wheat productivity is sensitive to soil preparation, climatic conditions, and agronomic management. Zero-tillage practices have significantly increased soil organic carbon (SOC) levels, especially in the top 0 to 10 cm layer. This performance highlights the effectiveness of reducing soil disturbance and encouraging the incorporation of organic matter to improve SOC reserves [
6]. The implementing of a no-till system promotes better water retention in the soil, improves its physical and biological properties, and reduces erosion, often resulting in increased yields and net income for producers. This system also strengthens future food security by providing greater resilience to extreme weather events such as prolonged droughts and heat waves. These phenomena are projected to increase in the coming years [
7].
The integration of remote sensing technologies has revolutionized the study of agricultural systems, enabling non-destructive monitoring of physiological variables and spectral indices. The NDVI and NDRE indices are widely used to observe sensitivity to plant vigor variation, providing key information on the physiological status of crops [
8]. In addition, the SAVI minimizes the influence of soil brightness, which is particularly useful in DS systems. Studies have shown that applying unmanned aerial systems (UAS) improves the accuracy of estimating key indicators such as plant height and chlorophyll content [
9].
Spectral indices derived from remote sensing can contribute to agricultural yield estimation [
10].
Regarding data analysis, Principal Component Analysis (PCA) and machine learning algorithms, such as Random Forest (RF), are practical tools for identifying patterns in multidimensional data and distinguishing between different treatments. Researchers have highlighted the usefulness of PCA for selecting key variables in studies on genetic diversity and crop management [
11]. In complex systems such as wheat, multivariate analysis is crucial for interpreting interactions between agronomic, environmental, and management factors. The GWP method is one of the most widely used statistical procedures for data dimension reduction [
12]. In addition, statistical tests such as ANOVA and Levene are robust tools for detecting significant differences between treatments, as reported by Katral et al. [
13] in rice experiments.
On the other hand, combining remote sensing data, meteorological data, field observations, and machine learning algorithms such as RF can improve crop yield estimation [
14]. Xu et al. [
15] used RF to classify crops with high precision and accuracy, highlighting its robustness in agricultural scenarios. Han et al. [
16] emphasized the importance of considering structural and spectral information jointly rather than analyzing them independently when estimating crop biophysical parameters combined with machine learning.
This research identifies robust indicators that distinguish between agricultural management systems in order to optimize farming practices. Spearman’s correlations and ROC analysis are key techniques for establishing relationships between physiological variables and spectral indices [
17]. These tools validate the selected indicators, ensuring their applicability in different agronomic and climate contexts.
Despite advances in remote sensing and machine learning, significant gaps that remain are as follows: scarcity of studies that jointly integrate physiological variables and spectral indices to distinguish management systems; limited evidence under long-term conditions (e.g., fields with decades of DS) where surface residues and soil structure modulate the optical signal; and little consideration of discrimination measures alongside yield prediction in a unified analytical framework.
This study aims to predict wheat yield in DS and CS systems based on spectral indices selected through statistical analysis. In addition, the yield variable was evaluated to determine whether it has a greater ability to discriminative between management systems, using a classification model to develop an appropriate tool for estimating crop productivity.
Unlike studies that typically evaluate these systems separately, this study integrates UAS-derived spectral indices and physiological variables into a single multivariate framework to predict yield and distinguish between DS and CS. It also explicitly evaluates discrimination ability alongside predictive performance, providing operational indicators for early management decisions in precision agriculture.
2. Materials and Methods
2.1. Study Area and Experimental Design
This study was carried out in the Autonomous Community of Castilla and León, in the province of Palencia, in the Integrated Vocational Training Centers Viñalta, located on the outskirts of Palencia, 2 km west. The area lies at 709 m.a.s.l., at 42°0′16.35″ North latitude and 4°34′8.70″ West longitude, with average temperature is 11.6 °C, the coldest month is January with average temperatures of 3.4 °C, the warmest month is July with average temperatures of 20.9 °C, annual precipitation of 397 mm, average annual solar radiation 6.07 GJ/m
2.year, climate classification according to Köppen is temperate and type of Atlantic climate, annual potential evapotranspiration of 699 mm and type of háplico luvisol soil [
18].
Figure 1 illustrates the ubication in Spain and the distribution of the plots in the field comprising an area of one hectare for each treatment.
The experimental design consisted of divided plots, each with an approximate area of 0.42 ha, implemented under two planting systems. In the first plot, the Viñalta Integrated Vocational Training Centers have practiced a DS system for 30 years, actively applying conservation agriculture techniques that prioritize the protection of ecological integrity; among the measures adopted, the prohibition of the use of agricultural machinery stands out, two applications of non-selective herbicide (glyphosate) were made before planting to control weeds. In the second plot, the CS was used, which included activities such as two passes with fast disk stands (Minidisc), cultivator, and vibrating cultivator. In both plots of investigation, the previous crop was Veza, in rainfed, and the wheat variety evaluated was Andino, which was sown with a 200 kg/ha density.
2.2. Field Data Collection
2.2.1. Measurements of Physiological Responses
To ensure representativeness in a 1 ha plot, a geolocated grid of points with a 5 m spacing was generated in QGIS version 3.28.15. In each plot, nine sampling points per treatment were selected using spatially balanced sampling based on the conditional Latin hypercube (cLHS) methodology [
19], which maximizes covariate space coverage and avoids preferential sampling. The selection was implemented in RStudio version 4.4.1, and the design was chosen because it homogeneously distributes the locations across the entire area and reduces variance compared to simple random sampling with equal effort (
Figure 2). In this study, reliability was addressed a priori through a spatially balanced design and the use of blocks, which ensures uniform coverage of within-field gradients across the one-hectare plot. The selected points were exported to the QField version 3.7.9 application to facilitate the location and installation in the field of 18 sampling quadrants, each with dimensions of 0.5 × 0.5 m. At these sites, the corresponding samples were collected during the phenological stage of flowering on 13 May 2024.
At each sampling point, information corresponding to the crop, contained in
Table 1, was collected.
The harvest was carried out on 6 July 2024. Therefore, sampling was carried out one day earlier. The variables to be measured are presented in
Table 1.
2.2.2. Acquisition of Aerial Images
The images were captured using a DJI Mavic 3 Enterprise UAS, model M3M, DJI, Shenzhen, China (
Figure 3a), classified as rotor equipment with four propellers, which has integrated a 4/3 CMOS RGB camera, effective 20 MP pixel, 84° field of view, 24 mm equivalent format, f/2.8–f/11 aperture and 1 m to ∞ focus (with autofocus); the CMOS 1⁄2.8’ multispectral camera, 5 MP effective pixel, 73.91° field of view, 25 mm equivalent format and f/2.0 aperture. The multispectral sensor makes it possible to collect data in four spectral bands: green (G 560 ± 16 nm), red (R 650 ± 16 nm), red (RE 730 ± 16 nm), and near-infrared (NIR 860 ± 26 nm) (
Figure 3b).
The aerial reconnaissance was carried out on the same day the phenological field responses were evaluated and carried out between 10:00 a.m. and 1:00 p.m. The mission was carried out at an altitude of 50 m and an approximate speed of 4.4 m/s. The images had a ground sampling distance (GSD) of 2.31 cm per pixel, incorporating 80% frontal and 80% lateral overlap. The flight path was automatically created using DJI Pilot 2 software version 02.01.07.12 integrated into the remote control. In addition, the radiometric sensor was calibrated before and after image capture, aiming to compensate for variations in incident light conditions. This procedure made it possible to obtain accurate quantitative data by using a reference plate and adjusting the image capture to the fluctuations in sunlight recorded during the flight.
The UAS has a real-time kinematic (RTK) sensor to ensure the images’ georeferencing. This centimeter-precision positioning sensor connects to the national geodetic network of GNSS reference stations (ERGNSS); for the study area, the PALE3M sensor from the Castile and León GNSS station network was used.
For radiometric calibration, the images obtained from the UAS were adjusted to convert reflectance signals into physical values using a reflectance calibration panel (a surface calibrator with known reflectance) that was placed in the field during acquisition. This panel allowed for the adjustment of the images to the atmospheric reflectance values of each spectral band, ensuring that the measurements were comparable to reference conditions [
23]. Regarding geometric calibration, a georeferencing process was applied to correct possible geometric distortions caused by UAV movement or camera tilt. To do this, geospatial control points and a GPS coordinate system were used, which allowed the images to be adjusted to a standard cartographic projection. Data acquisition took place at 10:00 AM, with a temperature of 15 °C, wind speed of 16.7 km/h from the southwest, visibility of over 10 km, and clear skies, ensuring optimal conditions for image capture.
2.3. Aerial Image Processing
The images collected were processed using the Pix4Dmapper software (version 4.4.12) to obtain the orthophoto and vegetation indices. During this procedure, additional products, such as elevation point clouds, digital elevation models, digital surface models, and three-dimensional meshes (3D), can also be generated [
24].
Processing in Pix4Dmapper was carried out using the standard workflows for RGB and multispectral images, applying the “3D Maps” and “Ag Multispectral” configurations, respectively, to generate the corresponding ortho mosaics. Once the ortho mosaics were generated, the vegetation indices were calculated and exported in TIFF format, facilitating their subsequent analysis.
The vegetation indices (
Table 2) were selected according to the potential relationship with crop biophysical attributes—such as canopy health and sensitivity to chlorophyll content, nitrogen status, leaf structure, and senescence [
25]—as well as their agronomic relevance, robustness to illumination and soil-background variability, and high predictive accuracy with the on-board sensor bands and the UAS RGB camera [
26]. Specifically, NDVI and RVI were chosen for their long track record and were used as established metrics of vigor and biomass, widely reported in precision agriculture in recent years [
27]. GNDVI and the green chlorophyll index (GCL) were included due to their higher sensitivity to chlorophyll and nitrogen status and their lower saturation relative to NDVI, as <corroborated by recent reviews and studies showing a strong correlation between GNDVI and chlorophyll [
28]. NDRE and the red-based chlorophyll index (RECL) leverage the red-edge region to estimate chlorophyll and diagnose nitrogen with better performance at high canopy cover; their utility has been reinforced by the availability of red-edge bands [
29]. SAVI and AVI were used to mitigate soil and illumination effects (early stages or low cover), with AVI documented in current operational guides [
30]. Finally, the RGR ratio (R/G) provides sensitivity to senescence and pigments [
31]. A buffer of 2.5 m was generated around each point to calculate the vegetation index values at the sampling points. All available values were collected within this area, obtaining 672,862 data. Subsequently, the analysis used the mean and median as representative measures. To differentiate them, they were labeled as NDVI_P (associated with the average) and NDVI_M (those corresponding to the median).
2.4. Statistical Analysis
Identifying the most relevant factors to differentiate the treatments is essential to visualize the contributions of the original and modeled variables. This process involves a workflow that integrates dimension reduction and predictive modeling techniques, enabling the simplification and analysis of complex data while ensuring interpretability and accuracy in parcel differentiation.
First, data related to crop physiological responses and vegetation indices were processed. A component reduction technique was applied using PCA to simplify the large amount of multidimensional information [
40]. This procedure made it possible to identify the most relevant variables, facilitating their understanding, evaluation, and interpretation. As part of the analysis, the data were standardized to scale and center the numerical variables to an average of 0 and a standard deviation of 1, ensuring that all variables had equal importance in the analysis, regardless of their units or ranges. PCA was applied to the numerical variables, combining them linearly to obtain new components. The number of components (k) was defined as the minimum required to achieve ≥90% cumulative variance, calculated from the eigenvalue spectrum (percentage of variance explained by each component and its cumulative total), in order to retain at least 90% of the information. The data were then projected into the k-dimensional PCA space and split into training (70%) and test (30%) sets using a reproducible random partition (fixed seed: 123) to ensure reproducibility.
The differentiation model was trained, evaluated, and validated using the RF algorithm to predict the DS and CS treatment variables from the training data set. The model was adjusted by cross-validation of 5 partitions on the training set, exploring values of the hyperparameter mtry ∈ {2, 4, 6} to optimize performance; the number of trees was set to the package default (500), the split rule was Gini, and sampling was performed by bootstrap with replacement (enabling estimation of the out-of-bag, OOB, error), allowing a robust performance evaluation. Once trained, predictions were made on the test data set, and performance was evaluated using a confusion matrix, obtaining metrics such as accuracy, sensitivity, and specificity.
Finally, the algorithm’s performance and differentiation quality were analyzed using the ROC curve and the area under the curve (AUC), fundamental metrics to understand the model’s capacity for predicting and differentiating observations in the categories defined by the treatment variables DS and CS.
In addition, to identify if there are significant differences between treatments, an average test was performed using vegetation indices. For this purpose, the normality of the data was initially verified graphically and using the Anderson-Darling test [
41]. For data with normal distribution, ANOVA with 5% probability (
p < 0.05) was applied, and the Levene test was based on means for data without normal distribution with 5% probability (
p < 0.05) [
42].
The indices that presented significant differences were selected to apply the Spearman correlation and the ROC curve, indicating that the lowest correlations and the highest value of ROC correspond to the most effective vegetative indices and correlated with the physiological variables of interest. With the indices that demonstrated a more significant relationship with performance, we proceeded with RF to model the performance prediction and compare it with the actual data; 70% of the data was used to train the model, and the remaining 30% was used for validation [
43]. The model’s accuracy was assessed by calculating the coefficient of determination R
2. All procedures were performed using RStudio software (Version 2024.04.2+764 “Chocolate Cosmos”).
2.4.1. Model Validation
This study uses data from a single season and site. Accordingly, we performed internal validation via a 70/30 train–test split, stratified fivefold cross-validation on the training set, and RF out-of-bag (OOB) error estimation. We report accuracy, Cohen’s kappa, sensitivity/specificity, and AUC, with confidence intervals where appropriate.
ROC/AUC criteria. For the binary discrimination (CS vs. DS), ROC curves were computed from model class probabilities and performance was summarized by AUC. We interpreted AUC using conventional thresholds widely adopted in the diagnostic-test literature: AUC < 0.60 = no/poor, 0.60–0.69 = poor, 0.70–0.79 = acceptable, 0.80–0.89 = good/excellent, ≥0.90 = outstanding. Under this scale, AUC > 0.80 indicates a good classifier.
Note on the use of AI Tools: ChatGPT version 5.0 (OpenAI, San Francisco, CA, USA) and Grammarly were used solely to improve the clarity, grammar, and English language of the manuscript. These tools were not employed to generate, analyze, or interpret any methodological or scientific content. All descriptions, analyses, and conclusions were written, reviewed, and validated by the authors, who take full responsibility for the final content.
3. Results
3.1. Principal Component Analysis and Random Forest
Applying PCA, dimensionality reduction is observed to identify the linear combinations of original variables that capture most of the variance, facilitating visualization and interpretation.
Figure 4 shows the percentage of variance explained by each principal component and determines how many components are needed to capture most of the information in the original data. The first two components combined account for approximately 76% of the cumulative variance.
Figure 5 represents the measured quality values of each variable in the different principal components (dimensions), where dimensions 1 and 2 are the ones that most contribute to explaining the variability in the data.
Principal components (PCs) were interpreted using factor loadings (sign and magnitude), variable contributions (% contrib), and squared cosines (cos2) to assess the quality of representation. A variable was considered relevant in a principal component when its contribution exceeded the uniform expectation (100/p) or when |loading| > 0.40. Positive factor loadings indicate covarying traits, while opposite signs suggest trade-offs. Indices designed to reduce soil effects (e.g., SAVI, AVI) aligned with canopy cover/soil gradients, while chlorophyll-sensitive metrics (e.g., GNDVI, NDRE, GCL/RECL) correlated with greenness/chlorophyll gradients. This framework guided the biological interpretation of the PCA figures and influenced the RF model inputs.
For RF modeling, the minimum number of principal components (k) required to reach ≥90% cumulative variance was retained, and the data were projected into the k-dimensional PC space. On the held-out test set, RF achieved 93.75% accuracy (95% CI: 69.77–99.84%) with Kappa = 0.875; in stratified fivefold cross-validation, accuracy was 94.29% (Kappa = 0.8831). Methodological details are provided in Methods (
Section 2.4.1).
The model demonstrates perfect sensitivity, with a value of 1.0 for the positive class corresponding to the CS variables, indicating that it correctly differentiates all cases in this category. With a specificity of 0.8750, it correctly identifies most of the DS negatives, largely avoiding their misclassification as CS. The predictive value for CS was 0.8889, equivalent to 88.89% accuracy, while for DS, the positive predictive value reached 1.0, indicating perfect accuracy.
Furthermore, the balanced accuracy value of 0.9375 reflected balanced performance between both classes and the low p-value (0.0002594) associated with the significance test indicates that the model significantly outperforms the performance expected by chance. The McNemar’s Test p-Value (1.000) result shows no evidence of imbalance in differentiation errors, reinforcing the model’s robustness and confidence.
The above result is confirmed by the ROC curve analysis, where the AUC value reached 0.9375, indicating that the model has the ability to correctly differentiate the classes in an average of 93.75% of the cases on average.
After adequately differentiating the treatments, we proceeded to analyze the importance of the RF model’s variables. According to the most relevant dimension presented in
Figure 6a, Dim. 2 is the most important within the model, with a value close to 100. This indicates that this dimension contributes significantly to the accuracy of the predictions generated by the model.
For its part,
Figure 6b highlights the variables with greater relevance in the model in Dim.2. Among these, yield, nitrogen, and protein were identified as those with the highest contribution, suggesting that they should receive priority attention, given that they significantly impact the analysis’s results.
3.2. Vegetation Index Analysis
Vegetation indices vary depending on the crop’s phenological status. In this study, wheat monitoring was carried out in two stages of development, observing that significant differences were concentrated in the flowering phase. After carrying out the normality analysis and the means comparison tests, the results were represented by box diagrams that illustrate the distribution of the values of each index under the different treatments.
Considering average and median values, eight of the nine indices evaluated showed significant differences according to the means tests. Additionally, Spearman correlation values were included, which allowed the identification of the most relevant vegetation indices (
Figure 7). The results indicate that indices such as AVI and SAVI. However, they have low or even negative R
2 values and show a greater capacity for differentiation between the CS and DS classes, suggesting a stronger relationship between these indices and the variable of interest. Moreover, they outperform other indices because they integrate the canopy signal (high NIR and low R) and correct for soil reflectance, thereby better capturing canopy closure and structure, as well as the greater shadow and residue characteristic of DS. Additionally, in
Figure 8, a differentiation is visually observed between the treatments based on the SAVI and AVI, evidencing distinctive patterns in the spectral response of wheat under DS and CS.
The ROC curve complements and reinforces the results obtained by the Spearman correlation, showing that the vegetation index with the highest predictive capacity is the AVI, presenting the highest value of AUC = 0.8025, followed by the SAVI with AUC = 0.7778. These results highlight that both vegetation indices have the best discriminative capacity within the model, which indicates that they are the most effective in distinguishing between the treatments evaluated (
Table 3).
Based on these findings, it was decided to work with the median values since these values present slightly higher performance than the averages, although with a minimal difference. This approach ensures greater consistency and robustness in interpreting the results obtained [
44].
The Spearman correlation was performed again with the vegetation indices that obtained the best performance in the differentiation of treatments. This analysis made it possible to compare these indices with the most important variable identified by the RF model to determine their relationship (
Figure 9). The AVI_CS and SAVI_CS indices have significant positive correlations with CS performance (0.69), indicating a close relationship between these indices and the performance variable in the CS class. This result suggests that as the values of these indices increase, the performance associated with this class also increases. On the other hand, AVI_DS and SAVI_DS show strong negative correlations with performance in DS (−0.79), which implies that these indices are helpful in differentiating the characteristics of the DS class. In this case, performance in the DS class tends to increase as index values decrease.
3.3. Prediction of Performance
The wheat yield prediction was carried out using the vegetation indices AVI and SAVI, which were selected for their ability to capture variability in tillage systems. These indices provide crucial information about the physiological state of the crop and the differences generated by the treatments applied. The RF algorithm was used to ensure an accurate performance prediction in the CS and DS systems, using the spectral indices mentioned as predictive variables. The model was trained with 70% of the available data, reserving the remaining 30% for validation. As a result, only the five-point representation is observed.
Additionally, linear regression graphs were generated to visualize the relationship between the actual performance and the performance predicted by the model. The model’s accuracy was evaluated by calculating the coefficient of determination (R2) as a key performance metric. This coefficient was estimated independently for the AVI and SAVI spectral indices in both tillage systems, providing a detailed assessment of the model’s efficacy under different crop management conditions.
In
Figure 10 and
Figure 11, high predictive capability is demonstrated with the AVI and SAVI, respectively. With AVI, the coefficient of determination (R
2) values were 0.92504 in CS and 0.93948 in DS, reflecting an excellent fit between predicted and observed values. With SAVI, R
2 reached 0.91142 in CS and 0.9398 in DS, confirming comparable performance. In both cases, the proximity of the points to the identity line indicates low discrepancy and strong agreement between predictions and actual data. Overall, these results validate the reliability of the approach and support AVI and SAVI as robust predictors of yield, with particularly high performance in DS.
4. Discussion
The combination of PCA and RF proved effective in modeling wheat yield in DS and CS areas based on physiological variables and spectral indices. In the multivariate analysis, the PCA results revealed that the first two dimensions explained 76% of the cumulative variance. This level is comparable to that reported by Liu et al. [
45], indicating that PCA is highly effective in identifying discriminant patterns in complex management systems. It reduces dimensionality without losing relevant information and facilitates data interpretation and key variable selection. In biological terms, the first principal component (PC1) summarizes a gradient of vegetation cover closure relative to soil influence, characterized by positive loads in NIR and indices that attenuate soil brightness (SAVI/AVI), as well as negative loads in the red band (R) and ratios related to greater soil exposure (e.g., RGR). The second component (PC2) captures a chlorophyll/greenness gradient, with high loads in GNDVI, NDRE, and GCL/RECL, which are associated with higher pigment content and better nitrogen status. Higher nitrogen availability is linked to an increase in chlorophyll content, which directly influences the health of the plant cover and, consequently, yield [
46]. According to the loads and correlation models, yield, nitrogen, and protein are positively associated with these axes, confirming their agronomic relevance.
In the evaluation of the confusion-matrix, the differentiation model achieved an overall accuracy of 93.75% and a Kappa index of 0.875, reflecting a high level of agreement. These results coincide with the findings of Zhao et al. [
47], who demonstrated the high effectiveness of the RF method for predicting wheat yield in the Northern plains of China. Nevertheless, our study focuses on physiological variables and spectral indices in DS treatments (a plot with 30 years of conservation agriculture) compared to CS, while the previous work centered on accumulated biomass and various climatic indices.
The RF method identified yield, nitrogen, and protein as the most relevant variables for distinguishing between treatments, in line with previous studies. Esaulko et al. [
48] report that DS can stabilize productivity in arid regions by improving infiltration and reducing evaporation. On the other hand, Colecchia et al. [
49] observed higher yields in CS under conditions of good water availability. Our findings reinforce the usefulness of spectral indices for quantifying and modeling crop yield, highlighting the long-term benefits of DS. However, the effectiveness of each system depends mainly on soil and climate conditions and the agronomic management.
Furthermore, Levene’s test and ANOVA confirmed significant differences in the variables evaluated between treatments. These tests are essential for validating the robustness of the results and corroborating that the observed differences are not due to chance. Houšt et al. [
50] reported similar findings when applying these tests in wheat experiments, highlighting their relevance in comparative agricultural management studies.
Vegetation indices showed significant differences between treatments, particularly during the flowering phase. Walsh et al. [
51], reported that spectral indices based on UAS accurately capture phenological variations in crops. In our case, ROC analysis and Spearman correlations highlighted AVI and SAVI as the most informative indices for distinguishing treatments at flowering (AUC = 0.8025 and 0.7778, respectively). In CS, the Spearman correlation with yield was positive (ρ = 0.69), indicating that higher index values are associated with greater productivity. Conversely, in DS, correlations with yield were negative (ρ = −0.79). This pattern is consistent with the presence of residue cover and higher soil moisture, which reduce soil brightness/reflectance and increase canopy shade. At the pixel level, this decreases the NIR–R contrast and can reduce SAVI/AVI even when crop condition is good, producing negative correlations with yield (i.e., lower index values associated with higher yield). This coincides with the findings of Cheng et al. [
52], who highlighted the usefulness of SAVI in minimizing the effect of soil brightness in management systems such as DS.
SAVI and AVI not only demonstrated their ability to distinguish between treatments, but also their ability to model performance in tillage systems. Using RF, coefficients of determination (R
2) greater than 0.9 were obtained in DS and CS, providing accurate performance prediction. The relationship between yield and AVI highlights its usefulness as a key indicator in agricultural systems, integrating information on plant cover and photosynthetic efficiency. This is consistent with the study by Yang et al. [
53], who noted that spectral indices capture yield variation through non-destructive monitoring. In this context, AVI not only distinguishes DS from CS, but also provides essential information for estimating yield under different management strategies. This behavior reinforces the importance of spectral indices as essential tools in precision agriculture, facilitating the distinction of treatments and optimizing management practices.
5. Conclusions
The analysis identified that the vegetation indices AVI and SAVI were the most relevant in identifying the treatments, standing out for their discriminative capacity, especially between CS and DS. The ROC curve indicated its efficacy, with AVI (AUC = 0.8025) and SAVI (AUC = 0.7778) as the best for classification and discrimination.
The high predictive capacity of the AVI and SAVI stands out, with R2 higher than 91%. Both indexes showed excellent performance in CS and DS, with the highest values of R2 in DS (0.93948 for AVI and 0.93980 for SAVI). The closeness of the points to the identity line confirms the reliability of the predictions, consolidating AVI and SAVI as precise tools to estimate agricultural yield. In practical terms, the results enable anticipatory, zone-specific fertilization, prioritized irrigation and monitoring in low-vigor areas, and postharvest scheduling according to spatial variability, thereby optimizing costs, inputs, and logistics.
This study demonstrates the feasibility of applying remote sensing and machine learning to precision agriculture in wheat. Integrating UAS, spectral indices, and Random Forest enabled us to discriminate between DS and CS and to estimate yield with high accuracy, providing operational indicators for more efficient and sustainable management. However, the models were trained with data from a single site and a single season; broader external validation across multiple seasons and regions with different edaphoclimatic conditions is required to assess generalizability. Likewise, although AVI and SAVI proved robust for wheat under DS and CS, their direct applicability to other crops or climates should be verified before operational use. Future work will expand multi-site, multi-year validation and explore the transferability of both the models and the indices to diverse production contexts.
Since yield, nitrogen, and protein were identified as the most relevant variables, it is recommended to further investigate their analysis through studies exploring their interaction with other agronomic and climatic factors, to refine the understanding of their impact and improve predictive accuracy. Additionally, it is advisable to compare RF with alternatives such as Gradient Boosting, Support Vector Machines, and deep neural networks, evaluating precision, robustness, and efficiency. Also, validating models using external datasets is critical to ensure their applicability and generalization in real-world scenarios.