1. Introduction
Precision agriculture (PA) in Europe and Serbia faces several agronomic, economic, and demographic challenges. Small to medium-sized farms dominate Central and Eastern Europe, limiting economies of scale and raising production costs per hectare [
1]. Serbia reflects this trend, with an average farm size of 5.4 ha and pronounced land fragmentation [
2]. Concurrently, rural depopulation, population aging, and youth migration contribute to a shortage of agricultural labor [
3]. As a result, PA offers a promising strategy to boost productivity and cut input costs, even for small and fragmented farms. Technologies like automated soil sampling, satellite- and UAV-based remote sensing, proximal sensing, and advanced analytics are becoming essential for modern, sustainable agriculture [
4]. Moreover, the concept of digital twins and the fusion of real-time field and satellite data have gained prominence for zone delineation and input optimization [
5].
While several pilot programs have introduced PA technologies to larger farms—particularly in parts of Western Serbia—broader adoption remains limited among smallholders. Reported yield increases of 12%, and even up to 100% for specific high-value crops, suggest the agronomic potential of these technologies; however, uptake is often constrained by high initial costs, insufficient advisory support, and limited technical capacity [
6]. In regions like Vojvodina, where digital infrastructure is somewhat more developed, empirical studies still highlight that PA tools such as drones, online platforms, and remote sensing are underutilized due to financial and educational barriers [
7]. While the EU promotes PA through the Common Agricultural Policy (CAP), Serbia is gradually embracing it through pilot initiatives and national rural development strategies [
8]. Even so, further investment in digital infrastructure, broadband connectivity, and targeted education is needed to enable wider adoption, particularly on fragmented and resource-limited farms [
9]. The integration of digital and geospatial technologies is thus becoming a critical driver of sustainable agricultural production, offering a pathway to address structural limitations and reduce the rural–urban digital divide.
PA is a modern farming approach that combines various technologies to improve yields while conserving natural resources [
10]. Recent studies emphasize that the integration of site-specific data sources, even on smallholder farms, is increasingly supported by open access remote sensing and low-cost proximal sensors, enabling broader PA adoption across Europe and transitional regions [
5,
11,
12]. It is grounded in the acquisition and analysis of spatial data on soil and crop variability, facilitating the delineation of site-specific management zones (MZs). This zonation supports targeted application of fertilizers, water, and other inputs based on localized needs [
13]. The effectiveness of this approach depends on the combined use of complementary data sources, including proximal sensors, laboratory soil analyses, and satellite imagery, which together provide a comprehensive understanding of field heterogeneity [
14,
15].
Methods relying on apparent soil electrical conductivity (EC), particularly those using the EM38-MK2 sensor, have become increasingly important for detecting spatial variability in key soil physico-chemical properties in a fast and cost-effective manner [
16]. The EM38-MK2 performs continuous, non-invasive measurements at depths of 0–0.5 m and 0–1.5 m, using horizontal and vertical dipoles to generate four data channels per point [
17,
18]. These readings indirectly reflect soil texture, compaction, moisture, salinity, and, in some cases, organic matter content [
17,
19], thereby supporting efficient management zone (MZ) delineation and potentially lowering input costs [
20]. The approach offers high sampling density, broad coverage, and rapid data acquisition without direct soil contact. However, accuracy may be reduced in excessively moist conditions, where EC is dominated by water content [
21], and interference from nearby metallic structures or rocks can require additional data correction.
Satellite-based data, particularly from the Sentinel-2 platform, are increasingly utilized in precision agriculture due to their ability to deliver frequent and wide-area observations of vegetation and soil conditions [
22,
23]. Vegetation indices (VIs) such as NDVI, SAVI, and LCI provide estimates of biomass and photosynthetic activity, while the Bare Soil Index (BSI) helps detect exposed soil and monitor land degradation [
24,
25]. These indicators reflect phenological development and support informed decisions throughout the growing season—from sowing to harvest [
26,
27]. Key advantages include large spatial coverage, high revisit frequency, and access to multi-year time series at up to 10 m resolution. However, satellite data are affected by atmospheric conditions and mixed pixels, and VIs may be influenced by factors unrelated to soil status, such as disease or mechanical damage [
25,
28]. When combined with field and laboratory data, these indices enhance the robustness of multi-source models for delineating stable and agronomically relevant management zones [
28].
Basic soil property (BSP) analysis is a critical element of precision agriculture, offering direct insight into soil fertility through measurements of nitrogen, phosphorus (P2O5), potassium (K2O), pH, and humus content. These parameters are essential for planning fertilization, selecting suitable crops, and assessing productive potential. BSP is valued for its analytical precision and agronomic relevance, yet it remains constrained by the localized nature of sampling, higher per-sample costs, and laboratory processing time. Consequently, BSP data are often integrated with other sources to support spatial interpolation and produce continuous soil property maps that better reflect within-field variability.
The past few years have witnessed a methodological shift from single-source to multi-layer zonation strategies, where EM38-derived soil structure, Sentinel-2-based vegetation dynamics, and laboratory-measured fertility indicators are combined to form robust models [
12,
29]. These approaches are increasingly favored for their capacity to capture the full spectrum of soil–plant interactions and improve zone stability across different seasons and crop types.
Although previous research highlights the importance of evaluating management zone (MZ) delineation methods—particularly regarding temporal stability and the potential to replace costly field analyses with remote sensing—few studies have jointly examined EM38-MK2, basic soil properties (BSPs), and satellite-derived indices [
30]. Moreover, limited attention has been given to how classification techniques such as kriging, fuzzy k-means, or percentile-based methods influence spatial reliability. Integrating these three data sources offers a more comprehensive representation of field variability: EM38-MK2 captures physical soil traits, BSP reflects chemical composition, and satellite indices characterize vegetation dynamics and biological activity.
This study employed kriging as the principal interpolation method for mapping the spatial distribution of soil properties. As a geostatistical approach, kriging leverages spatial autocorrelation to produce continuous prediction surfaces and is commonly used in delineating management zones in precision agriculture. However, its accuracy is sensitive to sampling density and spatial configuration [
31], with errors increasing when sampling falls below 0.5 points per hectare or when field conditions are heterogeneous [
14]. The method also assumes normally distributed data with stable variance—assumptions often unmet in practice—and relies on robust variogram estimation, which becomes challenging with limited data [
32]. In such cases, kriging may yield overly smoothed outputs that obscure within-field variability. Integrating finer-resolution datasets, such as vegetation indices or apparent electrical conductivity (EC) maps, has been shown to enhance the spatial precision of interpolated zones [
15].
Recent research highlights the value of multi-year satellite data—particularly NDVI during peak vegetation periods—for assessing the temporal consistency of field performance. Excluding outlier years that deviate from typical agroecological patterns improves management zone reliability [
33]. Simplified zoning methods based on a few seasonal observations are gaining traction, especially for small or resource-limited farms [
11]. Integrating proximal data (e.g., EM38-MK2) with Sentinel-2 imagery and applying kriging can reduce sampling demands without sacrificing spatial resolution [
5]. Clustering algorithms like fuzzy c-means and k-means, using vegetation (VI) and bare soil indices (BSIs), have proven effective with high-resolution time series. NDVI correlates well with soil properties such as organic matter, texture, and pH [
12], and its combination with EC data in unsupervised classification has shown promise, although further agronomic validation is needed [
30]. Multivariate geostatistical models that integrate dense and sparse data—such as Vis–NIR, EC, and NDVI—are increasingly employed to delineate zones capturing the full spectrum of soil variability [
34].
This study aims to evaluate various methods for delineating agricultural management zones using data from EM38-MK2 measurements, basic soil properties (BSPs), and satellite-derived indices (VI/BSI), with a focus on assessing their spatiotemporal stability over a five-year period (2018–2022). It further seeks to determine the optimal number of zones and the minimal yet effective set of input parameters for practical application in precision agriculture.
The paper is structured as follows:
Section 2 details the data sources and methodology;
Section 3 presents the comparative results and stability assessments;
Section 4 discusses key findings and their practical implications; and
Section 5 concludes with recommendations for future research and operational deployment.
2. Materials and Methods
This study investigates soil spatial variability in the context of precision agriculture by integrating proximal sensing data, satellite-derived indices, and laboratory-based soil analyses to delineate stable and agronomically relevant management zones. The methodology combines several classification approaches—such as quantile-based partitioning and weighted overlay—with spatial validation metrics including the Cohen Kappa, Jaccard, and Dice coefficients.
This section outlines the applied methodology (
Figure 1), focusing on multi-source data integration, selection of informative input layers, and determination of the optimal number of management zones. Fuzzy k-means clustering was the primary classification method. Its quality was evaluated using the Fuzzy Performance Index (FPI) and Normalized Classification Entropy (NCE). These tools supported informed decision-making in defining functionally relevant management zones.
2.1. Study Area
The study was conducted in the Komareva Humka cadastral municipality, located in southeastern Vojvodina, Serbia. As part of the Belgrade agricultural basin—formerly managed by PKB and now operated by Al Dahra Serbia [
35,
36]—the area is characterized by flat, fertile terrain dominated by alluvial and chernozem soils. This landscape supports intensive field crop production, including cereals, industrial, and forage crops. Long-standing agricultural practices and well-developed irrigation and drainage infrastructure make the region one of Serbia’s key arable zones.
The experimental study covered 10 agricultural parcels in Vojvodina, Serbia, ranging from 16 to 92 hectares in size, with a total area of 378 ha (44°56′ N, 20°29′ E) and elevations between 64 and 80 m a.s.l. (
Figure 2). Field data were collected in July 2022—on the 11th, 15th, and 19th—shortly after wheat and barley harvests. All parcels were under intensive crop production, primarily focused on field crops.
The parcels were chosen based on the availability of synchronized multi-source data, which is rarely accessible under typical field conditions. Specifically, both soil chemical properties (BSPs) and EM38-MK2 electromagnetic measurements were collected on the same day, at nearly the same time, across all fields (
Figure 3). This level of spatial and temporal alignment ensured consistency in data integration and allowed for reliable comparative analysis. Although the parcels included in the study range from 16 to 92 ha, the applied methods—including BSP, EM38-MK2 scanning, interpolation, fuzzy clustering, and weighted overlay—are scalable and applicable to much smaller agricultural plots. Based on practical experience with this workflow, the approach can be reliably applied to plots as small as 0.5 ha. However, due to the 10-m spatial resolution of Sentinel-2 imagery, we recommend using this workflow for parcels above that threshold to avoid degradation of classification accuracy.
2.2. Data Collection
2.2.1. EM38-MK2 Data
Soil apparent electrical conductivity (EC) was measured using the EM38-MK2 sensor (Geonics Ltd., Mississauga, ON, Canada) at four configurations (C1, C0.5, L1, L0.5), corresponding to two depths and dipole modes. The sensor records both conductivity and magnetic susceptibility components, enabling assessment of soil texture, moisture, salinity, and, under certain conditions, organic matter and nutrient distribution [
17]. Thanks to its dual-coil design and enhanced sensitivity, EM38-MK2 offers improved accuracy over earlier models [
37], and it has proven effective in capturing spatial variability and delineating management zones in agricultural fields [
38].
The EM38-MK2 sensor was mounted on a custom sled towed by a vehicle moving at 20 km/h, with transect spacing ranging from 30 to 40 m, depending on parcel shape. This configuration yielded an average point spacing of 8.5 m. On the smallest plot (16 ha), approximately 8000 measurements were recorded, while the largest (90 ha) generated around 27,000 readings (
Figure 3). Although not fully optimized [
38], the sampling ensured consistent and reliable data acquisition across all parcels.
2.2.2. Sampling and Analysis of Basic Soil Properties
Soil sampling followed standardized protocols defined by ISO 18400-102:2017, which recommends systematic or stratified approaches based on field heterogeneity, typically at one composite sample per 1–5 ha [
39]. International practices such as the LUCAS survey [
40], FAO guidelines [
41], and national protocols in Germany [
42] and the U.S. [
43] emphasize spatial representativeness. In Serbia, similar recommendations apply, with 3–5 ha per sample being standard in Vojvodina.
We collected soil samples at a depth of 0–30 cm using automated probes (Duoprob 60-UP and Multiprob 120, Soil Probetechnik Peters GmbH, Quakenbrück, Germany). Each ~5 ha section contained 15–25 evenly distributed points, forming one composite sample. For example, a 16 ha parcel yielded 3 samples, while a 90 ha parcel resulted in 18 composite samples (
Figure 3). Laboratory analysis included pH (KCl), humus (%), total nitrogen (%), available phosphorus (P
2O
5 mg/kg), and potassium (K
2O mg/kg), in line with accredited procedures. Sampling was performed post-harvest, ensuring timely results for fertilizer planning. The methodology aligns with both national recommendations and regional agronomic practice [
44,
45].
2.2.3. Sentinel-2 Satellite Imagery and Indices
In addition to soil sampling and measurements of apparent electrical conductivity, satellite data from the Sentinel-2 platform were collected for all parcels included in this study. To complement field measurements, Sentinel-2 imagery from 2018 to 2022 was used to derive vegetation indices—NDVI, SAVI, LCI—and the Bare Soil Index (BSI) for all studied parcels [
46,
47,
48]. For each season, NDVI time series were interpolated to identify phenologically relevant dates, resulting in five temporal configurations: one (NDVI peak), two (before + peak), three (before, peak, after), four (two before, peak, after), and five dates per year (
Figure 4).
BSI was calculated using three configurations: pre-sowing (minimum NDVI), post-harvest (minimum NDVI), and a combination of both (
Figure 5). These temporally stratified layers supported further zonation and stability analysis.
To address the well-documented issue of NDVI saturation under dense vegetation, we included two additional indices—SAVI and LCI—that mitigate this limitation. SAVI incorporates a soil brightness correction factor to reduce background noise in medium-density vegetation, while LCI leverages Sentinel-2 red-edge bands, which are less prone to saturation and more sensitive to chlorophyll variation. This complementary index selection was intended to enhance the robustness of zonation across different canopy densities.
2.3. Spatial Analysis
Following data acquisition from EM38-MK2 and laboratory soil analysis, spatial interpolation was performed using Ordinary Kriging or Support Vector Machines (SVMs). We applied kriging using five isotropic semivariogram models—linear, linear with threshold, exponential, spherical, and Gaussian—with the optimal model selected via cross-validation [
49]. All raster outputs were normalized to a 0–1 scale and resampled to 10 m spatial resolution to ensure compatibility with Sentinel-2-based VI and BSI layers.
Figure 6 includes a total of twenty-five interpolated and normalized raster layers, grouped into five categories: four EM38-MK2 layers (C0.5, C1, L0.5, L1), five basic soil property layers (humus, pH, P
2O
5, K
2O, nitrogen), one BSI index layer, and five layers each for NDVI, SAVI, and LCI derived from different seasonal dates. All layers were resampled to 10 m spatial resolution and normalized to a 0–1 scale to enable consistent comparison.
Following cloud filtering and date selection, satellite data processing included the generation of temporally integrated vegetation and bare soil index (BSI) layers. For each of the five selected dates, NDVI, SAVI, and LCI were computed and averaged into a composite raster:
Each raster was assigned a temporal weight, depending on the number of available dates—for example, [0.3, 0.4, 0.3] for three dates (VI) or [0.6, 0.4] for BSI (pre- and post-season). Final composites were calculated using the following:
where
is the weight and
the date-specific raster.
Post-processing included a majority filter to reduce noise and normalization to the [0, 1] scale. The resulting temporally balanced rasters served as inputs for subsequent classification and management zone delineation.
2.4. Cluster Analysis
Beyond interpolation, we classified all datasets into management zones (MZs) using fuzzy k-means clustering. To determine the optimal number of classes, Fuzzy Performance Index (FPI) and Normalized Classification Entropy (NCE) were calculated, following established recommendations [
50,
51,
52]. Clustering was performed with a maximum of 100 iterations and a fuzzy exponent of 1.25. Across parcels and data types (BSP and EC), the optimal number of classes typically ranged from 2 to 8, with occasional suggestions up to 10. Consequently, classifications were conducted using 2–8 classes per dataset to analyze spatial variability and identify viable management strategies.
Figure 7 illustrates an example of how the same dataset is classified using an increasing number of management zones. The figure includes 14 sub-images, which display clustering results for the EM38-MK2 C0.5 layer and a BSI layer corresponding to a selected date. Each example shows zonation maps derived using 2 to 8 classes, allowing for a visual comparison of how classification complexity affects the spatial delineation of management zones.
Three classification approaches were applied across all 10 parcels: fuzzy k-means clustering, Weighted Overlay Analysis (WOA), and percentile-based classification. For each data combination, raster layers were generated using 2 to 8 classes, resulting in eight classified rasters per configuration.
For BSP data, two sets of fuzzy classifications were produced: (a) using all chemical properties and (b) humus only. For EM38-MK2, fuzzy classification included all individual depths (C1, C0.5, L1, L0.5), selected combinations (e.g., C1+C0.5), and each parameter separately.
The weighting coefficients used in WOA were based on agronomic relevance and adapted to the specific combination of parameters. For BSP-only configurations, two weighting schemes were used: the first assigned humus, phosphorus (P2O5), and potassium (K2O) a ratio of 0.65:0.23:0.12; the second incorporated pH with adjusted weights (humus 0.63, P2O5 0.21, K2O 0.11, pH 0.05).
In configurations combining BSP with EM38-MK2 ECa layers (C1 and C0.5), a portion of the total weight was allocated to ECa inputs (typically 0.2 each), while the remainder was proportionally distributed across BSP parameters. These included humus, P2O5, K2O, pH, and in some schemes, total nitrogen, depending on the configuration.
This flexible approach ensured that agronomically important variables remained well represented, while incorporating physical soil structure data into the zonation models.
Percentile-based classification was applied to VI and BSI layers across five growing seasons (2018–2022), using one to five selected dates per year. For BSI in 2018, three configurations were analyzed: pre-sowing, post-harvest, and their combination. Each configuration generated 8 classified rasters (2–8 classes) per parcel (
Figure 7).
2.5. Zonation Stability Analysis
To assess the reliability of spatial classifications and delineated management zones (MZ), we conducted a multi-metric stability analysis. Given the importance of spatial consistency in precision agriculture, three complementary methods were applied: (1) systematic grid sampling to examine the effect of sampling resolution [
53,
54,
55,
56]; (2) Cohen’s Kappa coefficient to measure agreement between classified layers while correcting for chance [
55]; and (3) the Jaccard and Dice coefficients to quantify spatial overlap in binary zonations [
57,
58]. This framework enables a comprehensive evaluation of zone robustness, capturing both internal consistency across methods and sensitivity to input configurations and class number. A concise overview of the underlying geostatistical and clustering methods used in this study (e.g., fuzzy k-means, kriging, and agreement indices) can be found in [
50,
51,
55], which offer accessible introductions for readers less familiar with these techniques.
2.5.1. Systematic Grid Sampling
Zone stability was evaluated using systematic grid sampling at intervals of 100 m, 50 m, 20 m, and 10 m across all fields. On the largest parcel (90 ha), this corresponded to 88, 354, 2216, and 8857 sampling points, respectively. Values were extracted from all classified raster layers—including EC, BSP, VI, BSI, and their composites—at each grid point. This approach allowed for consistent comparison of classification outputs across methods and resolutions, minimizing random sampling effects. The extracted data served as input for computing agreement metrics such as the Cohen Kappa, Jaccard, and Dice coefficients [
55,
57,
58].
2.5.2. Kappa Coefficient
We employed Cohen’s Kappa coefficient (κ) as the primary metric to assess agreement between classified raster layers, accounting for chance-level concordance. The Kappa coefficient (
) is defined as follows:
where
is the observed agreement and
the expected agreement under random classification. Values range from −1 (total disagreement) to 1 (perfect agreement), with values near 0 indicating randomness. The Kappa value was calculated across all raster group combinations (EC, BSP, VI/BSI, and their weighted variants), with stratification by class number (2–8) to evaluate the impact of classification granularity on consistency.
2.5.3. Jaccard and Dice
To complement the Kappa analysis and assess spatial similarity of binarized management zones, the Jaccard and Dice coefficients were applied. The Jaccard index (J) quantifies overlap as the ratio of intersection to union:
The Dice coefficient (D) gives more weight to shared areas and is defined as follows:
These metrics were used to evaluate spatial agreement between zones produced by different methods and data combinations (e.g., EC vs. BSP, BSP & WOA vs. VI & BSI), offering additional insight into zone consistency and robustness.
4. Discussion
4.1. Interpretation of Results in the Context of Existing Literature and Practice
The obtained results clearly demonstrate that different data sources and zonation approaches yield distinct patterns of stability, consistency, and spatial uniformity. Consistent with previous studies [
15,
17,
32], classifications using a smaller number of zones (ranging from two to four) provide a higher degree of spatial similarity and lower inter-annual and inter-method variability. This is particularly important for practical implementation in precision agriculture.
The high stability observed in zones derived from the integration of multiple data sources (EC and BSP) through Weighted Overlay Analysis reinforces earlier findings on the effectiveness of multi-criteria approaches in soil zonation [
17]. The results also show that giving more weight to humus leads to better alignment with reference zones, supporting earlier findings [
19] that highlight the importance of soil organic matter.
Vegetation indices derived from Sentinel-2 data demonstrated good consistency, particularly in multi-year zonation analyses. This suggests that, with careful selection of acquisition dates and the combination of multiple indices, Sentinel-2 data can serve as a valid alternative for assessing the spatial structure of agricultural land—an observation also supported by previous studies [
11,
15,
18].
Additionally, similar approaches to multi-year zonation evaluation have been applied by Heidari and Samavati [
33], who introduced the Zoning Dissimilarity Metric (ZDM) to identify years that significantly disrupt the temporal consistency of zones derived from Sentinel-2 NDVI time series. Their analysis shows that excluding so-called “abnormal years,” identified through statistical deviations in their impact on zonation, can substantially enhance the accuracy and robustness of resulting management zones. This underscores the importance of careful date selection and temporal window definition when using satellite-derived indices for zonation purposes.
Although this study primarily employed the Kappa, Jaccard, and Dice coefficients to assess zone stability, future research could incorporate metrics such as ZDM to provide a more quantitative measure of the impact individual years have on zonation patterns.
4.2. Advantages and Limitations of Different Zonation Methods
The use of interpolation methods such as kriging enables the generation of smooth, continuous surfaces suitable for spatial analysis, but their performance is highly sensitive to the distribution and density of sampling points. In contrast, percentile-based classification provides a straightforward categorization of values but does not account for spatial context.
Fuzzy k-means proved to be flexible, letting each pixel belong partly to more than one class while still preserving spatial continuity. Its key advantage lies in its ability to express gradual class membership, though it requires longer processing times and careful parameter tuning.
Weighted Overlay Analysis (WOA) offers a high degree of adaptability and the capacity to integrate multiple data sources. However, the resulting zones are highly sensitive to the selection of weight coefficients, which, if not properly calibrated, can lead to significant variation in the final outputs. For this reason, cautious application and, where possible, validation using reference laboratory data or field measurements are recommended.
Vegetation and bare soil indices (BSIs) enable rapid and cost-effective zonation but are subject to seasonal and interannual variability, as well as the influence of cloud cover and atmospheric conditions. Therefore, their use is best complemented with additional data sources to enhance reliability.
4.3. Recommendations for the Optimal Selection of Data Sources and Methods
Based on the conducted analysis, the following recommendations can be made:
For practical implementation in precision agriculture, an optimal number of management zones lies between three and four, as this provides a balanced trade-off between spatial informativeness and zone stability.
The use of EM38-MK2 data—particularly channel C1 or the combination of C1 and C0.5—proved to be the most effective in detecting soil variability.
BSP data, especially when all analyzed parameters are integrated, yield reliable results and can serve as a reference for calibrating other methods.
VI and BSI derived from Sentinel-2 imagery are useful as complementary inputs, particularly when generated from multiple acquisition dates and weighted appropriately.
The Weighted Overlay Analysis (WOA) method demonstrated high flexibility and produced the most robust results when humus played a dominant role in the weighting scheme.
Raster normalization and the use of spatial resolutions between 20 and 50 meters provide an optimal balance between spatial detail and zone stability.
These recommendations offer concrete guidelines for selecting appropriate methodologies based on specific objectives, data availability, and budget constraints in the context of precision soil management.
4.4. Added Value of Multi-Source Correlation Analysis
The correlation matrices between raster layers—particularly those integrating EM38, BSP, and VI/BSI data—revealed important synergies and limitations in the spatial structure captured by each source. While normalized and interpolated EM38 layers consistently showed strong internal coherence, their correlation with satellite-derived indices was the highest when both data types were aggregated over multiple temporal images. Although time series data require more processing, they help identify stable biophysical patterns beyond just seasonal effects.
A critical observation was that correlation matrices can inform not only zonation stability, but also highlight complementary data sources, as low cross-correlations may reflect distinct underlying spatial processes. Careful matrix interpretation is thus essential to avoid over-integration or misrepresentation of spatial coherence.
4.5. Toward More Interoperable and Transferable Zonation Models
An open question arising from this study is the transferability of the delineation approach across geographies and farming systems. Given the strong internal agreement in weighted overlays (especially humus-dominant schemes), there is an opportunity to adapt this methodology to other regions, such as Dutch polder landscapes, where organic matter and moisture gradients also play a key agronomic role.
To support future replication, the methodological pipeline presented here—combining proximal sensing, remote sensing, and fuzzy clustering—could be adapted into a decision-support module.
4.6. Broader Implications for Decision-Making in Data-Rich and Data-Poor Contexts
This study reinforces the emerging understanding that precision agriculture requires context-specific approaches rather than universal models. In data-rich environments, fully integrated models using all three sources (EC, BSP, VI/BSI) can generate nuanced and robust zones. Conversely, in data-scarce settings, carefully selected VI/BSI composites or single-depth EC readings (e.g., C1) can still yield sufficiently stable zonation maps, enabling broader adoption in smallholder or transitional farming systems.
4.7. Limitations of the Study
While the results presented here demonstrate consistent and interpretable management zone delineation across multiple data sources, several limitations must be acknowledged. First, Sentinel-2-based indices are sensitive to atmospheric interference, cloud cover, and vegetation phenology, which can introduce variability in inter-annual comparisons. Although multi-date compositing helps reduce these effects, some residual noise may persist. Second, interpolation of BSP data is constrained by the spatial density and distribution of samples. In areas with heterogeneous soil structure or sparse sampling (e.g., >5 ha per sample), kriging may oversmooth the data or misrepresent local variability. Finally, classification results depend on selected parameters (e.g., number of classes, weights in WOA), which may limit transferability to other contexts unless locally calibrated. Future work should further explore uncertainty quantification and sensitivity analysis in zonation models.
4.8. Overcoming Operational and Methodological Constraints
This study addressed several key operational and methodological challenges commonly encountered in precision agriculture. By combining data from EM38-MK2 sensors, basic soil properties (BSPs), and Sentinel-2-derived indices, we demonstrated that stable and interpretable management zones can be delineated without the need for high-density sampling or complex ground campaigns. The use of optimized weighting schemes, fuzzy clustering, and temporal composites further enhanced the reproducibility and cost-efficiency of the zonation process.
Moreover, this study identified that a zonation scheme using three to four classes represents an optimal trade-off between spatial detail and operational feasibility. This level of complexity aligns with field-level management capacity, ensuring that resulting maps are both agronomically informative and practically implementable. These findings support the development of practical, scalable methods for soil and crop management that can be adapted to different production settings.
5. Conclusions
This study evaluated multiple approaches for delineating within-field management zones by integrating geophysical (EM38-MK2), chemical (basic soil properties), and spectral (VI/BSI from Sentinel-2) data. Through a multi-step analytical framework—encompassing interpolation, classification, and validation using the Kappa, Jaccard, Dice, and correlation metrics—we demonstrated that the most stable and internally consistent zonations were achieved using weighted combinations, particularly those dominated by humus content. Two- and three-class configurations generally provided the best trade-off between spatial coherence and agronomic interpretability.
A critical observation was that correlation matrices can inform not only zonation stability but also highlight complementary data sources, as low cross-correlations may reflect distinct underlying spatial processes. Careful matrix interpretation is thus essential to avoid over-integration or misrepresentation of spatial coherence.
The findings also reinforce the evolving perspective that precision agriculture must be tailored to specific agronomic and data contexts. In data-rich environments, robust multi-source models are feasible and desirable, allowing for complex, high-resolution zonation. In contrast, in data-scarce scenarios—such as smallholder or transitional farming systems—simpler inputs like two-date BSI composites or single-depth EC readings (e.g., C1) still offer viable pathways toward spatial differentiation and site-specific management.
Additionally, the strong internal agreement within humus-weighted overlays suggests potential for adapting this framework beyond the current study area. The approach may be transferable to other regions with similar soil-climate conditions, such as lowland agricultural regions characterized by high groundwater tables, organic-rich soils, and structured drainage systems common in parts of Northwestern Europe. To support future replication, the presented methodology—based on proximal sensing, remote sensing, and fuzzy clustering—can be modularized into open-source decision-support tools, enabling calibration across sites and farming systems.
Overall, this research highlights the value of integrating heterogeneous data sources and applying scalable analytical workflows for generating actionable management zones. Continued development of flexible, context-aware zonation models remains essential for advancing precision agriculture across diverse landscapes.