1. Introduction
In agricultural optical remote sensing, vegetation indices (VIs) have also been proposed to align the data with the target property better. This is mainly because the VIs can provide an apparent measure of the vegetation cover and can be correlated with critical physical measurements [
5]. Other data acquired within the other parts of the optical band can be merged to get different VIs. One of the most used and the oldest indices is the Normalised Difference Vegetation Index (NDVI) [
6].
Sentinel-2 for precision agriculture has various roles, including monitoring plant health, soil moisture, and nutrient levels. Sentinel-2 also assists with crop rotation and pest control. All these help to monitor various growth phases of plants carefully and make informed interventions to maintain healthy plants and boost agricultural production [
7]. Sentinel-2 has the ability to take imagery in 13 various spectral bands (
Table 1), which are used to calculate various vegetation indices (VIs) [
8].
Given the large number of papers mentioning Sentinel-2, only a small and limited number of studies use the AS7263 sensor. In these studies, authors employ this sensor for non-destructive quality assessment of Siamese oranges by measuring acidity, total soluble solids, and vitamin C [
9] for measuring spectral reflectance of plant leaves and soilless substrates under greenhouse conditions [
10], for developing a low-cost ground-based NDVI sensor for manual and automated monitoring of crop conditions [
11] and for estimating chlorophyll levels in different types of leaves, such as banana, jasmine, mango, rice, and sugarcane [
12]. Unlike other works, which typically use only one sensor node, often with several integrated sensors of the same or different types (such as a combination of AS7262, AS7263, and AS7265x) to optimize measurements at a single point, we advanced by using five, or effectively four, separate sensor SPD nodes as shown in
Figure 1a deployed at different locations as shown in
Figure 1b,c. Despite the limited data set size (
), this approach provided an important spatial distribution of samples for the development and validation of the PLSR model.
Sentinel-2 is successful in a wide range of precision agriculture applications. Studies have shown that Sentinel-2 data can be integrated with multispectral imagery from unmanned aerial vehicles (UAVs) to accurately predict the Green Area Index (GAI) in winter wheat. GAI predictability is highly useful for crop condition monitoring and yield estimation. Despite the potential of Sentinel-2 data for describing average GAI trends, limitations due to the low red-edge band and temporal resolution lag caused by cloud cover hinder the correlation of site-specific variability with single-image datasets [
13].
The development of applications such as AgroShadow [
14], improves the quality of Sentinel-2 data through enhanced shadow detection.
Although Sentinel-2 plays an important role in precision agriculture, its effectiveness is limited by critical constraints, primarily cloud cover and data acquisition delays. Cloud cover significantly reduces the satellite’s ability to collect timely and accurate data on crop conditions [
15], resulting in information gaps needed for agricultural decision-making. Given these challenges, research into alternative technologies that enable faster data acquisition and are less susceptible to environmental influences, such as cloud cover, is underway. One such tool is the use of SPDs, which are cost-effective and offer faster response times than traditional satellite imagery [
16]. SPDs are small spectral sensors that can be mounted directly in fields to detect specific wavelengths of light reflected by vegetation. This approach enables real-time ground monitoring that is not interrupted by clouds. Because they are small and affordable, they can be deployed in large numbers, providing precise and continuous monitoring at the field level.
The introduction of Sentinel-2 imagery has significantly advanced precision agriculture through its multispectral, high-spatial-resolution data. However, the application of Sentinel-2 remains limited due to unpredictable cloud cover and data latency [
7]. Research on new detectors, such as the AMS AS7263 (
https://cdn.sparkfun.com/assets/1/b/7/3/b/AS7263.pdf (accessed on 24 March 2025)), provides an alternative method for extending real-time sensing capabilities in agriculture.
SPDs offer low cost and fast response capability. Limited spatial coverage of SPDs means they can’t provide the full view of big agricultural fields that satellites can. Effective utilization may require a hybrid approach that combines the broad coverage of Sentinel-2 with the cloud coverage, independent real-time data of SPDs. This hybrid technology can enhance the performance of both individual systems, delivering broad-area coverage and high-resolution detail while minimizing the effects of cloud cover and data latency. Our work demonstrates the value of incorporating SPDs into precision agriculture systems to complement Sentinel-2 satellite data. While SPDs lack broad spatial coverage, they offer high temporal resolution and real-time detection independent of environmental conditions. SPDs are useful for the immediate detection of plant spectral responses and for providing instantaneous feedback that satellites can’t offer due to data delays and atmospheric limitations. With a network of such SPDs, we can gather accurate, continuous data that achieves similar spatial coverage as satellites while supplementing their limitations in temporal and environmental fields. In this paper, we are proposing and developing such a network of SPDs that complements satellite data using a hybrid system, where the strengths of both technologies are combined. The integration of these technologies enables an active and better understanding of plant health and environmental conditions, leading to timely and efficient decision-making in agriculture.
Related Work
In the era of remote sensing using satellite data, challenges remain, such as timely data acquisition and cloud cover interference during satellite overpasses. Although previous studies have significantly advanced precision agriculture and vegetation monitoring, limitations connected to temporal resolution and environmental conditions remain. Our research addresses these gaps by proposing a hybrid approach that combines satellite imagery with a low-cost, high-quality SPD featuring six spectral channels. This method enables faster and more reliable data collection, even under bad weather conditions, and is expected to have results comparable to those from satellite-based observations alone. In the following section, we will provide an overview of related research that is similar to ours.
In [
17] different chlorophyll-related VIs from simulated Sentinel-2 images are presented to estimate the fraction of absorbed photosynthetically active radiation (FPAR) of wheat and maize. The study identifies the modified Simple Ratio-2 (mSR2) and the Red-Edge Normalised Difference Vegetation Index (ND705) as the most effective VIs, as they exhibit strong linear correlations with FPAR. Results suggest that high-resolution FPAR mapping can be improved in agriculture, highlighting the advantage of VI that utilise near-infrared and red-edge reflectance. The study contributes to the development of crop-independent FPAR estimation methods, but is limited by not accounting for atmospheric effects in the simulations.
The model proposed in [
18] uses Sentinel-2 VIs in combination with apparent electrical conductivity (ECa) to predict barley yields in spring 2017 in north-east Spain. The results show that the Sentinel-2 and ECa data much improve yield estimates, with the imagery taken in early April showing maximum predictive power with an
of 0.75. The results suggest that such techniques could be applied in agriculture to improve yield estimates and enable more targeted nutrient application.
The paper [
19], proposes a hybrid learning model that uses Random Forests (RF) and Support Vector Machines (SVM) to classify plants from Sentinel-2 imagery. After evaluating 82 VIs, this model outperforms individual RF and SVM models in identifying six distinct plants.
The paper [
20] introduces NDVIRE, a new vegetation index constructed from Sentinel-2 bands. The random forest model is used to develop NDVIRE for predicting forest stand volume (FSV) in the Helan Mountains region of China. NDVIRE is positively correlated with FSV and outperforms traditional vegetation indices.
The authors in [
21] compare the correlations between NDVI and the Soil Adjusted Vegetation Index (SAVI) from Sentinel-2 for vegetation monitoring in Shrirampur, classifying vegetation as dead, stressed, or healthy based on data from November and December 2019.
The study [
22] demonstrates how NDVI relates to nine SAR characteristics, including backscatter coefficients and VI. The results indicate that the strongest association with NDVI is found in VH backscatter, suggesting that SAR data can complement conventional multispectral techniques for crop monitoring.
Monitoring crop growth using NDVI time series from Sentinel-2 and Landsat Operational Land Imager (OLI) [
23] demonstrates how satellite remote sensing techniques, combined with ground data verification, can produce accurate data on plant life phases. Applying NDVI helps determine plant growth and its relationship to climatic factors.
A Deep Neural Network (DNN) is used to predict maize biomass using Sentinel-2 vegetation indices (VIs) and Leaf Area Index (LAI) data [
24]. This study aims to identify the optimal VIs for biomass prediction, clarify the relationship between biomass and LAI, and validate the biomass model using these techniques.
The authors in [
25] examined the land cover patterns of Vellore district using NDVI, multispectral remote sensing, and digital elevation models (DEMs) derived from Landsat TM images. NDVI was used to identify changes in land-cover characteristics over five years, showing significant shifts in land cover, including the conversion of forest and uncultivated areas to other uses.
In this paper [
26] describe how satellite-based vegetation indices (VIs) are implemented in the Agricolus platform to support agriculture. They highlight the use of VIs, such as NDVI, G-NDVI, and SAVI, to monitor crops, mitigate pollution, and optimize crop management.
NDVI and NDMI data analysis in Kyiv, 2017–2021 [
27] examines the relationship between plant health and moisture levels using NDVI and NDMI vegetation indices from Sentinel-2 images. Over five years, the study finds a strong correlation (r = 0.73, r
2 = 0.55) between these indices in urban areas.
The research [
28] introduces two new measures: the cotton Boll Area Ratio Index (BARI) and the cotton Boll Opening Rate Index (BORI). These indices were developed using Sentinel-2 data to improve the estimation of cotton boll opening. The BARI index, in particular, is more accurate than previous methods, enabling more effective and timely resource allocation.
As proposed by [
29], vegetation indices from Sentinel-2 and GPS data on sheep movements can be used to control overgrazing. The study demonstrates how vegetation condition is linked to sheep grazing behavior at different stocking densities and explains how selective grazing leads to increased overgrazing.
The work in [
30] uses vegetation index-derived texture features from Sentinel-2 satellite images to map and estimate growing stock volume (GSV).
The authors work in [
31] suggests an assimilation system validating the decision by using biophysical parameters from Sentinel-2, fAPAR, to irrigate vineyards automatically. Such an assimilation system uses the regulated deficit irrigation (RDI) technique to irrigate efficiently.
As shown in the previous section, and summarized in
Table 2, the use of Sentinel-2 and vegetation indices is a widely researched topic, but none of the existing works address a low-temporal and energy-efficient method for predicting end-of-season crop state.
3. Results
The research integrates remote sensing and IoT-based multispectral SPDs to enhance agricultural monitoring using Sentinel-2 data to derive vegetation indices, particularly NDVI. Field data were collected using customized, low-cost SPDs deployed at multiple locations, as shown in
Table 3, to capture site-specific changes in real time, while Sentinel-2 data were used to validate the field data. Sentinel-2 MSI data are delivered as atmospherically corrected surface reflectance bottom-of-atmosphere products [
33,
34]. This ensures that Sentinel-2 MSI data are not affected by atmospheric conditions and can be directly compared with ground values. SPD measurements were performed directly above crop canopies, without the influence of atmospheric absorption or scattering. Non-additional atmospheric or BRDF correction was necessary for SPD data. Absolute consistency between Sentinel-2 NDVI data and SPD measurements was achieved for comparison purposes by using standard reference targets during SPD data acquisition, following methods described in the literature [
33,
34,
35].
The research examined the correlation between Sentinel-2 vegetation indices and multispectral sensor measurements to assess the effectiveness of these indices in assessing plant health across diverse environmental conditions. The results showed a high correlation (r > 0.85) between the NDVI values from the AS7263 sensor and those from Sentinel-2 imagery, demonstrating the effectiveness of the sensor in closing temporal gaps in the satellite imagery caused by cloud cover, as shown in
Figure 5.
After the field trial concluded, harvesting was completed, and actual agronomic attributes such as yield, grain moisture, and plant height were measured
Table 7. Using these records, along with reflectance attributes and VI extracted by the SPD sensors during the field trial, predictive relationships were established to assess the potential of using ground-based spectral data for non-destructive estimation of major crop attributes. The initiative aimed to assess the potential of SPD-extracted information for accurate forecasting of yield and plant attributes under various growth conditions. The method significantly reduced monitoring latency and provided better spatial and temporal resolution, emphasising the benefits of the hybrid system for precision agriculture. The SPD BE7A00000000304A has been excluded from model training and testing because the data were found to be unreliable during the satellite overpass instances.
Prediction models were developed using partial least squares regression (PLSR) [
36] and validated by five-fold cross-validation, selecting the model with the lowest root mean square error of prediction (RMSEP) [
37]. Predictions were computed as the dot product of the measurement indices and the model coefficients, including an intercept [
38], as shown in Equation (
5). In this equation,
is the intercept term, and
are the regression coefficients from the PLSR model for each predictor variable. The
values are the input variables measured on the SPDs. Six variables are the directly measured spectral bands from AS7263, and the remaining fifteen are normalized vegetation indices calculated according to Equation (
4). The summation thus extends to twenty-one, creating predictions for yield, kernel moisture, and plant height.
Due to the dynamic nature of moisture content in corn kernels, predictions are most useful as relative deviations rather than absolute values, especially during the harvest period when kernel moisture content is below 20%. Predictions outside the realistic moisture content range of 5% to 15% were adjusted by limiting them to these values. The measurements used for model calibration were limited to data collected between 10 a.m. and 12 a.m. to enhance the quality of the predictions.
The data set, consisting of six directly measured spectral variables and fifteen sensor-based predictor variables, was first preprocessed for modeling using linear methods. Each predictor was normalized and filtered to remove outliers caused by false readings or uneven lighting. The analysis focused on three target variables as follows: yield, kernel moisture, and plant height, which were evaluated separately using partial least squares regression (PLSR).
The top four vegetation index coefficients derived from visible and near-infrared wavelengths are shown in
Figure 6. These represent the four new VI-based vegetation indices with the strongest correlation to NDVI_S2. The newly proposed VIs, T (730 nm)/R (610 nm), V (810 nm)/R (610 nm), U (760 nm)/R (610 nm), and W (860 nm)/R (610 nm) can be directly computed from the SPD sensor node data.
The optimal number of components for each target variable was determined using cross-validation (CV), using the six wavelengths measured by the SPD sensor node and the fifteen calculated VIs mentioned in Equation (
4). Model performance was assessed by comparing the true and estimated values of yield, moisture, and height, along with the calculation of Pearson’s correlation coefficient (
Figure 5).
The correlation analysis between and across the individual measurement devices showed strong positive relationships with Pearson correlation coefficients, indicating consistent and reliable agreement of NDVI measurements across the sensor network. The resulting regression coefficients for each optimal model were exported for further analysis.
This difference in the number of data points occurred because satellite overpasses are rare and highly dependent on weather conditions, as mentioned earlier. Regression models were trained on the full dataset of 2781 NDVI-SPD measurements collected using four SPD devices. However, the number of days when Sentinel-2 overpasses coincided with field measurements was very limited, resulting in a low ratio of valid NDVI-SPD to NDVI-S2 pairs. Only measurements that were fully spatially and temporally aligned and passed all quality-control filters were used for independent validation. Despite the limited number of valid, the correlation analysis showed that the aligned data yielded promising relationships. Algorithm 1 presents the implementation of the 5k-fold cross-validation procedure applied to the training dataset, ensuring transparent documentation of the internal validation process for the PLSR models.
| Algorithm 1 PLSR Model Training and Cross-Validation |
Require: Dataset D with predictors , responses Ensure: Optimal PLSR model with components
- 1:
plsr “CV”) - 2:
selectNcomp - 3:
summary - 4:
fitted - 5:
cor - 6:
if then - 7:
coef - 8:
exportCoefficients - 9:
end if
|
Table 8 presents the intercepts and regression coefficients of the developed PLSR models for plant height, moisture, and yield. These are the final predictive equations used to estimate each trait from the spectral input data. Each model in
Table 8 was individually optimized for a specific plant attribute height, moisture content, and yield before being implemented in the SPD devices.
The analysis shows that the predictions for the yield and kernel moisture closely matched the measured data, as indicated by the relatively low MAE and RMSE values in
Table 9. Predictions for the final plant height were less accurate, with higher error values and lower model accuracy than for the other traits. This is likely due to the small sample size and the limited spectral response to end-of-season variations in canopy height.
MAE and RMSE in
Table 10 describe the average magnitude of deviation between the model predictions and the measured values, with RMSE placing greater weight on larger errors and therefore being more sensitive to extreme deviations. R
2 was not calculated because individual treatments and populations contain only a single observation, which prevents a statistically valid assessment of the variance required for determining the coefficient of determination.
Figure 7,
Figure 8 and
Figure 9, and
Table 7 present an application of PLSR to predict yield, moisture content, and canopy height using spectral and sensor data.
Comparative analysis between the anticipated values and actual measurements in
Table 9 indicates that the PLSR models effectively identified the primary patterns in yield, kernel moisture, and plant height across various treatments. Although there were slight variations, more pronounced when plant heights from irrigated and rainfed trials were considered, the individual models performed well in generating predictions within the constraints of time and sample size. Such outcomes indicate that the modeling technique is viable for monitoring crops during the vegetative season and predicting yields.
For the yield shown in
Figure 7, overall model performance was good MAE = 2.52 t ha; RMSE = 2.64 t ha. The worst performance occurred in the Irrigated trial, where the model underestimated the final yield, but estimates under Rainfed conditions and for the nitrogen treatments N75 and NFull closely matched the measured end-of-season values. This indicates that the model accurately captured management-induced variations in yield but may require further calibration under high water availability.
Model predictions for moisture (
Figure 8) generally agreed well with measurements showing MAE = 2.17% and RMSE = 2.39% under rainfed and moderate nitrogen conditions. Overpredictions for the irrigation and NFull treatments showed that the model slightly overresponded to the effects of water and nitrogen input, likely due to the relatively small training sample size used for model calibration.
For plant height (
Figure 9), the model consistently underestimated measurements in both irrigated and rainfed plots; however, predictions for nitrogen treatments (N75 and NFull) closely matched observed values. The relatively high overall error (MAE = 20.35 cm; RMSE = 20.51 cm) may be due to the limited number of calibration samples and the spectral insensitivity observed during late-season canopy development in response to further vegetative growth.
The results of this study show that individual PLSR models were very effective at distinguishing treatment-specific patterns as described in
Table 7. However, the findings also suggest that future studies could benefit from incorporating conservative or regularized models to reduce the risk of overfitting and improve robustness under changing environmental conditions. The PLSR was successful in making reliable end-of-season predictions with high consistency for yield and moisture and moderate precision for plant height. These outcomes validate the model’s ability to track maize growth and to predict key agronomic characteristics across diverse environmental settings.
These field devices are responsible for real-time pre-processing of the raw spectrometric data received via the SPD and apply the preset coefficients to provide real-time predictions. The implementation protocol for these models and the computational tasks performed through the devices are captured in Algorithm 2.
| Algorithm 2 Processing of the SPD data in the field |
- 1:
procedure ProcessSPDData (, ) - 2:
1 Data acquisition and preparation - 3:
extracts the red channel from - 4:
extracts the near-infrared channel from - 5:
Calibrate () - 6:
2nd NDVI calculation from SPD data - 7:
- 8:
3. correlation with satellite NDVI - 9:
Pearson () - 10:
4. calculation of the prediction for height, moisture, and yield values using the dot product - 11:
- 12:
return , - 13:
end procedure
|
The energy analysis in
Table 11 provides a detailed assessment of current consumption, power, and energy in each phase of system operation, shown in
Figure 10. Measurements were taken at a nominal voltage of 3 V, with each phase defined by its duration and current consumption. Based on these data, power values (P = I × U) and energy values (E = P × t) were calculated. During the total active time of 10.141 s, the system consumed approximately 0.8208 W/s of energy, while in the idle state of 600 s, only 0.0046 W/s was consumed, which is less than 0.6% of the total consumption. The average power during the entire period is 0.00023 W, and the average current, calculated from the active phase, is 69.6 µA.
The results confirm that the SPDs have excellent energy efficiency due to minimal consumption in sleep mode and short active intervals of higher energy consumtion during evaluating measured data and running proposed Algorithm 2 every 10 min during daytime. The ratio of idle time to total duration is approximately 98.4%, indicating a dominant influence of the low-power phase on the overall energy balance. This consumption profile is suitable for the proposed SPDs, where long-term operation with minimal energy loss is expected.
The energy analysis indicates the expected lifetime of a system powered by two VARTA Industrial Pro AA batteries connected in series. Under ideal conditions, without significant losses, the system can operate for approximately 4.8 years, as shown in
Figure 11. In real-world situations, accounting for 20% losses and occasional increases in consumption, the expected lifetime ranges from 3 to 4 years. A conservative estimate for a cold environment with increased current consumption gives a minimum duration of about 2 years, which still ensures long-term autonomy for typical IoT and sensor applications. With this life expectancy, the proposed system provides a real-time, cost-effective, and scalable solution for crop monitoring during the vegetation season.