Next Article in Journal
Comprehensive Quality Analysis of Common Vetch (Vicia sativa L.) Varieties Using Image Processing Techniques and Artificial Intelligence
Previous Article in Journal
Removal of Neonicotinoid Residues from Beeswax Using an Eco-Friendly Oxalic Acid Treatment: A Sustainable Solution for Apicultural Decontamination
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

How Do Spectral Scales and Machine Learning Affect SPAD Monitoring at Different Growth Stages of Winter Wheat?

1
College of Intelligent Manufacturing, Anhui Science and Technology University, Chuzhou 233100, China
2
College of Resources and Environment, Anhui Science and Technology University, Chuzhou 233100, China
3
Anhui Engineering Research Center of Smart Crop Planting and Processing Technology, Anhui Science and Technology University, Chuzhou 233100, China
4
Anhui Province Key Laboratory of Functional Agriculture and Functional Food, Anhui Science and Technology University, Chuzhou 239000, China
*
Authors to whom correspondence should be addressed.
Agriculture 2025, 15(23), 2410; https://doi.org/10.3390/agriculture15232410
Submission received: 1 October 2025 / Revised: 10 November 2025 / Accepted: 20 November 2025 / Published: 22 November 2025
(This article belongs to the Section Artificial Intelligence and Digital Agriculture)

Abstract

Chlorophyll serves as a crucial indicator for crop growth monitoring and reflects the health status of crops. Hyperspectral remote sensing technology, leveraging its advantages of repeated observations and high-throughput analysis, provides an effective approach for non-destructive chlorophyll monitoring. However, determining the optimal spectral scale remains the primary bottleneck constraining the widespread application of hyperspectral remote sensing in crop chlorophyll estimation: excessively fine spectral scale readily introduces redundant information, leading to dramatically increased data dimensions and reduced computational efficiency; conversely, overly coarse spectral scale risks losing critical spectral features such as absorption peaks and reflection troughs, thereby compromising model accuracy. Therefore, establishing an appropriate spectral scale that effectively preserves spectral feature information while maintaining computational efficiency is crucial for enhancing the accuracy and practicality of chlorophyll remote sensing estimation. To address this, this study proposes a three-dimensional analytical framework integrating “spectral scale—machine learning algorithm—crop growth stage” to systematically solve the scale optimization problem. Ground-truth measurements and hyperspectral data from five growth stages of winter wheat in Fengyang County, Anhui Province, were collected. Spectral bands sensitive to chlorophyll were analyzed, and four modeling methods—Ridge Regression (RR), K-Nearest Neighbors (KNN), Random Forest (RF), and Support Vector Regression (SVR)—were employed to integrate data from different spectral scales with respective bandwidths of 2, 3, 5, 7, 10, 20, and 50 nanometers (nm). The results evaluated the response characteristics of raw band reflectance to chlorophyll values and its impact on machine learning-based chlorophyll estimation across different spectral scales. Results indicate: (1) Canopy spectra significantly correlated with winter wheat chlorophyll primarily reside in the red and red-edge bands; (2) For single-scale analysis, larger spectral scales (10, 20 nm) enhance monitoring accuracy compared to 1 nm high-resolution data, while medium and small scales (5, 7 nm) may degrade accuracy due to redundant noise introduction. (3) Integrating growth stages, spectral scales, and machine learning revealed optimal monitoring accuracy during the jointing and heading stages using 1–5 nm spectral scales combined with the KNN algorithm. For the booting, flowering, and grain filling stages, the highest accuracy was achieved using 20–50 nm spectral scales combined with either the KNN or RF algorithm. The results indicate that high-precision chlorophyll inversion for winter wheat does not rely on a single fixed model or scale, but rather on the dynamic adaptation of the “scale-model-growth stage” triad. The proposed systematic framework not only provides a theoretical basis for chlorophyll monitoring using multi-platform remote sensing data, but also offers methodological support for future crop-sensing sensor design and data processing strategy optimization.

1. Introduction

Wheat is China’s second-largest staple crop and ranks among the world’s most widely distributed, extensively cultivated, and highest-yielding food crops. China’s wheat production constitutes a vital component of national food security [1], and ensuring stable and high yields has remained a key research focus. Chlorophyll is a crucial pigment enabling crops to absorb solar radiation for photosynthesis [2,3,4]. Chlorophyll content is intrinsically linked to leaf photosynthetic capacity; its concentration, distribution, and variation directly or indirectly reflect plant photosynthetic ability, nutrient status, and vegetation productivity. It serves as a key indicator determining crop growth status, yield levels, and quality [5,6,7]. Consequently, timely and accurate acquisition of chlorophyll content forms the foundation for monitoring crop growth and predicting yield [8,9]. Traditional chemical methods for monitoring chlorophyll content rely on destructive measurements, which are time-consuming, labour-intensive, and limited in scope, making large-scale monitoring and rapid application challenging [10,11,12]. There is therefore an urgent need for a low-cost, efficient, and accurate monitoring method.
In recent years, remote sensing technology has demonstrated extensive applications and promising prospects in crop monitoring and forecasting due to its advantages of large-area simultaneous observation and instantaneous imaging [13,14]. Satellite-derived spectral imagery enables wide-area monitoring but is susceptible to cloud cover, features long revisit periods, and exhibits low spectral and spatial resolution [15]; Unmanned aerial vehicle (UAV) remote sensing, as a significant method for monitoring crop growth, typically employs multispectral sensors at relatively low cost. However, the spectral bands captured are limited in their relevance to crop development. Both approaches suffer from constraints in spectral scale, compromising the accuracy of retrieved variables and hindering the timely and effective detection of early signals of crop stress [16]. Hyperspectral remote sensing technology offers high spectral scale and detailed data. Compared to traditional panchromatic and multispectral remote sensing techniques, it features multiple imaging bands capable of capturing diverse spectral characteristics associated with crop biochemistry and physiological traits [17]. This enables the identification of sensitive bands closely related to crops, facilitating precise and efficient monitoring of spectral changes in crops [18].
Currently, extensive research employs specific spectral band information to construct vegetation indices (VIs) for monitoring crop physiological and biochemical trait parameters, enabling qualitative and quantitative assessments of vegetation vitality and growth dynamics [19]. However, during the integration of multiple spectral bands into VIs, some detailed information from the original bands may be lost [20]. Furthermore, certain VIs exhibit heightened sensitivity only to specific crop types, growth stages, environmental conditions, and monitoring scenarios. They may fail to accurately reflect crop physiological states for other vegetation types or differing ecological environments. Studies such as that by Thenkabail et al. [21] indicate that VIs readily exhibit saturation during later crop growth stages, rendering them ineffective for estimating crop traits. Given these considerations, raw data modelling approaches significantly reduce computational costs and enhance efficiency by selecting characteristic bands strongly correlated with target variables as independent variables. Compared to VIs, raw spectral data minimises error propagation and avoids saturation effects [22].
Direct utilisation of reflectance information from raw spectral bands avoids loss of original spectral characteristics. However, hyperspectral data, comprising hundreds or thousands of narrow spectral bands, faces challenges of data redundancy and the curse of dimensionality [20]. Machine learning algorithms possess the capability to handle complex relationships between predictors and target variables, demonstrating excellent redundancy mitigation effects when addressing multimodal, high-dimensional data. Compared to traditional regression algorithms, they offer significant improvements and have been widely applied in crop phenotyping monitoring [23,24]. Shen et al. [25] monitored winter wheat SPAD values using ground-based hyperspectral data and measured agronomic data. Their findings indicated that partial least squares regression combined with specific preprocessing methods enhanced the accuracy and stability of SPAD predictions. Bhadra et al. [26] combined multiple machine learning algorithms with derivative processing to estimate sorghum chlorophyll content, concluding that support vector machine regression models delivered optimal performance. Xu et al. [27] employed four machine learning algorithms alongside feature selection methods to monitor liquorice chlorophyll content using UAV hyperspectral data. Their findings indicated that the combination of genetic algorithms and random forest models yielded the most favourable results. Zhu et al. [28] employed three machine learning algorithms alongside multivariate linear regression for monitoring maize and wheat LCC, observing superior estimation performance from SVM and RF. Whilst machine learning methods demonstrate robust nonlinear fitting and feature learning capabilities in chlorophyll inversion, spectral scale directly influences both the strength of correlation between feature bands and chlorophyll content, and the model’s feature extraction capacity.
As a primary parameter influencing spectral information, variations in spectral scale hinder direct comparison and fusion of data from different sources [29], thereby limiting assessments of model cross-platform applicability. Jiang’s [30] study employed PROSPECT simulations with stepwise bandwidths from 5 nm to 35 nm, finding that indices such as VIUPD, NDVI, and SRI achieved chlorophyll monitoring accuracies with R2 > 0.88 at a 10 nm spectral resolution. However, Murugan’s [31] research suggested that data with spectral bandwidths below 8 nm yield superior accuracy in estimating chlorophyll a in aquatic environments. Dian [32] proposed that employing spectral scales between 10 and 30 nm yields superior results for chlorophyll inversion at leaf or canopy levels, a scale already comparable to satellite or UAV multispectral data. In summary, although studies have explored optimal scales, they face challenges in determining suitable scales. For instance, Jiang and Dian both set bandwidths in 5 nm increments, while Murugan used multiples of 2 nm. These scale settings are disconnected from actual sensors, featuring a wide range of scale choices that are poorly configured and fail to account for the characteristics of different sensors. Furthermore, they neglect the dynamic changes throughout the entire crop growth period. At different growth stages, vegetation exhibits changing relationships between spectral characteristics, chlorophyll content, and other biochemical/structural parameters. Consequently, the optimal spectral scale may not remain constant throughout the growth cycle. Thus, determining the appropriate scale warrants further investigation. Currently, research systematically examining how spectral scales themselves influence crop chlorophyll inversion accuracy remains scarce. Although studies have assessed differences in how various spectral scales affect crop biochemical parameters, these analyses primarily rely on vegetation indices. Few investigations utilise single-band reflectance data to evaluate SPAD response characteristics across different spectral scales, particularly lacking systematic coupling with machine learning methods for quantitative evaluation across distinct growth stages.
This study utilised winter wheat as its subject, employing 1 nm resolution canopy reflectance data acquired via hyperspectral-ground object spectrometers. Raw spectral data were resampled to seven scales—2, 3, 5, 7, 10, 20, and 50 nm—using Gaussian spectral response functions. Four machine learning methods—ridge regression (RR), K-Nearest Neighbours (KNN), Random Forest (RF), and Support Vector Regression (SVR)—were used to establish chlorophyll content monitoring models for winter wheat across its growth stages. The research framework is illustrated in Figure 1. Compared with existing studies, the innovation of this research lies in the first systematic construction of a three-dimensional analytical framework integrating “spectral scale-growth stage-machine learning algorithm.” This framework aims to: (1) systematically reveal the influence patterns of spectral scale variations on SPAD values; (2) deeply explore the accuracy differences of SPAD inversion models under the coupling of different spectral scales and machine learning algorithms; (3) Dynamically explore the growth stage specificity of optimal “scale-model” combinations. By elucidating the dynamic evolution of optimal spectral scales across growth stages and their coupling mechanisms with different algorithms, this study aims to provide a theoretical basis for high-precision monitoring of wheat chlorophyll using multi-platform remote sensing data, and to offer references for future sensor design and data processing strategy optimization.

2. Materials and Methods

2.1. Experimental Design

The study area is located in Shanhou Village, Fengyang County, Anhui Province (32°53′32″ N, 117°33′55″ E), situated within the Jianghuai Plain and belonging to the hilly terrain of the Jianghuai watershed (Figure 2). The region has an average annual temperature of 14.9 °C and average annual precipitation of 904.4 mm. The study area comprised 45 plots, with five nitrogen gradient treatments (N0: 0 kg/ha, N1: 90 kg/ha, N2: 180 kg/ha, N3: 270 kg/ha, N4: 360 kg/ha), each replicated three times. To mitigate the potential influence of single-variety specificity on the generalisability of assessment results, three representative winter wheat varieties were selected: V1: high-gluten Huai Mai 44, V2: high-gluten Yan Nong 999, and V3: low-gluten Ning Mai 13. Each plot measured 6 m in length and 4.4 m in width, with a plant spacing of 30 cm. Seeds were sown using a drill method. Sample data were collected at different growth stages: Jointing stage (T1: 15 March 2024), heading stage (T2: 9 April 2024), full heading stage (T3: 17 April 2024), flowering stage (T4: 25 April 2024), and grain filling stage (T5: 7 May 2024).

2.2. Data Acquisition

2.2.1. SPAD Data Collection

Using the TYS-4N Plant Nutrient Analyser (Zhejiang Topyun Agricultural Technology Co., Ltd., Hangzhou, China), obtain Soil and Plant Analysis Development (SPAD) data for canopy leaves across five growth stages. Within each plot, ten plants representative of the overall growth condition of the winter wheat were selected. Measurements were taken at the midpoint of the second leaf (the second leaf from the base) on each plant, avoiding the main leaf vein. The average of the ten samples was recorded as the SPAD value for that plot. The second-from-top leaf occupies the mid-canopy region, exhibiting stable physiological conditions. This position avoids the excessive sensitivity of the flag leaf and the premature senescence of basal leaves, thereby reliably reflecting the wheat’s nutritional status.

2.2.2. Canopy Spectral Data Acquisition

Spectral data acquisition was performed using the ASD FieldSpec HandHeld2 handheld ground-based spectrometer (Analytical Spectral Devices, Boulder, CO, USA). Spectral measurements spanned 325 to 1075 nm with 1 nm resolution. Experiments were conducted between 10:00 AM and 1:00 PM local time under clear, windless conditions. Calibration was performed using a standard white plate prior to data collection, followed by subsequent calibrations every three plots to mitigate environmental noise. During measurements, three representative sampling points exhibiting uniform growth conditions were selected per plot. Data were collected once at each point to generate three spectral curves, yielding a total of nine spectral curves per plot. The average of these curves served as the raw spectral data for each plot.

2.3. Data Processing

2.3.1. Resampling of Canopy Hyperspectral Data

Spectral resolution refers to a sensor’s ability to distinguish between adjacent wavelength reflectance signals, typically expressed as bandwidth (Full Width at Half Maximum, FWHM). Significant variations exist in spectral scale characteristics across different remote sensing platforms and optical sensors, directly impacting the accuracy and reliability of crop biochemical parameter monitoring. Ground-based hyperspectral instruments typically feature concentrated spectral scale, with FWHM ranging between 3 nm and 10 nm [33,34,35]. As a vital method for monitoring crop growth, UAV remote sensing, constrained by payload limitations, predominantly employs broadband multispectral sensors. Examples such as the Parrot Sequoia+ and MicaSense Altum typically feature FWHM > 10 nm [36,37]. In contrast, UAV hyperspectral sensors generally exhibit FWHM values ranging from 1 nm to 10 nm [38,39], with certain models displaying lower FWHM values. For instance, the Cubert UHD 185 achieves 25 nm at 850 nm [40]. Satellite remote sensing offers the advantage of large-area observation with a broad spectral scale range. Most sensors exhibit FWHM values between 2.5 nm and 50 nm [41,42,43], with some high-resolution sensors achieving FWHM as low as 0.1 nm and low-resolution sensors reaching up to 1000 nm. Considering the characteristics of each sensor, this study employed Gaussian distribution spectral response functions in Python (3.12.4) to resample raw 1 nm spectral reflectance data at seven spectral scale intervals: 2, 3, 5, 7, 10, 20, and 50 nm. This approach aims to simulate a continuous resolution range from fine (approaching ground/UAV hyperspectral) to coarse (covering UAV multispectral and typical satellite sensors) to quantitatively reveal the influence patterns of spectral scale variations on the estimation accuracy of winter wheat SPAD values. This provides direct evidence for scale matching and model transfer among multi-source remote sensing data. Table 1 presents the resampled spectral scales and corresponding spectral band counts.
ρ j = λ s j λ e j ρ λ S j λ d λ | λ s j λ e j S j λ d λ
Here, ρj denotes the resampled canopy reflectance for band j, ρ(λ) represents the original spectral data, λ is the wavelength (nm), λs(j) and λe(j) denote the start and end wavelengths (nm) for band j, Sj(λ) is the spectral response function, and dλ denotes the integration step size. The Gaussian spectral response function is calculated as follows:
S j λ = e x p λ c j F W H M k 2 l n 2 2
where cj denotes the centre wavelength of band j (nm), and FWHMk (full width at half maximum) represents the simulated spectral scale, σ = F W H M k 2 l n 2 is the standard deviation.

2.3.2. Feature Band Selection

To reduce the dimensionality of hyperspectral data and select feature bands most correlated with winter wheat SPAD values, this study employed Pearson correlation coefficient analysis for feature selection. Pearson correlation analysis was conducted between reflectance data at different spectral scales and SPAD values, identifying the top 20 bands with the highest correlation coefficients at each scale. At the 50 nm spectral scale, only the top 11 bands were selected.

2.3.3. Inversion Model Construction

This study employs a repeated random partitioning validation method, randomly dividing the entire dataset into a training set (70%) and a test set (30%), and independently repeating this process 100 times. In each iteration, the model was trained on the training set and evaluated on the corresponding test set. The final model performance was represented by the mean of the 100 repeated experiments. Four distinct machine learning algorithms were employed to comprehensively assess their coupling effects with spectral scales:
The fundamental principle of the Support Vector Regression (SVR) model is to minimise the structural risk, seeking a balance between data approximation accuracy and the complexity of the approximation function to achieve optimal model generalisation capability. As a typical application of Support Vector Machines (SVM), SVR possesses the ability to prevent overfitting, with its advantage lying in conveying biophysical variables [44].
The Random Forest (RF) algorithm is an ensemble machine learning method based on regression trees, proposed by Leo Breiman [45]. As a Bagging ensemble algorithm utilising decision trees as its fundamental building blocks, RF relies on the assumption that different independent predictions will misclassify instances in distinct regions. By combining these independent predictions, overall prediction accuracy can be enhanced, demonstrating strong performance in training and learning high-dimensional datasets such as hyperspectral data [46,47,48].
K-nearest neighbours (KNN) constitutes a non-parametric supervised learning method. Within a multidimensional feature space, for any given point, the K nearest neighbouring points are first identified. Subsequently, the predicted value is calculated and derived through weighted estimation based on the feature values of these neighbouring points. The kNN method has become highly prevalent in data mining owing to its ease of implementation and notable classification performance [49].
Ridge regression (RR) is a biased estimation regression method designed to address multicollinearity issues among independent variables in small sample data [50]. This method addresses the substantial reduction in regression accuracy and stability caused by multicollinearity among independent variables in conventional regression analysis. Ridge regression abandons the unbiased estimation of ordinary least squares, sacrificing some information. However, its estimated biased regression coefficients often more closely approximate the true situation, thereby enhancing model stability and reliability. It demonstrates favourable effects in repairing and fitting pathological data.

2.3.4. Model Evaluation Criteria

Model performance is evaluated using Root Mean Square Error (RMSE), Coefficient of Determination (R2), and Relative Percent Deviation (RPD). A higher R2 value approaching 1 indicates greater predictive accuracy; a lower RMSE value signifies improved model stability. When RPD < 1.4, the model is deemed unusable; values between 1.4 and 2.0 indicate reasonable reliability; and RPD > 2.0 signifies high reliability. The calculation formulae are as follows:
R 2 = i = 1 n ( y y ¯ ) 2 i = 1 n ( y i y i ¯ ) 2
RMSE = i = 1 n ( y y i ) 2 n
RPD = SD RMSE
where y and yi denote the measured chlorophyll content and the model-inverted predicted value for the sample, respectively; y ¯ and y i ¯ represent the measured mean and the inverted mean value for the sample; n is the number of samples; SD denotes the standard deviation.

3. Results

3.1. Analysis of Canopy Spectral Reflectance Data

Figure 3a presents a box-and-whisker plot of SPAD values for winter wheat at different growth stages, while Figure 3b displays the normal distribution of SPAD values. Combined analysis indicates that SPAD values exhibit an overall trend of initial increase followed by decline as growth progresses, peaking during the heading stage. Figure 4 depicts canopy spectral curves for winter wheat at various growth stages, with spectral reflectance ranging between 0 and 0.5. Due to chlorophyll forming absorption bands in the blue and red regions with relatively weak absorption, the curve exhibits a distinct reflection peak in the green light band (550–600 nm), characteristic of the typical vegetation “green peak” [51]. An absorption trough appears around 680 nm, corresponding to chlorophyll’s strong absorption zone. Reflectance rises sharply between 680 nm and 750 nm, known as the “red edge” effect [52]. Canopy reflectance stabilises within the 750–900 nm range. To validate the spectral integrity after resampling, we compared the curves of typical objects before and after resampling (S1), confirming that their primary spectral features (such as red edges and absorption troughs) were fully preserved without significant reflectance distortion. Before utilizing raw spectral data for modeling, we performed rigorous band filtering to control noise. Due to known severe atmospheric interference and low sensor signal-to-noise ratios in the 325–400 nm and 900–1075 nm bands, these bands were excluded from analysis. Ultimately, all processed input features comprised spectral reflectance data within the 400–900 nm range.

3.2. Correlation Analysis

3.2.1. Single-Band Correlation Analysis at Different Growth Stages

Figure 5 illustrates the variation in correlation coefficients between single-band reflectance and SPAD values across five growth stages as a function of wavelength. Results indicate that the correlation between single-band reflectance and chlorophyll content exhibits consistent trends across all stages. Within the visible spectrum, correlation coefficients predominantly remain negative, with negative correlations within the blue light region (450–495 nm) progressively strengthening as wavelength increases. correlation coefficients fluctuated within the green light region (495–570 nm), with variations in amplitude and position across different growth stages; coefficients in the red light region (620–700 nm) exhibited stable changes, transitioning from negative to positive correlation in the red edge region (700–750 nm) before stabilising in the near-infrared region (760–900 nm). Within the visible spectrum, the maximum positive correlation between leaf SPAD values and spectral reflectance occurred at 750 nm, with a correlation coefficient of 0.696. Within the near-infrared spectrum, the maximum positive correlation between leaf SPAD values and leaf spectral reflectance occurred at 813 nm, with a correlation coefficient of 0.780. As indicated by the yellow data in the figure, the characteristic bands are distributed across different wavelength ranges. The bands exhibiting strong correlations are primarily concentrated between 570 nm and 710 nm, with correlation coefficients ranging from −0.9 to −0.3.
Figure 6 presents the characteristic bands identified at different spectral scales across growth stages alongside their corresponding reflectance values. Combining Figure 4 and Figure 5, the five growth stages exhibit three distinct trends at the 1 nm spectral scale: the jointing stage shows the weakest negative correlation, with correlation coefficients around −0.40, and characteristic bands primarily concentrated in the red light band; The booting stage and flowering stage exhibited similar concentrations of characteristic bands within the red and red-edge bands, though the negative correlation for booting was marginally stronger than for flowering, with correlation coefficients around −0.75; The correlation coefficients for the heading stage and filling stage fell within the same range, both exceeding −0.88. Characteristic bands for the heading stage primarily concentrated in the yellow and red-edge bands, while those for filling mainly focused on the red and red-edge bands.

3.2.2. Correlation Analysis Across Different Spectral Scales at Various Growth Stages

As shown in Figure 6, with increasing sampling width, the characteristic bands corresponding to each spectral scale expanded uniformly in both directions from the 1 nm characteristic band; Except for the heading stage, characteristic bands for all other stages predominantly resided within the red light region. As spectral scale increased, these bands expanded towards the green and blue light regions (leftward) and the red edge and near-infrared regions (rightward). Reflectance values were ranked sequentially as follows: heading stage, filling stage, booting stage, flowering stage, and jointing stage.
Table 2 presents the maximum correlation coefficients between SPAD readings and winter wheat spectral reflectance at different spectral scales, along with their corresponding spectral bands. The findings indicate that the absolute values of maximum correlation coefficients across the five growth stages remain comparable under varying spectral scales, consistent with Cho’s [53] research conclusions. As growth stages progress, the optimal spectral regions consistently reside within the red and red-edge bands. However, differences exist in the specific spectral scales: at 1 nm, 2 nm, 3 nm, and 10 nm scales, the maximum correlation band shifts from the red to the red-edge band before reverting to the red; at 5 nm, it shifts from red to red-edge; at 7 nm, it exhibits a cyclical pattern of red light-red edge-red light-red edge. The 20 nm scale follows a red-yellow-red distribution, whilst the 50 nm scale concentrates entirely on the 650 nm wavelength within the red light region. Across different spectral scales, the core band centre wavelengths primarily cluster around 700 nm in the red edge and 650 nm in the red light region, with the 20 nm and 50 nm scales focusing exclusively on the 650 nm band.

3.3. Machine Learning Model Estimation at Different Spectral Scales

This study employed feature bands selected via Pearson correlation analysis as model input variables. Combining four machine learning algorithms (RR, RF, KNN, SVR), a total of 192 SPAD estimation models were constructed across different growth stages. Model evaluation metrics comprised R2, RMSE, and RPD. Figure 7 presents the test set modelling results for the four machine learning models under varying spectral scales and growth stages. Overall, model accuracy for the entire growth period, booting stage, flowering stage, and filling stage generally increased with higher spectral scale, whereas accuracy declined during the jointing stage and heading stage. Models for the heading and filling stages exhibited higher and more consistent accuracy, with R2 values ranging from 0.70 to 0.95. Models for the jointing stage demonstrated the lowest accuracy, with R2 consistently below 0.50. The estimation accuracy of models for the entire growth period, boot stage, and flowering stage showed greater variability, with R2 values ranging from 0.4 to 0.9.
Comparing the estimation performance of four models for SPAD across different growth stages and spectral scales. The RR model demonstrated optimal performance across all scales during the entire growth period and most scales during the booting stage, with model accuracy significantly improving as the spectral scale increased. Optimal model accuracy was achieved at spectral scales of 20 nm and 10 nm, yielding R2 values of 0.774 and 0.622, respectively—representing increases of 0.108 and 0.034 compared to the 1 nm resolution. The RF model demonstrated superior performance during the filling stage, achieving an R2 of 0.939—an improvement of 0.217 over the 1 nm spectral scale. Its RPD of 3.46 significantly exceeded the reliability threshold. The model also performed well at larger scales (10–50 nm), with R2 values exceeding 0.91 and RPD exceeding 2.98. The KNN model demonstrated significant advantages during the heading and flowering stages (R2 > 0.83). Although the overall mean R2 of the flowering stage KNN model did not exceed that of the RR model, its RMSE and RPD data were superior. The SVR model demonstrated the weakest overall performance (mean R2 = 0.546), with the best R2 across all growth stages reaching only 0.565. During the grain filling stage, the RMSE was as high as 7.328, indicating that under the current experimental setup and data conditions, the predictive efficacy of the SVR model is suboptimal.
Comparing SPAD estimation performance across different growth stages using varying spectral scales and models (Figure 8). Throughout the entire growth period, the RR model at the 20 nm spectral scale demonstrated optimal estimation performance (R2 = 0.77, RMSE = 3.42, RPD = 2.06), followed by the RR model at the 50 nm spectral scale. The optimal model for both the jointing and heading stages was the KNN model at the 1 nm spectral scale. Despite consistent spectral scale and inversion models, significant differences emerged due to distinct growth stages, with R2 values at jointing being lower than those at heading (0.435). The optimal models for the heading and filling stages were both the RF model at a 20 nm spectral scale, yielding inversion accuracies of R2 = 0.617 and 0.939, RMSE = 2.189 and 2.725, and RPD = 1.596 and 3.457, respectively. Although the R2 at 20 nm during the heading stage was lower than that at 10 nm (0.622), it exhibited the lowest RMSE and highest RPD, making it the optimal choice when considering overall accuracy. The optimal model for the flowering stage was the KNN model at a 50 nm spectral scale, achieving an R2 of 0.837. Compared to the original 1 nm data, this resulted in an R2 improvement of 0.09, an RMSE reduction of 3.10, and an RPD increase of 1.33.
In summary, different models are suited to different growth stages, with distinct spectral scales matching specific growth stage characteristics. SPAD inversion must adhere to the triadic relationship of “scale-model-growth stage”.

4. Discussion

4.1. Correlation Analysis Between Canopy Spectral Data and SPAD at Different Growth Stages

Models for estimating SPAD values in winter wheat based on hyperspectral remote sensing data have yielded favourable results in previous studies and gained widespread application [54,55]. Consequently, in-depth analysis of the correlation between canopy spectral data and SPAD values aids in comprehensively understanding winter wheat growth conditions. This study conducted correlation analyses between spectral data and winter wheat SPAD values across different growth stages, revealing variations in the results (Figure 3). The overall trend in spectral data correlation with SPAD values across growth stages was upward. The weakest negative correlation with SPAD was observed during the jointing stage, while correlations strengthened during the booting and heading stages. The highest correlation in the visible spectrum was recorded during heading, with a subsequent decline during flowering. Correlation increased again during filling, reaching its peak in the near-infrared spectrum. The primary reasons are as follows: during the jointing stage, the plant canopy is not yet closed, with low coverage, leading to soil background noise affecting the spectral signal. During the booting and heading stages, the canopy gradually closes, reducing the influence of the soil background and enhancing the correlation. During flowering, floral organs exhibit high reflectance, potentially interfering with and attenuating chlorophyll absorption signals. At filling, leaf senescence occurs with chlorophyll degradation, and nitrogen and carbohydrates are transferred from leaves to grains [56]. Leaf cell structures dehydrate and shrivel, reducing scattering [57], thereby decreasing reflectance and increasing the correlation between canopy spectra and SPAD readings. As shown in Figure 5, the sensitive bands across all five growth stages are concentrated in the red and red-edge regions, primarily due to chlorophyll’s strong absorption of red light. This aligns with findings from Buscaglia [58] and Zhao [59]. Although this study successfully revealed the spectral response patterns of winter wheat SPAD values, the robustness of the model is inevitably influenced by specific years and natural conditions. Therefore, the universality of these conclusions across years and regions requires further validation. Future research will focus on collecting multi-year, multi-location data and integrating environmental variables into the model framework to develop a monitoring model with enhanced spatio-temporal generalization capabilities.

4.2. Interaction Between Spectral Scale and Model Performance

4.2.1. Evaluation of SPAD Model Performance at Different Spectral Scales

Spectral scale conversion aims to adapt spectral wavelength resolution or band response characteristics to suit different spectrometers or specific application requirements [60]. Utilising Gaussian response functions or sensor band characteristics, high-resolution spectral data can be converted into equivalent band reflectance for the target platform. This preserves core spectral features while matching the sensor’s data format. For instance, Li et al. [61] simulated the wide-band spectral reflectance characteristics of GF-1 satellite data based on measured winter wheat hyperspectral data, constructing a remote sensing monitoring model for winter wheat leaf SPAD values at different growth stages using GF-1 data. Z. Oumar et al. [62] resampled field-spectrometer (Fieldspec 3 Pro FR) measurements to the 10 nm calibration bands of the Hyperion sensor, successfully evaluating their predictive capability for Thaumastocoris peregrinus damage. Spectral scale conversion reduces data dimensions, decreases computational complexity and processing time, thereby enhancing analytical efficiency [63].
As discussed in Section 2.3, model accuracy (R2 and RPD) for the entire growth period, booting stage, flowering stage, and filling stage generally exhibited a trend of first decreasing and then increasing with increasing spectral scale, whereas RMSE showed the opposite pattern. Across these four stages, model accuracy changed slowly within the first five spectral scales, with a trough occurring around 3–7 nm and a marked inflection point near 10 nm, followed by accelerated changes thereafter. Model accuracy (R2 and RPD) for the jointing and heading stages exhibited an overall declining trend, with an upward peak around 10 nm, while RMSE showed the opposite variation. Overall, compared to 1 nm high-resolution data, larger spectral scales (10, 20 nm) enhanced the accuracy of SPAD monitoring for winter wheat, whereas medium and small resolutions (5, 7 nm) potentially reduced accuracy. This may occur because large-scale (10, 20 nm) spectral scale enhances signal-to-noise ratio (SNR) through spectral band aggregation, whereas extremely large-scale (>20 nm) resolution utilises only specific bands within the solar spectrum, potentially causing spectral information loss when integrating broader bandwidths [64]; Although medium- and small-scale (5, 7 nm) resolutions show improvement, they fail to effectively filter noise, leading to reduced inversion accuracy. This conclusion aligns with the findings of Thenkabail et al. [65], where excessively narrow bandwidths (<5 nm) significantly impair SNR, while overly broad bandwidths (>30 nm) result in loss of specific narrow-band information. Thenkabail et al. [21] found that for five crops, nine growth stages, and multiple traits, the vast majority of bandwidths closely associated with crop characteristics fell between 10 nm and 20 nm. Furthermore, Broge et al. [66] indicated that the optimal narrow bandwidth is approximately 15 nm.
This study employed only the 1–50 nm scale, with some sensors achieving spectral resolution of 100 nm. Future experimental designs should be more comprehensive, taking into account the characteristics of different sensors and platforms. Sampling should be performed based on their respective spectral response functions to achieve more accurate data simulation.

4.2.2. Performance Evaluation of Machine Learning Algorithms in SPAD Estimation

Previous studies have demonstrated the significant potential of machine learning in predicting chlorophyll content in winter wheat leaves [67,68,69]. This research employed four algorithms—RR, RF, KNN, and SVR—to construct SPAD prediction models for five growth stages of winter wheat based on eight spectral scale feature bands. Results indicate that RR, KNN, and SVR models achieved the highest R2 (0.89, 0.86, 0.88) and RPD (2.94, 2.57, 1.32) at 10 nm during the filling stage. The RF model attained the highest R2 (0.94) and RPD (3.46) at 20 nm during this stage.
A comprehensive comparison of 576 evaluation metrics across different spectral scales, machine learning algorithms, and growth stages identified optimal modelling approaches and accuracies for each scale and stage (Table 3). Results indicate KNN demonstrated superior and consistent performance, achieving the highest accuracy in 21 out of 48 models (43.75%) across scales and stages. The predictive accuracy of the KNN model remained above 0.70 in most cases, reaching a maximum of 0.90. This indicates that the SPAD values predicted by the KNN model exhibit a high correlation with actual values, rendering the predictions reliable. As an instance-based algorithm, KNN relies on the similarity of neighbouring samples in the feature space for prediction, effectively capturing the non-linear relationships between local samples in hyperspectral data [70]. Second was the RR model, achieving 15 best accuracies (31.25%). It demonstrated strong generalisation capability and stability, likely because RR is a linear regression model with L2 regularisation, making it less prone to overfitting and effectively addressing multicollinearity issues in high-dimensional spectral data [71]. Although the RF algorithm demonstrated favourable predictive accuracy at certain large scales (e.g., 10 nm, 20 nm) with R2 exceeding 0.92, it exhibited severe overfitting, resulting in unstable model performance and limited adaptability to the complex and variable conditions encountered in field environments. This may stem from excessive model complexity, leading to overfitting of training data details and noise, thereby compromising generalisation capabilities [72]. Compared to the former model, the SVR model yields overall lower inversion results. This may be attributable to the parameters (C, γ) and kernel function not being optimally tuned, thereby failing to effectively capture the complex relationships within the data [73]. System hyperparameter optimization is an essential approach to mitigate overfitting and enhance model generalization capabilities. Future research will prioritize hyperparameter tuning for systems, employing advanced strategies such as grid search or Bayesian optimization to identify optimal parameter configurations for different algorithms and scenarios. This aims to achieve more stable and robust inversion models. Simultaneously, effectively applying deep learning methods like convolutional neural networks (CNNs) for spectral analysis to enhance model performance and complex data processing capabilities will also be a key focus of future research [74].
When further exploring the application of spectral scales and machine learning methods, ensuring the quality and accuracy of raw data is essential. Hyperspectral data features numerous bands with complex structures and highly correlated information across bands, making it prone to data collinearity and redundancy issues [75]. Appropriate data preprocessing methods, such as first-order derivatives and SG smoothing, must be employed to enhance data precision.

5. Conclusions

This study addressed the core issue of spectral scale variations across multi-platform remote sensing data affecting crop physiological parameter inversion accuracy. By integrating Gaussian resampling techniques with machine learning algorithms, it systematically elucidated the influence mechanism of spectral scale on SPAD inversion for winter wheat. Key conclusions are as follows:
(1)
The correlation between winter wheat canopy spectral data and SPAD values exhibits distinct developmental stage dynamics, with sensitive spectral bands concentrated in the 570–710 nm range. The highest correlation was observed during the heading stage (r = −0.89), while the lowest was recorded during the jointing stage (r = −0.41), indicating that changes in vegetation physiological processes directly influence spectral response intensity.
(2)
Spectral resolution systematically influences model performance. Compared to 1 nm raw data, larger scales like 10 nm and 20 nm effectively suppress noise interference and enhance model robustness. Conversely, intermediate scales such as 5 nm and 7 nm reduce model accuracy due to information redundancy and noise amplification effects.
(3)
This study first clarifies the dynamic adaptation rules among “scale-model-growth stage”: During the jointing and heading stages where vegetative growth and reproductive growth coexist, the 1–5 nm fine scale combined with the KNN algorithm achieves optimal monitoring. In contrast, during the booting stage and the flowering and grain-filling stages dominated by reproductive growth, the 20–50 nm spectral scale combined with either the KNN or RF algorithm proves more suitable.
Spectral scale conversion achieves data comparison and fusion by standardizing multi-source spectral response functions, effectively resolving spectral baseline shifts between sensors. This provides foundational support for cross-platform data fusion and interoperability, playing a crucial role in collaborative satellite-UAV remote sensing analysis to significantly enhance data reliability and model generalization capabilities for large-scale agricultural monitoring [76,77]. Research indicates that when using UAVs for crop monitoring, sensors should be dynamically selected based on growth stages: hyperspectral sensors should be prioritized in early stages to preserve fine spectral information, while multispectral sensors with matched band centers and widths (e.g., 20–50 nm) can be used in later stages to optimize cost and accuracy. Beyond UAV-based near-surface remote sensing, our framework integrates with satellite-based spaceborne remote sensing to achieve precise, timely estimation of multi-scale crop traits. For instance, the Gaofen-5 satellite (VNIR: 5 nm, SWIR: 10 nm) enables direct application of research outcomes to guide satellite data preprocessing and monitoring algorithm selection. Future work will validate the optimal models developed in this study across ecological regions using China’s typical land cover spectral database and multi-regional independent datasets, advancing their integration into operational crop growth monitoring systems.

Author Contributions

Conceptualization, X.Z. and X.L.; methodology, X.Z.; software, X.Z., H.W. and H.Y.; validation, W.W., Y.S., H.W. and H.Y.; formal analysis, Y.S., W.W. and H.W.; investigation, X.Z., Y.N.; resources, X.Z.; data curation, X.Z., H.W.; writing—original draft preparation, X.Z.; writing—review and editing, X.Z., J.L. (Jun Li), J.L. (Jikai Liu), and X.L.; visualization, H.W.; supervision, J.L. (Jikai Liu), and X.L.; project administration, X.L.; funding acquisition, X.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Sub-Project of the National Key Research and Development Program (no. 2022YFD2301402-3); Scientific Research Projects in Higher Education Institutions of Anhui Province (no. 2023AH051855); and Anhui Province University Science and Engineering Teachers Enterprise Practice Program Project (no. 224jsqygz62).

Data Availability Statement

The data presented in this study are available on request from the corresponding author due to the need for follow-up studies.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Lv, Z.; Liu, X.; Cao, W.; Zhu, Y. Climate Change Impacts on Regional Winter Wheat Production in Main Wheat Production Regions of China. Agric. For. Meteorol. 2013, 171–172, 234–248. [Google Scholar] [CrossRef]
  2. Evans, J.R. Photosynthesis and Nitrogen Relationships in Leaves of C3 Plants. Oecologia 1989, 78, 9–19. [Google Scholar] [CrossRef]
  3. Niinemets, Ü.; Tenhunen, J.D. A Model Separating Leaf Structural and Physiological Effects on Carbon Gain along Light Gradients for the Shade-Tolerant Species Acer Saccharum. Plant Cell Environ. 1997, 20, 845–866. [Google Scholar] [CrossRef]
  4. Yoder, B.J.; Pettigrew-Crosby, R.E. Predicting Nitrogen and Chlorophyll Content and Concentrations from Reflectance Spectra (400–2500 Nm) at Leaf and Canopy Scales. Remote Sens. Environ. 1995, 53, 199–211. [Google Scholar] [CrossRef]
  5. Zhen, J.; Jiang, X.; Xu, Y.; Miao, J.; Zhao, D.; Wang, J.; Wang, J.; Wu, G. Mapping leaf chlorophyll content of mangrove forests with Sentinel-2 images of four periods. Int. J. Appl. Earth Obs. Geoinf. 2021, 102, 102387. [Google Scholar] [CrossRef]
  6. Li, W.; Pan, K.; Liu, W.; Xiao, W.; Ni, S.; Shi, P.; Chen, X.; Li, T. Monitoring maize canopy chlorophyll content throughout the growth stages based on UAV MS and RGB feature fusion. Agriculture 2024, 14, 1265. [Google Scholar] [CrossRef]
  7. Feng, Z.; Guan, H.; Yang, T.; He, L.; Duan, J.; Song, L.; Wang, C.; Feng, W. Estimating the canopy chlorophyll content of winter wheat under nitrogen deficiency and powdery mildew stress using machine learning. Comput. Electron. Agric. 2023, 211, 107989. [Google Scholar] [CrossRef]
  8. Yu, Y.; Yu, H.Y.; Li, X.K.; Wang, H.J.; Liu, S.; Zhang, L.; Sui, Y.Y. Hyperspectral inversion model for SPAD of rice leaves based on optimized spectral index. Spectrosc. Spectr. Anal. 2022, 42, 1092–1097. [Google Scholar] [CrossRef]
  9. Nile, W.; Rina, S.; Mula, N.; Ersi, C.; Bao, Y.; Zhang, J.; Tong, Z.; Liu, X.; Zhao, C. Inversion of Leaf Chlorophyll Content in Different Growth Periods of Maize Based on Multi-Source Data from “Sky–Space–Ground. ” Remote Sens. 2025, 17, 572. [Google Scholar] [CrossRef]
  10. Sonobe, R.; Hirono, Y. Carotenoid Content Estimation in Tea Leaves Using Noisy Reflectance Data. Remote Sens. 2023, 15, 4303. [Google Scholar] [CrossRef]
  11. Falcioni, R.; Antunes, W.C.; Demattê, J.A.M.; Nanni, M.R. A Novel Method for Estimating Chlorophyll and Carotenoid Concentrations in Leaves: A Two Hyperspectral Sensor Approach. Sensors 2023, 23, 3843. [Google Scholar] [CrossRef]
  12. Blackburn, G.A. Hyperspectral Remote Sensing of Plant Pigments. J. Exp. Bot. 2006, 58, 855–867. [Google Scholar] [CrossRef]
  13. Dong, T.; Shang, J.; Qian, B.; Liu, J.; Chen, J.; Jing, Q.; McConkey, B.; Huffman, T.; Daneshfar, B.; Champagne, C.; et al. Field-Scale Crop Seeding Date Estimation from MODIS Data and Growing Degree Days in Manitoba, Canada. Remote Sens. 2019, 11, 1760. [Google Scholar] [CrossRef]
  14. He, T.; Xie, C.; Liu, Q.; Guan, S.; Liu, G. Evaluation and Comparison of Random Forest and A-LSTM Networks for Large-Scale Winter Wheat Identification. Remote Sens. 2019, 11, 1665. [Google Scholar] [CrossRef]
  15. Tao, H.; Feng, H.; Xu, L.; Miao, M.; Long, H.; Yue, J.; Li, Z.; Yang, G.; Yang, X.; Fan, L. Estimation of Crop Growth Parameters Using UAV-Based Hyperspectral Remote Sensing Data. Sensors 2020, 20, 1296. [Google Scholar] [CrossRef] [PubMed]
  16. Ang, K.L.-M.; Seng, J.K.P. Big Data and Machine Learning With Hyperspectral Information in Agriculture. IEEE Access 2021, 9, 36699–36718. [Google Scholar] [CrossRef]
  17. Liu, Y. Development of Hyperspectral Imaging Remote Sensing Technology. zggx 2021, 25, 439–459. [Google Scholar] [CrossRef]
  18. Osco, L.P.; Ramos, A.P.M.; Moriya, É.A.S.; Bavaresco, L.G.; Lima, B.C.D.; Estrabis, N.; Pereira, D.R.; Creste, J.E.; Júnior, J.M.; Gonçalves, W.N.; et al. Modeling Hyperspectral Response of Water-Stress Induced Lettuce Plants Using Artificial Neural Networks. Remote Sens. 2019, 11, 2797. [Google Scholar] [CrossRef]
  19. Zhai, Y.; Zhou, L.; Qi, H.; Gao, P.; Zhang, C. Application of Visible/Near-Infrared Spectroscopy and Hyperspectral Imaging with Machine Learning for High-Throughput Plant Heavy Metal Stress Phenotyping: A Review. Plant Phenomics 2023, 5, 0124. [Google Scholar] [CrossRef]
  20. Liang, L.; Huang, T.; Di, L.; Geng, D.; Yan, J.; Wang, S.; Wang, L.; Li, L.; Chen, B.; Kang, J. Influence of Different Bandwidths on LAI Estimation Using Vegetation Indices. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2020, 13, 1494–1502. [Google Scholar] [CrossRef]
  21. Thenkabail, P.S.; Smith, R.B.; De Pauw, E. Hyperspectral Vegetation Indices and Their Relationships with Agricultural Crop Characteristics. Remote Sens. Environ. 2000, 71, 158–182. [Google Scholar] [CrossRef]
  22. Shu, M.; Shen, M.; Zuo, J.; Yin, P.; Wang, M.; Xie, Z.; Tang, J.; Wang, R.; Li, B.; Yang, X.; et al. The Application of UAV-Based Hyperspectral Imaging to Estimate Crop Traits in Maize Inbred Lines. Plant Phenomics 2021, 2021, 9890745. [Google Scholar] [CrossRef]
  23. Liakos, K.; Busato, P.; Moshou, D.; Pearson, S.; Bochtis, D. Machine Learning in Agriculture: A Review. Sensors 2018, 18, 2674. [Google Scholar] [CrossRef] [PubMed]
  24. Weiss, M.; Jacob, F.; Duveiller, G. Remote Sensing for Agricultural Applications: A Meta-Review. Remote Sens. Environ. 2020, 236, 111402. [Google Scholar] [CrossRef]
  25. Shen, L.; Gao, M.; Yan, J.; Wang, Q.; Shen, H. Winter Wheat SPAD Value Inversion Based on Multiple Pretreatment Methods. Remote Sens. 2022, 14, 4660. [Google Scholar] [CrossRef]
  26. Bhadra, S.; Sagan, V.; Maimaitijiang, M.; Maimaitiyiming, M.; Newcomb, M.; Shakoor, N.; Mockler, T.C. Quantifying Leaf Chlorophyll Concentration of Sorghum from Hyperspectral Data Using Derivative Calculus and Machine Learning. Remote Sens. 2020, 12, 2082. [Google Scholar] [CrossRef]
  27. Xu, M.; Dai, J.; Zhang, G.; Hou, W.; Mu, Z.; Chen, P.; Cao, Y.; Zhao, Q. Inversion of Glycyrrhiza Chlorophyll Content Based on Hyperspectral Imagery. Agronomy 2024, 14, 1163. [Google Scholar] [CrossRef]
  28. Zhu, W.; Sun, Z.; Yang, T.; Li, J.; Peng, J.; Zhu, K.; Li, S.; Gong, H.; Lyu, Y.; Li, B.; et al. Estimating Leaf Chlorophyll Content of Crops via Optimal Unmanned Aerial Vehicle Hyperspectral Data at Multi-Scales. Comput. Electron. Agric. 2020, 178, 105786. [Google Scholar] [CrossRef]
  29. Ma, R.F.; Chen, Y.; Hou, P.; Ren, X.Q. Progress on Hyperspectral Remote Sensing Inversion Method for Vegetation Chlorophyll Content. For. Grassl. Resour. Res. 2024, 5, 166–178. [Google Scholar] [CrossRef]
  30. Jiang, H.; Zhang, L.; Yang, H.; Chen, X.; Tong, Q. Research on Spectral Scale Effect in the Estimation of Vegetation Leaf Chlorophyll Content. Guang Pu Xue Yu Guang Pu Fen Xi 2016, 36, 169–176. (In Chinese) [Google Scholar]
  31. Murugan, P.; Sivakumar, R.; Bhat, S.; Annadurai, M. Optimal Bandwidth Selection for Remote Sensing Data to Estimate Chlorophyll-a in Inland Waters. Mangalore Inst. Technol. Eng. Moodbidri India 2017, 21–26. [Google Scholar]
  32. Dian, Y.; Le, Y.; Fang, S.; Xu, Y.; Yao, C.; Liu, G. Influence of Spectral Bandwidth and Position on Chlorophyll Content Retrieval at Leaf and Canopy Levels. J. Indian Soc. Remote Sens. 2016, 44, 583–593. [Google Scholar] [CrossRef]
  33. Stroppiana, D.; Boschetti, M.; Brivio, P.A.; Bocchi, S. Plant Nitrogen Concentration in Paddy Rice from Field Canopy Hyperspectral Radiometry. Field Crops Res. 2009, 111, 119–129. [Google Scholar] [CrossRef]
  34. Yao, X.; Ren, H.; Cao, Z.; Tian, Y.; Cao, W.; Zhu, Y.; Cheng, T. Detecting Leaf Nitrogen Content in Wheat with Canopy Hyperspectrum under Different Soil Backgrounds. Int. J. Appl. Earth Obs. Geoinf. 2014, 32, 114–124. [Google Scholar] [CrossRef]
  35. Wang, F.; Huang, J.; Wang, Y.; Liu, Z.; Peng, D.; Cao, F. Monitoring Nitrogen Concentration of Oilseed Rape from Hyperspectral Data Using Radial Basis Function. Int. J. Digit. Earth 2013, 6, 550–562. [Google Scholar] [CrossRef]
  36. Kopačková-Strnadová, V.; Koucká, L.; Jelének, J.; Lhotáková, Z.; Oulehle, F. Canopy Top, Height and Photosynthetic Pigment Estimation Using Parrot Sequoia Multispectral Imagery and the Unmanned Aerial Vehicle (UAV). Remote Sens. 2021, 13, 705. [Google Scholar] [CrossRef]
  37. Cottrell, B.; Kalacska, M.; Arroyo-Mora, J.-P.; Lucanus, O.; Inamdar, D.; Løke, T.; Soffer, R.J. Limitations of a Multispectral UAV Sensor for Satellite Validation and Mapping Complex Vegetation. Remote Sens. 2024, 16, 2463. [Google Scholar] [CrossRef]
  38. Tong, Q.; Xue, Y.; Zhang, L. Progress in Hyperspectral Remote Sensing Science and Technology in China Over the Past Three Decades. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2014, 7, 70–91. [Google Scholar] [CrossRef]
  39. Vane, G.; Goetz, A.F.H.; Wellman, J.B. Airborne Imaging Spectrometer: A New Tool for Remote Sensing. IEEE Trans. Geosci. Remote Sens. 1984, GE-22, 546–549. [Google Scholar] [CrossRef]
  40. Deng, L.; Yan, Y.; Gong, H.; Duan, F.; Zhong, R. The Effect of Spatial Resolution on Radiometric and Geometric Performances of a UAV-Mounted Hyperspectral 2D Imager. ISPRS J. Photogramm. Remote Sens. 2018, 144, 298–314. [Google Scholar] [CrossRef]
  41. Wu, C.; Niu, Z.; Tang, Q.; Huang, W. Estimating Chlorophyll Content from Hyperspectral Vegetation Indices: Modeling and Validation. Agric. For. Meteorol. 2008, 148, 1230–1241. [Google Scholar] [CrossRef]
  42. Zhao, X.; Xiao, Z.; Kang, Q.; Li, Q.; Fang, L. Overview of the Fourier Transform Hyperspectral Imager (HSI) Boarded on HJ-1A Satellite. In Proceedings of the 2010 IEEE International Geoscience and Remote Sensing Symposium, Honolulu, HI, USA, 25–30 July 2010; IEEE: Honolulu, HI, USA, 2010; pp. 4272–4274. [Google Scholar] [CrossRef]
  43. Praks, J.; Kestila, A.; Hallikainen, M.; Saari, H.; Antila, J.; Janhunen, P.; Vainio, R. Aalto-1-An Experimental Nanosatellite for Hyperspectral Remote Sensing. In Proceedings of the 2011 IEEE International Geoscience and Remote Sensing Symposium, Vancouver, BC, Canada, 24–29 July 2011; IEEE: Vancouver, BC, Canada, 2011; pp. 4367–4370. [Google Scholar] [CrossRef]
  44. Camps-Valls, G.; Bruzzone, L.; Rojo-Rojo, J.L.; Melgani, F. Robust Support Vector Regression for Biophysical Variable Estimation From Remotely Sensed Images. IEEE Geosci. Remote Sens. Lett. 2006, 3, 339–343. [Google Scholar] [CrossRef]
  45. Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  46. Amirruddin, A.D.; Muharam, F.M.; Ismail, M.H.; Ismail, M.F.; Tan, N.P.; Karam, D.S. Hyperspectral Remote Sensing for Assessment of Chlorophyll Sufficiency Levels in Mature Oil Palm (Elaeis guineensis) Based on Frond Numbers: Analysis of Decision Tree and Random Forest. Comput. Electron. Agric. 2020, 169, 105221. [Google Scholar] [CrossRef]
  47. Lassalle, G.; Ferreira, M.P.; Cué La Rosa, L.E.; Del’Papa Moreira Scafutto, R.; De Souza Filho, C.R. Advances in Multi- and Hyperspectral Remote Sensing of Mangrove Species: A Synthesis and Study Case on Airborne and Multisource Spaceborne Imagery. ISPRS J. Photogramm. Remote Sens. 2023, 195, 298–312. [Google Scholar] [CrossRef]
  48. Soltanikazemi, M.; Minaei, S.; Shafizadeh-Moghadam, H.; Mahdavian, A. Field-Scale Estimation of Sugarcane Leaf Nitrogen Content Using Vegetation Indices and Spectral Bands of Sentinel-2: Application of Random Forest and Support Vector Regression. Comput. Electron. Agric. 2022, 200, 107130. [Google Scholar] [CrossRef]
  49. Zhang, S.; Li, X.; Zong, M.; Zhu, X.; Wang, R. Efficient kNN Classification With Different Numbers of Nearest Neighbors. IEEE Trans. Neural Netw. Learn. Syst. 2018, 29, 1774–1785. [Google Scholar] [CrossRef]
  50. McDonald, G.C. Ridge Regression. WIREs Comput. Stats 2009, 1, 93–100. [Google Scholar] [CrossRef]
  51. Chen, D.; Zhang, F.; Tan, M.L.; Chan, N.W.; Shi, J.; Liu, C.; Wang, W. Improved Na+ Estimation from Hyperspectral Data of Saline Vegetation by Machine Learning. Comput. Electron. Agric. 2022, 196, 106862. [Google Scholar] [CrossRef]
  52. Zhu, C.; Ding, J.; Zhang, Z.; Wang, J.; Wang, Z.; Chen, X.; Wang, J. SPAD Monitoring of Saline Vegetation Based on Gaussian Mixture Model and UAV Hyperspectral Image Feature Classification. Comput. Electron. Agric. 2022, 200, 107236. [Google Scholar] [CrossRef]
  53. Cho, M.A.; Skidmore, A.K.; Atzberger, C. Towards Red-edge Positions Less Sensitive to Canopy Biophysical Parameters for Leaf Chlorophyll Estimation Using Properties Optique Spectrales Des Feuilles (PROSPECT) and Scattering by Arbitrarily Inclined Leaves (SAILH) Simulated Data. Int. J. Remote Sens. 2008, 29, 2241–2255. [Google Scholar] [CrossRef]
  54. Silva-Perez, V.; Molero, G.; Serbin, S.P.; Condon, A.G.; Reynolds, M.P.; Furbank, R.T.; Evans, J.R. Hyperspectral Reflectance as a Tool to Measure Biochemical and Physiological Traits in Wheat. J. Exp. Bot. 2018, 69, 483–496. [Google Scholar] [CrossRef] [PubMed]
  55. Yang, X.; Yang, R.; Ye, Y.; Yuan, Z.; Wang, D.; Hua, K. Winter Wheat SPAD Estimation from UAV Hyperspectral Data Using Cluster-Regression Methods. Int. J. Appl. Earth Obs. Geoinf. 2021, 105, 102618. [Google Scholar] [CrossRef]
  56. Liu, S.; Hu, Z.; Han, J.; Li, Y.; Zhou, T. Predicting Grain Yield and Protein Content of Winter Wheat at Different Growth Stages by Hyperspectral Data Integrated with Growth Monitor Index. Comput. Electron. Agric. 2022, 200, 107235. [Google Scholar] [CrossRef]
  57. Zhao, Q.L.; Zang, H.C.; Li, G.Q.; Qang, M.; Zhao, W.; Zheng, G.Q. Study on wheat maturity monitoring model based on Lab color space. Jiangsu Agric. Sci. 2023, 51, 194–200. (In Chinese) [Google Scholar] [CrossRef]
  58. Buscaglia, H.J.; Varco, J.J. Early Detection of Cotton Leaf Nitrogen Status Using Leaf Reflectance. J. Plant Nutr. 2002, 25, 2067. [Google Scholar] [CrossRef]
  59. Zhao, D.; Raja Reddy, K.; Kakani, V.G.; Read, J.J.; Carter, G.A. Corn (Zea mays L.) Growth, Leaf Pigment Concentration, Photosynthesis and Leaf Hyperspectral Reflectance Properties as Affected by Nitrogen Supply. Plant Soil 2003, 257, 205–218. [Google Scholar] [CrossRef]
  60. Gao, C.; Li, H.; Wang, J.; Zhang, X.; Huang, K.; Song, X.; Yang, W.; Feng, M.; Xiao, L.; Zhao, Y.; et al. Combined Use of Spectral Resampling and Machine Learning Algorithms to Estimate Soybean Leaf Chlorophyll. Comput. Electron. Agric. 2024, 218, 108675. [Google Scholar] [CrossRef]
  61. Li, F.L.; Wang, L.; Liu, J.; Chang, Q. Remote sensing estimation of SPAD value for wheat leaf based on GF-1 data. Trans. Chin. Soc. Agric. Mach. 2015, 46, 273–281. [Google Scholar] [CrossRef]
  62. Oumar, Z.; Mutanga, O.; Ismail, R. Predicting Thaumastocoris Peregrinus Damage Using Narrow Band Normalized Indices and Hyperspectral Indices Using Field Spectra Resampled to the Hyperion Sensor. Int. J. Appl. Earth Obs. Geoinf. 2013, 21, 113–121. [Google Scholar] [CrossRef]
  63. Jin, X.; Song, K.; Du, J.; Liu, H.; Wen, Z. Comparison of Different Satellite Bands and Vegetation Indices for Estimation of Soil Organic Matter Based on Simulated Spectral Configuration. Agric. For. Meteorol. 2017, 244–245, 57–71. [Google Scholar] [CrossRef]
  64. Ollinger, S.V. Sources of Variability in Canopy Reflectance and the Convergent Properties of Plants. New Phytol. 2011, 189, 375–394. [Google Scholar] [CrossRef] [PubMed]
  65. Thenkabail, P.S. Optimal Hyperspectral Narrowbands for Discriminating Agricultural Crops. Remote Sens. Rev. 2001, 20, 257–291. [Google Scholar] [CrossRef]
  66. Broge, N.H.; Leblanc, E. Comparing Prediction Power and Stability of Broadband and Hyperspectral Vegetation Indices for Estimation of Green Leaf Area Index and Canopy Chlorophyll Density. Remote Sens. Environ. 2001, 76, 156–172. [Google Scholar] [CrossRef]
  67. Su, X.; Nian, Y.; Shaghaleh, H.; Hamad, A.; Yue, H.; Zhu, Y.; Li, J.; Wang, W.; Wang, H.; Ma, Q.; et al. Combining Features Selection Strategy and Features Fusion Strategy for SPAD Estimation of Winter Wheat Based on UAV Multispectral Imagery. Front. Plant Sci. 2024, 15, 1404238. [Google Scholar] [CrossRef]
  68. Wang, Q.; Chen, X.; Meng, H.; Miao, H.; Jiang, S.; Chang, Q. UAV Hyperspectral Data Combined with Machine Learning for Winter Wheat Canopy SPAD Values Estimation. Remote Sens. 2023, 15, 4658. [Google Scholar] [CrossRef]
  69. Wang, J.; Zhou, Q.; Shang, J.; Liu, C.; Zhuang, T.; Ding, J.; Xian, Y.; Zhao, L.; Wang, W.; Zhou, G.; et al. UAV- and Machine Learning-Based Retrieval of Wheat SPAD Values at the Overwintering Stage for Variety Screening. Remote Sens. 2021, 13, 5166. [Google Scholar] [CrossRef]
  70. Yan, N.; Xie, Q.; Qin, Y.; Wang, Q.; Lv, S.; Zhang, X.; Li, X. Inversion of SPAD Values of Pear Leaves at Different Growth Stages Based on Machine Learning and Sentinel-2 Remote Sensing Data. Agriculture 2025, 15, 1264. [Google Scholar] [CrossRef]
  71. Forrester, J.B.; Kalivas, J.H. Ridge Regression Optimization Using a Harmonious Approach. J. Chemom. 2004, 18, 372–384. [Google Scholar] [CrossRef]
  72. Zhu, T. Analysis on the Applicability of the Random Forest. J. Phys. Conf. Ser. 2020, 1607, 012123. [Google Scholar] [CrossRef]
  73. Üstün, B.; Melssen, W.J.; Buydens, L.M.C. Visualisation and Interpretation of Support Vector Regression Models. Anal. Chim. Acta 2007, 595, 299–309. [Google Scholar] [CrossRef] [PubMed]
  74. Pullanagari, R.R.; Dehghan-Shoar, M.; Yule, I.J.; Bhatia, N. Field Spectroscopy of Canopy Nitrogen Concentration in Temperate Grasslands Using a Convolutional Neural Network. Remote Sens. Environ. 2021, 257, 112353. [Google Scholar] [CrossRef]
  75. Zhang, Y.; Hui, J.; Qin, Q.; Sun, Y.; Zhang, T.; Sun, H.; Li, M. Transfer-Learning-Based Approach for Leaf Chlorophyll Content Estimation of Winter Wheat from Hyperspectral Data. Remote Sens. Environ. 2021, 267, 112724. [Google Scholar] [CrossRef]
  76. Omia, E.; Bae, H.; Park, E.; Kim, M.S.; Baek, I.; Kabenge, I.; Cho, B.-K. Remote Sensing in Field Crop Monitoring: A Comprehensive Review of Sensor Systems, Data Analyses and Recent Advances. Remote Sens. 2023, 15, 354. [Google Scholar] [CrossRef]
  77. Roth, K.L.; Roberts, D.A.; Dennison, P.E.; Peterson, S.H.; Alonzo, M. The Impact of Spatial Resolution on the Classification of Plant Species and Functional Types within Imaging Spectrometer Data. Remote Sens. Environ. 2015, 171, 45–57. [Google Scholar] [CrossRef]
Figure 1. Research framework diagram.
Figure 1. Research framework diagram.
Agriculture 15 02410 g001
Figure 2. Winter wheat experimental area.
Figure 2. Winter wheat experimental area.
Agriculture 15 02410 g002
Figure 3. Analysis of chlorophyll content in winter wheat at different growth stages. (a) presents the statistical description, and (b) shows the normal distribution of chlorophyll content across different growth stages.
Figure 3. Analysis of chlorophyll content in winter wheat at different growth stages. (a) presents the statistical description, and (b) shows the normal distribution of chlorophyll content across different growth stages.
Agriculture 15 02410 g003
Figure 4. Canopy spectral curves at different growth stages.
Figure 4. Canopy spectral curves at different growth stages.
Agriculture 15 02410 g004
Figure 5. Correlation analysis at different growth stages, with highlights indicating characteristic bands at different growth periods.
Figure 5. Correlation analysis at different growth stages, with highlights indicating characteristic bands at different growth periods.
Agriculture 15 02410 g005
Figure 6. Correlation coefficients between single-band spectral reflectance and SPAD values at eight different spectral scales following feature screening.
Figure 6. Correlation coefficients between single-band spectral reflectance and SPAD values at eight different spectral scales following feature screening.
Agriculture 15 02410 g006
Figure 7. Estimation accuracy of SPAD values of winter wheat based on machine learning at different growth stages. The gradient blue bar charts represent RMSE data, the dotted green lines denote R2 data, and the gradient orange bar charts indicate RPD data. The numbers 1, 2, 3, 5, 7, 10, 20, and 50 represent eight different spectral scale datasets.
Figure 7. Estimation accuracy of SPAD values of winter wheat based on machine learning at different growth stages. The gradient blue bar charts represent RMSE data, the dotted green lines denote R2 data, and the gradient orange bar charts indicate RPD data. The numbers 1, 2, 3, 5, 7, 10, 20, and 50 represent eight different spectral scale datasets.
Agriculture 15 02410 g007
Figure 8. Measured and predicted SPAD values estimated by the optimal spectral scale and machine learning algorithm at different growth stages. The blue line represents the 1:1 reference line (estimated value = measured value), while the red line represents the model-fitted regression line.
Figure 8. Measured and predicted SPAD values estimated by the optimal spectral scale and machine learning algorithm at different growth stages. The blue line represents the 1:1 reference line (estimated value = measured value), while the red line represents the model-fitted regression line.
Agriculture 15 02410 g008
Table 1. Resampled spectral scale and corresponding number of bands.
Table 1. Resampled spectral scale and corresponding number of bands.
Spectral Scale (nm)Number of BandsSpectral Scale (nm)Number of Bands
1501771
22511051
31672026
51015011
Table 2. Maximum correlation coefficients between single-band spectral reflectance and SPAD values across eight spectral scales following feature screening. Note: MAX represents the maximum correlation coefficient.
Table 2. Maximum correlation coefficients between single-band spectral reflectance and SPAD values across eight spectral scales following feature screening. Note: MAX represents the maximum correlation coefficient.
Spectral Scale(nm)JointingBootingHeadingFloweringFilling
BandMaxBandMaxBandMaxBandMaxBandMax
1412−0.520615−0.746701−0.889698−0.732651−0.877
2412−0.429696−0.744702−0.887642−0.731642−0.875
3411−0.408696−0.743702−0.887642−0.730651−0.875
5640−0.404695−0.741700−0.885695−0.730695−0.874
7644−0.403616−0.735700−0.885644−0.729693−0.874
10640−0.403620−0.734700−0.885640−0.729650−0.873
20640−0.401620−0.734580−0.884640−0.728640−0.873
50650−0.398650−0.728600−0.883650−0.726650−0.872
Table 3. The best model under different spectral scales and growth stages.
Table 3. The best model under different spectral scales and growth stages.
Spectral Scale
(nm)
Entire PeriodJointingBootingHeadingFloweringFilling
R2RMSERPDR2RMSERPDR2RMSERPDR2RMSERPDR2RMSERPDR2RMSERPD
10.674.241.660.415.731.210.592.561.370.852.062.490.623.741.450.754.801.96
20.674.241.660.375.811.190.582.601.340.832.192.340.633.731.450.754.801.96
30.684.201.670.345.891.180.582.631.330.822.242.290.633.731.450.754.761.98
50.694.101.710.296.001.150.562.331.500.822.252.270.633.731.450.774.572.06
70.723.931.790.236.281.100.562.331.500.822.252.280.623.861.410.754.771.97
100.763.531.990.305.971.160.622.541.380.822.252.280.633.911.390.932.753.42
200.773.412.060.196.391.080.622.191.600.802.312.210.823.171.710.942.723.46
500.773.492.010.186.411.080.602.371.470.782.412.120.842.332.330.913.172.98
Note: Grey cells denote the RR model, green cells denote the KNN model, and blue cells denote the RF model. Bolded text indicates the optimal model under different growth stages.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhu, X.; Li, J.; Sheng, Y.; Wang, W.; Wang, H.; Yang, H.; Nian, Y.; Liu, J.; Li, X. How Do Spectral Scales and Machine Learning Affect SPAD Monitoring at Different Growth Stages of Winter Wheat? Agriculture 2025, 15, 2410. https://doi.org/10.3390/agriculture15232410

AMA Style

Zhu X, Li J, Sheng Y, Wang W, Wang H, Yang H, Nian Y, Liu J, Li X. How Do Spectral Scales and Machine Learning Affect SPAD Monitoring at Different Growth Stages of Winter Wheat? Agriculture. 2025; 15(23):2410. https://doi.org/10.3390/agriculture15232410

Chicago/Turabian Style

Zhu, Xueqing, Jun Li, Yali Sheng, Weiqiang Wang, Haoran Wang, Hui Yang, Ying Nian, Jikai Liu, and Xinwei Li. 2025. "How Do Spectral Scales and Machine Learning Affect SPAD Monitoring at Different Growth Stages of Winter Wheat?" Agriculture 15, no. 23: 2410. https://doi.org/10.3390/agriculture15232410

APA Style

Zhu, X., Li, J., Sheng, Y., Wang, W., Wang, H., Yang, H., Nian, Y., Liu, J., & Li, X. (2025). How Do Spectral Scales and Machine Learning Affect SPAD Monitoring at Different Growth Stages of Winter Wheat? Agriculture, 15(23), 2410. https://doi.org/10.3390/agriculture15232410

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop