Advancements in Non-Destructive Detection of Biochemical Traits in Plants Through Spectral Imaging-Based Algorithms: A Systematic Review
Abstract
:1. Introduction
2. Materials and Methods
2.1. Data Collection
2.1.1. Search Methodology
2.1.2. Selection Process
2.1.3. Table Contents
2.1.4. Risk of Bias
3. Sensor Technologies in Chemical and Elemental Detection
3.1. Hyperspectral Imaging
3.2. Multispectral Imaging
3.3. Spectrometers
3.4. Data Acquisition Methods in Spectral Imaging: Principles and Instrumentation
4. Results
- Low risk: Clearly described and well-defined methods, consistent procedures.
- High risk: Methods unclear, inconsistent, or potentially biased
- Unclear risk: Insufficient details to assess risk confidently.
4.1. Chlorophyll Content Estimation
4.2. Carotenoids Content Estimation
Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
Lettuce (baby leaf) | Hyperspectral Camera: 350 nm–2500 nm | Data dimensionality: First-order derivative (FDR) of reflectance was used to find the 13 most important principal components. | Mulitvariate PLS R = 0.9099 NRMSE = 0.4147. Multivariate PCR R = 0.9094 NRMSE = 0.4159. Multivariate ANN with normalized full HSI data R = 0.8424 NRMSE = 0.3737. | 300 | [22] |
Spinach | Dual-channel diode array spectrometer: 305 nm–1800 nm | Data dimensionality: Three methods: Competitive adaptive reweighed sampling (CARS) was used to eliminate the irrelevant variables of PLSR. VIs: PRI, PRI 500, PRI norm, RDVI, VOGREI3, NDVI, mrNDVI, TCARI, REIP, WI calculated from the spectrum Parameters based on inflection points (IPs) of the spectrum (first and second derivatives) | CARS-PLSR R2 = 0.89 RMSEP = 0.10 Best VI linear model: REIP R2 = 0.86 p < 0.001 Best IP linear model: IP1 R2 = 0.86 p < 0.001 | 1120 | [27] |
Lettuce | Fiberoptic spectrometer: 350 nm–2500 nm RGB Camera | Data dimensionality: Spectral vegetation indices (SVI) and color vegetation indices (CVI) were calculated from the hyperspectral data and used as inputs for the prediction models. | AutoML (XGBoost, All Indices)
R2 0.93 RMSE N/A AutoML (GRVI) R2 0.91 RMSE 5.50 AutoML (CIgreen) R2 0.90 RMSE 6.15 AutoML (MCARI) R2 0.88 RMSE 6.43 AutoML (NDVI) R2 0.89 RMSE 6.20 Random Forest (All Indices) R2 0.91 RMSE N/A Random Forest (GRVI) R2 0.89 RMSE 5.81 Random Forest (NDVI) R2 0.78 RMSE 8.00 PLSR (GRVI) R2 0.82 RMSE 7.40 PLSR (NDVI) R2 0.79 RMSE 7.44 BPNN (CIgreen) R2 0.87 RMSE 6.23 BPNN (NDVI) R2 0.84 RMSE 6.83 SVM (GRVI) R2 0.81 RMSE 8.81 SVM (NDVI) R2 0.77 RMSE 7.94 | 3600 fiberoptic measurement 800 RGB | [30] |
Lettuce | Hyperspectral camera: 390.57 nm–1008.6 nm | Data dimensionality:
Three Methods: - First derivative of the reflectance to identify band regions with the most differences - PLSR and PCA to identify the bands with more weight - VIP score of a predictor variable over “1” | Best results of each method over four lettuce species cultivated with two different systems each (8 datasets)
Results for hydroponically grown black-seeded Simpson lettuce: FDR: Rp2 = 0.95 RMSE = 1.05 PLSR/PCA: Rp2 = 0.94 RMSE = 1.171 VIP-Score Index Rp2 = 0.99 RMSE = 0.03 However, there is high performance variability depending on the dataset | Not specified | [37] |
Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
Lettuce | Hyperspectral Camera: 400 nm–1000 nm | Data dimensionality: Spectral index (CI700) from the HRI | Does not provide specific accuracy metrics. It discusses the relationships between hyperspectral indices (CI700, PSRI) and physiological indicators of senescence | Not specified | [21] |
Lettuce | Double-beam spectrophotometer: 340 nm–900 nm. | Not specified | Correlation between the measured reflectance and transmittance spectrum and the one estimated from PROSPECT-D model, which uses pigment quantities as input to see if a model inversion is possible. Only a graph is given for the result. | Not specified | [23] |
Spinach | Hyperspectral camera: 400 nm–1000 nm 900 nm–1700 nm | Not specified | Unpackaged Spinach Leaves
Best performing model
Singletask regression model VNIR: BPNN Chla R2 0.8263 RMSE 0.2321 CNN Chlb R2 0.8527 RMSE 0.1296 CNN Chlt R2 0.8053 RMSE 0.1560 NIR: CNN Chla R2 0.7525 RMSE 0.3319 CNN Chlb R2 0.8514 RMSE 0.1994 PLSR Chlt R2 0.8050 RMSE 0.3484 Multitask regression model Best performing model: MTCNN VNIR: Chla R2 0.8683 RMSE 0.2034 Chlb R2 0.8416 RMSE 0.1306 Chlt R2 0.8681 RMSE 0.3064 Best model: MTPLSR NIR: Chla R2 0.8046 RMSE 0.2268 Chlb R2 0.8123 RMSE 0.1291 Chlt R2 0.8188 RMSE 0.3426 Packaged Spinach Leaves Best performing model Singletask regression model VNIR: BPNN Chla R2 0.9616 RMSE 0.1827 BPNN Chlb R2 0.9358 RMSE 0.3229 BPNN Chlt R2 0.9576 RMSE 0.4516 NIR: PLSR Chla R2 0.8870 RMSE 0.2930 PLSR Chlb R2 0.8514 RMSE 0.1994 PLSR Chlt R2 0.8797 RMSE 0.4797 Multitask regression model Best performing model: MTPLSR VNIR: Chla R2 0.9452 RMSE 0.2852 Chlb R2 0.9221 RMSE 0.2062 Chlt R2 0.9396 RMSE 0.4856 Best model: MTPLSR NIR: Chla R2 0.8874 RMSE 0.3932 Chlb R2 0.8460 RMSE 0.2563 Chlt R2 0.8763 RMSE 0.6421 | Unpackaged spinach leaves 120 pictures. Packaged spinach leaves 150 pictures | [24] |
Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
Spinach | Hyperspectral camera: 470–900 nm | Data dimensionality: Attention embedded model | Spectral-feature model: PLS R2 0.8078 RMSE 3.0282 RPD 2.2903 SVM R2 0.8308 RMSE 2.8408 RPD 2.4414 1D ResNet R2 0.8457 RMSE 2.7128 RPD 2.5566 Spatial-feature model 2D ResNet R2 0.8495 RMSE 2.6789 RPD 2.5889 Spectral-spatial-feature 3D CNN models: C3D R2 0.8672 RMSE 2.5170 RPD 2.7554 3D ResNet R2 0.8786 RMSE 2.4204 RPD 2.8654 3D SqueezeNet R2 0.8740 RMSE 2.4519 RPD 2.8286 3D MobileNet R2 0.8729 RMSE 2.4621 RPD 2.8169 Attention embedded model 3D ResNet + channel + band R2 0.8998 RMSE 2.1865 RPD 3.1719 | 720 | [25] |
Lettuce | Fiber optic spectrometer: 200 nm–1100 nm (400 nm–980 nm considered) | Data Dimensionality: Three spectral indices were used (Pg, NDVI705, Pg/Pr) | Regression analysis Pg R2 = 0.677, F-value = 12.569 NDVI705 R2 = 0.789, F-value = 22.407 Pg/Pr R2 = 0.755, F-value = 40.009 | 45 | [26] |
Basil | Hyperspectral camera: 470 nm–900 nm | Not specified | Preprocessing (SG, SG + MSC, SG + SNV, SG + VSN) combined with machine learning methods (PLS, SVM, RF) Best Results: RF-SG-SNV R2 calibration: 0.905, RMSEC: 0.094 R2 prediction: 0.852, RMSEP: 0.120, RPD: 2.614 RF-SG-MSC R2 calibration: 0.878, RMSEC: 0.106 R2 prediction: 0.838, RMSEP: 0.126, RPD: 2.480 RF-SG-VSN R2 calibration: 0.857, RMSEC: 0.115 R2 prediction: 0.841, RMSEP: 0.124, RPD: 2.536 | 324 | [28] |
Basil | Hyperspectral camera: 470 nm–900 nm | Data dimensionality: The most important bands were selected by using an attention mechanism-based module in front of a 3D CNN (3D ResNet). For comparison, other band selection methods were used (SPA, GA, 2B-CNN). | The proposed model (attention band + 3D ResNet) was tested against other machine learning methods (SVM, 1D-CNN, 2D-CNN, 3D ResNet) using the reduced data (all method combinations). Best results obtained with the proposed model using the attention-based reduction: R2 = 0.912 RMSE = 2.046 | 540 hyperspectral images | [53] |
Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
Lettuce | Hyperspectral camera: 387 nm–1003 nm (437 nm–919 nm considered) | Data dimensionality: An attention module was put in front of a 1D-CNN to identify the most important spectral bands SPA algorithm was used as comparison band selection method. | One-dimensional-CNN Attention Module R2: 0.762 RMSE: 1.868 PLSR (Full spectrum) R2: 0.771 RMSE: 2.107 RF (Full spectrum) R2: 0.781 RMSE: 2.19 SPA + FDR + PLSR: R2: 0.742 RMSE: 2.440 SPA + FDR + RF: R2: 0.714 RMSE: 2.567 | 478 | [29] |
Lettuce | Hand-held spectrometer: 400 nm–1000 nm | Data dimensionality: The most important spectral bands were selected using uninformative variable elimination (UVE) combined with PLS. Results were compared with 30 SVI (MDATT especially) | UVE-PLS: R2 0.834 RMSE 38.58 mg m−2 MDATT: R2 0.736 RMSE 48.54 mg m−2 | 90 | [39] |
Lettuce | Multispectral camera: 475 nm (blue), 560 nm (green), 668 nm (red), 717 nm (red edge), and 840 nm (NIR). | Usage of VIs | Correlation between the most significant VIs and chlorophyll content. Chlorophyll a BNDVI R 0.89 GNDVI R 0.90 NDVI R 0.90 Chlorophyll b BNDVI R 0.89 GNDVI R 0.90 NDVI R 0.90 | Not specified | [41] |
Basil | Hyperspectral camera: 400 nm–1000 nm | Data dimensionality: Vegetative Indices (MND705, MND750/700) were calculated from the spectra. ANOVA analysis was used to see the significance of these parameters. | Weak correlation between the VIs (MND705, MND750/700) and total chlorophyll due to the uneven spectra of the purple basil. Results: MND705 R2 = 0.1822 MND750/700 R2 = 0.1539 | Not specified | [19] |
Lettuce | Fiberoptic spectroradiometer: 360 nm–1000 nm | Data dimensionality: Reflectance Indices (ChlRI, SIPI, R800, PRI, ARI) were calculated | Linear regression was performed for the significant RI Chl (a + b): ChlRI R2 = 0.910 p = 0.0002 Car/Chl SIPI R2 = 0.72 p = 0.0077 | 60 | [31] |
Lettuce | Hyperspectral camera: 400 nm–1000 nm | Data dimensionality: Four linear regression fit models were used to identify the 24 most significant wave bands (out of 448). The inputs of these models were reflectance, the first derivative of the reflectance, and two spectral indices (RSI, NDSI). After that, the bands were reduced to 22 due to the very small improvement of the regression models using 24 bands. | PLSR R2 = 0.97 RMSE = 2.34 PCA R2 = 0.96 RMSE = 2.53 | 28 | [32] |
Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
Lettuce | Hyperspectral Camera: 350 nm–2500 nm | Data dimensionality: First-order derivative (FDR) of reflectance was used to find the 13 most important principal components. | Multivariate PLS R = 0.655 NRMSE = 0.4254 Multivariate PCR R = 0.6229 NRMSE = 0.4301 Multivariate ANN with normalized full HSI data R = 0.7857 NRMSE = 0.611 | 300 | [22] |
Basil | Hyperspectral camera: 400 nm–1000 nm | Data dimensionality: Vegetative Index (CRI700) was calculated from the spectra. ANOVA analysis was used to see the significance of this parameter. | Weak correlation for CRI700 due to the shielding effect of anthocyanins. Results: CRI700 R2 = 0.0479 | Not specified | [19] |
Lettuce | Hyperspectral Camera: 400 nm–1000 nm | Data dimensionality: Spectral index (PSRI) from the HRI | The document does not provide specific accuracy metrics. It discusses the relationships between hyperspectral indices (CI700, PSRI) and physiological indicators of senescence | Not specified | [21] |
Lettuce | Double-beam spectrophotometer: 340 nm–900 nm. | Not specified | Correlation between the measured reflectance and transmittance spectrum and the one estimated from PROSPECT-D model, which uses pigment quantities as input to see if a model inversion is possible. Only a graph is given for the result. | Not specified | [23] |
Spinach | Hyperspectral camera: 400 nm–1000 nm 900 nm–1700 nm | Not specified | Unpackaged Spinach Leaves Best performing model Single-task regression model VNIR: BPNN Car R2 0.7110 RMSE 0.2482 NIR: CNN Car R2 0.7523 RMSE 0.0348 Multitask regression model MTCNN VNIR: Car R2 0.8133 RMSE 0.0397 MTPLSR NIR: Car R2 0.7375 RMSE 0.0360 Packaged Spinach Leaves Best performing model Single-task regression model VNIR: BPNN Car R2 0.8436 RMSE 0.0802 NIR: PLSR Car R2 0.7619 RMSE 0.0565 Multitask regression model Best performing model: MTPLSR VNIR: Car R2 0.8644 RMSE 0.0426 MTPLSR NIR: Car R2 0.8167 RMSE 0.0552 | Unpackaged spinach leaves
120 pictures per camera Packaged spinach leaves 150 pictures per camera | [24] |
Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
Lettuce | Multispectral camera: 475 nm (blue), 560 nm (green), 668 nm (red), 717 nm (red edge), and 840 nm (NIR). | Usage of VIs | Correlation between the most significant VIs and carotenoid content. BNDVI R 0.90 GNDVI R 0.89 NDVI R 0.90 | Not specified | [41] |
Lettuce | Fiberoptic spectroradiometer: 360 nm–1000 nm | Data dimensionality: Reflectance Indices (ChlRI, SIPI, R800, PRI, ARI) were calculated | Linear regression was performed for the significant RI Car/Chl SIPI R2 = 0.72 p = 0.0077 | 60 | [31] |
4.3. Anthocyanin Content Estimation
4.4. Nitrogen Nitrate and Nitrite Content Estimation
4.5. Additional Biochemical Traits Detected
- Glucose and sugar are energy storage and indicators of plant photosynthetic efficiency and are responsible for the sweetness and flavor profile of leafy greens [60]. Spectral measurements allow non-destructive quantification [37], which could improve understanding of metabolic processes and optimize quality traits in lettuce, basil, and spinach.
- Phosphorus is an important micronutrient needed by plants in large quantities. It helps with energy transfer through ATP (adenosine triphosphate) and with the formation of nucleic acid and cell membranes. It is essential in various tasks, such as plant growth, root development, and photosynthesis [61]. Adequate phosphorus levels are needed for maintaining metabolic balance and optimizing yield [61]. A non-destructive monitoring can lead to more precise management of nutrient inputs, which would lead to more efficient application of fertilizers. By doing so, costs and environmental impacts can be reduced.
- Potassium is a micronutrient that plays a role in regulating water balance in plants, activates enzymes, and has a significant role in photosynthesis [61]. It plays a crucial role in stomatal regulation, which helps plants efficiently control water loss and gas exchange; these processes are essential for maintaining plant hydration and nutrient transport. In leafy greens such as basil, lettuce, and spinach, potassium supports plant vigor, improves resistance to stress [61].
- Vitamin C (ascorbic acid) is a powerful antioxidant in plants; it is responsible for protecting them against oxidative stress that can be caused by excessive lighting, drought, and pollution [62]. By aiding in detoxification of reactive oxygen and maintaining cellular health, vitamin C is crucial for the plant defense mechanism [62,63]. In leafy greens such as lettuce, spinach, and basil, ascorbic acid contributes to the overall well-being of the plant and enhances the nutritional quality [62]. For plant development, it helps with enzymatic processes and growth regulations [62].
- Water Content is essential to determine plant hydration and freshness. Water supports photosynthesis and transports nutrients throughout the plant [64]. Insufficient amounts of water can lead to wilting, weak plant health, and reduced growth [65]. Excessive water can affect root functions and make the plant more susceptible to diseases [66].
- Calcium is a plant nutrient required in cell wall structure and membranes, which gives tissue integrity and strength [67]. Adequate calcium levels prevent physiological disorders and improve the structural quality of leaves, ensuring durability and freshness [61]. Calcium also contributes to the general resilience of plant tissues, helping them withstand mechanical stress [61].
- Soluble solid content (SSC) refers to the concentration of dissolved solids in plant tissues, primarily sugars, organic acids, and other soluble compounds
- Magnesium is a key component of chlorophyll and a cofactor for many enzymatic reactions [61].
- Sulfur is pivotal for protein synthesis, enzyme activity, and chlorophyll production [61].
- Cadmium and Lead are heavy metals that pose significant risks to plant health and therefore to human consumption.
5. Discussion
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
3D CNN | 3D Convolutional Neural Networks [68] |
3D CNN3D | Convolutional Neural Networks [68] |
ANN | Artificial Neural Network [69] |
BPNN | Back-Propagation Neural Network [70] |
BRNN | Bidirectional RNN [71] |
CARS | Competitive adaptive reweighed sampling [72] |
CARS-PLSR | Least Squares Regression |
(with Partial Competitive Adaptive Reweighted Sampling) [72,73] | |
CMT | Cubist model tree [74] |
CR | Continuum removal [75,76] |
DT | De-trending [77] |
EL | Ensemble Learning [78] |
EvalML | Automated Machine Learning for model evaluation and selection [79] |
FDR | First-order derivative of reflectance [80] |
GA | Genetic Algorithm [81] |
GRU | Gated Recurrent Unit [82] |
IM | Inception Module [83] |
IRIV | Iteratively retaining informative variables [84] |
KELM | Kernel-based extreme learning machine [85] |
LDA | Linear Discriminant Analysis [86] |
LSTM | Long Short-Term Memory [87] |
LR | Logistic Regression [88] |
MLP | Multilayer Perceptron [89] |
MPLS | Modified Partial Least Squares Regression [90] |
MSC | Multiplicative Scatter Correction [91] |
MVRMZ | Multivariate Regression Models [92] |
NB | Naive Bayes [93] |
OSTD | Optimal-Spec-TD Model |
PCA | Principal Components Analysis [94] |
PCR | Principal Components Regression [95] |
PLS-DAP | Partial Least Squares Discriminant Analysis [96] |
PLSR | Partial Least Squares Regression [73] |
R2 | Metrics of determination coefficient [97] |
R2cv | Coefficient of Determination for Cross Validation R2 (CV) [97] |
ReliefF | Feature Selection Algorithm [98] |
RNN | Recurrent Neural Networks [99] |
RM and AM | Residual Module and Attention Module [100] |
RMSE | Root mean Square Error [97] |
RPD | Residual Predictive Deviations [97] |
RPDcv | Ratio of the Standard Deviation [101] |
SGB | Stochastic Gradient Boosting [102] |
SG | Savitzky Golay Smoothing [103,104] |
SG 125 and 127 | Savitzky Golay Derivative [103] |
SA-1DCNN | Self-adjusted One-Dimensional Convolutional Neural Network [68] |
SDR | Second-order derivative of reflectance [80] |
SECV | Standard Error of Cross Validation [97] |
SNV | Standard Normal Variate [77] |
SPA | Successive projections algorithm [105] |
SVM | Support Vector Machine [106] |
SVR | Support Vector Regression [107] |
UVE | Uninformative variable elimination [108] |
UVE-PLS | uninformative variable elimination PLS [96,108] |
VIP | Variable Importance Projection VIP [109] |
VISSA | Variable iterative space shrinkage approach [110] |
VSN | Variable Sorting for Normalization [111] |
WT-SR | Wavelet transform combined with stepwise regression [112] |
Y(II) | Effective Quantum Yield of PSII Y(II) [113] |
Vegetation Indices: | |
ARI | Anthocyanin Reflectance Index |
CDM | Color Distance Model |
CF | Chlorophyll Fluorescence |
ChlRI | Chlorophyll Reflectance Index |
CVIs | Color Vegetation Indices |
DRS | Diffuse Reflectance Spectroscopy |
GI | Greenness Index |
MRNDVI | Modified Red Normalized Difference Vegetation Index |
NDSI | Normalized Difference Spectral Index |
NDVI | Normalised Difference Vegetation Index |
NDVI705 | Normalized Difference Vegetation Index at 705 nm |
PRI | Photochemical Reflectance Index |
REIP | Red Edge Inflection Point |
RSI | Spectral index |
SC | Spectral Characteristics (parameters) |
SVIs | Spectral Vegetation Indices |
VOGREI3 | Vogelman Red Edge Index 3 |
WI | Water Index |
References
- Mercanoglu Taban, B.; Halkman, A.K. Do leafy green vegetables and their ready-to-eat [RTE] salads carry a risk of foodborne pathogens? Anaerobe 2011, 17, 286–287. [Google Scholar] [CrossRef]
- UN General Assembly. Transforming Our World: The 2030 Agenda for Sustainable Development; United Nations: New York, NY, USA, 2015. [Google Scholar]
- Mishra, P.; Polder, G.; Vilfan, N. Close range spectral imaging for disease detection in plants using autonomous platforms: A review on recent studies. Curr. Robot. Rep. 2020, 1, 43–48. [Google Scholar] [CrossRef]
- Li, L.; Zhang, Q.; Huang, D. A review of imaging techniques for plant phenotyping. Sensors 2014, 14, 20078–20111. [Google Scholar] [CrossRef]
- Mishra, P.; Sadeh, R.; Ryckewaert, M.; Bino, E.; Polder, G.; Boer, M.P.; Rutledge, D.N.; Herrmann, I. A generic workflow combining deep learning and chemometrics for processing close-range spectral images to detect drought stress in Arabidopsis thaliana to support digital phenotyping. Chemom. Intell. Lab. Syst. 2021, 216, 104373. [Google Scholar] [CrossRef]
- Humplík, J.F.; Lazár, D.; Husičková, A.; Spíchal, L. Automated phenotyping of plant shoots using imaging methods for analysis of plant stress responses–a review. Plant Methods 2015, 11, 1–10. [Google Scholar] [CrossRef]
- Ouzzani, M.; Hammady, H.; Fedorowicz, Z.; Elmagarmid, A. Rayyan AI—Systematic Review Tool. 2024. Available online: https://www.rayyan.ai/ (accessed on 10 December 2024).
- Mishra, P.; Asaari, M.S.M.; Herrero-Langreo, A.; Lohumi, S.; Diezma, B.; Scheunders, P. Close range hyperspectral imaging of plants: A review. Biosyst. Eng. 2017, 164, 49–67. [Google Scholar] [CrossRef]
- Ceamanos, X.; Valero, S. Processing hyperspectral images. In Optical Remote Sensing of Land Surface; Elsevier: Amsterdam, The Netherlands, 2016; pp. 163–200. [Google Scholar]
- Hernandez-Palacios, J.; Randeberg, L.L.; Baarstad, I.; Løke, T.; Skauli, T. Hyperspectral low-light camera for imaging of biological samples. In Proceedings of the 2010 2nd IEEE Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing, Reykjavik, Iceland, 14–16 June 2010; pp. 1–4. [Google Scholar]
- Goel, M.; Whitmire, E.; Mariakakis, A.; Saponas, T.S.; Joshi, N.; Morris, D.; Guenter, B.; Gavriliu, M.; Borriello, G.; Patel, S.N. HyperCam: Hyperspectral imaging for ubiquitous computing applications. In Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing, Osaka, Japan, 9–11 September 2015; pp. 145–156. [Google Scholar]
- Kawakami, R.; Matsushita, Y.; Wright, J.; Ben-Ezra, M.; Tai, Y.W.; Ikeuchi, K. High-resolution hyperspectral imaging via matrix factorization. In Proceedings of the IEEE CVPR 2011, Colorado Springs, CO, USA, 20–25 June 2011; pp. 2329–2336. [Google Scholar]
- Pan, Z.; Healey, G.; Prasad, M.; Tromberg, B. Face recognition in hyperspectral images. IEEE Trans. Pattern Anal. Mach. Intell. 2003, 25, 1552–1560. [Google Scholar]
- Cao, X.; Yue, T.; Lin, X.; Lin, S.; Yuan, X.; Dai, Q.; Carin, L.; Brady, D.J. Computational snapshot multispectral cameras: Toward dynamic capture of the spectral world. IEEE Signal Process. Mag. 2016, 33, 95–108. [Google Scholar] [CrossRef]
- Cantrell, K.M.; Ingle, J.D. The SLIM spectrometer. Anal. Chem. 2003, 75, 27–35. [Google Scholar] [CrossRef]
- SM245: The Choice for High Speed Data Acquisition Applications—SM245 Highspeed CCD Spectrometer—Spectral Products—spectralproducts.com. Available online: https://www.spectralproducts.com/SM245 (accessed on 18 April 2025).
- Micasence. Available online: https://support.micasense.com/hc/en-us/articles/1500007828482-Comparison-of-MicaSense-Cameras (accessed on 18 April 2025).
- Specim. Available online: https://www.specim.com/technology/why-are-specim-cameras-line-scan-push-broom-cameras/ (accessed on 18 April 2025).
- Proshkin, Y.A.; Smirnov, A.A.; Semenova, N.A.; Dorokhov, A.S.; Burynin, D.A.; Ivanitskikh, A.S.; Panchenko, V.A. Assessment of ultraviolet impact on main pigment content in purple basil (Ocimum basilicum L.) by the spectrometric method and hyperspectral images analysis. Appl. Sci. 2021, 11, 8804. [Google Scholar] [CrossRef]
- Liu, C.; Yu, H.; Liu, Y.; Zhang, L.; Li, D.; Zhang, J.; Li, X.; Sui, Y. Prediction of Anthocyanin Content in Purple-Leaf Lettuce Based on Spectral Features and Optimized Extreme Learning Machine Algorithm. Agronomy 2024, 14, 2915. [Google Scholar] [CrossRef]
- Solovchenko, A.; Shurygin, B.; Kuzin, A.; Solovchenko, O.; Krylov, A. Extraction of Quantitative Information from Hyperspectral Reflectance Images for Noninvasive Plant Phenotyping. Russ. J. Plant Physiol. 2022, 69, 144. [Google Scholar] [CrossRef]
- Eshkabilov, S.; Simko, I. Assessing Contents of Sugars, Vitamins, and Nutrients in Baby Leaf Lettuce from Hyperspectral Data with Machine Learning Models. Agriculture 2024, 14, 834. [Google Scholar] [CrossRef]
- Cammarisano, L.; Graefe, J.; Körner, O. Using leaf spectroscopy and pigment estimation to monitor indoor grown lettuce dynamic response to spectral light intensity. Front. Plant Sci. 2022, 13, 1044976. [Google Scholar] [CrossRef]
- He, M.; Jin, C.; Li, C.; Cai, Z.; Peng, D.; Huang, X.; Wang, J.; Zhai, Y.; Qi, H.; Zhang, C. Simultaneous determination of pigments of spinach (Spinacia oleracea L.) leaf for quality inspection using hyperspectral imaging and multi-task deep learning regression approaches. Food Chem. X 2024, 22, 101481. [Google Scholar] [CrossRef] [PubMed]
- Zhu, F.; Cai, J.; He, M.; Li, X. Channel and band attention embedded 3D CNN for model development of hyperspectral image in object-scale analysis. Chemom. Intell. Lab. Syst. 2022, 224, 104537. [Google Scholar] [CrossRef]
- Zhou, L.; Zhou, L.; Wu, H.; Kong, L.; Li, J.; Qiao, J.; Chen, L. Analysis of Cadmium Contamination in Lettuce (Lactuca sativa L.) Using Visible-Near Infrared Reflectance Spectroscopy. Sensors 2023, 23, 9562. [Google Scholar] [CrossRef]
- Rubo, S.; Zinkernagel, J. Exploring hyperspectral reflectance indices for the estimation of water and nitrogen status of spinach. Biosyst. Eng. 2022, 214, 58–71. [Google Scholar] [CrossRef]
- Zhu, F.; Qiao, X.; Zhang, Y.; Jiang, J. Analysis and mitigation of illumination influences on canopy close-range hyperspectral imaging for the in situ detection of chlorophyll distribution of basil crops. Comput. Electron. Agric. 2024, 217, 108553. [Google Scholar] [CrossRef]
- Ye, Z.; Tan, X.; Dai, M.; Chen, X.; Zhong, Y.; Zhang, Y.; Ruan, Y.; Kong, D. A hyperspectral deep learning attention model for predicting lettuce chlorophyll content. Plant Methods 2024, 20, 22. [Google Scholar] [CrossRef]
- Taha, M.F.; Mao, H.; Wang, Y.; ElManawy, A.I.; Elmasry, G.; Wu, L.; Memon, M.S.; Niu, Z.; Huang, T.; Qiu, Z. High-throughput analysis of leaf chlorophyll content in aquaponically grown lettuce using hyperspectral reflectance and RGB images. Plants 2024, 13, 392. [Google Scholar] [CrossRef] [PubMed]
- Kanash, E.V.; Sinyavina, N.G.; Rusakov, D.V.; Egorova, K.V.; Panova, G.G.; Chesnokov, Y.V. Morpho-Physiological, Chlorophyll Fluorescence, and Diffuse Reflectance Spectra Characteristics of Lettuce under the Main Macronutrient Deficiency. Horticulturae 2023, 9, 1185. [Google Scholar] [CrossRef]
- Eshkabilov, S.; Lee, A.; Sun, X.; Lee, C.W.; Simsek, H. Hyperspectral imaging techniques for rapid detection of nutrient content of hydroponically grown lettuce cultivars. Comput. Electron. Agric. 2021, 181, 105968. [Google Scholar] [CrossRef]
- Boros, I.F.; Sipos, L.; Kappel, N.; Csambalik, L.; Fodor, M. Quantification of nitrate content with FT-NIR technique in lettuce (Lactuca sativa L.) variety types: A statistical approach. J. Food Sci. Technol. 2020, 57, 4084–4091. [Google Scholar] [CrossRef]
- Mahanti, N.K.; Chakraborty, S.K.; Kotwaliwale, N.; Vishwakarma, A.K. Chemometric strategies for nondestructive and rapid assessment of nitrate content in harvested spinach using Vis-NIR spectroscopy. J. Food Sci. 2020, 85, 3653–3662. [Google Scholar] [CrossRef]
- Torres, I.; Sánchez, M.T.; Pérez-Marín, D. Integrated soluble solid and nitrate content assessment of spinach plants using portable NIRS sensors along the supply chain. Postharvest Biol. Technol. 2020, 168, 111273. [Google Scholar] [CrossRef]
- Yu, S.; Fan, J.; Lu, X.; Wen, W.; Shao, S.; Liang, D.; Yang, X.; Guo, X.; Zhao, C. Deep learning models based on hyperspectral data and time-series phenotypes for predicting quality attributes in lettuces under water stress. Comput. Electron. Agric. 2023, 211, 108034. [Google Scholar] [CrossRef]
- Eshkabilov, S.; Stenger, J.; Knutson, E.N.; Küçüktopcu, E.; Simsek, H.; Lee, C.W. Hyperspectral image data and waveband indexing methods to estimate nutrient concentration on lettuce (Lactuca sativa L.) cultivars. Sensors 2022, 22, 8158. [Google Scholar] [CrossRef]
- Pandey, P.; Veazie, P.; Whipker, B.; Young, S. Predicting foliar nutrient concentrations and nutrient deficiencies of hydroponic lettuce using hyperspectral imaging. Biosyst. Eng. 2023, 230, 458–469. [Google Scholar] [CrossRef]
- Lu, F.; Bu, Z.; Lu, S. Estimating chlorophyll content of leafy green vegetables from adaxial and abaxial reflectance. Sensors 2019, 19, 4059. [Google Scholar] [CrossRef]
- Osco, L.P.; Ramos, A.P.M.; Moriya, É.A.S.; Bavaresco, L.G.; Lima, B.C.d.; Estrabis, N.; Pereira, D.R.; Creste, J.E.; Júnior, J.M.; Gonçalves, W.N.; et al. Modeling Hyperspectral Response of Water-Stress Induced Lettuce Plants Using Artificial Neural Networks. Remote Sens. 2019, 11, 2797. [Google Scholar] [CrossRef]
- Boonupara, T.; Udomkun, P.; Kajitvichyanukul, P. Quantitative Analysis of Atrazine Impact on UAV-Derived Multispectral Indices and Correlated Plant Pigment Alterations: A Heatmap Approach. Agronomy 2024, 14, 814. [Google Scholar] [CrossRef]
- Pérez-Marín, D.; Torres, I.; Entrenas, J.A.; Vega, M.; Sánchez, M.T. Pre-harvest screening on-vine of spinach quality and safety using NIRS technology. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2019, 207, 242–250. [Google Scholar] [CrossRef]
- Vega-Castellote, M.; Pérez-Marín, D.; Torres, I.; Sánchez, M.T. Online NIRS analysis for the routine assessment of the nitrate content in spinach plants in the processing industry using linear and non-linear methods. LWT 2021, 151, 112192. [Google Scholar] [CrossRef]
- Taha, M.F.; ElManawy, A.I.; Alshallash, K.S.; ElMasry, G.; Alharbi, K.; Zhou, L.; Liang, N.; Qiu, Z. Using machine learning for nutrient content detection of aquaponics-grown plants based on spectral data. Sustainability 2022, 14, 12318. [Google Scholar] [CrossRef]
- Entrenas, J.A.; Pérez-Marín, D.; Torres, I.; Garrido-Varo, A.; Sánchez, M.T. Simultaneous detection of quality and safety in spinach plants using a new generation of NIRS sensors. Postharvest Biol. Technol. 2020, 160, 111026. [Google Scholar] [CrossRef]
- Ciganda, V.; Gitelson, A.; Schepers, J. Non-destructive determination of maize leaf and canopy chlorophyll content. J. Plant Physiol. 2009, 166, 157–167. [Google Scholar] [CrossRef]
- Netto, A.T.; Campostrini, E.; de Oliveira, J.G.; Bressan-Smith, R.E. Photosynthetic pigments, nitrogen, chlorophyll a fluorescence and SPAD-502 readings in coffee leaves. Sci. Hortic. 2005, 104, 199–209. [Google Scholar] [CrossRef]
- Cazzonelli, C.I. Carotenoids in nature: Insights from plants and beyond. Funct. Plant Biol. 2011, 38, 833–847. [Google Scholar] [CrossRef]
- Maoka, T. Carotenoids as natural functional pigments. J. Nat. Med. 2020, 74, 1–16. [Google Scholar] [CrossRef]
- Stahl, W.; Sies, H. Antioxidant activity of carotenoids. Mol. Asp. Med. 2003, 24, 345–351. [Google Scholar] [CrossRef]
- Ahamad, M.N.; Saleemullah, M.; Shah, H.U.; Khalil, I.A.; Saljoqi, A. Determination of beta carotene content in fresh vegetables using high performance liquid chromatography. Sarhad J. Agric. 2007, 23, 767. [Google Scholar]
- Daly, T.; Jiwan, M.A.; O’Brien, N.M.; Aherne, S.A. Carotenoid content of commonly consumed herbs and assessment of their bioaccessibility using an in vitro digestion model. Plant Foods Hum. Nutr. 2010, 65, 164–169. [Google Scholar] [CrossRef]
- Zheng, Z.; Liu, Y.; He, M.; Chen, D.; Sun, L.; Zhu, F. Effective band selection of hyperspectral image by an attention mechanism-based convolutional network. RSC Adv. 2022, 12, 8750–8759. [Google Scholar] [CrossRef] [PubMed]
- Wallace, T.C.; Giusti, M.M. Anthocyanins. Adv. Nutr. 2015, 6, 620–622. [Google Scholar] [CrossRef] [PubMed]
- Castañeda-Ovando, A.; de Lourdes Pacheco-Hernández, M.; Páez-Hernández, M.E.; Rodríguez, J.A.; Galán-Vidal, C.A. Chemical studies of anthocyanins: A review. Food Chem. 2009, 113, 859–871. [Google Scholar] [CrossRef]
- Kim, D.E.; Shang, X.; Assefa, A.D.; Keum, Y.S.; Saini, R.K. Metabolite profiling of green, green/red, and red lettuce cultivars: Variation in health beneficial compounds and antioxidant potential. Food Res. Int. 2018, 105, 361–370. [Google Scholar] [CrossRef]
- Qin, H.; Xu, Y.; Liu, B.; Gao, Y.; Zheng, Y.; Li, Q. UV-A supplement improved growth, antioxidant capacity, and anthocyanin accumulation in purple lettuce (Lactuca sativa L.). Horticulturae 2023, 9, 634. [Google Scholar] [CrossRef]
- Ali, A. Nitrate assimilation pathway in higher plants: Critical role in nitrogen signalling and utilization. Plant Sci. Today 2020, 7, 182–192. [Google Scholar] [CrossRef]
- Karwowska, M.; Kononiuk, A. Nitrates/nitrites in food—Risk for nitrosative stress and benefits. Antioxidants 2020, 9, 241. [Google Scholar] [CrossRef]
- Rosa, M.; Prado, C.; Podazza, G.; Interdonato, R.; González, J.A.; Hilal, M.; Prado, F.E. Soluble sugars: Metabolism, sensing and abiotic stress: A complex network in the life of plants. Plant Signal. Behav. 2009, 4, 388–393. [Google Scholar] [CrossRef]
- Barker, A.V.; Pilbeam, D.J. Handbook of Plant Nutrition; CRC Press: Boca Raton, FL, USA, 2015. [Google Scholar]
- Paciolla, C.; Fortunato, S.; Dipierro, N.; Paradiso, A.; De Leonardis, S.; Mastropasqua, L.; de Pinto, M.C. Vitamin C in Plants: From Functions to Biofortification. Antioxidants 2019, 8, 519. [Google Scholar] [CrossRef]
- Wolucka, B.A.; Goossens, A.; Inzé, D. Methyl jasmonate stimulates the de novo biosynthesis of vitamin C in plant cell suspensions. J. Exp. Bot. 2005, 56, 2527–2538. [Google Scholar] [CrossRef]
- Smirnoff, N. The role of active oxygen in the response of plants to water deficit and desiccation. New Phytol. 1993, 125, 27–58. [Google Scholar] [CrossRef] [PubMed]
- Shchepetilnikov, A.; Zarezin, A.M.; Muravev, V.; Gusikhin, P.; Kukushkin, I. Quantitative analysis of water content and distribution in plants using terahertz imaging. Opt. Eng. 2020, 59, 061617. [Google Scholar] [CrossRef]
- Manda, R.; Addanki, V.A.; Srivastava, S. Role of Drip Irrigation in Plant Health Management, Its Importance and Maintenance. Plant Arch. 2021, 21, 1294–1302. [Google Scholar] [CrossRef]
- White, P.J.; Broadley, M.R. Calcium in Plants. Ann. Bot. 2003, 92, 487–511. [Google Scholar] [CrossRef]
- LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
- Zou, J.; Han, Y.; So, S.S. Overview of artificial neural networks. In Artificial Neural Networks: Methods and Applications; Springer: Berlin/Heidelberg, Germany, 2009; pp. 14–22. [Google Scholar]
- Goh, A.T. Back-propagation neural networks for modeling complex systems. Artif. Intell. Eng. 1995, 9, 143–151. [Google Scholar] [CrossRef]
- Schuster, M.; Paliwal, K.K. Bidirectional recurrent neural networks. IEEE Trans. Signal Process. 1997, 45, 2673–2681. [Google Scholar] [CrossRef]
- Li, H.; Liang, Y.; Xu, Q.; Cao, D. Key wavelengths screening using competitive adaptive reweighted sampling method for multivariate calibration. Anal. Chim. Acta 2009, 648, 77–84. [Google Scholar] [CrossRef] [PubMed]
- Geladi, P.; Kowalski, B.R. Partial least-squares regression: A tutorial. Anal. Chim. Acta 1986, 185, 1–17. [Google Scholar] [CrossRef]
- Kuhn, M.; Weston, S.; Keefer, C.; Coulter, N. Cubist Models for Regression, R package Vignette R package version 0.0; 2012, Volume 18, p. 480. Available online: https://citeseerx.ist.psu.edu/document?repid=rep1&type=pdf&doi=fd880d2b4482fc9b383435d51f6d730c02e0be36 (accessed on 26 March 2025).
- Kokaly, R.F.; Clark, R.N. Spectroscopic determination of leaf biochemistry using band-depth analysis of absorption features and stepwise multiple linear regression. Remote Sens. Environ. 1999, 67, 267–287. [Google Scholar] [CrossRef]
- Huang, Z.; Turner, B.J.; Dury, S.J.; Wallis, I.R.; Foley, W.J. Estimating foliage nitrogen concentration from HYMAP data using continuum removal analysis. Remote Sens. Environ. 2004, 93, 18–29. [Google Scholar] [CrossRef]
- Barnes, R.; Dhanoa, M.S.; Lister, S.J. Standard normal variate transformation and de-trending of near-infrared diffuse reflectance spectra. Appl. Spectrosc. 1989, 43, 772–777. [Google Scholar] [CrossRef]
- Sagi, O.; Rokach, L. Ensemble learning: A survey. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2018, 8, e1249. [Google Scholar] [CrossRef]
- Luo, G. A review of automatic selection methods for machine learning algorithms and hyper-parameter values. Netw. Model. Anal. Health Inform. Bioinform. 2016, 5, 18. [Google Scholar] [CrossRef]
- Kauppinen, J.K.; Moffatt, D.J.; Mantsch, H.H.; Cameron, D.G. Fourier transforms in the computation of self-deconvoluted and first-order derivative spectra of overlapped band contours. Anal. Chem. 1981, 53, 1454–1457. [Google Scholar] [CrossRef]
- Kramer, O.; Kramer, O. Genetic Algorithms; Springer: Berlin/Heidelberg, Germany, 2017. [Google Scholar]
- Chung, J.; Gulcehre, C.; Cho, K.; Bengio, Y. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv 2014, arXiv:1412.3555. [Google Scholar]
- Szegedy, C.; Liu, W.; Jia, Y.; Sermanet, P.; Reed, S.; Anguelov, D.; Erhan, D.; Vanhoucke, V.; Rabinovich, A. Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 1–9. [Google Scholar]
- Yun, Y.H.; Wang, W.T.; Tan, M.L.; Liang, Y.Z.; Li, H.D.; Cao, D.S.; Lu, H.M.; Xu, Q.S. A strategy that iteratively retains informative variables for selecting optimal variable subset in multivariate calibration. Anal. Chim. Acta 2014, 807, 36–43. [Google Scholar] [CrossRef]
- Pal, M.; Maxwell, A.E.; Warner, T.A. Kernel-based extreme learning machine for remote-sensing image classification. Remote Sens. Lett. 2013, 4, 853–862. [Google Scholar] [CrossRef]
- Xanthopoulos, P.; Pardalos, P.M.; Trafalis, T.B.; Xanthopoulos, P.; Pardalos, P.M.; Trafalis, T.B. Linear discriminant analysis. In Robust Data Mining; Springer: Berlin/Heidelberg, Germany, 2013; pp. 27–33. [Google Scholar]
- Hochreiter, S. Long Short-term Memory. In Neural Computation; MIT-Press: Cambridge, MA, USA, 1997. [Google Scholar]
- Kleinbaum, D.G.; Dietz, K.; Gail, M.; Klein, M.; Klein, M. Logistic Regression; Springer: Berlin/Heidelberg, Germany, 2002. [Google Scholar]
- Popescu, M.C.; Balas, V.E.; Perescu-Popescu, L.; Mastorakis, N. Multilayer perceptron and neural networks. WSEAS Trans. Circuits Syst. 2009, 8, 579–588. [Google Scholar]
- Shenk, J.S.; Westerhaus, M.O. Populations structuring of near infrared spectra and modified partial least squares regression. Crop Sci. 1991, 31, 1548–1555. [Google Scholar] [CrossRef]
- Helland, I.S.; Næs, T.; Isaksson, T. Related versions of the multiplicative scatter correction method for preprocessing spectroscopic data. Chemom. Intell. Lab. Syst. 1995, 29, 233–241. [Google Scholar] [CrossRef]
- Alexopoulos, E.C. Introduction to multivariate regression analysis. Hippokratia 2010, 14, 23. [Google Scholar]
- Rish, I. An empirical study of the naive Bayes classifier. In Proceedings of the IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence, Seattle, WA, USA, 4 August 2001; Volume 3, pp. 41–46. [Google Scholar]
- Dunteman, G.H. Principal Components Analysis; Sage: Thousand Oaks, CA, USA, 1989; Volume 69. [Google Scholar]
- Jolliffe, I.T.; Cadima, J. Principal component analysis: A review and recent developments. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 2016, 374, 20150202. [Google Scholar] [CrossRef]
- Barker, M.; Rayens, W. Partial least squares for discrimination. J. Chemom. A J. Chemom. Soc. 2003, 17, 166–173. [Google Scholar] [CrossRef]
- Handelman, G.S.; Kok, H.K.; Chandra, R.V.; Razavi, A.H.; Huang, S.; Brooks, M.; Lee, M.J.; Asadi, H. Peering into the black box of artificial intelligence: Evaluation metrics of machine learning methods. Am. J. Roentgenol. 2019, 212, 38–43. [Google Scholar] [CrossRef]
- Robnik-Šikonja, M.; Kononenko, I. Theoretical and empirical analysis of ReliefF and RReliefF. Mach. Learn. 2003, 53, 23–69. [Google Scholar] [CrossRef]
- Jain, L.C. Recurrent Neural Networks: Design and Applications; Medsker, L.R., Jain, L.C., Eds.; CRC Press: Boca Raton, FL, USA, 2000. [Google Scholar]
- Wang, F.; Jiang, M.; Qian, C.; Yang, S.; Li, C.; Zhang, H.; Wang, X.; Tang, X. Residual attention network for image classification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 3156–3164. [Google Scholar]
- David, H.A.; Hartley, H.; Pearson, E.S. The distribution of the ratio, in a single normal sample, of range to standard deviation. Biometrika 1954, 41, 482–493. [Google Scholar] [CrossRef]
- Friedman, J.H. Stochastic gradient boosting. Comput. Stat. Data Anal. 2002, 38, 367–378. [Google Scholar] [CrossRef]
- Chris Ruffin, R.L.K.; Younan, N.H. A Combined Derivative Spectroscopy and Savitzky-Golay Filtering Method for the Analysis of Hyperspectral Data. GIScience Remote Sens. 2008, 45, 1–15. [Google Scholar] [CrossRef]
- Gorry, P.A. General least-squares smoothing and differentiation by the convolution (Savitzky-Golay) method. Anal. Chem. 1990, 62, 570–573. [Google Scholar] [CrossRef]
- Araújo, M.C.U.; Saldanha, T.C.B.; Galvao, R.K.H.; Yoneyama, T.; Chame, H.C.; Visani, V. The successive projections algorithm for variable selection in spectroscopic multicomponent analysis. Chemom. Intell. Lab. Syst. 2001, 57, 65–73. [Google Scholar] [CrossRef]
- Stitson, M.; Weston, J.; Gammerman, A.; Vovk, V.; Vapnik, V. Theory of support vector machines. Univ. Lond. 1996, 117, 188–191. [Google Scholar]
- Smola, A.J.; Schölkopf, B. A tutorial on support vector regression. Stat. Comput. 2004, 14, 199–222. [Google Scholar] [CrossRef]
- Cai, W.; Li, Y.; Shao, X. A variable selection method based on uninformative variable elimination for multivariate calibration of near-infrared spectra. Chemom. Intell. Lab. Syst. 2008, 90, 188–194. [Google Scholar] [CrossRef]
- Wei, P.; Lu, Z.; Song, J. Variable importance analysis: A comprehensive review. Reliab. Eng. Syst. Saf. 2015, 142, 399–432. [Google Scholar] [CrossRef]
- Fränti, P.; Virmajoki, O. Iterative shrinking method for clustering problems. Pattern Recognit. 2006, 39, 761–775. [Google Scholar] [CrossRef]
- Rabatel, G.; Marini, F.; Walczak, B.; Roger, J.M. VSN: Variable sorting for normalization. J. Chemom. 2020, 34, e3164. [Google Scholar] [CrossRef]
- Pathak, R. The Wavelet Transform; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2009. [Google Scholar] [CrossRef]
- Demas, J.; Crosby, G. The measurement of photoluminescence quantum yields. A Review. J. Chem. Phys. 1968, 48, 4726. [Google Scholar]
Technique | Data Acquisition Principle | Cost | Dimensions | Data Completeness | Acquisition Speed | Advantages | Disadvantages |
---|---|---|---|---|---|---|---|
Spectrometry [16] | Point measurement | Low | Small | Moderate | Fast | Low cost, high precision | No spatial information |
Multispectral Imaging [17] | Snapshot/Filter-based | Medium | Compact | Limited | Very fast | Low cost, portable | Limited spectral resolution |
Hyperspectral Imaging [18] | Pushbroom/Line scanning | High | Larger | High | Slow to moderate | High spectral and spatial resolution | Expensive, computationally intensive |
Study | How Clearly the Spectral Imaging Methodology Was Described? | How Clearly the Reference Biochemical Methods (e.g., Standard Laboratory Measurements) Were Described? | If Validation Procedures (Cross-Validation, Independent Test Sets) Were Clearly Stated? |
---|---|---|---|
[19] | Low Risk | Low Risk | High Risk |
[20] | Low Risk | Low Risk | Low Risk |
[21] | Low Risk | Low Risk | High Risk |
[22] | Low Risk | High Risk | Low Risk |
[23] | Low Risk | Low Risk | High Risk |
[24] | Low Risk | Low Risk | Low Risk |
[25] | Low Risk | Low Risk | Low Risk |
[26] | Low Risk | Low Risk | Low Risk |
[27] | Low Risk | Low Risk | Low Risk |
[28] | Low Risk | Low Risk | Low Risk |
[29] | Low Risk | High Risk | High Risk |
[30] | High Risk | Low Risk | Low Risk |
[31] | Low Risk | Low Risk | Low Risk |
[32] | Low Risk | Low Risk | Unknown Risk |
[33] | Low Risk | Low Risk | Low Risk |
[34] | Low Risk | Low Risk | Low Risk |
[35] | Low Risk | High Risk | Low Risk |
[36] | Low Risk | Low Risk | Low Risk |
[37] | Low Risk | Low Risk | Unknown Risk |
[38] | Low Risk | Low Risk | Low Risk |
[39] | Low Risk | Low Risk | Low Risk |
[40] | Low Risk | Low Risk | Low Risk |
[41] | Low Risk | Low Risk | Unknown Risk |
[42] | Low Risk | Low Risk | Low Risk |
[43] | Low Risk | Low Risk | Low Risk |
[44] | Low Risk | Low Risk | Low Risk |
[45] | Low Risk | Low Risk | Low Risk |
Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
Basil | Hyperspectral camera: 400 nm–1000 nm | Data dimensionality: Vegetative Indices (ARI, mARI) were calculated from the spectra. ANOVA analysis was used to see the significance of these parameters. | High correlation between VIs (ARI, mARI) and anthocyanins. Results: ARI R2 = 0.82 mARI R2 = 0.85 | Not specified | [19] |
Lettuce | Hyperspectral Camera: 400 nm–1000 nm | Data dimensionality: Spectral index (ARI) from the HRI | The document does not provide specific accuracy metrics. It discusses the relationships between hyperspectral indices (e.g., CI700, PSRI, and ARI) and physiological indicators of senescence | Not specified | [21] |
Lettuce | Multispectral camera: 475 nm (blue), 560 nm (green), 668 nm (red), 717 nm (red edge), and 840 nm (NIR). | Usage of VIs | Correlation between the most significant VIs and anthocyanin content BNDVI R 0.83 GNDVI R 0.81 NDVI R 0.82 | Not specified | [41] |
Lettuce | Hyperspectral Camera: 400–1000 nm | Data Dimensionality Reduction: UVE and UVE + CARS were used to eliminate uninformative variables and focus on critical wavelengths. | Model 1 data on test UVE + CARS + SNV + DBO + ELM R2 = 0.8617, RMSE = 0.0095, RPD (Test Set): 2.7192 UVE + CARS + FD + SABO + ELM R2 0.8255, RMSE 0.010, RPD 2.4207 UVE + SNV + DBO + ELM R2 0.7986, RMSE 0.0123, RPD 2.2533 VI3 + WOA + ELM R2 0.812, RMSE 0.011, RPD 2.3323 UVE + Raw + DBO + ELM R2 0.7612, RMSE 0.0142, RPD 2.0693 | 135 hyperspectral picture | [20] |
Lettuce | Fiberoptic spectroradiometer: 360 nm–1000 nm | Data dimensionality: Reflectance Indices (ChlRI, SIPI, R800, PRI, ARI) were calculated | Linear regression was performed for the significant RI Anthocyanins ARI R2 = 0.57 p = 0.029 | 60 | [31] |
Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
Lettuce | Hyperspectral Camera: 350 nm–2500 nm | Data dimensionality: First-order derivative (FDR) of reflectance was used to find the 13 most important principal components. | Multivariate PLS R = 0.8216 NRMES = 0.5701 Multivariate PCR R = 0.8198 NRMSE = 0.5727 Multivariate ANN with normalized full HSI data R = 0.4343 NRMES = 0.5356 | 300 | [22] |
Spinach | Dual-channel diode array spectrometer: 305 nm–1800 nm | Data dimensionality: Three methods: Competitive adaptive reweighted sampling (CARS) was used to eliminate the irrelevant variables of PLSR. VIs (PRI, PRI 500, PRI norm, RDVI, VOGREI3, NDVI, mrNDVI, TCARI, REIP, WI) calculated form the spectrum Parameters based on inflection points (IPs) of the spectrum (first and second derivatives) | CARS-PLSR R2 = 0.58 RMSEP = 0.35 Best VI linear model: mrNDVI R2 = 0.47 p < 0.001 Best IP linear model IP1 R2 = 0.44 p < 0.001 CARS-PLSR No data Best VI linear model: mrNDVI R2 = 0.16 p < 0.001 Best IP linear model IP5 R2 = 0.17 p < 0.001 | 1120 Spectrometer measurements | [27] |
Lettuce | Fiberoptic spectroradiometer: 360 nm–1000 nm | Data dimensionality: Reflectance Indices (ChlRI, SIPI, R800, PRI, ARI) were calculated | The variation in the RI caused by nitrogen deficiency was assessed during a 21-day cultivation period: is the effect size of a factor in percent; p is the level of significance of the factor effect. Lettuce 1 (Vitaminnyi) ChlRI 52.7 p < 0.0001 SIPI 40.2 p < 0.0001 R800 10.1 p = 0.0056 PRI 18.8 p = 0.0001 ARI 57.2 p < 0.0001 Lettuce 2 (Kokarda) ChlRI 50.9 p < 0.001 SIPI 0.02 p = 0.893 R800 3.5 p = 0.091 PRI 9.7 p = 0.005 ARI 0.3 p = 0.865 | 60 | [31] |
Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
Lettuce | Hyperspectral camera: 390 nm–1000 nm | Data dimensionality: Segmentation of pixels based on two VIs (NDVI, GI), PLSR used to identify the predictor variables | Harvest 1 (four weeks) Prediction of nitrogen PLSR Models PLS1 (single nutrient output): Rp2 = 0.72 RMSE = 0.58 PLS2 (multiple nutrient output): Rp2 = 0.65 RMSE = 0.64 Classification of nitrogen deficiency (F1, Precision, Recall) (PLS1, PLS2, PLSDA, MLP) Best Results: Threshold 1: PLS1 F1 = 0.75 Precision = 0.67 Recall = 0.86 PLSDA F1 = 0.75 Precision = 0.67 Recall = 0.86 MLP F1 = 0.75 Precision = 0.67 Recall = 0.86 Threshold 2: PLS1 F1 = 0.86 Precision = 1 Recall = 0.75 Harvest 2 (six weeks) Prediction of nitrogen PLSR Models PLS1 (single nutrient output): Rp2 = 0.88 RMSE = 0.37 PLS2 (multiple nutrient output): Rp2 = 0.88 RMSE = 0.37 Classification of nitrogen deficiency (F1, Precision, Recall) (PLS1, PLS2, PLSDA, MLP) Best Results: Threshold 1: PLS1 F1 = 1 Precision = 1 Recall = 1 PLS2 F1 = 1 Precision = 1 Recall = 1 PLSDA F1 = 1 Precision = 1 Recall = 1 MLP F1 = 1 Precision = 1 Recall = 1 Threshold 2: PLS1 F1 = 1 Precision = 1 Recall = 1 PLS2 F1 = 1 Precision = 1 Recall = 1 PLSDA F1 = 1 Precision = 1 Recall = 1 MLP F1 = 1 Precision = 1 Recall = 1 | 288 | [38] |
Lettuce | Portable spectroradiometer: 350 nm–2500 nm | Data Dimensionality:
Three methods to select the optimal wavelengths: - Principal Component Analysis (PCA) - Genetic Algorithm (GA) - Sequential Forward Selection (SFS). | Three regression methods (PLSR, BPNN, RF) were tested to estimate the nitrogen content: PLSR + PCA R2 0.58 RMSE 1.14 GA R2 0.95 RMSE 0.42 SFS R2 0.96 RMSE 0.30 BPNN + PCA R2 0.87 RMSE 0.6 GA R2 0.85 RMSE 3.25 SFS R2 0.97 RMSE 0.25 RF + PCA R2 0.90 RMSE 0.55 GA R2 0.95 RMSE 0.39 SFS R2 0.96 RMSE 0.36 | 2304 spectra (6 measurements per leaf, 3 leaves per plant, 8 plants per group, 4 groups, 4 growth stages) | [44] |
Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
Lettuce | Hyperspectral camera: 400 nm–1000 nm | Data dimensionality: Four linear regression fit models were used to identify the 24 most significant wave bands (out of 448). The inputs of these models were reflectance, the first derivative of the reflectance, and two spectral indices (RSI, NDSI). After that, the bands were reduced to 22 due to the very small improvement of the regression models using 24 bands. | PLSR R2 = 0.98 RMSE = 346 PCA R2 = 0.97 RMSE = 455 | 28 | [32] |
Spinach | Spectroscopy 834 nm–1475 nm 2403 nm–2502 nm (noise regions were removed). | Data Dimensionality: Principal Component Analysis (PCA) was applied to identify outliers and preprocess the data. | MPLS Model: R2 0.38–0.45 Standard Error of Prediction (SEP): 920 mg/kg. LOCAL Algorithm: R2 0.6 SEP: 758 mg/kg. | 516 spinach plants | [43] |
Spinach | Near-Infrared Spectroscopy (NIRS): 1600 nm–2400 nm. | Data Dimensionality: Principal Component Analysis (PCA) was used to decompose and compress the data matrix and detect outliers before calibration modeling. | MPLS
Validation results: R2 0.41, RDP = 1.29 Partial Least Squares-Discriminant Analysis (PLS-DA) was used for classifying nitrate content with classification accuracies ranging from 73 percent to 85 percent for different classes. | 128 spinach samples were analyzed | [42] |
Lettuce | 800 to 2500 nm Spectrometer | Data dimensionality: PCA was used to identify outliers among the spectra, and LDA was used to divide the spectra based on the lettuce variety. After the regression, PLS managed to identify nine components (spectral bands) | PLSR Model (only 191 samples by removing outliers) Square of determinant coefficient after cross-validation (Q2) = 0.90 RMSECV = 99 mg/kg | 1330 spectra (266 lettuce heads, 5 spectra per plant) | [33] |
Lettuce | Hyperspectral camera: 390.57 nm–1008.6 nm | Data dimensionality: Three Methods: - First derivative of the reflectance to identify band regions with the most differences - PLSR and PCA to identify the bands with more weight - VIP score of a predictor variable over “1” | Best results of each method over four lettuce species cultivated with two different systems each (8 datasets)
Results for hydroponically grown black-seeded Simpson lettuce: FDR: Rp2 = 0.93 RMSE = 437.19 PLSR/PCA: Rp2 = 0.98 RMSE = 185.63 VIP-Score: Rp2 = 0.99 RMSE = 111.51 However, there is high performance variability depending on the dataset | Not specified | [37] |
Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
Spinach | Vis-NIR spectrophotometer: 350 nm–2500 nm (400 nm–2250 nm considered) | Data dimensionality: the most significant bands were selected with PLS using the full spectrum as input. Then, the PLS model was tested again with only the selected wavelengths. | PLS model with full spectral range SG 125 R2 0.865 cross-validation RPD 1.459 cross-validation SG 127 R2 0.882 cross-validation RPD 1.559 cross-validation SNV R2 0.908 cross-validation RPD 1.768 cross-validation Baseline Correction R2 0.892 cross-validation RPD 1.629 cross-validation Detrending DT R2 0.897 cross-validation RPD 1.673 cross-validation MSC R2 0.908 cross-validation RPD 1.767 cross-validation Raw unprocessed R2 0.898 cross-validation RPD 1.679 cross-validation PLS model with selected wavelengths Raw data R2 0.869 cross-validation RPD 1.482 cross-validation | 261 spectra (9 treatments, 29 leaves per treatment) | [34] |
Spinach | NIR spectrophotometer: 908 nm–1676 nm | Data Dimensionality: Principal Component Analysis (PCA) was applied to study the population structure, and some bands were identified. However, the MLPS regression was performed using the full measured spectrum. | In the first part, cross-validation analysis was used to optimize the number of spectra to be taken at each production chain step: 1 spectrum for each plant in the first two stages and 2 spectra for each plant in the last. After that, an MLPS regression was used to estimate the nitrate content in each production stage: Field R2cv 0.59 RPDcv 1.55 Laboratory R2cv 0.52 RPDcv 1.45 After Washing R2cv 0.54 RPDcv 1.46 | 4000 spectra (77 plants; 5 spectra per plant during first two phases; 6 spectra per leaf; and between 4 and 10 spectra per plant during last phase) | [35] |
Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
Lettuce | Spectrograph + CCD camera: 400 nm–1000 nm | Data dimensionality: to reduce the computational load of the CNN, an attention mechanism was employed to focus on significant features of the spectra | Prediction based on hyperspectral data using three methods (CNN, PLSR, SVR): CNN Inception Module Stratified Sampling R2P = 0.9166 RMSE = 679.5709 Normal Sampling R2P = 0.9073 RMSE = 717.5676 Residual Module R2P = 0.9235 RMSE = 650.8244 Attention Module R2P = 0.9232 RMSE = 652.2112 Residual Module + Attention Module R2P = 0.9317 RMSE = 615.0037 PLSR R2P = 0.8782 RMSE = 821.2243 SVR R2P = 0.8942 RMSE = 801.7419 Prediction based on time series phenotypes combined with the best deep learning (Optimal-Spec) models based on hyperspectral data using four methods (TD, LSTM, GRU, BRNN): Optimal-Spec-TD R2P = 0.9435 RMSE = 559.2437 Optimal-Spec-LSTM R2P = 0.9354 RMSE = 598.1094 Optimal-Spec-GRU R2P = 0.9400 RMSE = 576.2199 Optimal-Spec-BRNN R2P = 0.9380 RMSE = 581.2459 | 433 spectra (560 plants: 160 Butter, 200 Head, 100 Leaf, and 100 Roman) | [36] |
Spinach | MicroNIR™ 1700 spectral range 910–1676 nm and Matrix-F (FT-NIR-based) spectral range 834–2502 nm (after removing noisy regions) | Data dimensionality: MPLS, PCA, SNV, DT, Spectral preprocessing: derivative treatments 1, 5, 5, 1 and 2, 5, 5, 1 | MicroNIR™ 1700 R2cv = 0.50 SECV = 633.73 mg/kg RPFcv = 1.41 Matrix-F R2cv = 0.44 SECV = 67,614 mg/kg RPFcv = 1.33 | 195 spinach plants. After removing outliers, calibration sets included 144–146 samples, and validation sets included 47 samples. | [45] |
Biochemical Traits and Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
Sugar Lettuce | Hyperspectral Camera: 350 nm–2500 nm | Data dimensionality: First-order derivative (FDR) of reflectance was used to find the 13 most important principal components. | Sucrose Multivariate PLS R = 0.7092 NRMSE = 0.8398 Multivariate PCR R = 0.5771 NRMSE = 0.9038 Multivariate ANN with normalized full HSI data R = −0.04568 NRMSE = 0.6889 Glucose Multivariate PLS R = 0.8402 NRMSE = 0.5422 Multivariate PCR R = 0.7967 NRMSE = 0.6017 Multivariate ANN with normalized full HSI data R = 0.02561 NRMSE = 0.6746 Fructose Multivariate PLS R = 0.8579 NRMSE = 0.538 Multivariate PCR R = 0.8155 NRMSE = 0.5788 Multivariate ANN with normalized full HSI data R = 0.3967 NRMSE = 0.5251 | 300 | [22] |
Sugar, Lettuce | Hyperspectral camera: 390.57 nm–1008.6 nm | Data dimensionality:
Three Methods: - First derivative of the reflectance to identify band regions with the most differences - PLSR and PCA to identify the bands with more weight - VIP score of a predictor variable over “1” | Best results of each method over four lettuce species cultivated with two different systems each (8 datasets)
Results for hydroponically grown black-seeded Simpson lettuce: FDR: Rp2 = 0.98 RMSE = 0.36 PLSR/PCA: Rp2 = 0.99 RMSE = 0.21 VIP-Score: Rp2 = 0.99 RMSE = 0.03 However, there is high performance variability depending on the dataset | Not specified | [37] |
Phosphorus, Lettuce | Hyperspectral Camera: 350 nm–2500 nm | Data dimensionality: First-order derivative (FDR) of reflectance was used to find the 13 most important principal components. | Multivariate PLS R = 0.9094 NRMSE = 0.7649 Multivariate PCR R = 0.8911 NRMSE = 0.4539 Multivariate ANN with normalized full HSI data R = 0.7155 NRMSE = 0.6155 | 300 | [22] |
Phosphorus, Lettuce | Fiberoptic spectroradiometer: 360 nm–1000 nm | Data dimensionality: Reflectance Indices (ChlRI, SIPI, R800, PRI, ARI) were calculated | The variation in the RI caused by phosphorus deficiency was assessed during a 21-day cultivation period: is the effect size of a factor in percent; p is the level of significance of the factor effect. Lettuce 1 (Vitaminnyi) ChlRI 37.7 p < 0.0001 SIPI 11.7 p = 0.0026 R800 10.0 p = 0.0057 PRI 26.9 p < 0.0001 ARI 47.3 p < 0.0001 Lettuce 2 (Kokarda) ChlRI 32.1 p < 0.0001 SIPI 0.6 p = 0.473 R800 12.8 p = 0.0012 PRI 12.0 p = 0.0018 ARI 2.1 p = 0.202 | 60 | [31] |
Biochemical Traits and Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
Phosphorus, Lettuce | Hyperspectral camera: 390 nm–1000 nm | Data dimensionality: Segmentation of pixels based on two VIs (NDVI, GI), PLSR used to identify the predictor variables | Only the best results are shown.
Harvest 1 (four weeks) Prediction of phosphorus PLSR Models PLS1 (single nutrient output): Rp2 = 0.71 RMSE = 0.13 PLS2 (multiple nutrient output): Rp2 = 0.74 RMSE = 0.12 Classification of phosphorus deficiency (F1, Precision, Recall) (PLS1, PLS2, PLSDA, MLP) Threshold 1: PLSDA F1 = 0.86 Precision = 0.86 Recall = 0.86 Threshold 2: PLS1 F1 = 0.86 Precision = 1 Recall = 0.75 PLS2 F1 = 0.86 Precision = 1 Recall = 0.75 Harvest 2 (six weeks) Prediction of phosphorus PLSR Models PLS1 (single nutrient output): Rp2 = 0.68 RMSE = 0.12 PLS2 (multiple nutrient output): Rp2 = 0.58 RMSE = 0.14 Classification of phosphorus deficiency (F1, Precision, Recall) Threshold 1: MLP F1 = 0.75 Precision = 0.75 Recall = 0.75 Threshold 2: MLP F1 = 0.86 Precision = 1 Recall = 0.75 | 288 | [38] |
Phosphorus, Lettuce | Portable spectroradiometer: 350 nm–2500 nm | Data Dimensionality:
Three methods to select the optimal wavelengths: - Principal Component Analysis (PCA) - Genetic Algorithm (GA) - Sequential Forward Selection (SFS). | Three regression methods (PLSR, BPNN, RF) were tested to estimate the phosphorus content: PLSR + PCA R2 0. 52 RMSE 4.48 GA R2 0.88 RMSE 1.92 SFS R2 0.83 RMSE 4.4 BPNN + PCA R2 0.9 RMSE 0.26 GA R2 0.88 RMSE 3 SFS R2 0.57 RMSE 4.35 RF + PCA R2 0.94 RMSE 0.2 GA R2 0.89 RMSE 0.35 SFS R2 0.85 RMSE 0.54 | 2304 spectra (6 measurements per leaf; 3 leaves per plant; 8 plants per group; 4 groups; and 4 growth stages). | [44] |
Potassium, Lettuce | Hyperspectral Camera: 350 nm–2500 nm | Data dimensionality: First-order derivative (FDR) of reflectance was used to find the 13 most important principal components. | Multivariate PLS R = 0.6442 NRMSE = 0.7649 Multivariate PCR R = 0.545 NRMSE = 0.8385 Multivariance ANN with normalized full HSI data R = 0.6155 NRMSE = 0.7667 | 300 | [22] |
Biochemical Traits and Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
Potassium Lettuce | Fiberoptic spectroradiometer: 360 nm–1000 nm | Data dimensionality: Reflectance Indices (ChlRI, SIPI, R800, PRI, ARI) were calculated | The variation in the RI caused by phosphorus deficiency was assessed during a 21-day cultivation period: is the effect size of a factor in percent, p is the level of significance of the factor effect. Lettuce 1 (Vitaminnyi) ChlRI 20.4 p < 0.0001 SIPI 3.6 p = 0.101 R800 0.01 p = 0.93 PRI 12.0 p < 0.0023 ARI 17.9 p < 0.0002 Lettuce 2 (Kokarda) ChlRI 30.7 p < 0.0001 SIPI 0.6 p = 0.829 R800 1.1 p = 0.362 PRI 5.2 p = 0.042 ARI 2.6 p = 0.157 | 60 | [31] |
Potassium, Lettuce | Hyperspectral camera: 400 nm–1000 nm | Data dimensionality: Four linear regression fit models were used to identify the 24 most significant wave bands (out of 448). The inputs of these models were reflectance, the first derivative of the reflectance, and two spectral indices (RSI, NDSI). After that, the bands were reduced to 22 due to the very small improvement of the regression models using 24 bands. | PLSR R2 = 0.97 RMSE = 131 PCA R2 = 0.94 RMSE = 191 | 28 | [32] |
Potassium, Lettuce | Hyperspectral camera: 390.57 nm–1008.6 nm | Data dimensionality:
Three Methods: - First derivative of the reflectance to identify band regions with the most differences - PLSR and PCA to identify the bands with more weight - VIP score of a predictor variable over “1” | Best results of each method over four lettuce species cultivated with two different systems each (8 datasets)
Results for hydroponically grown black-seeded Simpson lettuce: FDR: Rp2 = 0.97 RMSE = 83.69 PLSR/PCA: Rp2 = 0.97 RMSE = 80.84 VIP-Score: Rp2 = 0.99 RMSE = 30.02 However, there is high performance variability depending on the dataset | Not specified | [37] |
Biochemical Traits and Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
Potassium, Lettuce | Hyperspectral camera: 390 nm–1000 nm | Data dimensionality: Segmentation of pixels based on two VIs (NDVI, GI), PLSR used to identify the predictor variables | Harvest 1 (four weeks) Prediction of Potassium PLSR Models PLS1 (single nutrient output): Rp2 = 0.69 RMSE = 1.5 PLS2 (multiple nutrient output): Rp2 = 0.67 RMSE = 1.54 Classification of potassium deficiency (F1, Precision, Recall) (PLS1, PLS2, PLSDA, MLP) Best results: Threshold 1: PLS1 F1 = 0.73 Precision = 0.67 Recall = 0.80 Threshold 2: PLS1 F1 = 0.67 Precision = 1 Recall = 0.50 PLS2 F1 = 0.67 Precision = 1 Recall = 0.50 PLSDA F1 = 0.67 Precision = 1 Recall = 0.50 MLP F1 = 0.67 Precision = 1 Recall = 0.50 Harvest 2 (six weeks) Prediction of Potassium PLSR Models PLS1 (single nutrient output): Rp2 = 0.38 RMSE = 1.95 PLS2 (multiple nutrient output): Rp2 = 0.42 RMSE = 1.87 Classification of potassium deficiency (F1, Precision, Recall) Best results: Threshold 1: PLS2 | 288 | [38] |
Potassium, Lettuce | Hyperspectral camera: 390 nm–1000 nm | Data dimensionality: Segmentation of pixels based on two VIs (NDVI, GI), PLSR used to identify the predictor variables | F1 = 0.73 Precision = 0.8 Recall = 0.67 PLSDA F1 = 0.73 Precision = 0.8 Recall = 0.67 MLP F1 = 0.73 Precision = 0.8 Recall = 0.67 Threshold 2: PLS1 F1 = 0.4 Precision = 1 Recall = 0.25 PLSDA F1 = 0.4 Precision = 1 Recall = 0.25 | 288 | [38] |
Potassium, Lettuce | Portable spectroradiometer: 350 nm–2500 nm | Data Dimensionality:
Three methods to select the optimal wavelengths: - Principal Component Analysis (PCA) - Genetic Algorithm (GA) - Sequential Forward Selection (SFS). | Three regression methods (PLSR, BPNN, RF) were tested to estimate the potassium content: PLSR + PCA R2 0. 93 RMSE 0.37 GA R2 0.95 RMSE 0.34 SFS R2 0.92 RMSE 0.38 BPNN + PCA R2 0.9 RMSE 0.5 GA R2 0.9 RMSE 0.53 SFS R2 0.94 RMSE 0.36 RF + PCA R2 0.96 RMSE 0.32 GA R2 0.88 RMSE 2.83 SFS R2 0.96 RMSE 0.35 | 2304 spectra (6 measurements per leaf; 3 leaves per plant; 8 plants per group; 4 groups; and 4 growth stages). | [44] |
Biochemical Traits and Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
Vitamin C, Lettuce | Hyperspectral Camera: 350 nm–2500 nm | Data dimensionality: First-order derivative (FDR) of reflectance was used to find the 13 most important principal components. | Multivariate PLS R = 0.655 NRMSE = 0.5701 Multivariate PCR R = 0.6229 NRMSE = 0.4301 Multivariance ANN with normalized full HSI data R = 0.3259 NRMSE = 0.7478 | 300 | [22] |
Vitamin C, Spinach | Near-Infrared Spectroscopy (NIRS): 1600 nm–2400 nm. | Data Dimensionality: Principal Component Analysis (PCA) was used to decompose and compress the data matrix and detect outliers before calibration modeling. | MPLS
Validation results: R2 0.33, RDP = 1.21 | 128 spinach samples were analyzed | [42] |
Water (Relative Water Content, RWC), Spinach | Dual-channel diode array spectrometer: 305 nm–1800 nm | Data dimensionality:
Three methods: - Competitive adaptive reweighted sampling (CARS) was used to eliminate the irrelevant variables of PLSR. - VIs (PRI, PRI 500, PRI norm, RDVI, VOGREI3, NDVI, mrNDVI, TCARI, REIP, WI) calculated from the spectrum - Parameters based on inflection points (IPs) of the spectrum | CARS-PLSR R2 = 0.30 RMSEP = 1.43 Best VI linear model: WI R2 = 0.16 p = 0.019 Best IP linear model: IP1 R2 = 0.05 p < 0.001 | 1120 | [27] |
Water Stress, Lettuce | Spectrograph + CCD camera: 400 nm–1000 nm | Data dimensionality: to reduce the computational load of the CNN, an attention mechanism was employed to focus on significant features of the spectra | Prediction of water stress based on time series phenotypes combined with the best deep
learning (Optimal-Spec) models based on hyperspectral data Optimal-Spec Accuracy = 96.59 percent Optimal-Spec-TD Accuracy = 98.86 percent | 433 spectra (560 plants: 160 butter; 200 head; 100 leaf; and 100 Roman) | [36] |
Biochemical Traits and Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
Calcium, Lettuce | Hyperspectral camera: 400 nm–1000 nm | Data dimensionality: Four linear regression fit models were used to identify the 24 most significant wave bands (out of 448). The inputs of these models were reflectance, the first derivative of the reflectance, and two spectral indices (RSI, NDSI). After that, the bands were reduced to 22 due to the very small improvement of the regression models using 24 bands. | PLSR R2 = 0.91 RMSE = 43.3 PCA R2 = 0.86 RMSE = 54.5 | 28 | [32] |
Calcium, Lettuce | Specrograph + CCD camera: 400 nm–1000 nm | Data dimensionality: to reduce the computational load of the CNN, an attention mechanism was employed to focus on significant features of the spectra | Prediction based on hyperspectral data using three methods (CNN, PLSR, SVR): CNN Inception Module Stratified Sampling R2P = 0.8136 RMSE = 38.9377 Normal Sampling R2P = 0.8027 RMSE = 34.0846 Residual Module R2P = 0.8325 RMSE = 36.9115 Attention Module R2P = 0.8460 RMSE = 35.3928 Residual Module + Attention Module R2P = 0.8390 RMSE = 36.1864 PLSR R2P = 0.7608 RMSE = 44.1029 SVR R2P = 0.7215 RMSE = 49.9887 Prediction based on time series phenotypes combined with the best deep learning (Optimal-Spec) models based on hyperspectral data using four methods (TD, LSTM, GRU, BRNN): | 433 spectra (560 plants: 160 Butter, 200 Head, 100 Leaf, and 100 Roman) | [36] |
Calcium, Lettuce | Spectograph + CCD camera: 400 nm–1000 nm | Data dimensionality: to reduce the computational load of the CNN, an attention mechanism was employed to focus on significant features of the spectra | Optimal-Spec-TD R2P = 0.8675 RMSE = 32.8215 Optimal-Spec-LSTM R2P = 0.8716 RMSE = 32.3151 Optimal-Spec-GRU R2P = 0.8670 RMSE = 32.8901 Optimal-Spec-BRNN R2P = 0.8593 RMSE = 33.8303 | 433 spectra (560 plants: 160 Butter, 200 Head, 100 Leaf, and 100 Roman) | [36] |
Biochemical Traits and Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
Calcium, Lettuce | Hyperspectral camera: 390.57 nm–1008.6 nm | Data dimensionality: Three Methods: First derivative of the reflectance to identify band regions with the most differences PLSR and PCA to identify the bands with more weight VIP score of a predictor variable over “1” | Best results of each method over four lettuce species cultivated with two different systems each (8 datasets)
Results for hydroponically grown black-seeded Simpson lettuce: FDR: Rp2 = 0.75 RMSE = 23.76 PLSR/PCA: Rp2 = 0.65 RMSE = 27.87 VIP-Score: Rp2 = 0.65 RMSE = 25.42 | Not specified | [37] |
Calcium, Lettuce | Hyperspectral camera: 390 nm–1000 nm | Data dimensionality: Segmentation of pixels based on two VIs (NDVI, GI), PLSR used to identify the predictor variables | Only the best results are shown.
Harvest 1 (four weeks) Prediction of calcium PLSR Models PLS1 (single nutrient output): Rp2 = 0.12 RMSE = 0.4 PLS2 (multiple nutrient output): Rp2 = 0.24 RMSE = 0.37 Classification of calcium deficiency (F1, Precision, Recall) (PLS1, PLS2, PLSDA, MLP) Threshold 1: PLS1 F1 = 0.73 Precision = 0.67 Recall = 0.80 Threshold 2: MLP F1 = 0.5 Precision = 0.5 Recall = 0.5 Harvest 2 (six weeks) Prediction of calcium PLSR Models PLS1 (single nutrient output): Rp2 = 0.07 RMSE = 0.52 PLS2 (multiple nutrient output): Rp2 = 0.11 RMSE = 0.52 Classification of calcium deficiency (F1, Precision, Recall) Threshold 1: None F1 = 0 Precision = 0 Recall = 0 Threshold 2: MLP F1 = 0.69 Precision = 0.69 Recall = 0.69 | 288 | [38] |
Soluble Solid Content, Lettuce | Hyperspectral camera: 400 nm–1000 nm | Data dimensionality: Four linear regression fit models were used to identify the 24 most significant wave bands (out of 448). The inputs of these models were reflectance, the first derivative of the reflectance, and two spectral indices (RSI, NDSI). After that, the bands were reduced to 22 due to the very small improvement of the regression models using 24 bands. | PLSR R2 = 0.95 RMSE = 1.46 PCA R2 = 0.88 RMSE = 2.25 | 28 | [32] |
Soluble Solid Content, Spinach | Near-Infrared Spectroscopy (NIRS): 1600 nm–2400 nm. | Data Dimensionality Reduction: Principal Component Analysis (PCA) was used to decompose and compress the data matrix and detect outliers before calibration modeling. | MPLS Validation results: R2 0.85, RDP = 2.54 | 128 | [42] |
Biochemical Traits and Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
Soluble Solid Content, Spinach | NIR spectrophotometer: 908 nm–1676 nm | Data Dimensionality: Principal Component Analysis (PCA) was applied to study the population structure, and some bands were identified. However, the MLPS regression was performed using the full measured spectrum. | In the first part, cross-validation analysis was used to optimize the number of spectra to be taken at each production chain step: 1 spectrum for each plant in the first two stages and 2 spectra for each plant in the last. After that, an MLPS regression was used to estimate the SSC content in each production stage: Field R2cv 0.55 RPDcv 1.55 Laboratory R2cv 0.60 RPDcv 1.66 After Washing R2cv 0.62 RPDcv 1.76 | 4000 spectra (77 plants; 5 spectra per plant during first two phases; 6 spectra per leaf; and between 4 and 10 spectra per plant during last phase) | [35] |
Soluble Solid Content, Lettuce | Spectrograph + CCD camera: 400 nm–1000 nm | Data dimensionality: to reduce the computational load of the CNN, an attention mechanism was employed to focus on significant features of the spectra | Prediction based on hyperspectral data using three methods (CNN, PLSR, SVR): CNN Inception Module Stratified Sampling R2P = 0.8307 RMSE = 0.4629 Normal Sampling R2P = 0.8279 RMSE = 0.4030 Residual Module R2P = 0.8413 RMSE = 0.4481 Attention Module R2P = 0.8694 RMSE = 0.4065 Residual Module + Attention Module R2P = 0.8743 RMSE = 0.3989 PLSR R2P = 0.7913 RMSE = 0.5139 SVR R2P = 0.8023 RMSE = 0.5599 Prediction based on time series phenotypes combined with the best deep learning (Optimal-Spec) models based on hyperspectral data using four methods (TD, LSTM, GRU, BRNN): Optimal-Spec-TD R2P = 0.8743 RMSE = 0.3989 Optimal-Spec-LSTM R2P = 0.8764 RMSE = 0.3955 Optimal-Spec-GRU R2P = 0.8772 RMSE = 0.3941 Optimal-Spec-BRNN R2P = 0.8828 RMSE = 0.3851 | 433 spectra (560 plants: 160 butter, 200 head, 100 leaf, and 100 Roman) | [36] |
Cadmium, Lettuce | Fiber optic spectrometer: 200 nm–1100 nm (400 nm–980 nm considered) | Data Dimensionality: Four spectral indices were used | Regression analysis
SDr/SDy R2 = 0.872, F-value = 24.959 SDb/SDy R2 = 0.781, F-value = 13.041 (SDr − SDy)/(SDr + SDy) R2 = 0.792, F-value = 13.996 (SDb − SDy)/(SDb + SDy) R2 = 0.65, F-value = 6.8 | 45 | [26] |
Biochemical Traits and Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
pH, Lettuce | Hyperspectral camera: 400 nm–1000 nm | Data dimensionality: Four linear regression fit models were used to identify the 24 most significant wave bands (out of 448). The inputs of these models were reflectance, the first derivative of the reflectance, and two spectral indices (RSI, NDSI). After that, the bands were reduced to 22 due to the very small improvement of the regression models using 24 bands. | PLSR
R2 = 0.97 RMSE = 0.03 PCA R2 = 0.81 RMSE = 0.09 | 28 | [32] |
pH, Lettuce | Spectrograph + CCD camera: 400 nm–1000 nm | Data dimensionality: to reduce the computational load of the CNN, an attention mechanism was employed to focus on significant features of the spectra | Prediction based on hyperspectral data using three methods (CNN, PLSR, SVR): CNN Inception Module Stratified Sampling R2P = 0.9116 RMSE = 0.1944 Normal Sampling R2P = 0.9079 RMSE = 0.2072 Residual Module R2P = 0.9472 RMSE = 0.1502 Attention Module R2P = 0.9399 RMSE = 0.1603 Residual Module + Attention Module R2P = 0.9240 RMSE = 0.1804 PLSR R2P = 0.9240 RMSE = 0.1804 SVR R2P = 0.8753 RMSE = 0.2374 Prediction based on time series phenotypes combined with the best deep learning (Optimal-Spec) models based on hyperspectral data using four methods (TD, LSTM, GRU, BRNN): Optimal-Spec-TD R2P = 0.9583 RMSE = 0.1336 Optimal-Spec-LSTM R2P = 0.9420 RMSE = 0.1575 Optimal-Spec-GRU R2P = 0.9404 RMSE = 0.1597 Optimal-Spec-BRNN R2P = 0.9464 RMSE = 0.1514 | 433 spectra (560 plants: 160 Butter, 200 Head, 100 Leaf, and 100 Roman) | [36] |
Biochemical Traits and Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
pH, Lettuce | Hyperspectral camera: 390.57 nm–1008.6 nm | Data dimensionality: Three Methods: First derivative of the reflectance to identify band regions with the most differences PLSR and PCA to identify the bands with more weight VIP score of a predictor variable over “1” | Best results of each method over four lettuce species cultivated with two different systems each (8 datasets)
Results for hydroponically grown black-seeded Simpson lettuce: FDR: Rp2 = 0.95 RMSE = 0.03 PLSR/PCA: Rp2 = 0.98 RMSE = 0.02 VIP-Score: Rp2 = 0.98 RMSE = 0.016 However, there is high performance variability depending on the dataset | Not specified | [37] |
Magnesium, Lettuce | Hyperspectral camera: 390 nm–1000 nm | Data dimensionality: Segmentation of pixels based on two VIs (NDVI, GI), PLSR used to identify the predictor variables | Harvest 1 (four weeks) Prediction of magnesium PLSR Models PLS1 (single nutrient output): Rp2 = 0.34 RMSE = 0.34 PLS2 (multiple nutrient output): Rp2 = 0.28 RMSE = 0.36 Classification of magnesium deficiency (F1, Precision, Recall) (PLS1, PLS2, PLSDA, MLP) Threshold 1: PLSDA F1 = 0.57 Precision = 1 Recall = 0.40 Threshold 2: PLS1 F1 = 0.74 Precision = 0.78 Recall = 0.7 Harvest 2 (six weeks) Prediction of magnesium PLSR Models PLS1 (single nutrient output): Rp2 = 0.07 RMSE = 0.37 PLS2 (multiple nutrient output): Rp2 = 0.09 RMSE = 0.36 Classification of magnesium deficiency (F1, Precision, Recall) Best results: Threshold 1: None F1 = 0 Precision = 0 Recall = 0 Threshold 2: PLSDA F1 = 0.62 Precision = 0.57 Recall = 0.67 | 288 | [38] |
Biochemical Traits and Plant | Spectral Range and Instrument | Dimensionality Reduction Techniques | Accuracy and Technique | Number of Samples | Ref. |
---|---|---|---|---|---|
Sulfur, Lettuce | Hyperspectral camera: 390 nm–1000 nm | Data dimensionality: Segmentation of pixels based on two VIs (NDVI, GI), PLSR used to identify the predictor variables | Harvest 1 (four weeks) Prediction of Sulfur PLSR Models PLS1 (single nutrient output): Rp2 = 0.6 RMSE = 0.04 PLS2 (multiple nutrient output): Rp2 = 0.64 RMSE = 0.04 Classification of sulfur deficiency (F1, Precision, Recall) (PLS1, PLS2, PLSDA, MLP) Best results: Threshold 1: MLP F1 = 0.84 Precision = 0.72 Recall = 1 Threshold 2: PLS1 F1 = 0.71 Precision = 0.83 Recall = 0.63 PLSDA F1 = 0.71 Precision = 0.83 Recall = 0.63 Harvest 2 (six weeks) Prediction of Sulfur PLSR Models PLS1 (single nutrient output): Rp2 = 0.27 RMSE = 0.04 PLS2 (multiple nutrient output): Rp2 = 0.33 RMSE = 0.04 Classification of magnesium deficiency (F1, Precision, Recall) Best results: Threshold 1: MLP F1 = 0.84 Precision = 0.73 Recall = 1 Threshold 2: MLP F1 = 0.75 Precision = 0.75 Recall = 0.75 | 288 | [38] |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Dabek, A.; Mantovani, L.; Mirabella, S.; Vignati, M.; Cinquemani, S. Advancements in Non-Destructive Detection of Biochemical Traits in Plants Through Spectral Imaging-Based Algorithms: A Systematic Review. Algorithms 2025, 18, 255. https://doi.org/10.3390/a18050255
Dabek A, Mantovani L, Mirabella S, Vignati M, Cinquemani S. Advancements in Non-Destructive Detection of Biochemical Traits in Plants Through Spectral Imaging-Based Algorithms: A Systematic Review. Algorithms. 2025; 18(5):255. https://doi.org/10.3390/a18050255
Chicago/Turabian StyleDabek, Aleksander, Lorenzo Mantovani, Susanna Mirabella, Michele Vignati, and Simone Cinquemani. 2025. "Advancements in Non-Destructive Detection of Biochemical Traits in Plants Through Spectral Imaging-Based Algorithms: A Systematic Review" Algorithms 18, no. 5: 255. https://doi.org/10.3390/a18050255
APA StyleDabek, A., Mantovani, L., Mirabella, S., Vignati, M., & Cinquemani, S. (2025). Advancements in Non-Destructive Detection of Biochemical Traits in Plants Through Spectral Imaging-Based Algorithms: A Systematic Review. Algorithms, 18(5), 255. https://doi.org/10.3390/a18050255