Estimation of Leaf Chlorophyll a, b and Carotenoid Contents and Their Ratios Using Hyperspectral Reflectance

Japanese horseradish (wasabi) grows in very specific conditions, and recent environmental climate changes have damaged wasabi production. In addition, the optimal culture methods are not well known, and it is becoming increasingly difficult for incipient farmers to cultivate it. Chlorophyll a, b and carotenoid contents, as well as their allocation, could be an adequate indicator in evaluating its production and environmental stress; thus, developing an in situ method to monitor photosynthetic pigments based on reflectance could be useful for agricultural management. Besides original reflectance (OR), five pre-processing techniques, namely, first derivative reflectance (FDR), continuum-removed (CR), de-trending (DT), multiplicative scatter correction (MSC), and standard normal variate transformation (SNV), were compared to assess the accuracy of the estimation. Furthermore, five machine learning algorithms—random forest (RF), support vector machine (SVM), kernel-based extreme learning machine (KELM), Cubist, and Stochastic Gradient Boosting (SGB)—were considered. To classify the samples under different pH or sulphur ion concentration conditions, the end of the red edge bands was effective for OR, FDR, DT, MSC, and SNV, while a green-peak band was effective for CR. Overall, KELM and Cubist showed high performance and incorporating pre-processing techniques was effective for obtaining estimated values with high accuracy. The best combinations were found to be DT–KELM for chl a (RPD = 1.511–5.17, RMSE = 1.23–3.62 μg cm−2) and chl a:b (RPD = 0.73–3.17, RMSE = 0.13–0.60); CR–KELM for chl b (RPD = 1.92–5.06, RMSE = 0.41–1.03 μg cm−2) and chl a:car (RPD = 1.31–3.23, RMSE = 0.26–0.50); SNV–Cubist for car (RPD = 1.63–3.32, RMSE = 0.31–1.89 μg cm−2); and DT–Cubist for chl:car (RPD = 1.53–3.96, RMSE = 0.27–0.74).


Introduction
Chlorophyll pigments (chl) consist of two main types, a (chl a) and b (chl b). Their contents relate closely to primary production because they absorb sunlight and convert sunlight, water, and carbon dioxide into carbohydrates and oxygen [1]. In addition, it has been reported that the chl a:chl b ratio (chl a:b) increases sharply in a linear manner at low light intensity but increases gradually and linearly at higher light intensities [2]. As a result, chl a:b is effective in predicting the abundance of chl associated with photosystem I or II and has been used to evaluate the response of vegetation to changing environmental conditions [3,4], as well as to disease and nutritional and algorithms were compared to clarify which algorithms were effective for evaluating three photosynthetic pigments and their ratios. Random forests (RF) is an extremely successful algorithm for the classification and regression method [37] and has been used for chl estimation in previous studies [23,38]. The support vector machine (SVM) is another successful algorithm and has been widely used with a Gaussian kernel function [39], and it could be used as a benchmark with RF. Some previous studies reported that the Cubist algorithm was superior to RF in predicting LAI [40] and SVM for estimating acid detergent fibre in forage [41]. In addition, Stochastic Gradient Boosting (SGB) also has been applied to biomass estimation, and its high performance has been reported [42]. Therefore, Cubist and SGB could be more effective options than RF and SVM. In addition to these algorithms, the kernel-based extreme learning machine (KELM) was considered because KELM possesses significant robustness due to a few hyperparameters (i.e., the regulation coefficient (Cr) and the kernel parameter (Kp)) and few optimisation constraints, which could be an advantage in regression applications [43]. Thus, this study examines the five aforementioned algorithms (RF, SVM, KELM, Cubist, and SGM) to determine an optimal approach for analysing reflectance data obtained from wasabi. Thus, the five aforementioned algorithms are evaluated. Although the combination of vegetation indices and machine learning is an interesting option, it has led to inconclusive results in previous studies [44] and was thus not applied in this study.
Furthermore, some pre-processing techniques have provided some better solutions to retrieve vegetation characteristics from hyperspectral reflectance. First derivative reflectance (FDR) analysis is one of the common spectral pre-processing techniques, and it effectively removes background effects and enhances subtle spectral features as well as enhancing weak spectral features [45,46]. The continuum-removal (CR) transformation, which is a brightness normalization technique that fits a convex hull over the original reflectance data, has also been applied to enhance the spectral features and eliminate or reduce unrelated effects [47,48]. Standard normal variate (SNV) and multiplicative scatter correction (MSC) could be used to reduce the noise in the raw reflectance data caused by light scattering and baseline drift [49]. Furthermore, SNV and then de-trending (DT) was effective at reducing the effect of additive interference of scattered light from particles [50]. However, the combination of spectral pre-processing techniques and machine learning algorithms for the estimation of photosynthetic pigments and their ratios have not been fully considered.
The main objectives of this study are (1) to assess the potential of hyperspectral reflectance under high stress conditions based on the measurements from two wasabi cultivars grown in conditions of varying pH and sulphur ion concentration; (2) to evaluate the effects of pre-processing techniques; and (3) to identify which machine learning algorithms were the most appropriate for estimating the three photosynthetic pigments and their ratios.

Measurements and Datasets
The measurements from the two popular wasabi cultivars, "Onimidori" and "Mazuma", were analysed in this study ( Figure 1). The wasabi clonal plants were cultivated individually in Wagner pots (1/5000 a) containing 3 L of tap water adjusted to a pH of 6.0 using HCl and NaOH and continuously aerated. After 1 week, slightly modified solutions of 0.1×Hoagland [51], which is one of the most popular solution compositions for growing plants and contains macronutrients of 0. 25  were supplied stepwise for 1 week each at 1/100 and 1/10 strengths to adapt the plants to the hydroponic system under standard nutrient solution conditions. The potential of hydrogen is one of the factors to define the formation of a complex between a polyelectrolyte and a protein and influences the quality of the wasabi. Then, four different pH conditions, pH 5, 7, 8, and 9, were applied. Ten samples were cultivated under standard conditions (pH 6). The importance of sulphur has been reported for improving the allyl isothiocyanate concentration and yield, which determines the pungency and nitrogen fertilization with sulphur amendment [52]. Thus, four sulphur ion concentrations were also applied, namely the standard condition (1×S), which is 0.58 mM SO 4 2− ; conditions of zero (0×S); half (0.5×S); and 1.5 times (1.5×S) sulphur concentration. Five samples for each condition and cultivar, except for 0×S (4 and 5 for "Onimidori" and "Mazuma", respectively) using solutions of (NH 4   Reflectance spectra of the samples were measured using an ASD Fieldspec4 spectroradiometer (Analytical Spectral Devices Inc., USA) with a leaf clip. Spectra were recorded with a sampling resolution of 1 nm. The splice correction function within the ViewSpec Pro Software (Analytical Spectral Devices Inc., USA) was applied to modify the spectral drifts at two wavelengths (1000 and 1800 nm), reflected by inherent variations caused by the three detectors [53].
Following the reflectance measurements, leaf discs were punched, and pigment concentrations were measured using dual-beam scanning ultraviolet-visible spectrophotometers (UV-1900, Shimadzu, Japan). Wellburn's method [54] was applied to quantify chl a, b and car based on absorption.
Based on a previous study, a stratified random-sampling approach, which is a method of sampling that involves the division of all the measurements into smaller sub-groups like strata, was applied. In this study, the strata were formed based on treatments (i.e., varying pH or sulphur ion concentration) and cultivars. Then, the measurements were divided into three groups: a training dataset (50%), validation dataset (25%), and test dataset (25%) [55]. This approach was repeated one hundred times to ensure robust results. Compared to the simple random sampling method, this method is effective to avoid missing representative data in the divided datasets [56].

Pre-Processing of the Raw Reflectance Data
Pre-processing techniques have been proposed for reducing noise or modifying the base and slope shift from spectral data, and some previous studies showed that they could sharpen the peaks and valleys of spectra [57,58]. Thus, applying pre-processing transformations was considered to enhance the more chemically relevant peaks and reduce the effects of baseline shifts and overall curvature over the original reflectance (OR). First derivative reflectance (FDR) is effective in reducing baseline variation and increasing the resolutions of spectral peak features [59,60]. Continuum removal (CR) is a brightness normalization technique that has been applied to enhance related changes [61]. De-trending (DT) has been used to correct wavelength-dependent scattering effects and account for the variation in baseline shift and curvilinearity by fitting a second-degree polynomial through each spectrum [50]. Multiplicative scatter correction (MSC) and standard normal variate transformation (SNV) have also been used to eliminate the effect of noise, baseline drift, and light scattering of the spectrogram [62]. All methods were implemented using R version 3.5.0 [63] and the "prospectr" package [64].

Regression in Machine Learning
A genetic algorithm (GA) is effective for identifying which bands were effective to evaluate vegetation properties, and it offers some tips to design effective sensors for drone-based or satellite remote sensing, which usually have spectral bands less than ten channels. Thus, the variable section based on a GA was also adopted in this study. The effectiveness of the regression models based on random forest (RF), support vector machine (SVM), kernel-based extreme learning machine (KELM), Cubist, and Stochastic Gradient Boosting (SGB) was investigated to estimate chl a, b, car, and their ratios from the hyperspectral reflectance data. Bayesian optimization was applied with the Gaussian process [65] using R version 3.5.0 [63] and the "rBayesianoptimization" package [66] to select the optimal combinations of the hyperparameters of the machine learning algorithms.
RF is an ensemble learning technique that uses a set of Classification and Regression Trees (CARTs) to make predictions [67]. They have been used in several remote-sensing fields, and their high performances have been reported for classification, regression, and several different machine learning problems [68]. Many models based on CARTs have generated a bagging approach. Although each CART is generally independently generated without any pruning and each node is split using user-defined hyperparameters, the number of trees, and the variables used to split the nodes [67], some previous studies have shown that randomizing the splitting rule was effective for improving the performance of the ensembles [69]. Thus, three additional hyperparameters, which defined the rules to generate decision trees, including the minimum number of unique cases in a terminal node (nodesize), the maximum depth at which a tree should be grown (nodedepth), and the number of random splits (nsplit), were also optimized using R version 3.5.0 [63] and the "randomForestSRC" package [70].
SVM is a kernel-based learning method, and the Gaussian radial basis function (RBF) kernel, which is the typical general-purpose kernel, was applied in this study [71] and its two hyperparameters, including the regularisation parameter C and the kernel bandwidth σ, were optimized using the "e1071" package [72].
Although the Extreme Learning Machine (ELM) is a single hidden layer feed-forward neural network composed of a vast number of nonlinear nodes and a hidden layer bias, these are defined randomly and the number of its hyperparameters is fewer than deep learning according to Huang [73]. Similar to SVM, the RBF kernel was used for fitting a non-linear model, and the hyperparameters of the regulation coefficient (Cr) and kernel parameter (Kp) were optimized. KELM code was generated based on the MATLAB code, downloaded from https://www.ntu.edu.sg/home/egbhuang/elm_codes.html.
Cubist is a rule-based model tree approach, and its leaves are expressed as multivariate linear regression models with two steps: (1) establishing a set of rules that divides the training data into smaller subsets and (2) fitting a regression model to these smaller subsets [74]. The number of committee models (committee) and neighbours (neighbour) used for correcting the model predictions were optimized using the "Cubist" package [75]. Adjusting the committee has a boosting similar effect. A nearest neighbour algorithm was applied to the leaf node to use an ensemble approach combination.
SGB built an ensemble of shallow and weak successive trees using a random sub-sample of the dataset for each iteration to improve the computation speed and prediction accuracy, avoiding the overfitting of the training data [76]. These could be a powerful algorithm when they are combined, producing a committee. The number of iterations and basic functions in the additive expansion, the maximum depth of each tree, the learning rate, and the minimum number of observations in the terminal nodes of the trees are optimized using the "gbm" package [77].

Statistical Criteria
Stepwise linear discriminant analysis [78,79] was used to identify which wavelengths had significant differences related to the pH conditions or S strengths (p < 0.001), and a combination of forward and backward stepwise regression was adopted in a multiple regression model.
To assess the performances of the regression models, the root-mean-square error (RMSE, Equation (1)) and the ratio of performance to deviation (RPD, Equation (2)) were used [80] since RPD is a widely applied indicator with a clear definition, e.g., Category A (RPD > 2.0), Category B (1.4 ≤ RPD ≤ 2.0), and Category C (RPD < 1.4). It is then very suitable to compare the index performance for the different datasets directly and especially fits for examining the robustness across different datasets.
where SD is the standard deviation of the measurements, n is the number of samples, y i is the real value, andŷ i is the estimated value.
Using "randomForestSRC" package, it is possible to assess the contributions of each variable based on the variable importance (VIMP), which is calculated from training data and defined as the difference between the prediction error when the variable is noisy versus the prediction error caused by randomly permuting it [70,81]. However, the strategies of other algorithms are sometimes too different to obtain acceptable results [13,82]. Then, a black box data-based sensitivity analysis (DSA) was applied to judge which narrow bands contributed to generating the regression models [83]. DSA is similar to a computationally efficient one-dimensional sensitivity analysis [84]. However, DSA uses several training samples instead of a baseline vector, and then it performs a pure black-box use of the fitted models by querying the fitted models with sensitivity samples and recording their responses [83]. Figure 2 shows the boxplots of the chl a, b, and car contents of the different pH conditions or S-strength treatments. The contents per leaf area (cm 2 ) ranged from 9.84 to 27.29 µg for chl a, 2.91 to 8.91 µg for chl b, and 2.23 to 5.57 µg for car. There were no significant differences in photosynthetic pigments contents between the two cultivars among different pH conditions and S-strengths (p < 0.05, based on the Tukey-Kramer test). The relationships between the pigment ratios and pH conditions or S-strengths are shown in Figure 3. Their ranges were 2.93-4.92, 2.11-5.74, and 2.66-7.48 for chl a:b, chl a:car, and chl:car, respectively. For the chl a:b ratio, significant differences were not confirmed between the two cultivars nor among S-strength treatments, while significant differences were confirmed between two cultivars for pH 5; pH 5 and 9 for Mazuma; and pH 6 and 9 for Onimidori (p < 0.05, based on the Tukey-Kramer test).  Negative correlations were confirmed between the pH value and chl a, b and car. The correlations of Mazuma were stronger than those of Onimidori (Table 1). Although a negative correlation was also confirmed for chl a:car and chl:car, the correlation between pH and chl a:car was not significant when the dataset was separated by cultivars (p values were 0.206 and 0.054 for Onimidori and Mazuma, respectively). On the contrary, a positive correlation was confirmed between pH and chl a:b; it was significant for all, as well as for Mazuma. The photosynthetic pigments contents and S-strength were more strongly correlated with the photosynthetic pigment contents than with pH. This positive correlation was generally confirmed, except for chl a, b of Mazuma (r = −0.328, p = 0.158).

Spectral Patterns
The mean reflectance spectra and standard deviations for each pH condition and S strength are shown in Figure 4. Generally, some differences in spectra were confirmed over the domain between 400 and 900 nm among the pH conditions or S-strength values. Tables 2 and 3 show the effective wavelength to distinguish five pH conditions or four S-strengths based on the stepwise discriminant analysis (p < 0.001). To classify five different pH conditions, the ends of the red edge bands (706-756 nm) were effective. Then, they were selected for OR, FDR, DT, MSC, and SNV, while a green-peak band (536 nm) was selected for CR. For FDR, the start of the red edge bands (614 and 619 nm) was also selected. Reflectance values near 700 nm were also effective for identifying samples at four S-strengths; this domain was selected for all spectra. Additionally, a green peak band (514 nm) was selected for OR, and 893 nm was selected for CR.

Accuracy Assessment
The performances of the regression models based on the machine learning algorithms are shown in Figure 5 and SNV-KELM were always categorized as "A" or "B". Poor models, namely those in "Category C", were confirmed in some cases when OR was used for estimating car.
To estimate the ratio of chl a:b, chl a:car, and total chl:car, there were not always applicable combinations of pre-processing and machine learning algorithms except for the total chl:car estimation based on DT-Cubist. Nevertheless, Cubist and KELM possessed mean RPD vales with all the pre-processing for chl a:car and total chl:car. Although KELM still possessed the mean RPD values greater than 1.4 for chl a:b, except for SNV, the RMSE of SNV-KELM was greater than and the averaged RPD values were less than for the combinations with SNV (mean RPD value of 1.24).

Sensitivity Analysis
The importance of each wavelength was evaluated based on the DSA for each algorithm at a 25-nm interval. Generally, RF, Cubist, and SGB had double peaks at 550-575 and 700-725 nm for all the spectra. Although this tendency was also confirmed for SVM and KELM, their importance was relatively lower than RF, Cubist, and SGB; the peaks were obscure. For MSC and SNV, small peaks were confirmed for RF and SNV. For RF, the green peak had a greater influence on chl b estimations than on chl b estimations for all spectra; this tendency was obscure for the other algorithm. The importance at 400-425 nm of SGM was also high for OR, DT, MSC, and SNV. Similar to the chl a estimation, the two peaks for all the spectra and the importance at 400-425 nm of SGM were confirmed. However, the importance near 650-675 nm of FDR was higher than that of chl a. For the car estimation, the high importance near 530 nm was confirmed from OR, FDR, CR, and MSC. However, the two peaks were also confirmed from FDR, MSC, and SNV near 1875-1900 and 2000-2025 nm for RF, Cubist, and SGB.
The importance values for chl a:b were relatively low, and no band possessed an importance more than 10% except for the combinations of Cubist and DT or FDB. However, peaks at 675-700 nm were confirmed for OR, FDR, DT, and SNV, a peak near 1150-1175 nm for MSC, a peak at 1375-1400 nm for OR and DT, and a peak at 1425-1450 nm for FDR. The patterns of importance distribution were similar between the chl a:car and chl:car estimations. The high importance at 400-425 nm of SGB was also confirmed for OR, DT, MSC, and SNV. While the peak near 700-725 nm was confirmed for OR, FDR, CR, and DT, OR also possessed a peak near 625-650 nm. MSC possessed a peak near 2075-2100 nm.

Characteristics of the Samples Used in This Study
Changing chl a:b is either caused by the kind of materials involved or by the different developmental stages, fertilizers, chemicals, moisture, and other environments. Chl a:b is considered generally to be 3 [85] and is positively correlated with the ratio of the PSII cores to the light-harvesting chlorophyll-protein complex [86]. In this study, the mean chl a:b values were 3.21, 3.33, 3.33, and 3.29 for samples under pH 5-9; and 3.52, 3.33, 3.75, 3.68, and 3.95 for samples under 0×S, 0.5×S, 1×S, and 1.5×S. These values were larger than 3, representative of causing stress under high-light or nitrogen-lacking conditions. Furthermore, chl:car is also a good indicator for evaluating environmental stress in plants [8]. There was a linear relationship between the formation of light-harvesting complex II (LHCII) aggregates and nonphotochemical quenching [87]. The increase in chl:car implies an increase in LHCII [88]. The mean chl:car values were 5. 73, 6.11, 5.64, 4.76, and 5.31 for samples under pH 5-9 and 5.68, 7.02, 6.11, and 6.83 for samples under 0×S, 0.5×S, 1×S, and 1.5×S, respectively. Chl a:car possessed a significant positive correlation with chl:car in this study (r = 0.99, p < 0.001) and has been used to evaluate light stress and Light-Harvesting Complex Stress [9,89] [9]. These indicators imply that the dataset used in this study included measurements under high-stress environments.

Performance of Different Pre-Processing and Machine Learning Algorithms
DT was the best or second-best technique, and its combinations with KELM or Cubist were effective for all the parameters focused on in this study. The advantages of SNV were not fully confirmed, although it was the second-best technique for estimating chl:car. DT shows an excellent performance in removing the effects of baseline shift and curvilinearity from the original reflectance [50]; it was also effective in improving accuracies. MSC and SNV were also adjusted for baseline shifts between samples, and more corrections could be conducted than with DT. For instance, MSC minimizes the additive and multiplicative effects in reflectance, and SNV removes the multiplicative interferences of scatter and particle size [90]. However, leaf-scale measurements were conducted with leaf-clips in this study, and more corrections may not be required. As a result, their advantages have not been shown in this study, and slop corrections were not required for the study data. As a result, FDR, which is effective in enhancing resolution, as well as correcting baseline shifts [91], but is influenced by slop errors, also showed high performances for chl a, b and their ratios. CR was effective in estimating the car-related parameters (i.e., car, chl a:car, and chl:car). In this study, reflectance values at 400 nm and the end of the red edge were normally selected as the continuum endpoints, enhancing the changes near the green peaks. Many studies have shown the advantage of green peaks for estimating car and some vegetation indices based on green peaks have been previously proposed [92][93][94][95][96]. CR also succeeded in enhancing the green peak features.
Generally, KELM and Cubist were the best algorithms. KELM possessed better performances for estimating chl a, b and their ratios, while Cubist was more effective in estimating car. SVM and KELM are kernel-based algorithms and, thus, an inappropriate selection of hyperparameters related to kernel function made the estimation accuracies worse [97]. A grid search was used to find an adequate set of hyperparameters of SVM in previous studies, with several suitable combinations being found for the training data [98,99]. However, some of the combinations are not applicable to other datasets as they result in overfitting. This phenomenon was also confirmed in this study, in which the Bayesian optimization was applied to tune the hyperparameters, confirming the RPD values less than 1.0 (for example, 11 times for chl a estimations based on OR). In contrast, the variances of Kp were smaller than σ. RF, Cubist, and SGB are stochastic modelling techniques involving ensemble regression trees or rule-based models; the high performances of Cubist and SGB have been reported in previous studies [42]. Although the high performance of Cubist was also confirmed in this study, the accuracies of SGB were the worst at 12, 19, 18, 22, 43, and 37 times for chl a, b, car, chl a:car, and chl:car, respectively. SGB improves the robustness for imbalanced training data dealing with interactions and avoids problems arising from a wrong learning rate [76]. However, sampling a fraction of the training data was required in this strategy. The sample size might be too small to obtain fractions from the training data, leading to overfitting. The better performance of Cubist compared to RF was reported in previous studies [42,100,101] and confirmed in this study.
The differences in reflectance between healthy and stressed vegetation can be detected in changes to the green peak and the red edge [102], and then the shifts of the green peak and the red edge inflexion point were confirmed within the pH conditions or S strengths (Figure 4). For estimating the photosynthetic pigment contents, the importance at the green peak and the red edge inflexion point was larger when RF was applied; on the contrary, their importance was obscure for KELM. In order to avoid these phenomena, KELM could be a better option.

Direct Estimation vs. Indirect Estimation for Pigment Ratio
In this study, the regression models were generated directly to estimate the pigment ratio. However, these ratios could be calculated from the estimation results of chl a, b and car. Therefore, the estimation accuracies were compared to more accurate stress evaluations. For the RPD values from the estimated chl a, b and car, the direct estimations were generally superior to the indirect estimations. Therefore, direct estimations should be used to evaluate the stress of plants. Although the differences of CR-RF, MSC-RF, and MSC-Cubist for chl:car were significantly negative values (p < 0.01 based on a t-test), their RPD values were still poor compared with other combinations between the pre-processing and machine learning algorithms. As such, CR-RF, MSC-RF, and MSC-Cubist could not be a better choice for estimating chl:car.
It was reported that chl:car estimation based on hyperspectral reflectance is not straightforward due to non-obvious absorption features in the visible spectral range [103]. In this study, this phenomenon was also confirmed for chl a:b and chl a:car, and the advantages of direct estimation were clearer for them, which implied hyperspectral reflectance could detect some physiological signal and could be widely used in stress detection. As a result, the reflectance values at some wavelengths allowed for evaluating the risk of early mortality as well as the appearance and ingredients of wasabi before their cultivation.

Conclusions
The potential of vegetation reflectance for quantifying three photosynthetic pigment contents, including chlorophyll a, b and carotenoid, and their ratios, was evaluated for wasabi. In this study, five pre-processing techniques, first derivative reflectance (FDR), continuum-removal (CR) transformation, de-trending (DT), multiplicative scatter correction (MSC), and standard normal variate (SNV), were applied in addition to the original reflectance using five machine learning algorithms (random forests (RF), support vector machine (SVM), kernel-based extreme learning machine (KELM), Cubist, and Stochastic Gradient Boosting (SGB)). The superior usefulness of CR, DT, and SVN was confirmed, and the combination with KELM or Cubist was most effective for estimating the content of the three photosynthetic pigments and their ratios. For carotenoid estimation, OR was selected 22 times as having the best hyperspectral data, and it was still useful. However, the spectrum after pre-processing was selected more than 80 times for estimating leaf chlorophyll a, b contents and pigment ratios. This implied that pre-processing helps improve the estimation accuracies and can be used for the in situ measurements of leaf properties.
According to the results, hyperspectral data allowed for evaluating the stresses as well as the appearance and ingredients of wasabi before their cultivation, and the information provided from hyperspectral reflectance can be used to carry out more suitable nutrient management, facilitating quality control and plant maintenance for less experienced farmers.
Supplementary Materials: The following are available online at http://www.mdpi.com/2072-4292/12/19/3265/s1, Table S1: Mean reflectance spectra and standard deviations for the different pH conditions or S-strength treatments, Table S2: Estimated values from OR, Table S3: Estimated values from FDR, Table S4: Estimated values from CR, Table S5: Estimated values from DT, Table S6: Estimated values from MSC, Table S7: Estimated values from SNV.

Conflicts of Interest:
The author declares no conflict of interest.