Estimation of Glucosinolates and Anthocyanins in Kale Leaves Grown in a Plant Factory Using Spectral Reﬂectance

: The spectral reﬂectance technique for the quantiﬁcation of the functional components was applied in different studies for different crops, but related research on kale leaves is limited. This study was conducted to estimate the glucosinolate and anthocyanin components of kale leaves cultivated in a plant factory based on diffuse reﬂectance spectroscopy through regression methods. Kale was grown in a plant factory under different treatments. After speciﬁc periods of transplantation, leaf samples were collected, and reﬂectance spectra were measured immediately from nine different points on each leaf. The same leaf samples were freeze-dried and stored for analysis of the functional components. Regression procedures, such as principal component regression (PCR), partial least squares regression (PLSR), and stepwise multiple linear regression (SMLR), were applied to relate the functional components with the spectral data. In the laboratory analysis, progoitrin and glucobrassicin, as well as cyanidin and malvidin, were found to be dominating components in glucosinolates and anthocyanins, respectively. From the overall analysis, the SMLR model showed better performance, and the identiﬁed wavelengths for estimating the glucosinolates and anthocyanins were in the early near-infrared (NIR) region. Speciﬁcally, reﬂectance at 742, 761, 787, 796, 805, 833, 855, 932, 947, and 1000 nm showed a strong correlation.


Introduction
Kale (Brassica oleracea var. alboglabra) is one of the major sources of phytonutrient components (e.g., glucosinolates, anthocyanins, carotenoids, amino acids, and sugars), from which glucosinolates and anthocyanins are well known for containing cancer-chemo preventive compounds. In basic terms, glucosinolates and anthocyanins are the combination of secondary metabolites, enriched with nitrogen and sulfur-containing glycosides, available in species of the Brassicaceae families [1]. In-vitro and in-vivo studies reported that glucosinolates and their breakdown components inhibited many cancer development steps, such as phase I and II modulation of detoxification enzymes [2]. Consumption of anthocyanins reduces the risk of cardiovascular problems, diabetes, and cancer due to their anti-inflammatory and antioxidant activities [3,4].
Quantification of functional components in vegetables and fruits is important to nutritionists and researchers but also essential to farmers for producing nutrient-rich crops. Usually, functional components are analyzed in the laboratory using different cyanins, carotenoids, amino acids, and sugars) is also essential for ensuring nutritional levels and determining proper harvesting schedules [9,37,38].
Although controlled-environment cultivation increases the concentration of the functional components of crops, the accumulation rate of every component is not the same. Quantification of the functional components of various vegetables using the spectral reflectance technique was reported in some studies, but significant differences in results were observed, even among species of the same crop family, due to cultivation methods and environmental parameters. As research related to the quantification of functional components on kale leaves is limited, the objective of this study was to estimate the contents of glucosinolates and anthocyanins in leaves of kale grown in a plant factory through UV/VIS/NIR-diffuse reflectance spectroscopy data.

Kale Cultivation in a Plant Factory
A plant factory is a closed crop cultivation facility used to grow high-value crops of a high quality throughout the year by utilizing artificially controlled environmental parameters. In this study, a plant factory was used to maintain the desired levels of the controlled environmental parameters precisely and evaluate their effect on the accumulation of the nutritional components of kale. For this purpose, three experiments for the ambient environmental factors (temperature, humidity, and carbon dioxide (CO 2 )), three experiments for the light conditions (light type, light intensity, and light photoperiod), and one experiment for the electrical conductivity (EC) were implemented. In each experiment, five different levels for each environmental factor were implemented. All the treatments were prepared following the guidelines of the horticultural crop cultivation process and summarized in Table 1. A wireless sensor and control networks were used for monitoring and controlling the ambient environmental conditions, as detailed by Chung et al. [39].
Kale was selected because it is a nutrient-dense vegetable and is considered a healthy and popular food in many countries for its powerful medicinal properties. A commercial kale variety with smooth, green leaves and a hard and fibred stem was cultivated in the plant factory. Kale seeds were sown in a hydroponic germination sponge, and three weeks after germination, healthy seedlings with true leaves were transplanted into the plant beds. Twenty-four plants were placed in each plant bed. Seedlings were grown using a recycletype aeroponic nutrient management system over a period of about 40 to 50 days ( Figure 1). Commercial nutrient solutions A and B (Daeyu Co., Ltd., Seoul, Korea) were used, and the target nutrient level was monitored and managed once a day using the electrical conduction (EC) and pH sensors. The nutrient solution was sprayed onto the plant root zone for two minutes at 15-min intervals. The unused solution was returned to the nutrient mixing tank, and filtration and sterilization were performed using a commercial UV-sterilizer and filter (HY-600F, Haiyang, China). Additional distilled water and stock solutions were added during nutrient replenishment to prepare the target nutrient solution. Kale was selected because it is a nutrient-dense vegetable and is considered a healthy and popular food in many countries for its powerful medicinal properties. A commercial kale variety with smooth, green leaves and a hard and fibred stem was cultivated in the plant factory. Kale seeds were sown in a hydroponic germination sponge, and three weeks after germination, healthy seedlings with true leaves were transplanted into the plant beds. Twenty-four plants were placed in each plant bed. Seedlings were grown using a recycle-type aeroponic nutrient management system over a period of about 40 to 50 days ( Figure 1). Commercial nutrient solutions A and B (Daeyu Co., Ltd., Seoul, Korea) were used, and the target nutrient level was monitored and managed once a day using the electrical conduction (EC) and pH sensors. The nutrient solution was sprayed onto the plant root zone for two minutes at 15-min intervals. The unused solution was returned to the nutrient mixing tank, and filtration and sterilization were performed using a commercial UV-sterilizer and filter (HY-600F, Haiyang, China). Additional distilled water and stock solutions were added during nutrient replenishment to prepare the target nutrient solution.

Leaf Sample Collection
Two and four weeks after transplanting, leaf sample collection was performed in three steps. Mature and healthy leaf samples were visually selected and collected accord-

Leaf Sample Collection
Two and four weeks after transplanting, leaf sample collection was performed in three steps. Mature and healthy leaf samples were visually selected and collected according to their color and size condition for spectral reflectance measurement and analysis of functional components. Three normal-sized, matured, and healthy leaves were harvested from each plant, and total nine leaves were collected from three plants in each plant bed. Three replications were applied. In total, 27 (3 leaves × 3 plants × 3 replications) kale leaves were harvested. The reflectance spectra were measured first, and the leaves were transferred to the chemical laboratory immediately (to minimize the degradation of nutrient contents) for functional component analyses using a commercial high-performance liquid chromatography (HPLC) machine (model: 1200 series, Agilent Technologies, Santa Clara, CA, USA). Figure 2 shows the different growth stages of kale after different periods of transplantation, a selected plant, and a harvested leaf sample for reflectance acquisition and functional components analysis. The measured reflectance spectra and functional components from 9 leaves were averaged to represent one data point. In total, 204 data points (34 treatments × 2 sampling time × 3 replications) for glucosinolates contents and 90 data points (15 treatments of temperature, humidity, and CO 2 × 2 sampling time × 3 replications) for anthocyanins contents were obtained. CA, USA). Figure 2 shows the different growth stages of kale after different periods of transplantation, a selected plant, and a harvested leaf sample for reflectance acquisition and functional components analysis. The measured reflectance spectra and functional components from 9 leaves were averaged to represent one data point. In total, 204 data points (34 treatments × 2 sampling time × 3 replications) for glucosinolates contents and 90 data points (15 treatments of temperature, humidity, and CO2 × 2 sampling time × 3 replications) for anthocyanins contents were obtained.

Reflectance Spectra Acquisition
Right after the leaf sample collection, the reflected spectra were measured from each sample leaf using a spectrometer (model: Jaz-Combo-2, Ocean Optics, FL, USA). The applied wavelength range was 190 to 1130 nm with an interval of 0.37 nm, where one detector device provided 190 to 890 nm (UV/VIS) and other detector devices provided 470 to 1130 nm (NIR). However, the wavelength range from 300 to 1050 nm was considered during analysis to avoid excessive noise at the edge of the wavelengths. The received spectra from the mentioned detectors were joined, centering at about 720 nm. The reflected spectra were collected in the dark to minimize the noise caused by background effects and other circumstances. The spectrometer was operated by software provided by the manufacturer. Following the methodology of the previous studies of [9,37,40], the spectral reflectance data were measured from 9 sampling points over the blade part of each sampling leaf, as shown in Figure 3.

Reflectance Spectra Acquisition
Right after the leaf sample collection, the reflected spectra were measured from each sample leaf using a spectrometer (model: Jaz-Combo-2, Ocean Optics, FL, USA). The applied wavelength range was 190 to 1130 nm with an interval of 0.37 nm, where one detector device provided 190 to 890 nm (UV/VIS) and other detector devices provided 470 to 1130 nm (NIR). However, the wavelength range from 300 to 1050 nm was considered during analysis to avoid excessive noise at the edge of the wavelengths. The received spectra from the mentioned detectors were joined, centering at about 720 nm. The reflected spectra were collected in the dark to minimize the noise caused by background effects and other circumstances. The spectrometer was operated by software provided by the manufacturer. Following the methodology of the previous studies of [9,37,40], the spectral reflectance data were measured from 9 sampling points over the blade part of each sampling leaf, as shown in Figure 3.
CA, USA). Figure 2 shows the different growth stages of kale after different periods of transplantation, a selected plant, and a harvested leaf sample for reflectance acquisition and functional components analysis. The measured reflectance spectra and functional components from 9 leaves were averaged to represent one data point. In total, 204 data points (34 treatments × 2 sampling time × 3 replications) for glucosinolates contents and 90 data points (15 treatments of temperature, humidity, and CO2 × 2 sampling time × 3 replications) for anthocyanins contents were obtained.

Reflectance Spectra Acquisition
Right after the leaf sample collection, the reflected spectra were measured from each sample leaf using a spectrometer (model: Jaz-Combo-2, Ocean Optics, FL, USA). The applied wavelength range was 190 to 1130 nm with an interval of 0.37 nm, where one detector device provided 190 to 890 nm (UV/VIS) and other detector devices provided 470 to 1130 nm (NIR). However, the wavelength range from 300 to 1050 nm was considered during analysis to avoid excessive noise at the edge of the wavelengths. The received spectra from the mentioned detectors were joined, centering at about 720 nm. The reflected spectra were collected in the dark to minimize the noise caused by background effects and other circumstances. The spectrometer was operated by software provided by the manufacturer. Following the methodology of the previous studies of [9,37,40], the spectral reflectance data were measured from 9 sampling points over the blade part of each sampling leaf, as shown in Figure 3.

Extraction of Glucosinolates and Anthocyanins
The freshly harvested leaf samples were freeze-dried using liquid nitrogen for 48 h and ground into a fine powder using a pestle and mortar. Part of the freeze-dried samples (100 mg) was separated, and the crude glucosinolates were extracted with 70% boiling methanol (4.5 mL). The diethyl-aminoethyl (DEAE) anion exchange columns were used to obtain delsulphated glucosinolates. Distilled water (1.5 mL) was used to eluate delsulphated glucosinolates. The prepared eluates were analyzed using an HPLC (model: 1200 series, Agilent Technologies, CA, USA) after filtering the delsulphated glucosinolates through a 0.45 µm polytetrafluoroethylene (PTFE) syringe filter. HPLC conditions were set as follows: a C18 column (150 × 3.0 mm, 3 µm, Inertsil ODS-3, GL Sciences, Tokyo, Japan) was used, the elution solution included ultra-pure water (solvent A) and 100% acetonitrile (solvent B), flow rate was 0.4 mL min −1 , detector wavelength was set at 227 nm, and sinigrin was used as an external standard [41]. The results of glucosinolates are given as mmol g −1 dried weight of the samples.
The content of anthocyanin in the leaf samples was analyzed as mentioned by [42,43]. In brief, anthocyanins were extracted overnight at 22 • C. The freeze-dried products (8-10 mg) were extracted with 1 mL of solvent (MeOH:AcOH:H 2 O = 80:0.2:19.8). The extracts were filtered through a 0.45 µm filter before analysis with the HPLC system. HPLC conditions were set as follows: the normalized collision energy was set to 30%, a C18 column (150 × 2 m, Imtakt Corporation, Kyoto, Japan) was used, flow rate was 0.

Statistical Analyses Procedures
A total of 2027 reflectance values (300 to 1050 nm with a 0.37 nm interval) were obtained from each sampling point by the detectors. Although excessive noise was omitted by considering 300 to 1050 nm reflectance spectra instead of 190 to 1130 nm (default range), smoothing (median filter with 21 smoothing points) and 1st derivative methods were additionally applied to remove outliers and reach the actual spectra, and for resolution enhancement, respectively. This combined application technique was applied following the reference [46]. After the preliminary processing, transformation of the raw reflectance value (Ref) was performed independently to check the non-linear correlation of reflectance spectra to the functional components. The used transformations of variables were squared power ( , and inversion (1/Ref) [47].
Due to the similarity of the wavebands, the diffuse reflectance spectra are highly correlated. To reduce the multicollinearity effects and overfitting, some reflectance values were removed using the default procedures of the software. Then, the relationship between reflectance spectra and the content of functional components was investigated using several regression procedures, namely principal component regression (PCR), partial least squares regression (PLSR), and stepwise multiple linear regression (SMLR), to reduce the multicollinearity effects through MATLAB software (version: vR2013b, The MathWorks Ins, Natick, MA, USA). A ten-fold cross-validation (CV) process was applied to optimize the regression results. A total of ten subsets were generated from the dataset. One subset was used as the testing set, and the rest were used for the training set. The models containing from one to ten variables were determined for the remaining observations. The prediction residuals were determined through comparison with the removed subset. This process was repeated 10 times, with each of the subsets used exactly once as the testing data. Then, the average performance across all 10 trials was combined to find the validation residual variance. The coefficient of determination (R 2 ) and root mean square error (RMSE) were also calculated in order to select the proper regression model [9,48].
The PCR model is generally used for analyzing multiple regression data to avoid prediction instabilities caused by multicollinearity [49]. The general linear matrix form of the PCR model is shown in Equation (1) with usual notation.
whereŶ: dependent or response variable, X: independent or controlled variables, andβ: regression coefficient. In comparison with PCR, PLSR delivers a better predictive linearrelationship, and is computed using a selected number of the latent factors from both datasets [50]. The model is linear for each sample n, and the value Y nj is: where Y j : the p dependent variables, X i : the k explanatory variables, and β i : the regression coefficient. Here, i indicates the number of variables. The basic model and notations of PCR and PLSR are similar, but the method of β i determination is different. In SMLR, a multivariate model is constructed for the dependent variable (Y) considering some selected descriptive parameters (independent variables) [51,52]. The model of SMLR can be expressed as Equation (3).
where Y: dependent or response variable, X 1~Xi : predictor variables (surface reflectance bands), β 0 : constant variable, and β 1~βi : estimated weighted regression coefficient of X 1~Xi , respectively. Additionally, the Pearson correlation coefficient was implemented to assess the relationship between the reflectance spectra and functional components. Standardized beta coefficient analysis was also performed by the PCR and PLSR models using the square power of the reflectance data.

Glucosinolate and Anthocyanin Contents
The yield, total glucosinolates, and anthocyanins of kale under the different treatments of each experiment is summarized in Table 2. Besides this, the contents of each laboratory-analyzed glucosinolate and anthocyanin component are shown in Table 3. The number of samples were different, as some of the glucosinolate and anthocyanin components were not identified in all samples due to negligible (nearly zero) content. The quantity of some functional components was not satisfactory, and these were ignored. The minimum, maximum, and mean (non-normal distribution) concentration are also shown in Table 3. Depending on the leaf samples, five to eight components of glucosinolates and four components of anthocyanins were detected. From the results of HPLC analysis, we found that the highest proportion of glucosinolate content was represented by progoitrin (38.61 ± 46.46 µmol g −1 DW), and the most abundant glucosinolate component was neoglucobrassicin (0.40 ± 1.03 µmol g −1 DW). Similarly, cyanidin (134.10 ± 92.91 µg g −1 ) and pelargonidin (0.14 ± 0.11 µg g −1 ) represented the highest and lowest contents of anthocyanin, respectively. However, most of the glucosinolate and anthocyanin components were below 3 µmol g −1 DW.

Characteristics of Spectral Data
The pre-processed reflectance spectra of kale leaves showing low and high concentrations of glucosinolates and anthocyanins are presented in Figure 4. A median filter with 21 smoothing points along with a 1st derivative was applied to pre-process the reflectance spectral data. Data processing with a greater number of smoothing points reduces the noise of raw data, but valuable information that cannot be identified by visual inspection could be lost. As shown in Figure 4, peaks appeared at about 530-570 nm due to pigment variation [53], specifically foliar chlorophyll content [54], while no major difference was observed from 300 to 500 nm, but a significant difference was found among different concentrations of glucosinolates and anthocyanins from 700 to 1050 nm for strong absorption by carotenoids and chlorophylls. Overall, reflectance percentages were lower and almost similar in the visible region (except the peaks at 550 nm), but higher reflectance was obtained in the early NIR region (700 to 1050 nm), where low and high concentrations of glucosinolate and anthocyanins could be distinguished clearly.

Characteristics of Spectral Data
The pre-processed reflectance spectra of kale leaves showing low and high concentrations of glucosinolates and anthocyanins are presented in Figure 4. A median filter with 21 smoothing points along with a 1 st derivative was applied to pre-process the reflectance spectral data. Data processing with a greater number of smoothing points reduces the noise of raw data, but valuable information that cannot be identified by visual inspection could be lost. As shown in Figure 4, peaks appeared at about 530-570 nm due to pigment variation [53], specifically foliar chlorophyll content [54], while no major difference was observed from 300 to 500 nm, but a significant difference was found among different concentrations of glucosinolates and anthocyanins from 700 to 1050 nm for strong absorption by carotenoids and chlorophylls. Overall, reflectance percentages were lower and almost similar in the visible region (except the peaks at 550 nm), but higher reflectance was obtained in the early NIR region (700 to 1050 nm), where low and high concentrations of glucosinolate and anthocyanins could be distinguished clearly. Pearson correlation coefficients between reflectance and an example glucosinolate component (sinigrin) are shown in Figure 5. These correlation coefficients provide transparent views of the relationships between spectra and functional components. The absolute values of correlations were higher for some transformations (e.g., raw spectra, square root, square power, and logarithm) than for exponent and inversion in the near-infrared regions. Wavelengths above 700 nm were highly correlated (|r| ≥ 0.5) with the concentration of functional components as functional components were highly sensitive to spectra in the NIR regions [23,26]. Pearson correlation coefficients between reflectance and an example glucosinolate component (sinigrin) are shown in Figure 5. These correlation coefficients provide transparent views of the relationships between spectra and functional components. The absolute values of correlations were higher for some transformations (e.g., raw spectra, square root, square power, and logarithm) than for exponent and inversion in the near-infrared regions. Wavelengths above 700 nm were highly correlated (|r| ≥ 0.5) with the concentration of functional components as functional components were highly sensitive to spectra in the NIR regions [23,26].
Horticulturae 2021, 7, x FOR PEER REVIEW 10 of 17 Figure 5. Representation of the correlation coefficients between the spectral reflectance and sinigrin content over the entire wavelength range.

Performance of PCR, PLSR, and SMLR Using Raw Data
A summary of the raw reflectance data analysis using the PCR, PLSR, and SMLR models over the wavelength region (from 300 to 1050 nm) is presented in Table 4. The performance of the PCR model was not satisfactory compared with the PLSR and SMLR models. The coefficients of determination (R 2 ) of the PCR models were in the range of 0.4 to 0.5 for 4-methoxyglucobrassicin, neoglucobrassicin, malvidin, and pelargonidin whereas the other components showed worse results using the PCR model (R 2 : < 0.4). Using the PLSR model, a strong correlation (R 2 : ≥ 0.90, RMSE: ≈1 µmol g −1 DW) was found for glucobrassicin and malvidin; a fair correlation (R 2 : ≥ 0.60, RMSE: < 0.5 µmol g −1 DW) was obtained for sinigrin, 4-methoxyglucobrassicin, neoglucobrassicin, and pelargonidin and a poor correlation (R 2 : ≤ 0.60) was observed for gluconapin and cyanidin contents The SMLR model performed well (R 2 : ≥ 0.80, RMSE: ≈1 µmol g −1 DW) for all glucosinolate components, whereas poor performance was shown for anthocyanins (R 2 : 0.14, RMSE: 60 µg g −1 ) and cyanidin, and fair performance (R 2 : 0.46, RMSE: 0.04 µg g −1 ) for pelargonidin and malvidin (R 2 : 0.67, RMSE: 2.09 µg g −1 ). Although good results were obtained for some specific functional components, the transformation of spectral reflectance data was used to include possible non-linear relationships.  Figure 5. Representation of the correlation coefficients between the spectral reflectance and sinigrin content over the entire wavelength range.

Performance of PCR, PLSR, and SMLR Using Raw Data
A summary of the raw reflectance data analysis using the PCR, PLSR, and SMLR models over the wavelength region (from 300 to 1050 nm) is presented in Table 4. The performance of the PCR model was not satisfactory compared with the PLSR and SMLR models. The coefficients of determination (R 2 ) of the PCR models were in the range of 0.4 to 0.5 for 4-methoxyglucobrassicin, neoglucobrassicin, malvidin, and pelargonidin, whereas the other components showed worse results using the PCR model (R 2 : < 0.4). Using the PLSR model, a strong correlation (R 2 : ≥ 0.90, RMSE: ≈1 µmol g −1 DW) was found for glucobrassicin and malvidin; a fair correlation (R 2 : ≥ 0.60, RMSE: < 0.5 µmol g −1 DW) was obtained for sinigrin, 4-methoxyglucobrassicin, neoglucobrassicin, and pelargonidin; and a poor correlation (R 2 : ≤ 0.60) was observed for gluconapin and cyanidin contents. The SMLR model performed well (R 2 : ≥ 0.80, RMSE: ≈1 µmol g −1 DW) for all glucosinolate components, whereas poor performance was shown for anthocyanins (R 2 : 0.14, RMSE: 60 µg g −1 ) and cyanidin, and fair performance (R 2 : 0.46, RMSE: 0.04 µg g −1 ) for pelargonidin and malvidin (R 2 : 0.67, RMSE: 2.09 µg g −1 ). Although good results were obtained for some specific functional components, the transformation of spectral reflectance data was used to include possible non-linear relationships. Table 4. Summary of raw reflectance data analysis using PCR, PLSR, and SMLR models over the wavelength region (from 300 to 1050 nm). A summary of the transformed reflectance data analysis using the PCR, PLSR, and SMLR models over the wavelength region (from 300 to 1050 nm) is shown in Table 5  The B-coefficient values obtained from the PCR and PLSR models reveal important wavelengths for the quantification of functional components. Figure 6 shows the B-coefficient values using square reflectance data for the PCR and PLSR models for glucobrassicin calibration. Wavelengths at 761, 890, 933, and 1000 nm were identified by the PCR model, and the wavelengths determined by the PLSR model were 742, 761, 787, 796, 805, 833, 855, 932, 947, and 1000 nm. The peak in the VIS range at about 579 nm is associated with the green region due to electronic transition. Thus, the wavelength at 544 nm was assigned to chlorophyll, while the wavelengths at 742, 761, 787, 796, 805, 833, 855, 932, 947, and 1000 nm had a strong correlation with glucosinolate content. The B-coefficient values obtained from the PCR and PLSR models reveal important wavelengths for the quantification of functional components. Figure 6 shows the B-coefficient values using square reflectance data for the PCR and PLSR models for glucobrassicin calibration. Wavelengths at 761, 890, 933, and 1000 nm were identified by the PCR model, and the wavelengths determined by the PLSR model were 742, 761, 787, 796, 805, 833, 855, 932, 947, and 1000 nm. The peak in the VIS range at about 579 nm is associated with the green region due to electronic transition. Thus, the wavelength at 544 nm was assigned to chlorophyll, while the wavelengths at 742, 761, 787, 796, 805, 833, 855, 932, 947, and 1000 nm had a strong correlation with glucosinolate content. Figure 6. B-coefficient analysis performed by the PCR (top) and PLSR (bottom) models using square power of reflectance data for glucobrassicin. Figure 6. B-coefficient analysis performed by the PCR (top) and PLSR (bottom) models using square power of reflectance data for glucobrassicin.

Discussion
According to the laboratory (HPLC) analysis, progoitrin was the most dominant glucosinolate component in kale leaves, and the most abundant components were glu-coalyssin, gluconasturtiin, and 4-methoxyglucobrassicin. On the other hand, cyanidin was the main component of anthocyanins followed by malvidin. The abundant components were pelargonidin and delphinidin. This variation in functional components depends on crop species, growth stage, cultivation methods, and ambient environment or climatic conditions [23,38,[55][56][57][58]. For example, glucoiberin, glucoraphanin, and sinigrin were observed as dominating components, sequentially, in kale plants cultivated in a closed chamber and soil-based system [57]. Besides this, glucobrassicin and sinigrin were identified as dominating components in kale when cultivated in open-field conditions [55]. The authors of [44] found sinigrin to be a dominant component in green kale and progoitrin in red kale. A difference in concentrations was also observed among studies based on the cultivation period. However, most of the studies identified almost similar types of abundant components [5,43,44].
The reflected spectra from the kale leaf surface represent the status of leaf photochemical and morphological properties. Minimum reflectance indicates a higher concentration and maximum reflectance indicates a lower concentration of glucosinolates and anthocyanins. In this study, no meaningful variation was observed in the visible region, except the peak around 530 to 570 nm ( Figure 4). This sudden higher reflectance (peak) was observed due to pigment variation [53], specifically in foliar chlorophyll content [54], which was very familiar in the species of Brassicaceae family and indicates a lower glucosinolate content level in that region. A significant difference in glucosinolate content was detected for the 700 to 1050 nm wavelengths. Reflectance variation or overlapping during functional components analysis could occur in different cultivation facilities (i.e., greenhouse, plant factory, or open field), covering material types, radiation intensities or artificial light types and even the color or pigment properties of crop species (i.e., red and green kale or lettuce) [59][60][61]. The reflectance variation might occur in the visible region or NIR region based on the cultivation method, such as hydroponic or organic systems. A strong correlation between the pigment contents of red and green lettuces under different light intensities was also observed by the authors of [59,60].
Besides this, preprocessing of the reflectance spectral is necessary to minimize unwanted background information, along with accentuating the absorption features of the spectra. It also helps to attain accurate models and reduces the number of latent variables [53]. Smoothing and 1st derivative methods were applied in this study. Neto et al. [53] also applied smoothing, as well as the 1st and 2nd derivatives, and observed that the 1st derivative was the best pretreatment process for predicting anthocyanin content. They also mentioned that this pretreatment removed the non-chemical effects, resolved the overlapped bands, and provided a better version of the target data. In addition, PCR, PLSR, and SMLR models were applied in this study, and a significant correlation between spectral reflectance and glucosinolate and anthocyanin concentrations was observed in the NIR region through the SMLR model and Pearson's correlation coefficients test. A similar result was observed during functional components analysis on Chinese cabbage leaves in our previous study [9] and also in the literature [21,62].
Several studies were conducted to identify the variation of crops' physical, chemical, and biological properties based on crop species, cultivation methods, environmental conditions, and fertilization [30,63]. Among these, effects of cultivation systems, such as plant factory, greenhouse, or open field, and cultivation methods, such as soil-based and hydroponics, play a vital role in the variation of the glucosinolate and anthocyanin contents of crops ( Table 6). The authors of [30] compared the functional components of kale (Brassica oleracea var. alboglabra) grown in three different conditions and found glucobrassicin, sinigrin, neoglucobrassicin, and progoitrin to be dominant components, sequentially, and 4-methoxyglucobrassicin to be an abundant component. They reported that the functional components of kale were higher when grown in the plant factory than in the greenhouse and open field due to potential cultivation conditions. Glucoiberin, glucoraphanin, sinigrin, and glucobrassicin were detected as major components of kale (Brassica oleracea var. acephala) when cultivated using soil-based methods in a closed chamber and an open field, respectively [38,55,57].

Conclusions
This study focused on the determination of glucosinolate and anthocyanin contents in kale leaves using the diffuse reflectance spectroscopy technique, where kale plants were cultivated in a plant factory under different levels of environmental factors. The results showed that progoitrin and glucobrassicin, as well as cyanidin and malvidin, were found to be dominating components in glucosinolates and anthocyanins, respectively, in laboratory analysis. Among the applied regression methods, SMLR showed better performance compared with the PCR and PLSR models. Important wavelengths for estimating glucosinolates and anthocyanins were laid between 700 to 1050 nm. Although a similar methodology was applied in some studies on other crops, very little research has been conducted on kale plants. The components of glucosinolates and anthocyanins and their related reflectance spectrum vary based on the crop species, cultivation methods, and environmental parameters, so crop-specific accurate model development is essential. The findings of this study would be useful for designing or improving any multiple-wavelength property sensor applicable for on-site functional components determination of any crops.

Data Availability Statement:
The data presented in this study are available within the article.

Conflicts of Interest:
The authors declare no conflict of interest.