Next Article in Journal
Physical, Chemical, and Sensory Properties of a Turmeric-Fortified Pineapple Juice Beverage
Previous Article in Journal
New Developments in Nutraceuticals and Functional Food Products: Microorganisms as Alternative Sources of Nutritive and Beneficial Components
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Improving the Accuracy of Saffron Adulteration Classification and Quantification through Data Fusion of Thin-Layer Chromatography Imaging and Raman Spectral Analysis

1
Chenoweth Laboratory, Department of Food Science, University of Massachusetts Amherst, 102 Holdsworth Way, Amherst, MA 01003, USA
2
Department of Chemistry, University of Massachusetts, Amherst, MA 01002, USA
*
Author to whom correspondence should be addressed.
Foods 2023, 12(12), 2322; https://doi.org/10.3390/foods12122322
Submission received: 7 May 2023 / Revised: 2 June 2023 / Accepted: 7 June 2023 / Published: 9 June 2023
(This article belongs to the Section Food Analytical Methods)

Abstract

:
Agricultural crops of high value are frequently targeted by economic adulteration across the world. Saffron powder, being one of the most expensive spices and colorants on the market, is particularly vulnerable to adulteration with extraneous plant materials or synthetic colorants. However, the current international standard method has several drawbacks, such as being vulnerable to yellow artificial colorant adulteration and requiring tedious laboratory measuring procedures. To address these challenges, we previously developed a portable and versatile method for determining saffron quality using a thin-layer chromatography technique coupled with Raman spectroscopy (TLC-Raman). In this study, our aim was to improve the accuracy of the classification and quantification of adulterants in saffron by utilizing mid-level data fusion of TLC imaging and Raman spectral data. In summary, the featured imaging data and featured Raman data were concatenated into one data matrix. The classification and quantification results of saffron adulterants were compared between the fused data and the analysis based on each individual dataset. The best classification result was obtained from the partial least squares—discriminant analysis (PLS-DA) model developed using the mid-level fusion dataset, which accurately determined saffron with artificial adulterants (red 40 or yellow 5 at 2–10%, w/w) and natural plant adulterants (safflower and turmeric at 20–100%, w/w) with an overall accuracy of 99.52% and 99.20% in the training and validation group, respectively. Regarding quantification analysis, the PLS models built with the fused data block demonstrated improved quantification performance in terms of R2 and root-mean-square errors for most of the PLS models. In conclusion, the present study highlighted the significant potential of fusing TLC imaging data and Raman spectral data to improve saffron classification and quantification accuracy via the mid-level data fusion, which will facilitate rapid and accurate decision-making on site.

Graphical Abstract

1. Introduction

Saffron, the stigma of Crocus sativus L., which is also called “the red gold”, is one of the most expensive agricultural products on the market. For centuries, saffron has been utilized as a medicinal herb, spice, and colorant [1,2,3,4,5]. Throughout history, saffron has been vulnerable to economic adulteration, which involves the mixing of low-quality spices with saffron, the addition of plant materials, and the use of natural or artificial colorants to imitate the color of saffron [6].
Color strength is one of the key attributes used to describe saffron quality. The current standard method for saffron color strength analysis is protocol ISO 3632-2, issued by the International Organization for Standardization [7,8]. This method uses UV-vis spectroscopy and high-performance liquid chromatography (HPLC) to determine the color strength and crocin content, which is the active ingredient responsible for saffron’s color expression. Despite its widespread use, this method still has several limitations. For instance, some studies have reported that UV-vis spectroscopy can only detect saffron adulterants up to 20% w/w and is unable to distinguish between artificial yellow adulterants that display similar absorbance values to pure saffron [9,10]. Other well-adopted methods, such as HPLC and gas chromatography (GC), still have limitations such as long sample preparation, long test duration, high costs, and the need for specialized laboratory settings and trained personnel.
Previously, we developed a rapid and field-deployable method based on thin-layer chromatography (TLC) and Raman spectroscopy to determine saffron quality as well as saffron adulteration. By utilizing TLC as a separation substrate, the optical TLC pattern of pure saffron and adulterated saffron specimens could be easily captured via a camera and converted to digital imaging data under ambient light and 365 nm UV light [10]. Raman spectroscopy, in the meantime, provides molecular information on pure or adulterated saffron specimens from the TLC chip [11]. The established TLC-Raman method has demonstrated its capability to determine saffron grades and identify common adulterations. However, upon further execution of this method for adulterant quantification, we identified several notable drawbacks. In general, Raman signals provide information for quantifying the purity of saffron, while TLC patterns are used primarily for identifying saffron adulterations. Unfortunately, the lack of communication between these two data blocks limits the full potential of the method from being exploited. For instance, the imaging method failed to accurately determine adulteration levels at high yellow 5 concentrations (6–10% w/w) due to color saturation on TLC chips. In this case, the decrease in crocin concentration caused by yellow 5 adulteration can still be detected by the Raman spectrometer. Nonetheless, without communication between these data blocks, the final decision-making process relies on separate data analyses of the TLC pattern and Raman spectroscopy.
Data fusion, the analysis of concatenating several datasets into a single fused data block, has shown great potential to improve existing performance in spectroscopic analysis [12,13,14]. The integration of multiple datasets through data fusion enables interactivity and mutual information among each data block, resulting in a reduction of spurious sources of variability and diminished prediction errors when compared to analyses based on individual datasets. The concatenation of data can be carried out and categorized at three different levels: data level (low-level fusion), feature data level (mid-level fusion), and decision level (high-level fusion). With spectral data, data level and feature data level fusions are more suitable to be executed. In low-level data fusion, data from all measurement sources are simply combined into a single matrix after some data pretreatments for each individual data block. Mid-level data fusion involves the extraction of relevant features from each data block separately, which are then concatenated to form a single data block. Features can be defined as either relevant original variables or latent variables that are extracted through multivariate analysis models [12,13,14,15].
Extensive data fusion research has been conducted in the fields of food quality analysis and food fraud analysis. Most of these studies have yielded positive outcomes when employing fused datasets [16]. The integration of similar datasets, such as mass spectrometry, UV-vis, near-infrared (NIR), mid-infrared (MIR), or Raman spectral data, has been a primary focus in classifying or quantifying food quality attributes, such as the quality of soybeans [17], characterizing olive oil and essential oils [18,19], and determining the geographical origins of wines and saffron [20,21].
The food industry has efficiently utilized imaging data fusion techniques, such as E-eye or computer vision, for post-harvest quality analysis of fruits, providing advantages in terms of speed and cost-effectiveness [16]. Moreover, imaging analysis has garnered significant attention in various food categories. This includes assessing the sensory scores of fish fillets [22], evaluating apple fruit firmness [23], and detecting bruises in strawberries and blueberries [24,25]. Additionally, researchers have explored the fusion of imaging data with spectral data, which has proven successful in monitoring the fermentation quality of black tea and classifying green tea [26,27]. The aim of this paper was to evaluate the effectiveness of data fusion methodologies to improve the classification and quantification accuracy of adulterated saffron using Raman and TLC imaging data. In this study, a mid-level data fusion strategy was evaluated for classification and quantification of saffron quality, respectively. In brief, saffron specimens were adulterated with artificial colorants or extraneous natural plant materials at different levels, from 2% to 100%. Both adulterated and pure saffron specimens were prepared into sample solutions before each sample droplet was developed on TLC chips. Then raw imaging data and Raman spectra were collected on the TLC chip, respectively. Subsequently, featured imaging data were extracted by taking L*a*b* values at characteristic locations in the image (that were, edge and midplane), while featured Raman data were generated through variable influence on projection analysis (VIP) of the crocin characteristic Raman peaks. At last, the fused data matrix was completed by concatenating the featured imaging data and the featured Raman data. The fused data were used for multivariate classification and regression via partial least squares discriminant analysis (PLS-DA) and partial least squares analysis (PLS), respectively. The classification and quantification results of saffron adulterants from the fused data were also compared against the analysis results based on each single dataset.

2. Materials and Methods

2.1. Chemicals and Reagents

All natural specimens, i.e., saffron, turmeric powders, and safflower, were purchased from online sources. Each plant material was acquired from three different suppliers with a similar price range. Artificial colorants (Allura red and tartrazine) were purchased from two suppliers, which were IFC Solutions (Linden, NJ, USA) and Sigma Aldrich (Merk KGaA, Darmstadt, Germany). TLC aluminum plates (Silicagel 60W F254S) were purchased from EMD Millipore Corporation (Billerica, MA, USA).

2.2. Sample Preparation

All plant materials were pulverized using a Newtry high-speed food mill (Guangzhou, China) set at high speed for three minutes. Sample particle sizes were standardized by sieving through a 500 µm mesh sieve after pulverization of raw plant material. The resulting powders were then transferred into glass vials and stored in a light-shielded desiccator. Natural adulterants (safflower and turmeric) were predominantly utilized to manipulate the weight of saffron samples. To calibrate the analysis for the presence of these adulterants, samples were spiked with varying weight percentages of pure saffron (0%, 20%, 40%, 60%, 80%, and 100%). Lower spike levels for natural adulterants were considered, but it was deemed impractical to use trace levels of these plant materials for the purpose of adulteration in normal practices. To achieve the desired appearance and color intensity, artificial adulterants (tartrazine yellow and Allura red) were typically added to saffron in lower amounts compared to natural adulterants. Hence, we used weight percentages of 0%, 2%, 4%, 6%, 8%, and 10% to calibrate the level of artificial adulterants mixed with pure saffron. For all samples, 50 mg of pure or spiked powdered samples were dissolved in 50 mL of deionized water to prepare raw sample solutions. The untreated solutions were subsequently subjected to filtration using a 0.45 µm PES filter (GE Lifesciences, Marlborough, MA, USA). Subsequently, 2 µL of pure saffron or saffron-spiked solutions were deposited onto TLC plates using a pipette. Three droplets of the same sample solution were applied to the same location on the TLC plate, ensuring that the preceding droplet had completely dried before adding the next one.

2.3. Data Acquisition

2.3.1. Raw and Featured Raman Spectral Data

Raw Raman spectral data were collected at the center of each TLC pattern by a portable Raman system (TSI Incorporated, Shoreview, MN, USA), which was equipped with a 780 nm laser source. Each sample was measured in triplicate with five data collection spots for each replicate with an acquisition time of 10 s. The measurements were carried out at max laser power (500 mW) with a spectral range of 100 to 2200 cm−1. Thus, 75 spectra were collected for each type of adulteration, and 15 spectra were collected for pure saffron. A total of 315 spectra were collected for data analysis in this study. The Raman spectrum of each adulterated sample at different adulteration concentrations can be found in Supporting Figures S1–S4. Figure 1 shows the adulterated saffron Raman spectra, which were collected on TLC silica gel substrates. Featured Raman data (1000~1050 cm−1, 1130~1240 cm−1, 1270~1300 cm−1, and 1500~1580 cm−1) were extracted by reviewing the main compounds responsible for saffron color expression. Feature selection was also referred to as the result of the variable influence on projection (VIP, VIP value > 1) from the SIMCA software (Malmö, Sweden, version 14.1), which summarizes the importance of each Raman peak in the PLS-DA model.

2.3.2. Raw and Featured Imaging Data

The collection of raw imaging data and the process to generate the featured imaging data from TLC chips could be found in our previous work with some modifications [10]. In brief, two separate images were taken under ambient and UV (365 nm) light conditions. The ambient and UV images were then combined into a single image, as depicted in Figure 2A. To collect imaging data, Adobe Photoshop CS6 (64-bit) was utilized to split RGB images into the red, green, and blue channels (Figure 2C). A color picker tool (32 × 32 pixels square) was used to collect 20 data points from each TLC pattern per channel, with 10 points taken at ½ diameter distance from the center and 10 points taken from the TLC ring. This was performed for both the ambient light image and the UV light image under the green and blue channels, respectively (Figure 2C). The red channel data were excluded due to insufficient pattern information on the image (Supporting Figures S5–S8).
The featured imaging data can be expressed as shown below:
X A m b i e n t = X a m b i e n t   g r e e n   c h a n n e l   d a t a , X a m b i e n t   b l u e   c h a n n e l   d a t a
X U V = X U V   g r e e n   c h a n n e l   d a t a , X U V   b l u e   c h a n n e l   d a t a
X F e a t u r e d   I m a g i n g = X A m b i e n t , X U V
where the maximum, minimum, and average lightness values at the blue and green channels (Figure 2C) were collected at each sample collection point. Each adulteration level was measured in triplicate, resulting in a total of 7560 featured imaging data collected (7200 for adulterated saffron and 360 for pure saffron).

2.4. Data Fusion

Figure 3 illustrates the fusion strategy, where featured Raman data and featured imaging data were measured and fused into one data block in the mid-level fusion model. Normalization was applied to both data blocks prior to the fusion according to the equation below [28].
x i = x i min X max X min X
The fused data can be expressed via the equation below:
X F u s e d = X F e a t u r e d   R a m a n , X F e a t u r e d   I m a g i n g
where XFused stands for the fused data block, XFeatured Raman and XFeatured Imaging represent the single data blocks of featured Raman and featured imaging data, respectively. Next, XFused, XFeatured Raman and XImaging data blocks were loaded in PLS-DA and PLS models for multivariate qualitative and quantitative analyses.

2.5. Partial Least Squares Discriminant Analysis

Partial least squares discriminant analysis (PLS-DA) was used to conduct classification analysis in SIMCA (14.1) software. The entire classification model was classified by adulterant type using Pareto scaling. This scaling method was chosen due to its ability to retain the proximity of the scaled measurements to the original data, minimizing disturbances to the raw data, minimizing disturbances to the feature data, and ensuring more reliable feature peaks that are less vulnerable to noise [29]. The significant compounds were optimized via cross-validations. In the meantime, a total of 210 files from each dataset were selected as a training group, which included 50 spectra for each adulterant and 10 spectra for pure saffron. As a result, 3D scattered point graphs (Figure 4) and a misclassification table (Table 1) were generated. In addition, the performance of the data fusion strategy was evaluated by the remaining 105 spectra working as an external validation group, which included 25 spectra for each type of spiked saffron and 5 spectra of pure saffron (Table 2).

2.6. Partial Least Square (PLS)—Regression of Quantification

The PLS regression model was used to determine saffron adulteration levels. Raw data, featured data, and fused data from the classification study were directly used in the quantification analysis. Spectral data and imaging data were set up as variable X, whereas the adulteration level was set up as variable Y. The result was presented as a predicted value versus an observed value, as shown in Figure 5. The performance of each PLS plot can be partially expressed by the maximum of R2 values (goodness of fit) or the minimum of the root-mean-square error of prediction (RMSEP) values and the root-mean-square of cross-validation (RMSECV) values.

3. Results and Discussion

3.1. The Effect of Mid-Level Fusion on Classification Performance

As is shown in Figure 4A, the featured imaging data itself showed a clear cluster separation for safflower, turmeric, and yellow 5 adulterated samples. However, heavy cluster overlapping was observed between turmeric and red 40, yellow 5, and pure saffron clusters. These results were also reflected in the misclassification table. In Table 1A, 20 out of 50 turmeric samples were misclassified as red 40. In addition, all pure saffron samples were misclassified as yellow 5 adulterated samples. This was due to the low discrimination between these samples on TLC chips, which was also reported in our previous study [10].
In Figure 4B, the featured Raman data generated more scattered clusters for samples with natural adulterants. On the other hand, samples with artificial adulterants overlapped with pure saffron and thus led to misclassification in Table 1B. This was due to the low spike level (2–10% w/w) for artificially adulterated specimens as compared to the one for natural adulterants (20–100% w/w). The reason for spiking lower concentrations of artificial adulterants was their superior color intensity compared to natural adulterants. Therefore, less concentration was required to achieve the same color appearance as the natural adulterants. However, this posed analytical challenges when detecting these artificial adulterants. The result of mid-level data fusion on classification accuracy can be seen in Figure 4C and Table 1C. Both the PLS-DA plot and the misclassification table indicated significant improvements in cluster separation and correction rate compared to the results from each individual data block. In the meantime, the PLS-DA plot with mid-level fused data exhibited tighter clusters, yet each class was distributed with better separation, especially for red 40 and yellow 5 adulterated specimens (Figure 4C). Each individual data block provided a complementary piece of information that helped in the classification of spiked saffron samples. For instance, red 40 and turmeric-spiked saffron samples that could not be clearly differentiated using the imaging data (Figure 4A and Table 1A) were separated in the PLS-DA plot when featured Raman data were fused. Similarly, the introduction of the imaging data also helped enhance the classification capabilities of the featured Raman data to discriminate specimens with red 40 and yellow 5 adulterants (Figure 4B and Table 1B). The collaborative effect between these two data blocks in the fused matrix achieved a satisfying accuracy of 99.52% (Table 1C). These findings were in line with most reported results, indicating that data fusion yielded clear improvements in classification accuracies compared to individual analytical methods. For instance, enhanced classification results in identifying hazelnut paste adulteration were reported by combining FT-Raman and NIR spectroscopy as the fused dataset [30]. Likewise, the classification of the geographical location of a medicinal plant (Gentiana rigescens) demonstrated improved performance through the fusion of UV-vis and infrared spectroscopy data [31].

3.2. Mid-Level Fusion Model Validation for Adulteration Classification

The performance of the PLS-DA model was validated using external validation samples (n = 105), and the results are shown in Table 2. It is clear to see that the validation group had an excellent classification result among all adulterated specimens. Most samples with safflower, turmeric, and red 40 adulterants were all correctly identified, with adulteration levels ranging from 2–10% w/w for artificial adulterants and 20–100% for natural adulterants. The model achieved a correct rate of 99.2%, in which no adulterated specimens were classified as pure saffron in the validation group, giving it a 100% accuracy rate on adulterated sample determination.

3.3. The Effect of Data Fusion on PLS Model Quantification Performances

Figure 5 shows the PLS plots that predict spike levels in different adulterated saffron samples. Three data blocks, namely, XFeatured Imaging, XFeatured Raman, and XFused, were used to build each PLS plot, respectively. The R2 and RMSECV values in Table 3 were used to describe the goodness of fit of the model and the model’s ability to predict unknown samples. The RMSEP values indicate the goodness of the prediction of the model using external validation samples (n = 25). However, it is worth mentioning that the RMSECV and RMSEP values are positively correlated to the scale of the data. In this case, samples with natural adulterants (20–100% w/w) were expected to have larger RMSECV and RMSEP values than samples with artificial adulterants (2–10% w/w). Thus, artificial adulterants (red 40 and yellow 5) and natural adulterants (safflower and turmeric) should be compared separately due to the different adulteration levels between these two types of adulterants.
Separately, the PLS plots based on the featured imaging and Raman data in Figure 5C(a,b),D(a,b) showed poor prediction performances in the cases of red 40 and yellow 5 adulterated samples. As previously mentioned, low spiking levels were the main reason behind this phenomenon. In addition, sample droplets suffered from diffusion problems in every direction on TLC substrates. Thus, the actual concentrations of the target adulterants on the chips were usually lower in the test spots. This issue can potentially be solved by concentrating the sample solution or dropping multiple sample droplets at the same spot. However, this approach may bring new detection challenges, such as stronger interference from crocin.
When the fused data was used, higher R2 and lower prediction errors were achieved in most cases (Table 3). The best improvement was observed in Figure 5D(c), where the fused data block of yellow 5 adulterated specimens produced a significant improvement in the fittings of the plot. This result indicated the existing collaborative effects when the imaging and featured Raman data were combined. These effects, although not significant for most adulterated samples, still helped improve the quantification capabilities of the model.
When using actual validation samples, a similar trend could be seen with the RMSEP values. That is, the model had better performance using the fused data block for most adulterated samples, showing lower errors of prediction.
Similar studies also reported that the integration of independent datasets, such as the fusion of electronic nose and electronic tongue data, resulted in improved regression models for predicting juice quality parameters, including pH, titratable acidity, vitamin C, total soluble solids, phenolic content, and color indices of red wine. In addition, these studies consistently found higher R2 values in both the calibration and validation sets for the combined dataset compared to each individual measurement in most cases [32,33].
Nevertheless, it is worth noting that the improvement of the fusion strategy on prediction accuracies varied across different parameters or attributes. While certain specific variables demonstrated significant enhancements in prediction accuracies as a result of the fusion approach, the majority of variables experienced only modest improvements. Moreover, upon analyzing the quantification plots, it was observed that the fused dataset closely resembled the highest-performing individual dataset, indicating the preservation of key characteristics and similarities between the fused data and the most optimal standalone dataset [32,33].
Despite the overall improved quantification performance achieved with the fused data, there were some exceptions that brought some concerns about the effectiveness of the model. For example, the reduction in RMSEP and RMSECV values was not consistent when the fused data were used in PLS models. Samples spiked with safflower and red 40 showed slightly higher RMSEP and RMSECV values in the fused data, respectively. It was reasonable to believe that sample variation might have caused this issue. However, it still exposed one weakness of utilizing data fusion in quantification analysis. That is, the concatenation of different data blocks might cause deleterious effects when bad data blocks or outliers were introduced. This problem might not be obvious in sample classification since it is in favor of sample variation. However, it was very sensitive to quantification analysis. This inconsistent phenomenon was discussed in similar studies, where the presence of redundant information or the absence of feature information could impair the effectiveness of the fused data block [34,35]. This could be attributed to inappropriate feature extraction methods, which may result in datasets that are data-rich but information-poor or lead to the loss of important feature information [16,36]. In the present study, the selection of featured imaging data, such as pattern location and data acquisition numbers for each collection area, was determined manually. Consequently, it would be challenging to evaluate which manually defined imaging texture or pattern was the most sensitive to different adulteration concentrations. Furthermore, in this study, all variables in the fused data were uniformly weighted, assuming equal importance for classification or quantification predictions. Although this approach was proven suitable for classification work, a variable-wise weighted fusion approach may be more appropriate for quantification tasks [36].

4. Conclusions

The present study examined the effectiveness of mid-level data fusion strategies on the improvement of saffron adulteration classification and quantification accuracies using TLC imaging data and Raman spectral data. Our results indicated that mid-level data fusion had excellent classification performance. Under this setting, all spiked samples were identified and distinguished from pure saffron samples. In the meantime, the validation result exhibited great capabilities to distinguish each adulterant with excellent accuracy (safflower, 100%; turmeric, 100%; red 40, 96%; yellow 5, 100%).
In the meantime, quantification accuracies of both artificially and naturally adulterated samples showed better performances using the fused data block, achieving higher R2 values with lower errors. However, upon analyzing the results from external validation samples, it was found that the improvements in quantification accuracies were not markedly significant in comparison to the advancements observed in the classification results.
Further work needs to be performed to establish better protocols for using imaging data in quantitative analysis. This could involve assessing various strategies for imaging feature extraction to further improve the accuracy of quantification. Other data blocks can also be investigated and introduced to finalize both classification and quantification models. Furthermore, the established result can be further concatenated to produce a high-level fused result, which consists of both classification and quantification results. First, classification results are obtained from mid-level data fusion using the PLS-DA model. Then, corresponding quantification algorithms will be chosen based on the classification result. Depending on the classification results, the quantification results could be expressed either as saffron grade or adulterant level. Eventually, the final developed algorithm would be able to automatically determine saffron authenticity, adulterant identification, spike level, and pure saffron grade at the same time to facilitate fast decision-making on site.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/foods12122322/s1, Figure S1. Raw Raman spectra of saffron extracts spiked with Red 40 at different levels. Figure S2. Raw Raman spectra of saffron extracts spiked with Yellow 5 at different levels. Figure S3. Raw Raman spectra of saffron extracts spiked with safflower at different levels. Figure S4. Raw Raman spectra of saffron extracts spiked with turmeric at different levels. Figure S5. Raw imaging data of Red 40 spiked saffron specimens under different lighting and processing conditions. Figure S6. Raw imaging data of Yellow 5 spiked saffron specimens under different lighting and processing conditions. Figure S7. Raw imaging data of safflower spiked saffron specimens under different lighting and processing conditions. Figure S8. Raw imaging data of turmeric spiked saffron specimens under different lighting and processing conditions.

Author Contributions

Conceptualization, L.H. and H.D.; methodology, H.D.; software, H.D.; validation, L.H., J.L. and H.D.; formal analysis, H.D.; investigation, L.H.; resources, L.H.; data curation, Q.G. and H.D.; writing—original draft preparation, Q.G. and H.D.; writing—review and editing, L.H., J.L. and H.D.; visualization, H.D.; supervision, L.H. and J.L.; project administration, L.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Heidarbeigi, K.; Mohtasebi, S.S.; Foroughirad, A.; Ghasemi-Varnamkhasti, M.; Rafiee, S.; Rezaei, K. Detection of Adulteration in Saffron Samples Using Electronic Nose. Int. J. Food Prop. 2014, 18, 1391–1401. [Google Scholar] [CrossRef]
  2. Sujata, V.; Ravishankar, G.A.; Venkataraman, L.V. Methods for the analysis of the saffron metabolites crocin, crocetins, picrocrocin and safranal for the determination of the quality of the spice using thin-layer chromatography, high-performance liquid chromatography and gas chromatography. J. Chromatogr. A 1992, 624, 497–502. [Google Scholar] [CrossRef]
  3. Huang, W.; Li, F.; Liu, Y.; Long, C. Identification of Crocus sativus (Iridaceae) and its adulterants by using DNA barcoding technique. Planta Med. 2014, 80, 36–42. [Google Scholar] [CrossRef]
  4. Melnyk, J.P.; Wang, S.N.; Massimo, M.F. Chemical and biological properties of the world’s most expensive spice: Saffron. Food Res. Int. 2010, 43, 1981–1989. [Google Scholar] [CrossRef]
  5. Hamidpour, R.; Hamidpour, S.; Hamidpour, M.; Shahlari, M. Effect of Crocus sativus and its active compounds for the treatment of several diseases: A review. Int. J. Case Rep. Images 2013, 4, 666. [Google Scholar] [CrossRef]
  6. Varliklioz Er, S.; Eksi-Kocak, H.; Yetim, H.; Boyaci, I.H. Novel Spectroscopic Method for Determination and Quantification of Saffron Adulteration. Food Anal. Methods 2016, 10, 1547–1555. [Google Scholar] [CrossRef]
  7. ISO 3632-1; Saffron (Crocus sativus Linneaus). Part 1: Specifications. International Organization For Standardization: London, UK, 2011.
  8. ISO 3632-2; Saffron (Crocus sativus Linneaus). Part 2: Test Methods. International Organization for Standardization: London, UK, 2010.
  9. Sabatino, L.; Scordino, M.; Gargano, M.; Belligno, A.; Traulo, P.; Gagliano, G. HPLC/PDA/ESI-MS Evaluation of Saffron (Crocus sativus L.) Adulteration. Nat. Prod. Commun. 2011, 6, 1934578X1100601. [Google Scholar] [CrossRef]
  10. Dai, H.; Gao, Q.; He, L. Rapid Determination of Saffron Grade and Adulteration by Thin-Layer Chromatography Coupled with Raman Spectroscopy. Food Anal. Methods 2020, 13, 2128–2137. [Google Scholar] [CrossRef]
  11. Zheng, J.; He, L. Surface-Enhanced Raman Spectroscopy for the Chemical Analysis of Food. Compr. Rev. Food Sci. Food Saf. 2014, 13, 317–328. [Google Scholar] [CrossRef]
  12. Borràs, E.; Ferré, J.; Boqué, R.; Mestres, M.; Aceña, L.; Busto, O. Data fusion methodologies for food and beverage authentication and quality assessment—A review. Anal. Chim. Acta 2015, 891, 1–14. [Google Scholar] [CrossRef] [PubMed]
  13. Cocchi, M. Data Fusion Methodology and Applications, 1st ed.; Elsevier: Amsterdam, The Netherlands, 2019; ISSN Book 31. [Google Scholar]
  14. Castanedo, F. A review of data fusion techniques. Sci. World J. 2013, 2013, 704504. [Google Scholar] [CrossRef]
  15. Rodriguez-Mendez, M.; Arrieta, A.; Parra, V.; Bernal, A.; Vegas, A.; Villanueva, S.; Gutierrez-Osuna, R.; AntoniodeSaja, J. Fusion of three sensory modalities for the multimodal characterization of red wines. IEEE Sens. J. 2004, 4, 348–354. [Google Scholar] [CrossRef]
  16. Zhou, L.; Zhang, C.; Qiu, Z.; He, Y. Information fusion of emerging non-destructive analytical techniques for food quality authentication: A survey. TrAC Trends Anal. Chem. 2020, 127, 115901. [Google Scholar] [CrossRef]
  17. Brás, L.P.; Bernardino, S.A.; Lopes, J.A.; Menezes, J.C. Multiblock PLS as an approach to compare and combine NIR and MIR spectra in calibrations of soybean flour. Chemom. Intell. Lab. Syst. 2005, 75, 91–99. [Google Scholar] [CrossRef]
  18. Maléchaux, A.; Laroussi-Mezghani, S.; Le Dréau, Y.; Artaud, J.; Dupuy, N. Multiblock chemometrics for the discrimination of three extra virgin olive oil varieties. Food Chem. 2020, 309, 125588. [Google Scholar] [CrossRef]
  19. Mehl, F.; Marti, G.; Merle, P.; Delort, E.; Baroux, L.; Sommer, H.; Wolfender, J.-C.; Rudaz, S.; Boccard, J. Integrating metabolomic data from multiple analytical platforms for a comprehensive characterisation of lemon essential oils. Flavour Fragr. J. 2015, 30, 131–138. [Google Scholar] [CrossRef]
  20. Dos Santos, C.A.T.; Páscoa, R.N.; Sarraguça, M.C.; Porto, P.A.; Cerdeira, A.L.; González-Sáiz, J.M.; Pizarro, C.; Lopes, J.A. Merging vibrational spectroscopic data for wine classification according to the geographic origin. Food Res. Int. 2017, 102, 504–510. [Google Scholar] [CrossRef]
  21. Biancolillo, A.; Foschi, M.; D’Archivio, A.A. Geographical classification of Italian saffron (Crocus sativus L.) by multi-block treatments of UV-Vis and IR spectroscopic data. Molecules 2020, 25, 2332. [Google Scholar] [CrossRef]
  22. Cheng, J.H.; Sun, D.W. Data fusion and hyperspectral imaging in tandem with least squares-support vector machine for prediction of sensory quality index scores of fish fillet. LWT-Food Sci. Technol. 2015, 63, 892–898. [Google Scholar] [CrossRef]
  23. Wang, S.; Huang, M.; Zhu, Q. Model fusion for prediction of apple firmness using hyperspectral scattering image. Comput. Electron. Agric. 2012, 80, 1–7. [Google Scholar] [CrossRef]
  24. Nanyam, Y.; Choudhary, R.; Gupta, L.; Paliwal, J. A decision-fusion strategy for fruit quality inspection using hyperspectral imaging. Biosyst. Eng. 2012, 111, 118–125. [Google Scholar] [CrossRef]
  25. Fan, S.; Li, C.; Huang, W.; Chen, L. Data fusion of two hyperspectral imaging systems with complementary spectral sensing ranges for blueberry bruising detection. Sensors 2018, 18, 4463. [Google Scholar] [CrossRef]
  26. Jin, G.; Wang, Y.J.; Li, M.; Li, T.; Huang, W.J.; Li, L.; Deng, W.-W.; Ning, J. Rapid and real-time detection of black tea fermentation quality by using an inexpensive data fusion system. Food Chem. 2021, 358, 129815. [Google Scholar] [CrossRef]
  27. Mishra, P.; Nordon, A.; Asaari, M.S.M.; Lian, G.; Redfern, S. Fusing spectral and textural information in near-infrared hyperspectral imaging to improve green tea classification modelling. J. Food Eng. 2019, 249, 40–47. [Google Scholar] [CrossRef]
  28. Salvi, M.; Michielli, N.; Molinari, F. Stain Color Adaptive Normalization (SCAN) algorithm: Separation and standardization of histological stains in digital pathology. Comput. Methods Programs Biomed. 2020, 193, 105506. [Google Scholar] [CrossRef] [PubMed]
  29. Van den Berg, R.A.; Hoefsloot, H.C.; Westerhuis, J.A.; Smilde, A.K.; Van der Werf, M.J. Centering, scaling, and transformations: Improving the biological information content of metabolomics data. BMC Genom. 2006, 7, 142. [Google Scholar] [CrossRef]
  30. Márquez, C.; López, M.I.; Ruisánchez, I.; Callao, M.P. FT-Raman and NIR spectroscopy data fusion strategy for multivariate qualitative analysis of food fraud. Talanta 2016, 161, 80–86. [Google Scholar] [CrossRef]
  31. Qi, L.; Zhang, J.; Zhao, Y.; Zuo, Z.; Wang, Y.Z.; Jin, H. Characterization of Gentiana rigescens by Ultraviolet–Visible and Infrared Spectroscopies with Chemometrics. Anal. Lett. 2017, 50, 1497–1511. [Google Scholar] [CrossRef]
  32. Qiu, S.; Wang, J.; Gao, L. Qualification and quantisation of processed strawberry juice based on electronic nose and tongue. LWT-Food Sci. Technol. 2015, 60, 115–123. [Google Scholar] [CrossRef]
  33. Rodriguez-Mendez, M.L.; Apetrei, C.; Gay, M.; Medina-Plaza, C.; De Saja, J.A.; Vidal, S.; Aagaard, O.; Ugliano, M.; Wirth, J.; Cheynier, V. Evaluation of oxygen exposure levels and polyphenolic content of red wines using an electronic panel formed by an electronic nose and an electronic tongue. Food Chem. 2014, 155, 91–97. [Google Scholar] [CrossRef] [PubMed]
  34. Biancolillo, A.; Marini, F.; Ruckebusch, C.; Vitale, R. Chemometric strategies for spectroscopy-based food authentication. Appl. Sci. 2020, 10, 6544. [Google Scholar] [CrossRef]
  35. Westerhuis, J.A.; Coenegracht, P.M. Multivariate modelling of the pharmaceutical two-step process of wet granulation and tableting with multiblock partial least squares. J. Chemom. A J. Chemom. Soc. 1997, 11, 379–392. [Google Scholar] [CrossRef]
  36. Yuan, X.; Huang, B.; Wang, Y.; Yang, C.; Gui, W. Deep learning-based feature representation and its application for soft sensor modeling with variable-wise weighted SAE. IEEE Trans. Ind. Inform. 2018, 14, 3235–3243. [Google Scholar] [CrossRef]
Figure 1. Raman spectra of adulterated saffron samples collected on TLC substrates.
Figure 1. Raman spectra of adulterated saffron samples collected on TLC substrates.
Foods 12 02322 g001
Figure 2. (A) TLC pattern of adulterated saffron specimen under ambient and UV light; (B) featured imaging data collection point; (C) channel split in imaging processing software (Adobe Photoshop).
Figure 2. (A) TLC pattern of adulterated saffron specimen under ambient and UV light; (B) featured imaging data collection point; (C) channel split in imaging processing software (Adobe Photoshop).
Foods 12 02322 g002
Figure 3. Schematic view of mid-level data fusion processing.
Figure 3. Schematic view of mid-level data fusion processing.
Foods 12 02322 g003
Figure 4. PLS-DA 3D-plots of (A) featured imaging data; (B) featured Raman data; (C) fused data.
Figure 4. PLS-DA 3D-plots of (A) featured imaging data; (B) featured Raman data; (C) fused data.
Foods 12 02322 g004
Figure 5. Predicted vs. actual PLS plots of spiked saffron samples using featured imaging, featured Raman, and fused data blocks for calibration (black diamond) and external validation (white diamond).
Figure 5. Predicted vs. actual PLS plots of spiked saffron samples using featured imaging, featured Raman, and fused data blocks for calibration (black diamond) and external validation (white diamond).
Foods 12 02322 g005
Table 1. Misclassification table of spiked saffron samples based on (A) featured Imaging data, (B) featured Raman data, and (C) fused data as the training group.
Table 1. Misclassification table of spiked saffron samples based on (A) featured Imaging data, (B) featured Raman data, and (C) fused data as the training group.
MembersCorrect (%)SafflowerTurmericRed 40Yellow 5Pure Saffron
A. Featured Imaging data
Safflower50100%500000
Turmeric5056%2282000
Red 405096%004820
Yellow 550100%000500
Pure Saffron100%000100
Total21083.81%522868620
B. Featured Raman Data
Safflower5094%473000
Turmeric5094%347000
Red 405080%0040100
Yellow 55072%0014360
Pure Saffron10100%000010
Total21085.71%5050544610
C. Fused Data
Safflower50100%500000
Turmeric50100%149000
Red 405098%005000
Yellow 550100%000500
Pure Saffron10100%000010
Total21099.52%5149505010
Note: Highlighted numbers indicate accurately classified samples.
Table 2. Misclassification table of the validation group based on the fused data block.
Table 2. Misclassification table of the validation group based on the fused data block.
MembersCorrect (%)SafflowerTurmericRed 40Yellow 5Pure Saffron
Fused data (external validataion)
Safflower25100%250000
Turmeric25100%025000
Red 402596%002410
Yellow 525100%000250
Pure Saffron5100%00005
Total10599.20%252524265
Note: Highlighted numbers indicate accurately classified samples.
Table 3. Statistics for each data block in external and cross-validations.
Table 3. Statistics for each data block in external and cross-validations.
Featured Imaging DataFeatured Raman DataFused Data
R2RMSEPRMSECVR2RMSEPRMSECVR2RMSEPRMSECV
Safflower0.9236.156.020.9594.5411.090.9614.785.97
Turmeric0.9467.307.960.9783.535.370.9823.115.05
Red 400.9231.280.820.8921.651.370.9781.030.97
Yellow 50.8732.841.160.8681.601.380.9901.550.72
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Dai, H.; Gao, Q.; Lu, J.; He, L. Improving the Accuracy of Saffron Adulteration Classification and Quantification through Data Fusion of Thin-Layer Chromatography Imaging and Raman Spectral Analysis. Foods 2023, 12, 2322. https://doi.org/10.3390/foods12122322

AMA Style

Dai H, Gao Q, Lu J, He L. Improving the Accuracy of Saffron Adulteration Classification and Quantification through Data Fusion of Thin-Layer Chromatography Imaging and Raman Spectral Analysis. Foods. 2023; 12(12):2322. https://doi.org/10.3390/foods12122322

Chicago/Turabian Style

Dai, Haochen, Qixiang Gao, Jiakai Lu, and Lili He. 2023. "Improving the Accuracy of Saffron Adulteration Classification and Quantification through Data Fusion of Thin-Layer Chromatography Imaging and Raman Spectral Analysis" Foods 12, no. 12: 2322. https://doi.org/10.3390/foods12122322

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop