Next Article in Journal
Comparative Study of Coupling Techniques in Lamb Wave Testing of Metallic and Cementitious Plates
Previous Article in Journal
Applying OGC Sensor Web Enablement Standards to Develop a TDR Multi-Functional Measurement Model
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Online Feature Selection for Robust Classification of the Microbiological Quality of Traditional Vanilla Cream by Means of Multispectral Imaging

by
Alexandra Lianou
1,
Arianna Mencattini
2,
Alexandro Catini
2,
Corrado Di Natale
2,
George-John E. Nychas
1,
Eugenio Martinelli
2,* and
Efstathios Z. Panagou
1,*
1
Laboratory of Microbiology and Biotechnology of Foods, Department of Food Science and Human Nutrition, Agricultural University of Athens, Iera Odos 75, 11855 Athens, Greece
2
Department of Electronic Engineering, University of Rome for Vergata, via del Politecnico 1, 00133 Roma, Italy
*
Authors to whom correspondence should be addressed.
Sensors 2019, 19(19), 4071; https://doi.org/10.3390/s19194071
Submission received: 31 July 2019 / Revised: 12 September 2019 / Accepted: 16 September 2019 / Published: 20 September 2019
(This article belongs to the Section Optical Sensors)

Abstract

:
The performance of an Unsupervised Online feature Selection (UOS) algorithm was investigated for the selection of training features of multispectral images acquired from a dairy product (vanilla cream) stored under isothermal conditions. The selected features were further used as input in a support vector machine (SVM) model with linear kernel for the determination of the microbiological quality of vanilla cream. Model training (n = 65) was based on two batches of cream samples provided directly by the manufacturer and stored at different isothermal conditions (4, 8, 12, and 15 °C), whereas model testing (n = 132) and validation (n = 48) were based on real life conditions by analyzing samples from different retail outlets as well as expired samples from the market. Qualitative analysis was performed for the discrimination of cream samples in two microbiological quality classes based on the values of total viable counts [TVC ≤ 2.0 log CFU/g (fresh samples) and TVC ≥ 6.0 log CFU/g (spoiled samples)]. Results exhibited good performance with an overall accuracy of classification for the two classes of 91.7% for model validation. Further on, the model was extended to include the samples in the TVC range 2–6 log CFU/g, using 1 log step to define the microbiological quality of classes in order to assess the potential of the model to estimate increasing microbial populations. Results demonstrated that high rates of correct classification could be obtained in the range of 2–5 log CFU/g, whereas the percentage of erroneous classification increased in the TVC class (5,6) that was close to the spoilage level of the product. Overall, the results of this study demonstrated that the UOS algorithm in tandem with spectral data acquired from multispectral imaging could be a promising method for real-time assessment of the microbiological quality of vanilla cream samples.

1. Introduction

Quality and safety assurance are of paramount importance for the agro-food industry of the 21st century. Currently food quality and safety assessment are ensured through microbiological, physicochemical and sensory analyses. However, the majority of these methods are labor-intensive, time-consuming, destructive, require experienced personnel, and most importantly, provide retrospective information [1]. Thus, several analytical approaches have been proposed for the non-destructive and rapid monitoring and assessment of food quality and safety including sensor arrays (e.g., electronic noses), spectroscopy methods (e.g., vibrational, NMR or mass spectroscopy techniques), as well as imaging technology approaches [2,3,4,5]. With regard to imaging technology approaches, hyperspectral imaging spectroscopy, coupled with image analysis techniques and chemometric analyses, has been extensively investigated in recent studies for the purpose of food quality evaluation and monitoring [6,7,8,9,10]. Meanwhile, multispectral image analysis, although less studied than hyperspectral imaging, has also been proposed as a promising optical sensing technique for the rapid and non-invasive assessment of the quality attributes of fresh produce commodities [11,12] and the microbiological spoilage of meat and poultry [13,14]. Multispectral imaging (MSI) combines spectral and spatial information, with the former corresponding to the visual and short wave near infrared range of the spectrum and being obtained in discrete bands [15,16]. In this way, this technique has the ability of providing information pertinent to the so-called “surface chemistry” of a given food sample [15].
Hyperspectral images are composed of hundreds of wavelengths for each pixel of the object under study resulting in a high data processing load that may increase the complexity of online application. Moreover, a common problem in pattern recognition is the high collinearity and data redundancy among the variables affecting the robustness of classification or predictive models [17,18]. It is thus necessary to employ feature selection as an important step in identifying the small set of features that carries most of the information contained in the samples [19,20,21,22,23,24]. In this way, the exclusion of non-informative variables from the analysis will result in a simpler model with improved discrimination or prediction capacity. Several feature selection processes have been reported in the literature for developing multispectral imaging quality and safety control systems, based on multivariate methods such as Principal Components Analysis (PCA), Partial Least Squares Regression (PLSR), Stepwise Regression (SE), Successive Projection Algorithm (SPA), Genetic Algorithms (GA) [25]. The selection of the most appropriate approach depends on the nature of the problem under investigation, the size of the dataset, and the kind of sensors used. Feature selection is a crucial part of the previous algorithms and may strongly affect the final performance of classification. In this work, the Unsupervised On-line feature Selection (UOS) approach was employed for feature selection introduced by Magna et al. [26]. This approach is based on two different feature selection procedures. The first feature selection step is applied to the training data removing the non-informative features from the initial dataset. The second procedure is applied every time that a class label has to be assigned to a new sample. For each test sample, a decision-making process uses the training templates and the test sample data to further tune the most appropriate new set of features to build a classification model for the specific test sample.
In this context, the objective of the present study was to assess the performance of UOS on experimental datasets derived from a work aiming at the evaluation of multispectral imaging as means of characterizing the microbiological quality of a traditional dairy product (vanilla cream).

2. Materials and Methods

2.1. Vanilla Cream Samples

For a detailed description of the experimental procedure the reader should refer to a previously published work [27]. In brief, the cream samples employed in this work were packages (170 g) of commercially prepared product that were obtained: (a) directly from the manufacturer within three days of production (65 samples), (b) from different retail stores at various points of their shelf life (48 samples) and (c) from expired samples in retail outlets that were returned to the manufacturer (132 samples). The samples were subjected to microbiological analysis for the determination of their microbial load in terms of total viable counts (TVC) whereas at the same time multispectral images were acquired according to the procedures described in Section 2.3. Samples from two different production batches obtained directly from the manufacturer were stored at 4, 8, 12, and 15 °C in high-precision incubators (MIR-153 Sanyo Electric Co., Osaka, Japan) for a maximum period of 40 days. At appropriate time intervals during storage, depending on storage temperature, duplicate samples were subjected to the above-mentioned analyses.

2.2. Microbiological Analysis

A weighted amount (25 g) of cream sample was transferred aseptically to a sterile stomacher bag (Seward Medical, London, UK) containing 225 mL of sterilized quarter-strength Ringer’s solution (Lab M Limited, Lancashire, UK) and homogenized in a Stomacher device (Lab Blender 400, Seward Medical) for 1 min at room temperature. For the enumeration of total mesophiles (i.e., total viable counts (TVC)), appropriate serial decimal dilutions in Ringer’s solution were surface plated on tryptic glucose yeast agar (Biolife, Milan, Italy) and colonies were counted after incubation of the plates at 30 °C for 48 h. The obtained microbiological data were expressed as log (colony forming units) per gram of cream (log CFU/g).

2.3. Image Acquisition and Analysis

Images from each cream sample were acquired using the VideometerLab system originally developed by the Technical University of Denmark [28] and commercialized by “Videometer A/S” (http://www.videometer.com). This instrument acquires multispectral images in 18 different non-uniformly distributed wavelengths ranging from UV (405 nm) to short wave NIR (970 nm). During spectra collection, a data cube of spatial and spectral data of size m × n × 18 is acquired for each cream sample (m × n denotes the size in pixels). This process results in the generation of an increased amount of data, representing samples in a time series experiment and/or under different storage conditions (i.e., temperature in this case). Prior to image acquisition the system was subjected to a light set up procedure known as “autolight” and calibrated radiometrically and geometrically as previously described [29]. Each cream sample (i.e., plastic cup after removal of the aluminum lidding foil) was placed inside an Ulbricht sphere in which the camera is top-mounted and the corresponding multispectral image of the product’s surface was acquired. In order to remove redundant information related to the background (e.g., plastic cup) and not to the cream sample per se, image segmentation was performed using an in-house method for automated multispectral image analysis. This recently developed method allows for a high-throughput and unbiased analysis of food samples and has been proven to be highly reliable and robust [30]. Briefly, the corresponding processing system embeds a phase of selecting the optimal channel combination without user intervention and, more importantly, regardless of the food sample. Then, the appropriate threshold for the extraction of the informative sample area (i.e., the cream area only) is defined automatically and individually based on Gaussian Mixture Models [31], a robust machine learning technique adapted to this particular application [30]. After the segmentation process of each image, the pixels’ values for each wavelength were normalized into the range [0,1] and the mean reflectance spectrum (i.e., mean intensity of pixels within the informative area) along with the corresponding standard deviation values were calculated. Hence, the multispectral imaging data utilized in subsequent analyses consisted of 18 mean and 18 standard deviation values of the reflectance spectra. A typical spectrum of a fresh vanilla cream sample compared to spoiled samples stored at 4 and 8 °C for 40 days as well as at 12 and 15 °C for 18 days is presented in Figure 1. It is evident that no clear pattern between fresh and spoiled samples could be obtained in the visible region of the spectrum; however, some differences could be established in the near-infrared area, a fact that necessitates the implementation of feature selection for the effective discrimination of the samples in quality classes.

2.4. Data Labeling

The overall dataset consisted of 245 spectra of cream samples, from which (a) 65 spectra were acquired during the storage of the samples at isothermal conditions and used in model training, (b) 48 spectra were captured from samples coming from retail outlets at various time points of shelf life and used in model validation, and finally (c) 132 spectra were collected from expired samples and used in model testing. The input matrix for analysis contained as variables (features) the 18 average and 18 standard deviation values of the reflectance spectra acquired from 245 vanilla cream samples, whereas the output matrix contained the results of the microbiological analyses for the determination of TVCs in the respective samples. As a preliminary approach, we intended to apply the procedure to a binary classification problem that consisted in automatically determining the samples having a TVC higher than 6 log CFU/g or smaller than 2 log CFU/g. The idea was to construct a model able to accurately recognize fresh (TVC ≤ 2) or spoiled samples (TVC ≥ 6). The latter threshold value could be considered as the general microbiological guideline applied in food quality [32]. It must be underlined that the value of TVC ≤ 2 is a critical in-house threshold for the Quality Control Department of the dairy industry that supplied the cream samples, because if the population of TVC is less than 2 log CFU/g then the product is released immediately to the market with a designated shelf life of 30 days. So far, the determination of this threshold is defined by expensive, laborious, and time-consuming microbiological analysis. Thus, multispectral imaging could become a low cost, rapid and fully automated method to facilitate the decision-taking process of quality managers in order to release the product in the market, by correlating the outcome of microbiological analysis to multispectral images. As a second step, the recognition model will be applied to those test samples in the range of TVC ∈ (2,6) to assess the capacity of the system to recognize progressively increasing microbial populations. The data labelling scenario is presented in Table 1.

2.5. Dynamic Feature Selection (DFS) Method

Under the assumption that feature selection plays a crucial role in the recognition performance, especially due to the heterogeneity of the test set, a UOS based on a dynamic feature selection (DFS) procedure intended to optimally select model features according to each specific test data was employed in this work. More specifically, in line with the UOS approach recently developed [26,33,34] the following three-level DFS approach was designed, as summarized in Figure 2.

2.5.1. Training-Dependent Feature Elimination Step

The Fisher Discriminant Score (FDS) is used to sort all the available features in the training set according to their correlation with the classification problem [26]. Considering a two-class problem, let us assume that each class has Ld, d = {1,2} training vectors i each composed by elements b i k , i = 1, …, Ld, k =1, …, M, with M the total number of features.
The FDSk for each feature k, is then defined as the ratio between intra-class and inter-class variance and computed as follows:
F D S k = S B k / S W k   ,
where S B k and S W k are the intra-class and the inter-class variance, respectively. More specifically, S B k is defined as:
S B k = d = 1 2 ( b { Y = Y d } k b ¯ k ) 2 ,
with Y d = { 0 , 1 } , indicating the two class labels, b ¯ k representing the average value of the feature values b k computed over all the classes and SWk defined as:
S W k = d = 1 2 1 L d i = 1 L d ( b { Y = Y d } i k b ¯ d k ) 2 .
with b { Y = Y d } i k the i-th element of feature k-th for the class labelled as Y d and b ¯ d k the average value of k-th feature in the class labelled as Y d . For each feature k, S B k quantifies the dispersion of training samples in a class (by their variance) with respect to the global average value of that feature and S W k quantifies the sum of the dispersions of the elements in each class d (i.e., the sum of the inter-class variances).
The higher the FDSk is the more the k-th feature is representative of at least one class (high SBk and low SWk). On the contrary, features with a FDSk smaller than a given threshold, here indicated as thFD, will be immediately rejected (see first decision block in Figure 2).

2.5.2. Online Test-Dependent Feature Elimination Step

This step utilizes two criteria for further selecting features each time a test set is acquired and processed. The decision is made based on the training set and the test sample newly acquired. This step aims to online remove the features in which either the test sample is surrounded by samples of different classes (high probability of misclassification) or it is far from all class distributions (feature outlier values). For this task, we designed the two following criteria.
Let us denote with s the test sample and with s k its k-th element, named for brevity test element.
Determining Feature Outlier Values. The proposed method is founded on two criteria. Considering the generic feature k-th, the first criterion is based on the ratio between the Mahalanobis distance of the test sample s k from the distribution of the classes, indicated as MRk ( s k ). This value is computed as follows:
M R k ( s k ) = M k ( s k , Y = Y 1 ) · M k ( s k , Y = Y 1 ) M k ( s k , Y = Y 2 )   ,
where M k ( s k , Y = Y d ) indicates the Mahalanobis distance of the test sample s k from the training elements belonging to the class labelled as Y d and it is described as:
M k ( s k , Y = Y d ) = ( s k b ¯ d k ) ( C o v ( b { Y = Y d } k ) ) 1 ( s k b ¯ d k ) ,
where C o v ( b { Y = Y d } k ) is the sample covariance matrix of the class labelled as Y d . The M R k ( s k ) gives a quantitative evaluation of the distance of the two classes distribution and the test sample. Lower values of the M R k ( s k ) descriptor indicate that the test sample s k is near to a specific class and far from the other one. Therefore, this means that specific feature is representative of at least one class. Hence, for the task of analysing features with a smaller value of M R k , we set an additional threshold value thMR (see second decision block in Figure 2).
Determining probability of misclassification. The second criterion is applied during the online feature selection procedure. In this step, the algorithm evaluates the maximum probability Pk for the test example s k to fit to different class distribution. For the k-th feature, Pk is calculated in the following way:
P k ( s k ) = max d ( 1 2 π σ d k exp ( ( s k b ¯ d k ) 2 2 σ d k 2 ) ) ,
where σ d k is the standard deviation of training sample values for feature k-th in the class labelled as Y d . The higher P k ( s k ) values are the higher is the probability of the test sample s k to belong to a given class distribution. To evaluate the features that have a higher probability to belong to a given class, we defined a threshold value thP (see third decision block in Figure 2).
For each test sample s , only the features in the training set that satisfy all the three conditions are considered to build the classification model.
Among the possible models, in this work we have considered as classifier a Support Vector Machine (SVM) with linear kernel [35]. A new set of features is selected at each new test sample step; it is important to note that features not considered in the online feature selection procedure are re-inserted in the training set and then considered for the feature selection of new test data.

3. Results

3.1. DFS Optimization Procedure

With the aim to demonstrate the effectiveness of the DFS approach and its superiority with respect to standard feature selection for classification, the assessment procedure shown in Figure 3 was designed. By an exhaustive optimum search procedure, we designed three nested loop procedures in which each threshold, th1, th2, th3 varies within a specific range, th1 ∈ [0.13–0.17], step size equal to 0.01, th2 ∈ [0.74–0.77], step size 0.01, th3 ∈ [0.08–0.12], step size 0.01. At each iteration, the training set along with the corresponding label ytrain were fed as training data to the DFS and to the SVM classification model. The SVM model constructed is tested on the validation set thus producing an estimated label vector y ^ val. The value is compared against the actual validation label vector y val. The three thresholds are optimized by minimizing the absolute error between y val and y ^ val. After the thresholds have been set to optimal values, the DFS is run again over the training and test set, the SVM model is constructed over the selected features and an output label is assigned to the test set, y ^ test.

3.2. Simulation Results

In a first simulation, we collected the results of the proposed approach obtained when applied over test samples having a TVC smaller than 2 log CFU/g or larger than 6 log CFU/g. Results are presented in terms of accuracy of classification, i.e., the number of total samples correctly classified as fresh or spoiled with respect to the total number of samples in the fresh and spoiled range. Results are shown in terms of confusion matrix in Figure 4. The first column indicates the total number of negative instances in the test dataset (124 fresh samples) and the second column represents the total number of positive instances (8 spoiled samples). The confusion matrix indicates that out of the 124 fresh samples, only 114 were attributed to the negative class (i.e., correctly classified as fresh) and 10 were assigned to the spoiled class (i.e., misclassified as spoiled), leading to a specificity value (true negative rate) of 91.9%. Analogously, out of the eight spoiled samples, seven were assigned to the positive class (i.e., correctly classified as spoiled) and only 1 to the negative class (i.e., misclassified as fresh), resulting in a sensitivity value (true positive rate) of 87.5%. Globally, the system achieved an accuracy value equal to 91.7% for both classes. When applying the same approach also to all the samples in the test dataset we obtained the results reported in Table 2. As a general trend, it can be observed that positive assignments increase with TVC values, whereas negative assignments decrease for TVC increase. Specifically, apart from the fresh (TVC ≤ 2.0 log CFU/g) and spoiled (TVC ≥ 6.0 log CFU/g) samples, the system was able to predict accurately the population of TVC in the range between 2 and 5 log CFU/g. However, as the sample approaches the spoilage condition the percentage of erroneous predictions increases. Reasonably, the most difficult cases are those close to spoilage level. It could be thus suggested that TVC values below 5.0 log CFU/g could be associated with fresh samples, whereas the values of TVC ∈ [5,6) could be regarded as a transition zone to spoilage and could be thus associated to the state of incipient spoilage of vanilla cream samples.

3.3. Comparative Analysis

In order to demonstrate the effectiveness of the DFS approach in discriminating fresh from spoiled samples, some additional simulations were performed. Specifically, we decided to compare the following alternative methods: first, we applied different classification algorithms such as Linear and Quadratic Discriminant Analysis (LDA) and (QDA) approaches [36]. Then, we applied a standard Stepwise Regression (SE) static feature selection within a SVM classifier. Results are all presented in Table 3. It can be noted that LDA and QDA did not reach a high accuracy value, while presenting an adequate balancing of the results in critical categories of spoilage (third and fourth columns). This is due to the fact that DFS is the key part of the algorithm and manages to keep high discrimination performance. On the contrary, when DFS was replaced with standard SE feature selection approach, the performance severely got worse at the expense of low accuracy values (70%). More specifically, out of the 124 fresh samples, 74 were assigned to the spoiled category.

4. Discussion

Multispectral imaging has been increasingly applied as a powerful analytical tool for rapid and non-destructive determination of food quality and safety, including meat [37,38,39,40], fish [41,42,43], constituent analysis [44,45], adulteration [29,46,47], fruits and vegetables [48,49,50], nuts and grains [51,52] and fungal identification and growth inhibition [53,54]. To our knowledge, there are no published data related to the application and evaluation of multispectral image analysis as a means of appraising the microbiological quality of milk and dairy products. In a recent work [30] a high throughput multispectral image processing pipeline was reported for several foods, including vanilla cream. However, the purpose of that work was not to determine the microbiological quality of the samples during storage, but rather to develop an automated segmentation method for the acquired multispectral images based on unsupervised machine learning approach (Gaussian Mixture Models). An important point in this work is that the developed model included as much variability as possible, as in real life, since model development was based on two different product batches supplied directly by the manufacturer of the vanilla cream that were maintained under laboratory conditions, whereas model testing and validation were based on samples from different retail outlets as well as expired samples from the market. The performance of UOS approach has been successfully used in the past with simulated and experimental datasets for robust classification with drifting and faulty gas sensors [26] as well as in the case of self-repairing classification algorithms for chemical sensor array [55], affective computing [33], and developmental disorders recognition [34]. Not by chance all the considered scenarios present an intrinsic data heterogeneity that makes it usually difficult to develop an effective discrimination strategy using standard paradigms (i.e., diversity of emotion manifesting, autism phenotyping, food spoilage distribution).
As a further proof of concept, we analyzed how the different features were selected during the online procedure. To this scope, the selected features of the training dataset were firstly analyzed with a leave-one-out cross validation procedure. In a second step, consideration was given on the features selected with the test dataset. The results are shown in Figure 5 where the selection rate for all the features is illustrated. We separated the 18 average reflectance spectra features (upper graph) and the 18 standard deviation reflectance spectra features (lower graph), for the sake of demonstrating different behavior of the two groups of features highlighting the different information content carried by the two kind of descriptors. As it can be observed, there was a strong discrepancy in features selected for different datasets especially for the average reflectance spectra. This demonstrates once more the importance of the DFS approach for such real-life practical scenarios, where unpredictable fluctuations of the experimental conditions can drastically affect the measurement of data.
It needs to be noted that the UOS processing algorithm was applied for the first time on a food commodity (vanilla cream) with very promising results for the microbiological quality characterization of the product based on the analysis of multispectral images. It is characteristic that 96% of the samples with TVC < 2.0 log CFU/g were correctly classified, a fact that creates new perspectives for the dairy industry to employ multispectral imaging spectroscopy in the quality control pipeline for the rapid estimation of the microbiological quality of vanilla cream and the immediate release of the product to the market.

5. Conclusions

The results obtained in this work demonstrated that the UOS algorithm employed for a DFS of spectral data acquired from multispectral images of vanilla cream samples stored at isothermal conditions (4, 8, 12 and 15 °C) for an overall period of 40 days allowed the development of a robust classification model for rapid and non-destructive determination of the microbiological quality of this dairy product for the given set of experimental conditions. The model was extensively tested with independent data from retail outlets at different time points during the shelf-life of the product to include as much variability as possible. As confirmed by the results, the UOS strategy was able to maintain low classification error in the test dataset, especially in the class corresponding to the “fresh” vanilla cream samples, which is used as a criterion by the manufacturer for product release to the market. This fact provides promising perspectives for direct implementation of multispectral imaging in tandem with the UOS strategy in the quality control pipeline of the dairy industry.

Author Contributions

Conceptualization, A.L., E.Z.P. and G.-J.N.; methodology, E.Z.P. and E.M.; software, A.M., A.C., C.D.N. and E.M.; validation, A.M., A.C. and E.M.; formal analysis, A.M., C.D.N., A.C. and E.M.; investigation, E.M. and E.Z.P.; resources, E.M.; data curation, A.M., A.C. and E.M.; writing—original draft preparation, E.Z.P. and E.M.; writing—review and editing, E.Z.P. and E.M.; visualization, A.M. and E.M.; supervision, E.Z.P. and E.M.; project administration, E.M. and G.-J.N.; funding acquisition, G.-J.N. and E.M.

Funding

This work has been supported by the project “PhasmaFOOD”, funded from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 732541.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Nychas, G.-J.E.; Panagou, E.Z.; Mohareb, F. Novel approaches for food safety management and communication. Curr. Opin. Food Sci. 2016, 12, 13–20. [Google Scholar] [CrossRef] [Green Version]
  2. Russ, J.C. Image analysis of foods. J. Food Sci. 2015, 80, E1974–E1987. [Google Scholar] [CrossRef] [PubMed]
  3. Medina, S.; Pereira, J.A.; Silva, P.; Perestrelo, R.; Camara, J.S. Food fingerprints-A valuable tool to monitor food authenticity and safety. Food Chem. 2019, 278, 144–162. [Google Scholar] [CrossRef] [PubMed]
  4. ElMasry, G.; Mandour, N.; Al-Rejaie, S.; Belin, E.; Rousseau, D. Recent applications of multispectral imaging in seed phenotyping and quality monitoring—An overview. Sensors 2019, 19, 1090. [Google Scholar] [CrossRef] [PubMed]
  5. Su, W.H.; He, H.J.; Sun, D.W. Non-destructive and rapid evaluation of staple foods quality by using spectroscopic techniques: a review. Crit. Rev. Food Sci. Nutr. 2017, 57, 1039–1051. [Google Scholar] [CrossRef] [PubMed]
  6. Cheng, J.H.; Sun, D.W. Rapid quantification analysis and visualization of Escherichia coli loads in grass carp fish flesh by hyperspectral imaging method. Food Bioprocess Tech. 2015, 8, 951–959. [Google Scholar] [CrossRef]
  7. Feng, Y.Z.; Sun, D.W. Near-infrared hyperspectral imaging in tandem with partial least squares regression and genetic algorithm for non-destructive determination and visualization of Pseudomonas loads in chicken fillets. Talanta 2013, 109, 74–83. [Google Scholar] [CrossRef] [PubMed]
  8. He, H.J.; Sun, D.W.; Wu, D. Rapid and real-time prediction of lactic acid bacteria (LAB) in farmed salmon flesh using near-infrared (NIR) hyperspectral imaging combined with chemometric analysis. Food Res. Int. 2014, 62, 476–483. [Google Scholar] [CrossRef]
  9. ElMasry, G.M.; Nakauchi, S. Image analysis operations applied to hyperspectral images for non-invasive sensing of food quality—A comprehensive review. Biosyst. Eng. 2016, 142, 53–82. [Google Scholar] [CrossRef]
  10. Wu, D.; Sun, D.W. Advanced applications of hyperspectral imaging technology for food quality and safety analysis and assessment: A review-Part II: Applications. Innov. Food Sci. Emerg. Tech. 2013, 19, 15–28. [Google Scholar] [CrossRef]
  11. Su, W.H.; Sun, D.W. Multispectral imaging for plant food quality analysis and visualization. Compr. Rev. Food Sci. F. 2017, 17, 220–239. [Google Scholar] [CrossRef]
  12. Liu, C.; Liu, W.; Lu, X.; Ma, F.; Chen, W.; Yang, J.; Zheng, L. Application of multispectral imaging to determine quality attributes and ripeness stage in strawberry fruit. PLoS ONE 2014, 9, e87818. [Google Scholar] [CrossRef] [PubMed]
  13. Dissing, B.S.; Papadopoulou, O.S.; Tassou, C.C.; Ersbøll, B.K.; Carstensen, J.M.; Panagou, E.Z.; Nychas, G.-J.E. Using multispectral imaging for spoilage detection of pork meat. Food Bioprocess Tech. 2013, 6, 2268–2279. [Google Scholar] [CrossRef]
  14. Wang, K.; Pu, H.; Sun, D.W. Emerging spectroscopic and spectral imaging techniques for the rapid detection of microorganisms: An overview. Compr. Rev. Food Sci. F. 2018, 17, 256–273. [Google Scholar] [CrossRef]
  15. Carstensen, J.M.; Hansen, M.E.; Lassen, N.K.; Hansen, P.W. Creating surface chemistry maps using multispectral vision technology. In 9th Medical Image Computing and Computer Assisted Intervention (MICCAI)—Workshop on Biophotonics Imaging for Diagnostics and Treatment, Institute of Mathematical Modelling; Ersbøll, B., Jørgensen, T.M., Eds.; Technical University of Denmark: Lyngby, Denmark, 2006. [Google Scholar]
  16. Mehl, P.M.; Chen, Y.R.; Kim, M.S.; Chan, D.E. Development of hyperspectral imaging technique for the detection of apple surface defects and contaminations. J. Food Eng. 2004, 61, 67–81. [Google Scholar] [CrossRef]
  17. Brereton, R.G. Consequences of sample size, variable selection and model validation and optimisation for predicting classification ability from analytical data. Trend. Anal. Chem. 2006, 25, 1103–1111. [Google Scholar] [CrossRef]
  18. Dai, Q.; Cheng, J.H.; Sun, D.W.; Zeng, X.A. Advances in feature selection methods for hyperspectral image processing in food industry applications. Crit. Rev. Food Sci. Nutr. 2015, 56, 1368–1382. [Google Scholar] [CrossRef]
  19. Ariana, D.P.; Lu, R. Hyperspectral waveband selection for internal defect detection of pickling cucumbers and whole pickles. Comput. Electron. Agric. 2010, 74, 137–144. [Google Scholar] [CrossRef]
  20. Su, W.H.; Sun, D.W. Evaluation of spectral imaging for inspection for adulterants in terms of common wheat flour, cassava flour and corn flour in organic Avatar wheat (Triticum spp.) flour. J. Food Eng. 2017, 59–69. [Google Scholar] [CrossRef]
  21. Kamruzzaman, M.; Makino, Y.; Oshita, S.; Liu, S. Assessment of visible near-infrared hyperspectral imaging as a tool for detection of horsemeat adulteration in minced beef. Food Bioprocess Tech. 2015, 8, 1054–1062. [Google Scholar] [CrossRef]
  22. Kamruzzaman, M.; Makino, Y.; Oshita, S. Rapid and non-destructive detection of chicken adulteration in minced beef using visible near-infrared hyperspectral imaging and machine learning. J. Food Eng. 2016, 170, 8–15. [Google Scholar] [CrossRef]
  23. Su, W.H.; Sun, D.W. Comparative assessment of feature-wavelength eligibility for measurement of water binding capacity and specific gravity of tuber using diverse spectral indices stemmed from hyperspectral images. Comput. Electron. Agric. 2016, 130, 69–82. [Google Scholar] [CrossRef]
  24. Su, W.H.; Bakalis, S.; Sun, D.W. Chemometrics in tandem with near infrared (NIR) hyperspectral imaging and Fourier transform mid infrared (FT-MIR) microspectroscopy for variety identification and cooking loss determination of sweet potato. Biosyst. Eng. 2019, 180, 70–86. [Google Scholar] [CrossRef]
  25. Pu, H.; Kamruzzaman, M.; Sun, D.-W. Selection of feature wavelengths for developing multispectral imaging systems for quality, safety and authenticity of muscle foods: A review. Trend. Food Sci. Tech. 2015, 45, 86–104. [Google Scholar] [CrossRef]
  26. Magna, G.; Mosciano, F.; Martinelli, E.; Di Natale, C. Unsupervised On-Line Selection of Training Features for a robust classification with drifting and faulty gas sensors. Sensor. Actuat. B: Chem. 2018, 258, 1242–1251. [Google Scholar] [CrossRef]
  27. Lianou, A.; Malavazos, C.; Triantafyllou, I.; Nychas, G.-J.E.; Panagou, E.Z. Rapid assessment of the microbiological quality of pasteurized vanilla cream by means of Fourier transform infrared spectroscopy in tandem with support vector machine analysis. Food Anal. Meth. 2018, 11, 840–847. [Google Scholar] [CrossRef]
  28. Carstensen, J.M.; Hansen, J.F. An Apparatus and a Method of Recording an Image of an Object. US Patent EP1051660, 2005. [Google Scholar]
  29. Ropodi, A.I.; Pavlidis, D.E.; Mohareb, F.; Panagou, E.Z.; Nychas, G.-J.E. Multispectral image analysis approach to detect adulteration of beef and pork in raw meats. Food Res. Int. 2015, 67, 12–18. [Google Scholar] [CrossRef]
  30. Tsakanikas, P.; Pavlidis, D.E.; Nychas, G.-J.E. High throughput multispectral image processing with applications in food science. PLoS ONE 2015, 10, e0140122. [Google Scholar] [CrossRef]
  31. McLachlan, G.; Peel, D. Finite Mixture Models; Wiley Series in Probability and Statistics; John Wiley & Sons Inc.: New York, NY, USA, 2000. [Google Scholar]
  32. Gram, L.; Ravn, L.; Rasch, M.; Bruch, J.B.; Christiansen, A.B.; Givskov, M. Food spoilage interactions between food spoilage bacteria. Int. J. Food Microbiol. 2002, 87, 79–97. [Google Scholar] [CrossRef]
  33. Mosciano, F.; Mencattini, A.; Ringeval, F.; Schuller, B.; Martinelli, E.; Di Natale, C. An array of physical sensors and an adaptive regression strategy for emotion recognition in a noisy scenario. Sensor. Actuators A Phys. 2017, 267, 48–59. [Google Scholar] [CrossRef] [Green Version]
  34. Mencattini, A.; Mosciano, F.; Comes, M.C.; Di Gregorio, T.; Raguso, G.; Daprati, E.; Ringeval, F.; Schuller, B.; Di Natale, C.; Martinelli, E. An emotional modulation model as signature for the identification of children developmental disorders. Sci. Rep. 2018, 8, 14487. [Google Scholar] [CrossRef] [PubMed]
  35. Burges, C.J. A tutorial on support vector machines for pattern recognition. Data Min. Knowl. Disc. 1998, 2, 121–167. [Google Scholar] [CrossRef]
  36. Duda, R.O.; Hart, P.E.; Stork, D.G. Pattern Classification; John Wiley & Sons: New York, NY, USA, 2012. [Google Scholar]
  37. Ma, F.; Yao, J.; Xie, T.; Liu, C.; Chen, W.; Chen, C.; Zheng, L. Multispectral imaging for rapid and non-destructive determination of aerobic plate count (APC) in cooked pork sausages. Food Res. Int. 2014, 62, 902–908. [Google Scholar] [CrossRef]
  38. Liu, J.; Cao, Y.; Wang, Q.; Pan, W.; Ma, F.; Liu, C.; Chen, W.; Yang, J.; Zheng, L. Rapid and non-destructive identification of water-injected beef samples using multispectral image analysis. Food Chem. 2016, 190, 938–943. [Google Scholar] [CrossRef] [PubMed]
  39. Tsakanikas, P.; Pavlidis, D.; Panagou, E.; Nychas, G.-J. Exploiting multispectral imaging for non-invasive contamination assessment and mapping of meat samples. Talanta 2016, 161, 606–614. [Google Scholar] [CrossRef]
  40. Feng, C.H.; Makino, Y.; Oshita, S.; Garcia Martin, J.F. Hyperspectral imaging and multispectral imaging as the novel techniques for detecting defects in raw and processed meat products: Current state-of-the-art research advances. Food Control 2018, 84, 165–176. [Google Scholar] [CrossRef]
  41. Airado-Rodríguez, D.; Høy, M.; Skaret, J.; Wold, J.P. From multispectral imaging of autofluoresence to chemical and sensory images of lipid oxidation in cod caviar paste. Talanta 2014, 122, 70–79. [Google Scholar] [CrossRef]
  42. Cheng, J.H.; Sun, D.W.; Qu, J.H.; Pu, H.B.; Zhang, X.C.; Song, Z.; Chen, X.; Zhang, H. Developing a multispectral imaging for simultaneous prediction of freshness indicators during chemical spoilage of grass carp fish fillet. J. Food Eng. 2016, 182, 9–17. [Google Scholar] [CrossRef]
  43. Fengou, L.C.; Lianou, A.; Tsakanikas, P.; Gkana, E.N.; Panagou, E.Z.; Nychas, G.-J.E. Evaluation of Fourier transform infrared spectroscopy and multispectral imaging as means of estimating the microbiological spoilage of farmed sea bream. Food Microbiol. 2019, 79, 27–34. [Google Scholar] [CrossRef]
  44. Liu, C.; Liu, W.; Chen, W.; Yang, J.; Zheng, L. Feasibility in multispectral imaging for predicting the content of bioactive compounds in intact tomato fruit. Food Chem. 2015, 173, 482–488. [Google Scholar] [CrossRef] [PubMed]
  45. Tang, C.; He, H.; Li, E.; Li, H. Multispectral imaging for predicting sugar content of ‘Fuji’ apples. Opt. Laser Techn. 2018, 106, 280–285. [Google Scholar] [CrossRef]
  46. Liu, C.; Hao, G.; Su, M.; Chen, Y.; Zheng, L. Potential of multispectral imaging combined with chemometric methods for rapid detection of sucrose adulteration in tomato paste. J. Food Eng. 2017, 215, 78–83. [Google Scholar] [CrossRef]
  47. Ropodi, A.I.; Panagou, E.Z.; Nychas, G.-J.E. Multispectral imaging (MSI): a promising method for the detection of minced beef adulteration with horsemeat. Food Control 2017, 73, 57–63. [Google Scholar] [CrossRef]
  48. Løkke, M.M.; Seefeldt, H.F.; Skov, T.; Edelenbos, M. Color and textural quality of packaged wild rocket measured by multispectral imaging. Postharvest Biol. Tech. 2013, 75, 86–95. [Google Scholar] [CrossRef]
  49. Huang, W.; Li, J.; Wang, Q.; Chen, L. Development of a multispectral imaging system for online detection of bruises on apples. J. Food Eng. 2015, 146, 62–71. [Google Scholar] [CrossRef]
  50. Khodabakhshian, R.; Emadi, B.; Khojastehpour, M.; Golzarian, M.R. Determining quality and maturity of pomegranates using multispectral imaging. J. Saudi Soc. Agric. Sci. 2017, 16, 322–331. [Google Scholar] [CrossRef] [Green Version]
  51. Kalkan, H.; Beriat, P.; Yardimci, Y.; Pearson, T.C. Detection of contaminated hazelnuts and ground red chilli pepper flakes by multispectral imaging. Comput. Electron. Agric. 2011, 77, 28–34. [Google Scholar] [CrossRef]
  52. Sendin, K.; Manley, M.; Williams, P.J. Classification of white maize defects with multispectral imaging. Food Chem. 2018, 243, 311–318. [Google Scholar] [CrossRef]
  53. Clemmensen, L.H.; Hansen, M.E.; Ersbøll, B.K.; Frisvad, J.C. A method for comparison of growth media in objective identification of Penicillium based on multispectral imaging. J. Microbiol. Meth. 2007, 69, 249–255. [Google Scholar] [CrossRef]
  54. Ebrahimi, P.; van den Berg, F.; Aunsbjerg, S.D.; Honoré, A.; Benfeldt, C.; Jensen, H.M.; Engelsen, S.B. Quantitative determination of mold growth and inhibition by multispectral imaging. Food Control 2015, 55, 82–89. [Google Scholar] [CrossRef]
  55. Magna, G.; Di Natale, C.; Martinelli, E. Self-repairing classification algorithms for chemical sensor array. Sensor. Actuat. B Chem. 2019, 297, 126721. [Google Scholar] [CrossRef]
Figure 1. Reflectance spectra (mean ± standard deviation values) of selected vanilla cream samples corresponding to different storage conditions.
Figure 1. Reflectance spectra (mean ± standard deviation values) of selected vanilla cream samples corresponding to different storage conditions.
Sensors 19 04071 g001
Figure 2. A schematic representation of the Dynamic Feature Selection (DFS) process.
Figure 2. A schematic representation of the Dynamic Feature Selection (DFS) process.
Sensors 19 04071 g002
Figure 3. A schematic representation of the procedure for DFS assessment over the MSI dataset.
Figure 3. A schematic representation of the procedure for DFS assessment over the MSI dataset.
Sensors 19 04071 g003
Figure 4. Confusion matrix of the classification results achieved in the fresh (first column) and spoiled (second column) vanilla cream samples.
Figure 4. Confusion matrix of the classification results achieved in the fresh (first column) and spoiled (second column) vanilla cream samples.
Sensors 19 04071 g004
Figure 5. Histogram of the feature selection rate of the 18 average reflectance spectra (upper) and the 18 standard deviations of reflectance spectra (lower) features for the two datasets. Blue bars refer to the training sample and red bars indicate the test samples. The training results are obtained using a leave-one-out cross validation procedure.
Figure 5. Histogram of the feature selection rate of the 18 average reflectance spectra (upper) and the 18 standard deviations of reflectance spectra (lower) features for the two datasets. Blue bars refer to the training sample and red bars indicate the test samples. The training results are obtained using a leave-one-out cross validation procedure.
Sensors 19 04071 g005
Table 1. Dataset composition for multispectral imaging (MSI) data. Training and validation samples having TVC counts in the range (2, 6) log CFU/g have been excluded from the analysis. Test dataset is analyzed in total.
Table 1. Dataset composition for multispectral imaging (MSI) data. Training and validation samples having TVC counts in the range (2, 6) log CFU/g have been excluded from the analysis. Test dataset is analyzed in total.
Training DataValidation DataTest Data
TVC ≤ 2TVC ≥ 6TVC ≤ 2TVC ≥ 6TVC ≤ 2TVC ∈ (2,6)TVC ≥ 6
29/6536/657/4841/48106/13218/1328/132
Table 2. Classification results for test samples in the whole range of TVC counts obtained during the microbiological analyses of vanilla cream samples.
Table 2. Classification results for test samples in the whole range of TVC counts obtained during the microbiological analyses of vanilla cream samples.
Positive AssignmentNegative Assignment
TVC <24/103 (4%)99/103 (96%)
TVC ∈ [2,3)0/7 (0%)7/7 (100%)
TVC ∈ [3,4)0/2 (0%)2/2 (100%)
TVC ∈ [4,5)0/2 (0%)2/2 (100%)
TVC ∈ [5,6)6/10 (60%)4/10 (40%)
TVC ≥ 67/8 (87.5%)1/8 (12.5%)
Table 3. Classification results of the comparative analysis. Linear Discriminant analysis (LDA) and Quadratic Discriminant Analysis (QDA) are used instead of SVM with the DFS approach. SVM classifier is then used with standard SE feature selection approach. Accuracy values are listed together with a broad report of the sensitivity values for each subcategory of TVC (2,5) and (5,6).
Table 3. Classification results of the comparative analysis. Linear Discriminant analysis (LDA) and Quadratic Discriminant Analysis (QDA) are used instead of SVM with the DFS approach. SVM classifier is then used with standard SE feature selection approach. Accuracy values are listed together with a broad report of the sensitivity values for each subcategory of TVC (2,5) and (5,6).
Comparative Analysis
MethodAccuracySensitivity Per Subcategory
TVC < 6 vs. TVC ≥ 6TVC ∈ (2 ÷ 5)TVC ∈ (5 ÷ 6)
SVM + DFS91.7%0% to SPOILED60% to SPOILED
LDA + DFS84.9%57% to SPOILED70% to SPOILED
QDA + DFS85.7%14% to SPOILED80% to SPOILED
SVM + SE70.2%40% to SPOILED80% to SPOILED

Share and Cite

MDPI and ACS Style

Lianou, A.; Mencattini, A.; Catini, A.; Di Natale, C.; Nychas, G.-J.E.; Martinelli, E.; Panagou, E.Z. Online Feature Selection for Robust Classification of the Microbiological Quality of Traditional Vanilla Cream by Means of Multispectral Imaging. Sensors 2019, 19, 4071. https://doi.org/10.3390/s19194071

AMA Style

Lianou A, Mencattini A, Catini A, Di Natale C, Nychas G-JE, Martinelli E, Panagou EZ. Online Feature Selection for Robust Classification of the Microbiological Quality of Traditional Vanilla Cream by Means of Multispectral Imaging. Sensors. 2019; 19(19):4071. https://doi.org/10.3390/s19194071

Chicago/Turabian Style

Lianou, Alexandra, Arianna Mencattini, Alexandro Catini, Corrado Di Natale, George-John E. Nychas, Eugenio Martinelli, and Efstathios Z. Panagou. 2019. "Online Feature Selection for Robust Classification of the Microbiological Quality of Traditional Vanilla Cream by Means of Multispectral Imaging" Sensors 19, no. 19: 4071. https://doi.org/10.3390/s19194071

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop