You are currently viewing a new version of our website. To view the old version click .
Foods
  • Article
  • Open Access

23 April 2025

Method of Detecting Microorganisms on the Surface of Mandarin Fish Based on Hyperspectral and Information Fusion

,
,
,
,
,
,
and
1
College of Engineering, Huazhong Agricultural University, Wuhan 430070, China
2
School of Electrical and Electronic Engineering, Wuhan Polytechnic University, Wuhan 430070, China
*
Author to whom correspondence should be addressed.
This article belongs to the Section Food Analytical Methods

Abstract

Microorganisms play a key role in fish spoilage and quality deterioration, making the development of a rapid, accurate, and efficient technique for detecting surface microbes essential for enhancing freshness and ensuring the safety of mandarin fish consumption. This study focused on the total viable count (TVC) and Escherichia coli levels in the dorsal and ventral parts of fish, and we constructed a detection model using hyperspectral imaging and data fusion. The results showed that comprehensive and simplified models were successfully developed for quantitative detection across all wavelengths. The models performed best at predicting microbial growth on the dorsal side, with the RAW-CARS-PLSR model proving the most effective at predicting TVC and E. coli counts in that region. The RAW-PLSR model was identified as the optimal predictor of the E. coli concentration on the ventral side. A fusion model in the decision layer constructed using the Dempster–Shafer theory of evidence outperformed models relying solely on spectral or textural information, making it an optimal approach for detecting surface microbes in mandarin fish. The best prediction accuracy for dorsal TVC concentration achieved an Rp value of 0.9337, whereas that for ventral TVC concentration reached 0.8443. For the E. coli concentration, the optimal Rp values were 0.8180 for the dorsal section and 0.8512 for separate analysis.

1. Introduction

Mandarin fish, classified within the order Perciformes, family Sinipercidae, subfamily Siniperchinae, and genus Siniperca (Basilewsky), is a freshwater species known for its delicate savory flesh. Characterized by relatively few bones and a broad, thick body, it is highly valued for its palatability and nutritional richness, containing over 20% protein per 100 g along with various trace elements, contributing to its significant economic value. The freshness of mandarin fish can directly affect their flavor, texture, and nutritional benefits, whereas deterioration can lead to toxicity and health risks primarily driven by microbial activity [1]. During the capture and postmortem phases, microbial interactions initiate biochemical processes that lead to rigor mortis, maturation, autolysis, and eventual decomposition in fish. Although the flesh remains fresh during rigor mortis and maturation, microbial infiltration accelerates the transition to autolysis and spoilage [2]. Thus, microbial activity is the primary determinant of decay, with bacterial counts and coliform levels serving as the key indicators of fish freshness.
Merchants often use freezing for storage, which can compromise the quality, making freshness assessment crucial. Current technologies for evaluating fish freshness include sensory evaluation, chemical indicator tests (e.g., pH levels, TVB-N content, K value, moisture, fat, and protein content), physical indicator tests (e.g., color, texture, and water-holding capacity), and biomimetic sensor technologies such as electronic noses and tongues [3]. However, traditional methods are inadequate for rapid, non-destructive evaluation and lack widespread application. Hyperspectral imaging (HSI) technology provides a noninvasive, rapid analytical approach that captures optical images that reflect both the internal chemical indicators and geometric features of fish samples. In a study by Zhou et al. [4], hyperspectral information was extracted using three different methods to simulate the growth of Pseudomonas fluorescens. The Baranyi model coupled with the Ratkowsky square root model (BRSR model) and the Huang model paired with the Ratkowsky square root model (HRSR model) were employed to directly construct growth simulations for P. fluorescens in pork using a one-step method [4]. The texture feature information in HSI as a key two-dimensional characteristic not only represents the grayscale spatial arrangement of samples but also provides insights into their quality. Gamilla et al. [5] converted color images obtained from machine vision into CIELAB values and used an analysis of variance to assess the freshness of different meat products. By integrating spectral and image texture data, this method simplifies complex information while simultaneously assessing both internal and external quality, enhancing the accuracy and stability of detection models, and significantly improving the non-destructive evaluation of fish meat [6].
The method proposed in this study integrates spectral analysis and information fusion to assess mandarin fish freshness, effectively addressing the limitations of traditional rapid non-destructive testing techniques. This approach provides an innovative and practical method to evaluate the freshness of mandarin fish in production settings. To verify the effectiveness of hyperspectral technology in the quantitative detection of surface microbial counts in mandarin fish, this study demonstrates that hyperspectral technology is effective, accurate, and highly adaptable for the quantitative detection of fish meat. These findings provide a strong theoretical foundation for the development of portable detection devices.

2. Materials and Methods

2.1. Acquisition of Mandarin Fish Samples

The experimental samples were obtained from the Yuehuo Li Life Plaza in Wuhan City, Hubei Province. The fish were culled by a blow to the head, and their viscera were removed. In a sterile environment, surgical tools were used to extract tissue samples from both the dorsal and ventral sides of the mandarin fish, carefully removing surface fat and connective tissue. The samples were precisely cut to 4.0 cm × 4.0 cm × 1.5 cm, sealed in sterile Petri dishes, and stored at a constant temperature of 4 °C in a biochemical incubator, with a total of 108 samples.
On days 0, 2, 3, 4, 5, and 6, six samples from both dorsal and ventral sections were collected for spectroscopic scanning. After scanning, the total microbial load and Escherichia coli count were measured according to national standards. The total microbial load was calculated using the GB4789.2-2022 standard [7], and the E. coli count was calculated using the GB 4789.3-2016 standard [8]. The experiment was conducted in three batches, each consisting of 36 dorsal and 36 ventral samples with 108 samples per batch. Figure 1 shows photographs of the dorsal and ventral samples, highlighting the differences in color, texture, and tissue structure. Therefore, this study focused on spectroscopic imaging and microbial counting tests for these distinct areas to determine the most effective sites for detecting surface microbial populations in mandarin fish [9].
Figure 1. Back and abdominal samples of mandarin fish.

2.2. Collection and Information Extraction of Hyperspectral Images of Mandarin Fish

The experimental hyperspectral imaging system shown in Figure 2 consists of a hyperspectral imager (SPECIM, V10E, Helsiki, Finland), high-precision electronically controlled elevating platform (Beijing Zhuolihan Optical Instrument Co., Ltd., HSIA-TSA300-IMS, Beijing, China), collection box, cooling fans, halogen light sources, and a Lenovo ThinkPad computer (Beijing, China). The spectrometer captured wavelengths from 300 to 1100 nm with a spectral resolution of 2.8 nm.
Figure 2. Hyperspectral image acquisition system. 1. Computer; 2. hyperspectral imager; 3. acquisition box; 4. halogen lighting source; 5. carrier stage; 6. high-precision electrically controlled movable elevating platform.
The hyperspectral imaging system captures images in a single acquisition, encompassing multiple sample details under varying adulteration ratios. Initially, the corrected images were segmented to extract individual spectral images of each sample. These segmented images then underwent region of interest (ROI) extraction to minimize background spectral interference. Following threshold segmentation, morphological operations such as dilation and erosion were applied to create a mask that accurately identified the ROI. Finally, the average spectrum of the sample was obtained from the masked localized area. Figure 3 illustrates the sample mask image, where the white area represents the spectral data extraction region, demonstrating the precise alignment of the mask with the sample.
Figure 3. Mask image of the sample.

2.3. Data Processing Methods

2.3.1. Spectral Preprocessing Techniques

Owing to the complexities of spectral information and uncertainties arising from noise and errors during data collection, this study applied several spectral preprocessing techniques including smoothing [10], normalization [11], derivation [12], multiplicative scattering correction (MSC) [13], and standard normal variate transformation (SNV) [14].

2.3.2. Extraction of Texture Feature Parameters

In this experiment, the gray level co-occurrence matrix (GLCM) was utilized to extract texture information from images [15], with the characteristic image data selected at 0°, 45°, 90°, and 135° in the hyperspectral images, encompassing six variables: mean, contrast, correlation, homogeneity, energy (also referred to as second moment), and entropy [16].

2.3.3. Methodologies for Constructing Quantitative Models

Principal Component Regression (PCR) is a multivariate calibration method that combines Principal Component Analysis (PCA) with linear regression [17].
Partial Least Squares Regression (PLSR) utilizes a factor extraction approach that renders the new factorial variables mutually independent, adhering to the G-M conditions. The process begins with the establishment of a regression model between the dependent variables and the factors, which is then transformed into a regression model with independent variables [18].
The Backpropagation (BP) Neural Network is a multi-layer feedforward neural network that learns through an iterative error backpropagation process consisting of forward and backward propagation phases [19].

2.3.4. Wavelength Selection Method

In this study, the Successive Projection Algorithm (SPA) [20], Competitive Adaptive Reweighting Algorithm (CARS) [21], and Genetic Algorithm (GA) [22] were employed to optimize predictive models for total viable count (TVC) and E. coli concentrations in the dorsal and ventral regions of mandarin fish.

2.3.5. Information Fusion

Data layer fusion is the most basic level of integration and involves a direct combination of raw data. Although this approach preserves a large amount of information, it results in high temporal and spatial complexity, thereby reducing real-time capabilities. Moreover, the noise and interference inherent in raw data may negatively affect the results. This method is suitable for fusing information from homogeneous sensors but operates at low processing speeds [23].
Feature layer fusion represents an intermediate level of fusion in which information is condensed through preprocessing and feature extraction, thereby reducing noise and improving real-time processing capabilities. This approach preserves essential information and enhances noise resistance, making it suitable for the rapid computation and retention of critical feature information [24].
Decision-layer fusion represents an advanced level of fusion, in which data undergo preprocessing and modeling before integration. This approach optimizes the computational load and improves interference resistance, which may lead to information loss. In this study, the Dempster–Shafer theory was applied to further enhance the model performance [25].

3. Results and Discussion

3.1. Analysis of Microbial Spoilage in Mandarin Fish Samples

The TVC and E. coli concentrations in the mandarin fish samples were analyzed to effectively reflect the freshness of the samples. The statistical results for TVC and E. coli concentrations are shown in Figure 4 and Table 1. Using the plate counting method, the TVC concentration of the back ranged from 3.7475 log10 CFU/g to 7.4458 log10 CFU/g, whereas the TVC concentration of the abdomen ranged from 4.4024 log10 CFU/g to 10.6590 log10 CFU/g. The E. coli concentration of the back samples varied from 2.0000 log10 CFU/g to 9.4433 log10 CFU/g, and that of the abdominal samples ranged from 2.1761 log10 CFU/g to 9.4814 log10 CFU/g. These results show a reasonable range of variation, effectively reflecting the progression of mandarin fish samples from freshness to spoilage. These data are beneficial for establishing a stable and accurate predictive model [26].
Figure 4. Changes in TVC and E. coli values on the surface of mandarin fish.
Table 1. Basic statistical results of TVC and E. coli concentrations in mandarin fish.
During the period from days 0 to 2, the total bacterial count and E. coli numbers increased relatively slowly. However, from days 2 to 4, the growth rates of both total bacteria and E. coli increased significantly. Starting on day 5, as the storage time increased, the growth rate of the bacteria slowed again, which aligns with the typical patterns of bacterial growth curves. Additionally, the total bacterial count and E. coli numbers in the abdominal samples were generally higher than those in the back samples, indicating a higher susceptibility to spoilage in the abdominal samples. This difference is attributed to variations in tissue composition between the abdomen and the back [27].
Furthermore, national standards specify that aquatic products are considered to be spoiled when the total bacterial count exceeds 7 log10 CFU/g. This indicates that within the first 4 days (from day 0 to day 4), the mandarin fish samples remained in a usable state. However, after day 5, the freshness of the mandarin fish samples significantly declined, with microbial indicators beginning to exceed acceptable limits, rendering them inedible [28].

3.2. Sample Set Division

As shown in Table 2, 108 dorsal and 108 ventral samples of mandarin fish were subdivided at a ratio of 2:1, resulting in a training set and a prediction set. The dorsal and ventral samples included 72 and 36 specimens in the training and prediction sets, respectively. The SPXY algorithm was used to divide the sample set with the dorsal and ventral training sets employed for modeling, and the performance was subsequently assessed using the prediction set [29].
Table 2. Statistical results of microorganism quantity on the surface of mandarin fish after sample set division.

3.3. Detection of Microorganisms on the Surface of Mandarin Fish Based on Spectral Dimension Analysis

3.3.1. Analysis of a Full-Wavelength Detection Model of Microorganisms on the Surface of Mandarin Fish

After preprocessing the spectral data of the dorsal region of the mandarin fish samples, full-wavelength regression prediction models for TVC and E. coli concentrations were established using Partial Least Squares Regression (PLSR), Principal Component Analysis (PCR), and Backpropagation (BP) Neural Networks. As shown in Table 3, both the TVC and E. coli concentration prediction models were based on RAW-PCR. For the TVC concentration, the training set yielded Rc = 0.8642 and RMSEC = 0.8914, the cross-validation set yielded Rcv = 0.6798 and RMSECV = 1.4540, and the prediction set yielded Rp = 0.8402 and RMSEP = 1.0600. For the E. coli concentration, the training set achieved Rc = 0.8871 and RMSEC = 1.0297, the cross-validation set showed Rcv = 0.7788 and RMSECV = 1.4189, and the prediction set yielded Rp = 0.8181 and RMSEP = 1.2500.
Table 3. Results of prediction models for TVC and E. coli values in mandarin fish backs using different pretreatment methods.
After preprocessing the spectral data of the abdominal region of the mandarin fish samples, full-wavelength regression prediction models for the TVC and E. coli concentrations were developed using PLSR, MLR, and BP Neural Networks. As shown in Table 4, the optimal prediction model for the TVC concentration in the abdominal region was the RAW-PLSR model, with the training set yielding Rc = 0.9115 and RMSEC = 0.7259, the cross-validation set showing Rcv = 0.7995 and RMSECV = 1.0751, and the prediction set resulting in Rp = 0.7706 and RMSEP = 1.0797. Similarly, the best prediction model for the E. coli concentration was the RAW-PLSR model, with the training set achieving Rc = 0.9486 and RMSEC = 0.6724, cross-validation set showing Rcv = 0.7746 and RMSECV = 1.3626, and prediction set yielding Rp = 0.7802 and RMSEP = 1.2725.
Table 4. Results of prediction models for abdominal TVC and E. coli values of mandarin fish established using different pretreatment methods.

3.3.2. Simplified Model Analysis of Surface Microorganism Detection of Mandarin Fish

As shown in Table 4, both the dorsal and ventral sample groups demonstrated satisfactory modeling performance when raw data were used to construct predictive models for the TVC and E. coli concentrations. Consequently, this study aimed to streamline the model construction process by selecting characteristic wavelengths from the original data using techniques such as CARS, GA, and SPA for model simplification.
Figure 5 illustrates the application of Competitive Adaptive Reweighted Sampling (CARS), the Successive Projection Algorithm (SPA), and the Genetic Algorithm (GA) to select the feature wavelengths from dorsal samples of mandarin fish to evaluate the total viable count (TVC) concentration. CARS identified 47 characteristic wavelengths, primarily within the 840–860 and 950–1000 nm ranges, with fewer in the 550–650 nm spectrum during 50 Monte Carlo samplings. The SPA selected 19 characteristic wavelengths, mainly distributed within the 710–780 and 860–945 nm ranges, with fewer in the 950–1000 nm range, and a more uniform presence across the 500–710 nm spectrum. After 20 evaluations, the GA identified 26 characteristic wavelengths, predominantly in the 475–550 nm range, with minimal presence across the 600–1000 nm spectrum.
Figure 5. Distribution of characteristic wavelengths predicted for TVC and E. coli values in SPA-screened back samples.
To predict the E. coli concentration in the dorsal samples of mandarin fish, CARS identified 39 characteristic wavelengths, primarily within the 650–710 and 920–970 nm spectra, with fewer distributed between 470 and 550 nm during 50 Monte Carlo samplings. The SPA selected 36 characteristic wavelengths with a relatively even distribution while being similarly sparse in the 470–550 nm range. After 20 evaluations, the GA identified 26 characteristic wavelengths concentrated in the 470–710 nm range, particularly within the 470–550 nm range, with almost no detection in the 720–1000 nm spectrum, exhibiting distinct selection patterns compared with CARS and the SPA.
As shown in Table 5, an optimized model for correlating TVC and E. coli concentrations in the dorsal muscles of mandarin fish was developed using the feature wavelengths selected by the CARS method. This model was termed RAW-CARS-PLSR and employed PLSR, thereby demonstrating its superior accuracy and predictive performance. For the TVC concentration, the training set achieved a correlation coefficient (Rc) of 0.9226 with a standard error of calibration (RMSEC) of 0.7541, the cross-validation set had an Rcv of 0.8299 with a standard error of cross-validation (RMSECV) of 1.1058, and the prediction set yielded an Rp of 0.8437 with a standard error of prediction (RMSEP) of 0.9512. For the E. coli concentration, the cross-validation set achieved an Rcv of 0.8576 with an RMSECV of 1.1635, whereas the prediction set recorded an Rp of 0.7632 with an RMSEP of 1.4046. Compared with other wavelength selection methods, the CARS-based model demonstrated higher correlation coefficients and lower errors in both the training and prediction sets. This is consistent with the findings of Baek et al. (2020), who demonstrated that the modeling performance of the feature wavelength selection method was overall superior [30].
Table 5. Simplified models of TVC and E. coli values in back samples using different wavelength selection methods for RAW pretreatment.
As shown in Figure 6, for feature wavelength selection in mandarin fish abdominal samples to predict TVC concentration, CARS identified 38 characteristic wavelengths during 50 Monte Carlo samplings, primarily between 620–750 and 920–950 nm, with fewer wavelengths in the 420–550 and 780–875 nm ranges. The SPA selected 36 characteristic wavelengths, mainly concentrated between 820–1000 and 470–710 nm, with almost no distribution between 720 and 830 nm. After 20 evaluations, the GA identified 24 characteristic wavelengths predominantly within the 470–550 nm range, with minimal presence between 710 and 1000 nm. This demonstrates a divergence from the CARS and SPA results and indicates insufficient modeling performance.
Figure 6. Prediction of characteristic wavelength distribution using TVC and E. coli values of abdominal samples screened by CARs.
As shown in Table 6, to predict the E. coli concentration in mandarin fish abdominal samples, CARS identified 58 characteristic wavelengths during 50 Monte Carlo samplings, primarily distributed between 700 and 950 nm, with fewer in the 420–550 nm and 610–700 nm ranges. The SPA selected 19 characteristic wavelengths, mainly between 490–570 and 860–930 nm, with a sparse distribution between 760 and 860 nm. The GA applied to dorsal samples for E. coli concentration prediction identified 28 characteristic wavelengths primarily concentrated in the 470–550 nm range, with almost no wavelengths detected between 710 and 1000 nm.
Table 6. Simplified models of abdominal TVC and E. coli values established using different wavelength selection methods under RAW pretreatment.
When different wavelength selection methods were adopted to develop simplified models for predicting TVC and E. coli concentrations in the abdomen of mandarin fish, the PLSR model based on Competitive Adaptive Reweighted Sampling (RAW-CARS-PLSR) demonstrated the best performance. For the TVC concentration, the training set achieved Rc = 0.9437 and RMSEC = 0.5836. The cross-validation set showed Rcv = 0.8935 and RMSECV = 0.8037, and the prediction set yielded Rp = 0.8464 and RMSEP = 0.9474. For E. coli concentration, the training set recorded Rc = 0.9516 and RMSEC = 0.6528, the cross-validation set achieved Rcv = 0.8868 and RMSECV = 0.9956, and the prediction set resulted in Rp = 0.7239 and RMSEP = 1.4035.

3.3.3. Comparison and Analysis of Full Wavelength Spectral Data of Different Detection Sites and Simplified Models

As shown in Table 7, a comparison of predictive models for TVC on the dorsal and ventral surfaces of mandarin fish revealed that the model for the ventral section outperformed that for the dorsal area, indicating superior accuracy in forecasting the total microbial count on the fish surface. The utilization of untreated raw data produced the most effective model performance, with the Polymerase Chain Reaction (PCR) model demonstrating notable effectiveness in quantitative predictions. Additionally, the application of CARS to select characteristic wavelengths further enhances its efficacy.
Table 7. Comparison of prediction models for surface TVC and E. coli values of mandarin fish.
For the models predicting of E. coli concentration, the ventral section demonstrated superior modeling performance compared with the dorsal section, whereas the dorsal models exhibited a smaller margin of error in predictions. The optimal preprocessing method, involving the use of raw data with the PCR model and CARS, proved to be the most effective for constructing predictive models. This is consistent with the findings of Ma et al. [31], who indicated that spectral information varied across different parts of fish fillets.

3.4. Detection of Surface Microorganisms of Mandarin Fish Based on Texture Information

3.4.1. Mandarin Fish Surface Microbial Detection Model Based on Feature Band Texture Analysis

For texture feature extraction from hyperspectral images of mandarin fish, the gray level co-occurrence matrix (GLCM) method was used to derive six texture parameters, including the mean, contrast, correlation, homogeneity, energy, and entropy, from the samples of the main body and belly regions. These parameters served as variables for constructing a microbial detection model based on surface texture information. During feature extraction, gray-level quantization was set to 16 levels, with a distance (d) of 1. GLCMs were computed in four orientations, that is, 0°, 45°, 90°, and 135°, and the averages across these directions were used as textural feature parameters. To normalize the data, the sum of all parameters was calculated, and each parameter was divided by this sum, scaling the values within the range [0, 1].
As demonstrated in Section 3.4, the CARS method significantly improved the accuracy of model predictions, making it the most effective among the three feature wavelength selection methods. This section describes the application of the CARS method. The spectral data from the 472.5 to 997.5 nm bands were initially divided into 20 equal subsets, followed by the development of 21 subsets and a full-spectrum microbial PLSR detection model. The RMSEs of the subset-based PLSR models were compared with those of the full-spectrum model, and the four subsets with the lowest RMSE, each less than that of the full-spectrum model, were selected. CARS was then applied to these four subsets to further refine the bands, reducing the variable numbers, eliminating the redundant information, and extracting the optimal texture features for modeling.
Using this methodology, the feature wavelengths identified for the dorsal TVC prediction were 656.25, 658.75, 670, 673.75, 675, 858.75, and 861.25 nm, with the corresponding texture features producing 48 texture variables per sample. For the ventral TVC prediction, the feature wavelengths were 631.25, 633.75, 637.5, 715, 718.75, 733.75, 935, 941.25, and 942.5 nm, resulting in 54 texture variables per sample.
For the dorsal E. coli concentration prediction, the extracted feature wavelengths were 653.75, 656.25, 673.75, 948.75, 952.5, 966.25, and 977.5 nm, yielding 42 texture variables per sample. For the ventral E. coli prediction, the feature wavelengths were 712.5, 718.75, 721.25, 770, 771.25, 780, 781.25, 872.5, 875, 925, and 932.5 nm, resulting in 66 texture variables per sample. The modeling outcomes for microbial concentration predictions based on texture features from both anatomical regions are presented in Table 8.
Table 8. Comparison of TVC and E. coli value prediction models for mandarin fish surfaces based on texture analysis of characteristic bands.
As shown in Table 8, when using the characteristic band texture analysis method to build the TVC concentration prediction model for the surface of mandarin fish, the PCR model demonstrated the best performance. For the dorsal prediction model, the training set achieved an Rc of 0.7324 with an RMSEC of 1.1411, the cross-validation set showed Rcv = 0.6471 and RMSECV = 0.5138, and the prediction set yielded Rp = 0.7168 and RMSEP = 1.3571. For the abdominal prediction model, the training set recorded Rc = 0.7535 and RMSEC = 1.4471, the cross-validation set achieved Rcv = 0.6728 and RMSECV = 0.5047, and the prediction set resulted in Rp = 0.7104 and RMSEP = 1.2748. A comparison of the two anatomical regions indicated that the abdominal model outperformed the dorsal model, which is consistent with the results of the previous model based on spectral dimensions.
When constructing an E. coli concentration prediction model on the surface of mandarin fish using texture analysis of feature bands, the PCR model demonstrated the best performance. For the dorsal model, the training set achieved an Rc of 0.7221 with RMSEC = 1.4241, the cross-validation set showed Rcv = 0.6418 and RMSECV = 1.3987, and the prediction set yielded Rp = 0.7097 and RMSEP = 1.4635. For the abdominal model, the training set recorded Rc = 0.7549 and RMSEC = 1.3341, the cross-validation set achieved Rcv = 0.6646 and RMSECV = 1.3352, and the prediction set resulted in Rp = 0.7401 and RMSEP = 1.3789.
However, the prediction model based on feature band texture analysis has certain limitations compared with the spectral-dimension-based model, resulting in a slightly lower overall modeling performance.

3.4.2. Regression Model Based on Principal Component Image Texture Analysis

This study applied a Principal Component Analysis (PCA) to reduce the dimensionality of hyperspectral images from the dorsal and ventral regions of mandarin fish, selecting the first three principal component images (PC1, PC2, and PC3), which accounted for 95.2%, 2.9%, and 0.9% of the cumulative spectral variance, respectively, totaling over 98%. The texture features of these three principal components were then extracted using the gray level co-occurrence matrix (GLCM), resulting in 18 texture feature parameters. The training and testing sets were structured into matrices of 18 (feature count) × 72 (sample count) and 18 (feature count) × 36 (sample count), respectively. The results are presented in Table 9.
Table 9. Comparison of TVC and E. coli value prediction models for mandarin fish surfaces based on principal component image texture analysis.
As shown in Table 9, when predicting TVC concentration on the surface of mandarin fish using the texture analysis method of principal component images, the PCR model demonstrated the best performance among the models for both the dorsal and abdominal regions. For the dorsal model, the training set achieved a correlation coefficient Rc of 0.7318 with RMSEC = 1.3998, the cross-validation set showed Rcv = 0.6654 and RMSECV = 1.517, and the prediction set yielded Rp = 0.7074 and RMSEP = 1.4122. For the abdominal model, the training set recorded an Rc of 0.7633 with RMSEC = 1.3323, the cross-validation set achieved Rcv = 0.6943 and RMSECV = 1.4378, and the prediction set resulted in Rp = 0.7394 and RMSEP = 1.3875.
When predicting E. coli concentrations in the dorsal and abdominal regions of mandarin fish using principal component image texture analysis, the PCR model outperformed the other models. For the dorsal region, the training set achieved Rc = 0.7079 and RMSEC = 1.2064, the cross-validation set showed Rcv = 0.6383 and RMSECV = 1.4211, and the prediction set yielded Rp = 0.6831 and RMSEP = 1.3273. The PCR model performed slightly better, with the training set recording Rc = 0.7635 and RMSEC = 1.1138, the cross-validation set achieving Rcv = 0.6896 and RMSECV = 1.4699, and the prediction set resulting in Rp = 0.7459 and RMSEP = 1.2511.
Overall, the prediction model based on the PCA of image textures outperformed the model using feature band texture analysis but remained inferior to the spectral dimension model. Compared with the optimal models presented in Section 3.4 and Section 3.5, the correlation coefficients Rc, Rcv, and Rp were significantly lower, indicating that the texture information was less effective at predicting the microbial counts on the surface of mandarin fish than the spectral dimension data. The research by Liu et al. [32] also supports this viewpoint, demonstrating that when studying salted meat, there is a significant difference in the accuracy of models established using the two types of information, with spectral information performing notably better. Although the texture information primarily reflected the external quality of the samples, the spectral data provided a more comprehensive representation of internal quality. Because the decay process progressed from the outside inward, the spectral data offered a distinct advantage for predicting microbial counts on the fish surface.

3.5. Surface Microbial Detection of Mandarin Fish Based on Information Fusion

3.5.1. Surface Microbial Detection Model of Mandarin Fish Based on Feature Layer Fusion

Before developing a predictive model for TVC concentration on the surface of mandarin fish using feature layer fusion, differences in the physical significance and numerical range of spectral and textural data were considered. To improve the accuracy and effectiveness of the model, data normalization was applied by adjusting both the spectral and textural features to a uniform range of [0, 1]. After normalization, feature fusion was performed to ensure data comparability, thereby facilitating the construction of a more stable model. The results of the microbial detection model based on feature layer fusion are presented in Table 10.
Table 10. PCR model results of TVC and E. coli value prediction in mandarin fish based on feature layer fusion.
As shown in Table 10, the TVC concentration PCR model developed using feature-layer fusion demonstrated advantages over models based only on textural information, as reflected by the increased Rc, Rcv, and Rp values. However, it remained slightly inferior to the models that used only spectral data, primarily owing to the suboptimal performance of texture information, indicating a limitation in the representativeness of texture feature extraction. In terms of stability, the small discrepancy between RMSEC, RMSECV, and RMSEP suggested that while the accuracy decreased, the stability improved.
When constructing the E. coli concentration prediction model, a trend similar to that of the TVC concentration prediction model was observed. Compared to the models based on spectral dimensions, the coefficients of determination (Rc, Rcv, and Rp) decreased, whereas the root mean square errors (RMSEC, RMSECV, and RMSEP) increased, indicating that feature-layer fusion did not improve the modeling effectiveness compared to using a single information source.

3.5.2. Surface Microbial Detection Model of Mandarin Fish Based on Decision-Level Fusion

Using the optimal predictive model results from Section 2.2 for the TVC concentrations and E. coli values in the dorsal and ventral samples of mandarin fish, combined with the superior texture-based predictive results from Section 2.3, these inputs were integrated into a detection model based on decision-level fusion, resulting in the development of a microbial detection model for the surface of mandarin fish.
In the decision-level fusion process of spectral and texture information from the dorsal and ventral samples of mandarin fish, the first step involved combining the optimal predictive outcomes from both channels into a two-dimensional decision vector, which was used as the input for modeling using the PCR method. Next, a second decision-level fusion approach was applied using the Dempster–Shafer evidence theory algorithm to convert the two-dimensional decision vector into a one-dimensional decision vector, which was again used as the input for PCR modeling. Finally, a comparative analysis was conducted to evaluate the modeling performance of these two decision-level fusion techniques.
As shown in Table 11, the TVC concentration prediction model developed through decision-level fusion exhibited strong overall stability, with the decision coefficients (Rc, Rcv, and Rp) closely aligned and predominantly higher, indicating consistent performance across datasets. Additionally, the root mean square errors (RMSEC, RMSECV, and RMSEP) demonstrated minimal differences and were notably lower than those of the models using only spectral or textural information. The decision-level fusion model for predicting E. coli concentrations in mandarin fish also demonstrated an overall favorable modeling outcome. Although the model constructed through direct merging did not exhibit a clear advantage over the spectral-dimension model in terms of the decision coefficients (Rc and Rp), the RMSEC, RMSECV, and RMSEP values were comparable, reflecting stable and reliable performance. Ahmed et al. [33] suggest that information fusion technology involves associating or integrating data and information from single or multiple channels in various ways, enabling more accurate acquisition of target information. It can also compensate for the limitations of the data that a single channel cannot obtain. Although this increases system complexity, it often leads to significant performance improvements.
Table 11. PCR model results of TVC and E. coli value prediction of mandarin fish based on decision level fusion.
The performance of the prediction model was further enhanced by applying the D-S evidence theory algorithm, which improved the stability and accuracy of the TVC concentration model based on decision-level fusion. This demonstrates that combining information from different sources can effectively enhance predictive capabilities. The optimal TVC concentration prediction model for the dorsal region of mandarin fish achieved Rc = 0.9516 and RMSEC = 0.9315, a cross-validation set Rcv = 0.7469 and RMSECV = 0.8822, and a prediction set Rp= 0.9337 and RMSEP = 0.8389. For the abdominal TVC concentration model, the results were Rc = 0.9401 and RMSEC = 0.7387. The cross-validation set achieved Rcv = 0.8903 and RMSECV = 0.9026, and the prediction set obtained Rp = 0.8443 and RMSEP = 1.1464. The optimal E. coli concentration prediction model for the dorsal region achieved Rc = 0.8897 and RMSEC = 0.9699, the cross-validation set achieved Rcv = 0.8786 and RMSECV = 1.0146, and the prediction set obtained Rp= 0.8180 and RMSEP=1.2501. For the abdominal region, the E. coli model results were Rc = 0.9469 and RMSEC = 0.7443. The cross-validation set achieved Rcv = 0.8969 and RMSECV = 0.8995, and the prediction set obtained Rp= 0.8512 and RMSEP = 1.1224. These results indicate that the decision-level fusion model is the optimal approach for predicting microbial concentrations in mandarin fish.

4. Conclusions

A novel model utilizing hyperspectral imaging technology was proposed to detect the concentrations of TVC and E. coli in mandarin fish, predicting the concentrations on both the dorsal and ventral sides during storage from 0 to 6 days. The analytical results indicated that the predictive model constructed using the D-S evidence theory for decision-level fusion was significantly more effective than models based only on spectral or texture information. Decision-level fusion proved to be the optimal method for predicting microbial counts on fish surfaces. In conclusion, the application of hyperspectral imaging technology for assessing the freshness of mandarin fish is both feasible and reliable. However, further exploration and research are still necessary, particularly for the detection of surface microorganisms based on texture information, as the predictive performance of the established models remains relatively weak and requires algorithmic enhancement to improve accuracy. Additionally, this study focused solely on developing a hyperspectral-based model for predicting microbial quantities; however, the underlying detection mechanisms remain unclear and warrant further investigation. In the future, it may also be possible to develop a non-destructive detection device based on hyperspectral technology for microbial quantification, making it more suitable for practical industrial applications.

Author Contributions

Conceptualization, T.Y.; methodology, L.M. and Y.F.; validation, Y.M. and Z.G.; formal analysis, Y.W.; investigation, T.Y.; resources, L.M.; data curation, T.Y.; writing—original draft preparation, T.Y. and Y.M.; writing—review and editing, Y.M.; visualization, L.K.; supervision, H.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the HZAU-AGIS Cooperation Fund (grant number SZYJY2021028) and the Fundamental Research Funds for the Central Universities (grant number 2662020GXPY003).

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

The authors would like to thank all reviewers who participated in the review and the person who provided linguistic assistance during the preparation of this manuscript.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
1st De1st Derivative
BPBackpropagation
BPABasic Probability Assignment
BBABasic Belief Assignment
CARSCompetitive Adaptive Reweighted Sampling
E. coliEscherichia coli
GAGenetic Algorithm
GLCMGrey level co-occurrence matrix
MSCMultiplicative scattering correction
PCAPrincipal Component Analysis
PCRPrincipal Component Regression
PLSRPartial Least Squares Regression
R2Coefficients of determination
RMSERoot Mean Square Error
ROIRegion of interest
SNVStandard normal variate transformation
SPXYSample set partitioning based on joint x–y distance
SVCSupport Vector Classification
TVCTotal viable count
VNVector Normalization

References

  1. Yan, A.S.; Xiong, C.X.; Qian, J.W.; Wang, X.D. A study on the rate of flesh content of mandarin fish and nutritional quality of the flesh. J. Huazhong Agric. Univ. 1995, 14, 80–84. (In Chinese) [Google Scholar]
  2. Zeng, Z.Y.; Ding, Z.L.; Zhou, A.N.; Zhu, C.B.; Yang, S.; Fei, H. Bacterial diseases inSiniperca chuatsi: Status and therapeutic strategies. Vet. Res. Commun. 2024, 48, 3579–3592. [Google Scholar] [CrossRef]
  3. Ma, Y.; Wang, X.; Tian, Y.; Wang, Q. Effect of a Hypoxia-Controlled Atmosphere Box on Egg Respiration Intensity and Quality. Appl. Sci. 2022, 12, 380. [Google Scholar] [CrossRef]
  4. Zhou, B.; Fan, X.; Song, J.; Wu, J.; Pan, L.; Tu, K.; Peng, J.; Dong, Q.; Xu, J.; Wu, J. Growth simulation of Pseudomonas fluorescens in pork using hyperspectral imaging. Meat Sci. 2022, 188, 108767. [Google Scholar] [CrossRef]
  5. Gamilla, H.T.; Yuan, H.; Brad, K. Fresh meat color evaluation a structured light imaging system. Food Res. Int. 2015, 71, 100–107. [Google Scholar]
  6. Liu, Z.; Huang, Y.; Wang, J.; Yuan, G.; Pang, J. Multiscale Information Fusion Based on Large Model Inspired Bacterial Detection. Big Data Min. Anal. 2025, 8, 1–17. [Google Scholar] [CrossRef]
  7. GB 4789.2-2022; National Food Safety Standard, Food Microbiological Examination, Aerobic Plate Count. Standardization Administration of the People’s Republic of China: Beijing, China, 2022.
  8. GB 4789.3-2016; National Food Safety Standard—Food Microbiological Examination—Coliforms Count. Standardization Administration of the People’ s Republic of China: Beijing, China, 2016.
  9. Liu, S.; Lioe, T.S.; Sun, L.; Adriaenssens, E.M.; McCarthy, A.J.; Sekar, R. Validation of CrAssphage Microbial Source Tracking markers and comparison with Bacteroidales Markers for Detection and Quantification of Faecal Contaminations in Surface Water. Environ. Pollut. 2025, 366, 125403. [Google Scholar] [CrossRef]
  10. Hansen, P.C.; Jensen, T.K. Smoothing Norm Preconditioning for Regularizing Minimum Inesidual Methods. SIAM J. Matrix Anal. Appl. 2006, 29, 1–14. [Google Scholar] [CrossRef]
  11. Gillis, N.; Vavasis, S.A. Semidefinite Programming Based Preconditioning for More Robust Near-Separable Nonnegative Matrix Factorization. Siam J. Optimiz. 2015, 25, 677–698. [Google Scholar] [CrossRef]
  12. Housman, J.A.; Kiris, C.C.; Hafez, M.M. Time-Derivative Preconditioning Methods for Multicomponent Flows—Part II: Two-Dimensional Applications. J. Appl. Mech. 2009, 76, 031013. [Google Scholar] [CrossRef]
  13. Andriulli, F.P.; Cools, K.; Bagci, H.; Olyslager, F.; Buffa, A.; Christiansen, S. A multiplicative Calderón preconditioner for the electric field integral equation. IEEE Antenna. Propag. 2008, 56, 2398–2412. [Google Scholar] [CrossRef]
  14. Mizutani, T. Robustness Analysis of Preconditioned Successive Projection Algorithm for General Form of Separable NMF Problem. Linear Algebra Appl. 2016, 497, 1–22. [Google Scholar] [CrossRef]
  15. Wang, L.; Sun, J.; Li, T. Intelligent sports feature recognition system based on texture feature extraction and SVM parameter selection. J. Intell. Fuzzy Syst. 2020, 39, 4847–4858. [Google Scholar] [CrossRef]
  16. Ravindran, G.; Titus, T.J.; Ganesh, V.; Devi, V.S.S. Analysis of Image Segmentation Techniques for Texture Feature Extraction. Int. J. Eng. Sci. Res. Technol. 2017, 6, 66–71. [Google Scholar]
  17. You, J.; Tulpan, D.; Ellis, J.L. Predicting pellet quality using multiple linear regression with Principal Component Analysis (PCA). J. Anim. Sci. 2024, 102, 154–155. [Google Scholar] [CrossRef]
  18. Nesrstová, V.; Wilms, I.; Palarea-Albaladejo, J.; Filzmoser, P.; Martín-Fernández, J.A.; Friedecký, D.; Hron, K. Principal balances of compositional data for regression and classification using partial least squares. J. Chemom. 2023, 37, e3518. [Google Scholar] [CrossRef]
  19. Xue, C.; Xu, W.; Bao, Y.; Zhao, L.; Liu, X.; Zheng, R. Prediction of Converter Tapping Weight Based on Principal Component Analysis–Whale Optimization Algorithm–Backpropagation Algorithm Neural Network Model. Steel Res. Int. 2024, 95, 2300410. [Google Scholar] [CrossRef]
  20. Canova, L.D.S.; Vallese, F.D.; Pistonesi, M.F.; de Araújo Gomes, A. An improved successive projections algorithm version to variable selection in multiple linear regression. Anal. Chim. Acta 2023, 1274, 341560. [Google Scholar] [CrossRef]
  21. Kabir, E.; Guikema, S.D.; Quiring, S.M. Power outage prediction using data streams: An adaptive ensemble learning approach with a feature- and performance-based weighting mechanism. Risk Anal. 2024, 44, 686–704. [Google Scholar] [CrossRef]
  22. Jyakhwo, S.; Serov, N.; Dmitrenko, A.; Vinogradov, V.V. Machine Learning Reinforced Genetic Algorithm for Massive Targeted Discovery of Selectively Cytotoxic Inorganic Nanoparticles. Small 2024, 20, e2305375. [Google Scholar] [CrossRef]
  23. Tao, L.; Wang, F.; Li, Y.; Wu, J.; Jiang, X.; Xi, Q. A cross-texture haptic model based on tactile feature fusion. Multimed. Syst. 2024, 30, 159. [Google Scholar] [CrossRef]
  24. Tao, B.; Bai, J.; Wu, H.; Miao, F. Design and research of multi-information fusion RFID sensor for fruit and vegetable quality detection. Microchim. Acta 2024, 191, 568. [Google Scholar] [CrossRef]
  25. Wang, H.; Huang, Z.; Lou, S.; Li, W.; Liu, G.; Tang, Y. In Silico Prediction of Skin Sensitization for Compounds via Flexible Evidence Combination Based on Machine Learning and Dempster-Shafer Theory. Chem. Res. Toxicol. 2024, 37, 894–909. [Google Scholar] [CrossRef] [PubMed]
  26. Zhang, L.; Li, Y.; Peng, Y.; Wang, W.; Jiang, F.; Tao, F.; Shan, J. Determination of pork freshness attributes by hyperspectral imaging technique. Trans. Chin. Soc. Agric. Eng. 2012, 28, 254–259. [Google Scholar]
  27. Han, F.; Huang, X.; Teye, E.; Gu, F.; Gu, H. Nondestructive detection of fish freshness during its preservation by combining electronic nose and electronic tongue techniques in conjunction with chemometric analysis. Anal. Methods 2013, 6, 529–536. [Google Scholar] [CrossRef]
  28. Huang, X.; Ding, R.; Han, F.K.; Dai, H.; Teye, E.; Xu, F. Rapid and nondestructive evaluation of fish Freshness by near infrared reflectance spectroscopy combined with chemometrics analysis. Anal. Methods 2014, 6, 9675–9683. [Google Scholar]
  29. Chen, Y.; Zeng, H.B.; Li, Y. Stability analysis of linear delayed systems based on an allowable delay set partitioning approach. Automatica 2024, 163, 111603. [Google Scholar] [CrossRef]
  30. Baek, I.; Lee, H.; Cho, B.-K.; Mo, C.; Chan, D.E.; Kim, M.S. Shortwave infrared hyperspectral imaging system coupled with multivariable method for TVB-N measurement in pork. Food Control 2020, 124, 107854. [Google Scholar] [CrossRef]
  31. Ma, J.; Sun, D.-W.; Qu, J.-H.; Pu, H. Prediction of textural changes in grass carp fillets as affected by vacuum freeze drying using hyperspectral imaging based on integrated group wavelengths. Opt. Commun. 2017, 82, 377–385. [Google Scholar] [CrossRef]
  32. Liu, D.; Pu, H.; Sun, D.-W.; Wang, L.; Zeng, X.-A. Combination of spectra and texture data of hyperspectral imaging for prediction of pH in salted meat. Food Chem. 2014, 160, 330–337. [Google Scholar] [CrossRef]
  33. Ahmed, F.; Al-Mamun, H.A.; Bari, A.S.M.H.; Hossain, E.; Kwan, P. Classification of crops and weeds from digital images: A support vector machine approach. Crop Prot. 2012, 40, 98–104. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.