Non-Destructive and Real-Time Discrimination of Normal and Frozen-Thawed Beef Based on a Novel Deep Learning Model

Xi, Rui; Lyu, Xiangyu; Yang, Jun; Lu, Ping; Duan, Xinxin; Hopkins, David L.; Zhang, Yimin

doi:10.3390/foods14193344

Open AccessArticle

Non-Destructive and Real-Time Discrimination of Normal and Frozen-Thawed Beef Based on a Novel Deep Learning Model

by

Rui Xi

¹,

Xiangyu Lyu

¹,

Jun Yang

²,

Ping Lu

²,

Xinxin Duan

²,

David L. Hopkins

²

and

Yimin Zhang

^2,*

¹

Shandong Key Laboratory of Intelligent Production Technology and Equipment for Facility Horticulture, Shandong Engineering Research Center of Agricultural Equipment Intelligentization, Shandong Agricultural University, Tai’an 271018, China

²

Lab of Beef Processing and Quality Control, Shandong Agricultural University, Tai’an 271018, China

^*

Author to whom correspondence should be addressed.

Foods 2025, 14(19), 3344; https://doi.org/10.3390/foods14193344

Submission received: 1 August 2025 / Revised: 7 September 2025 / Accepted: 17 September 2025 / Published: 26 September 2025

(This article belongs to the Section Food Engineering and Technology)

Download

Browse Figures

Versions Notes

Abstract

Discrimination between normal (fresh/non-frozen) and frozen-thawed beef is crucial for ensuring food safety. This paper proposed a novel, non-destructive and real-time you only look once for normal and frozen-thawed beef discrimination (YOLO-NF) model using deep learning techniques. The simple, parameter-free attention module (SimAM) and the squeeze and excitation (SE) attention mechanism were introduced to enhance the model’s performance. A total of 1200 beef samples were used, with their images captured by a charge-coupled device (CCD) camera. In the model development, specifically, the training set comprised 3888 images after data augmentation, while the validation set and test set each included 216 original images. Experimental results on the test set showed that the YOLO-NF model achieved precision, recall, F1-Score and mean average precision (mAP) of 95.5%, 95.2%, 95.3% and 98.6%, respectively, significantly outperforming YOLOv7, YOLOv5 and YOLOv8 models. Additionally, gradient-weighted class activation mapping (Grad-CAM) was adopted to interpret the model’s decision basis. Moreover, the model was deployed on the web interface for user convenience, and the discrimination time on the local server was 0.94 s per image, satisfying the requirements for real-time processing. This study provides a promising technique for high-performance and rapid meat quality assessment in food safety monitoring systems.

Keywords:

YOLO-NF model; frozen-thawed beef; deep learning; model decision; food safety

1. Introduction

Beef is an important part of the global meat industry and is favored by most consumers for its high nutritional value. In recent years, per capita beef consumption and total beef consumption have increased dramatically due to population growth and increased incomes [1]. Freezing can effectively preserve the nutrients in beef, however, the ice crystals formed during the freezing process can cause physical and chemical damage to the meat, resulting in a loss of quality and sensory characteristics [2]. Despite these quality differences, it is hard for consumers to directly distinguish between frozen-thawed and normal (fresh/non-frozen) beef according to the appearance. Mislabelling for normal and frozen-thawed beef can occur both intentionally and unintentionally, which seriously breaches consumers rights and interests, and may also lead to food safety issues. Indeed, discrimination between normal and frozen-thawed beef has long been a necessity in the food industry to guarantee consumers assurance in terms of both economic and food quality aspects [3,4].

Currently, the leading non-invasive technologies used for assessing meat quality include spectroscopic technologies, ultrasonic technology, magnetic resonance imaging (MRI) and nuclear magnetic resonance (NMR) technology [5,6,7,8,9,10], as well as deep learning technologies [11,12]. Spectral information of the object to be measured can be obtained by spectroscopic technologies and non-invasive detection can be achieved. Yu et al. [13] studied the discrimination of fresh, frozen-stored and frozen-thawed beef cuts utilizing hyperspectral imaging technology and achieved accuracies of 92.8% using partial least squares discriminant analysis (PLS-DA), 97.8% using support vector machine (SVM) and 95.0% using back propagation artificial neural network (BP-ANN). Ultrasonic technology relies on two basic parameters, ultrasonic signal and ultrasonic velocity, to assess beef characteristics. Wu et al. [14] evaluated the effects of ultrasonic-assisted thawing at different frequency modes on the thawing efficiency and quality evaluation of frozen beef, and the results showed that ultrasound-assisted thawing treatments with single-frequency and dual-frequency have potential application in detecting thawed beef meat. MRI and NMR both have high sensitivity for distinguishing fresh and frozen-thawed beef, as the strength of their magnetic fields depends on the presence of magnetic atoms in the samples. Perez-Palacios et al. [15] evaluated the capability of MRI and computer vision techniques to classify fresh and frozen-thawed beef and predict physico-chemical, texture and sensory characteristics, and the results revealed that the computational analysis of MRI, especially the algorithm to analyze the image, may be set as a function of the aim and of the type of sample, while the analyzed characteristic was not relevant.

These studies have demonstrated the significance of the abovementioned technologies in assessing normal and frozen-thawed beef. However, some challenges such as high costs, complex technological requirements and data processing, and difficulty to achieve real-time applications persist during implementation. For example, redundant or irrelevant spectral features may be present due to the high dimensionality of spectral data, resulting in models that perform well on training data but poorly on test data, thus spectral feature selection and optimization need to be completed before modeling. Specialized equipment is required for the use of ultrasonic technology and MRI and NMR technology, which is costly. Besides, specialized technicians are needed for operation and data analysis. Therefore, a non-destructive and real-time assessment method is urgently needed to achieve discrimination between normal and frozen-thawed beef.

Deep learning excels in automatic feature extraction, which can automatically extract and learn useful features from raw data without human intervention, thereby reducing human bias while avoiding intricate feature engineering [16,17]. Given the advantages highlighted above, deep learning has been widely employed in meat quality assessment [18,19,20,21]. For example, Abd Elfattah et al. [22] assessed meat freshness using an optimized deep learning method, and achieved an accuracy of 98.5%, sensitivity of 98.5%, and specificity of 99.2%. Lin et al. [23] used fluorescent sensing array and deep learning to monitor meat (beef, chicken and pork) freshness. SqueezeNet, a lightweight convolutional neural network, was applied to automatically identify the freshness level of the meats based on fluorescent sensor array images, obtaining an accuracy of 98.2%. Huang et al. [24] used an improved ResNet-50 model to achieve automatic recognition and classification of pork primal cuts, and the identification accuracy reached 94.5%.

However, to the authors’ knowledge, research into the discrimination of normal and frozen-thawed beef based on deep learning is scarce. In this paper, a novel you only look once for normal and frozen-thawed beef (YOLO-NF) model was proposed and deployed utilizing deep learning techniques to achieve non-destructive and real-time discrimination of normal and frozen-thawed beef. The simple, parameter-free attention module (SimAM) and the squeeze and excitation (SE) attention mechanism were adopted in the YOLO-NF model to enhance the discrimination performance of the model. Moreover, the proposed model was deployed on the web interface to improve the accessibility for users. It is hoped that the current research should provide an accurate, non-destructive and real-time detection method for identifying normal and frozen-thawed beef.

2. Materials and Methods

A flowchart of the proposed method for discrimination of normal and frozen-thawed beef is shown in Figure 1. The pipeline workflow consists of data preparation, a novel model development and deployment of the discrimination model. Data augmentation methods and deep learning techniques were used in the process of data pre-processing and development of the discrimination model, respectively, and the model deployment was carried out based on the Gradio deep learning framework.

2.1. Data Acquisition and Pre-Processing

As shown in Figure 2a, the image acquisition equipment consisted of a charge-coupled device (CCD) camera (LBAS-GE60-17M/C, LUSTER LightTech Co., LTD. (Luster), Beijing, China) with a pixel size of 3072 × 2048, a circular light source, a collection shed (0.6 m × 0.6 m × 0.6 m) and a personal computer. All samples in this study were from our previous work [3,25,26]. Briefly, the M. longissimus lumborum were obtained at 24 h postmortem from both sides of 114 Simmental crossbred bulls (Age: 18–24 months), and the loins were cut into 2.54~3 cm thick steaks in a commercial beef harvest facility. All steaks were vacuum-packed and transported to the laboratory in ice within 3 h. A total of 1200 samples were used in this study, including 700 normal samples and 500 frozen-thawed samples. Some of the sample images are shown in Figure 2b. 90% of the sample images (1080 images, normal: 630, frozen-thawed: 450) were used for model development, and the remaining 10% (120 images, normal: 70, frozen-thawed: 50) were used for model deployment validation. The 1080 images were randomly split into training, validation and test sets at a 6:2:2 ratio (648, 216 and 216 images, respectively) for model development, with the normal-to-frozen-thawed ratio consistently maintained at 7:5 across all subsets. To enhance sample diversity and model robustness, data augmentation techniques (blurring, rotation, additive Gaussian noise, random erasing and rand-augment, see Figure 2c) were applied to the training set. This process generated 3240 new images, which were combined with the original 648 training images, yielding a total of 3888 images (normal: 2268, frozen-thawed: 1620) for model training.

2.2. The Proposed YOLO-NF Model

2.2.1. Overall Network Structure of the YOLO-NF Model

To achieve non-destructive and real-time discrimination for normal and frozen-thawed beef, a novel YOLO-NF model was proposed in this study. Based on the YOLOv7 model [27,28], the SimAM module and the SE attention mechanism module were integrated into the neck and backbone networks to enhance the model’s abilities of feature extraction and global information perception, thus improving the discrimination performance of the model. The overall network structure of the YOLO-NF model is shown in Figure 3.

2.2.2. Network Structure of YOLOv7 Model

YOLO, a real-time object detection algorithm, transforms the object detection task into a regression problem, enabling the prediction of object bounding boxes and class probabilities via a single neural network in one forward pass, which is suitable for scenarios demanding real-time processing. The YOLOv7 model is part of the successive versions developed and improved based on the original YOLO object detection model. It has inherited the basic ideas of the YOLO model while introducing distinct optimizations and enhancements.

The YOLOv7 model (baseline model) is composed of four main parts: Input, Backbone, Neck and Head, as shown in Figure 4. The input module scales the input image to a unified pixel size to reduce the computation cost. The backbone network comprises the Conv-BN-SiLU (CBS) module for feature extraction, the efficient layer aggregation network (ELAN) module for accelerating model convergence by combining CBS through multiple branches, and a maxpooling and convolution (MPConv) module for improving the feature extraction ability. The neck network employs CBS, ELAN-W, MPConv and spatial pyramid pooling cross-stage partial convolution (SPPCSPC) modules to refine and enhance the features extracted from the backbone layers by feature fusion. The ELAN-W module differs from ELAN only in the concatenate operation. The SPPCSPC module is used to change the channel dimensions, which consists of max pooling and CBS module.

The head network consists of the reparameterization (REP) module and the Conv-BN-Sigmoid (CBM) module. The REP module was used to adjust three different scale outputs from neck layers by combining the convolution (Conv) layers and batch normalization (BN) layers. The CBM module is used to predict the targets and create anchor frames.

2.2.3. SimAM Module

The SimAM module [29] could adaptively adjust the weight of each pixel by calculating the similarity between each pixel in the feature map and its surrounding pixels and infer 3-D attention weights for the feature map in a layer without adding parameters to the original networks, thus enhancing representation of feature maps and improving model performance. This well-established attention mechanism has been widely validated and applied in deep learning. The schematic diagram of the SimAM module is shown in Figure 5.

Aiming at achieving better weights allocation, the SimAM module constructs an energy function based on neuroscience theory to find the important neural by measuring the linear differentiability between neurons. The function is defined as follows.

e_{t}^{*} = \frac{4 (\overset{\land 2}{σ} + λ)}{{(t - \overset{\land}{μ})}^{2} + 2 \overset{\land 2}{σ} + 2 λ}

(1)

where

e_{t}^{*}

stands neuron importance, t is the target neuron of the input feature in the current channel,

\overset{\land}{μ}

and

\overset{\land 2}{σ}

are the mean and variance of all neurons in the channel to avoid repeated calculations and reduce the computational costs, and

λ

represents the weight constant. Equation (1) indicates that the lower

e_{t}^{*}

, the greater the difference between neuron

t

and its surrounding neurons, and the higher its importance. Finally, excessive values are suppressed by Sigmoid function and the formula is as follows.

\tilde{X} = S i g m o i d (\frac{1}{e_{t}^{*}}) ⊙ X

(2)

In order to enhance the feature extraction capability of the model, the SimAM module was integrated into YOLOv7 network. As shown in Figure 6, we proposed the ELAN-SA network by adding SimAM after the concatenation layer of the ELAN-W module, which adaptively emphasizes critical pixel-level features.

The ELAN-SA module inherits the existing features of ELAN-W module and it allows for more focused processing of sample features. The ELAN-SA module was added before the concatenate layers and output layers of the neck network to enhance the feature refinement ability and feature fusion efficiency of model.

2.2.4. SE Attention Mechanism Module

The SE attention mechanism [30] compresses the spatial dimensions of the feature maps through global average pooling operations, which makes global context information available for each channel and enhances the capability to perceive global features of the model. It has been widely applied in deep learning frameworks. As shown in Figure 7, the SE network consists of three main steps: Squeeze, Excitation and Scale operation.

The squeeze operation could implement image compression by global average pooling, the squeeze operation equation for the channel is shown as follows.

z_{c} = F_{s q} (u_{c}) = \frac{1}{H \times W} \sum_{i = 1}^{H} \sum_{j = 1}^{W} u_{c} (i, j), z \in R^{c}

(3)

where

u_{c} (i, j)

is the value of each point on the feature map channel.

The excitation operation could obtain a weighting coefficient for the importance of each channel by using two fully connected layers. The first fully connected layer is used to reduce the channel dimension and the other is used to restore the channel dimension. In the process, ReLU activation function is used for nonlinear transformation. Finally, weighting coefficients are normalized by Sigmoid activation function.

The scale operation could realize the assignment of channel importance by multiplying the normalized weights coefficients

s_{c}

obtained previously for each channel with the original eigenvalue

u_{c}

, which is presented in Equation (4).

F_{s c a l e} (s_{c}, u_{c}) = {\tilde{x}}_{c} = s_{c} \times u_{c}

(4)

To enhance the capability to focus on beef features of the model, the SE attention mechanism was adapted in this study. As shown in Figure 8, the SE attention mechanism was inserted between the backbone and neck networks. By weighting the beef feature maps from the backbone, SE highlights prominent features, enabling the neck network to fully fuse discriminative information for accurate differentiation.

2.3. Evaluation Metrics

Some evaluation metrics such as precision (P), recall (R), F1-Score, mean average precision (mAP) were introduced to evaluate the performance of the beef discrimination model. The above metrics are as follows.

P r e c i s i o n = \frac{T P}{T P + F P}

(5)

R e c a l l = \frac{T P}{T P + F N}

(6)

F 1 - S c o r e = 2 \times \frac{P \times R}{P + R}

(7)

m A P = \frac{1}{n} \sum_{i = 0}^{n} A P_{i}

(8)

where TP is true positive, TN is true negative, FP represents false positive and FN represents false negative.

The precision indicator measures the proportion of true positive samples in the predicted positive samples. The recall indictor presents the proportion of correctly predicted positive samples to the total positive samples. F1-Score is the harmonic mean of precision and recall and provides a balance between the two.

The Precision-Recall curve can be achieved by considering Recall and Precision as the x-axis and y-axis, respectively, and the area under the curve represents the value of average precision (AP). The mean average precision (mAP) is the average value of AP, which is a crucial indicator to evaluate the performance of the beef discrimination model, and the higher the value is, the better the discrimination effect of the model. The commonly used indicator about the mAP is mAP@0.5, which is often employed in the Precision-Recall curve and represents the average precision of AP for an intersection over union (IoU) of 0.5.

2.4. Supplementary Model Analyses

Qualitative analysis, ablation experiments, and model interpretation are supplementary to evaluation metrics.

Qualitative analysis visually presents misidentified samples to intuitively explain model behavior, compensating for quantitative metric limitations. Ablation experiments validate module effectiveness via four models (baseline, baseline + SimAM, baseline + SE, YOLO-NF) under consistent parameters. Model interpretation uses convolutional layer feature map extraction and activation heatmaps generated by Gradient-Weighted Class Activation Mapping (Grad-CAM) to reveal decision bases and enhance interpretability.

2.5. Model Deployment

For user-friendliness, the proposed model was deployed on the web interface based on the Gradio framework. Gradio is a python-based framework for deep learning applications, and the user interface built with it can be shared with others through web connections. The flowchart of the model deployment is composed of the front-end and the back-end, as depicted in Figure 9. Users upload the beef images required to be identified through the front-end interface. Subsequently, the discrimination task is performed by the back-end after receiving these uploaded images. Then, the results are presented to the front-end interface.

3. Results and Discussion

3.1. Experimental Setup

All experiments were carried out on a personal computer running Python 3.8 and Pytorch 2.3.0, equipped with Intel (R) Core (TM) i9-14900HX CPU, NVIDIA GeForce RTX 4060 GPU and CUDA 11.8. The SGD optimizer with CIoU loss function was employed in this study. The YOLO-NF model was trained over 80 epochs on the above personal computer with an initial learning rate of 0.0001 and a final learning rate of 0.1, more hyperparameter configurations are listed in Table 1.

3.2. Experimental Training Process

All images were scaled to 640 × 640 pixels size to reduce the computation effort in the training process. The training process are displayed in Figure 10. The values of the overall loss exhibited a decreasing trend and reached convergence after 80 epochs. The precision and recall increased gradually with the increase of epochs and reached a peak value sta-bilizing around it after 80 epochs. These results indicate that the proposed YOLO-NF model has strong fitting and generalization ability and is suitable for identification of normal and frozen-thawed beef.

3.3. Quantitative Analysis

Experimental results on the test set indicated that the YOLO-NF model achieved a precision of 95.5%, a recall of 95.2%, and thus obtained an F1-Score of 95.3%. The mAP was 98.6%, and the average running time per image was 0.022 s, meeting the demand for real-time identification [31].

To validate the performance of the proposed model for normal and frozen-thawed beef discrimination, quantitative comparisons were conducted with YOLOv7, YOLOv8 and YOLOv5 models [32]. These models were retrained in this study using the same dataset and experimental setup as the proposed YOLO-NF model, with consistent experimental setup, initial training parameters, dataset, and evaluation metrics across all mentioned models. The statistical results in Table 2 show that the proposed model has the best discrimination performance, followed by the YOLOv7 model, then YOLOv8 and YOLOv5 models.

Precision of the YOLO-NF model was 12.7%, 21.9% and 16.2% higher than that of YOLOv7, YOLOv8 and YOLOv5 models, respectively. Recall of the YOLO-NF model was improved by 15.5%, 10.6% and 11.9%, respectively, compared with YOLOv7, YOLOv8 and YOLOv5 models. The F1-Score of the YOLO-NF model was 14.1%, 16.6% and 14.1% higher than that of the above models, respectively. The mAP of the YOLO-NF model was 8.7%, 13.6% and 13.8% higher than that of the above models, respectively. The average running time per image of the YOLO-NF model was 0.022 s, higher than that of YOLOv7, YOLOv8 and YOLOv5 models. This suggests a general problem in deep learning that improving model performance usually requires an increase in model complexity, that is, it increases the floating point operations (FLOPs), which can reduce the model efficiency [33].

Discrimination performance indicators of the proposed model were all improved compared with the baseline model, and this indicates that the SimAM and SE modules can significantly improve the discrimination performance of the model. Meanwhile, the addition of these modules increases the model complexity, which in turn leads to longer running times. This is consistent with the common trade-off between performance and efficiency in deep learning as mentioned earlier.

Precision-Recall curves of discrimination results for all classes of the above models are shown in Figure 11. The overall discrimination performance of the YOLO-NF model was significantly better than other models, followed by YOLOv7 model.

To further explore the discrimination results of the YOLO-NF model compared with the baseline model, Precision-Recall curves of the YOLO-NF model and YOLOv7 model for all classes, normal beef, frozen-thawed beef discriminations are shown in Figure 12, respectively. The overall mAP for beef discrimination of the YOLO-NF model was 98.6%, and outperformed the 89.9% achieved by YOLOv7 model. The mAPs for normal and frozen-thawed beef discrimination of the YOLO-NF model were 98.7% and 98.5%, respectively, which were 3.8% and 13.5% higher than those of YOLOv7 model, respectively.

Table 3 shows more detailed quantitative comparisons. And Figure 13 displays the confusion matrices of the YOLO-NF model and YOLOv7 model for normal and frozen-thawed beef discrimination, which all demonstrate that the proposed YOLO-NF model has achieved strong discrimination performance, especially on frozen-thawed beef.

3.4. Qualitative Analysis

Qualitative discrimination results for normal and frozen-thawed beef of the YOLO-NF model are shown in Figure 14. As can be seen from it, the YOLO-NF model can achieve accurate discrimination for normal and frozen-thawed beef.

Qualitative comparisons of discrimination results between the YOLO-NF model and the baseline model and were also investigated, as presented in Figure 15. Discrimination results for normal beef of the YOLO-NF model and the baseline model are displayed in Figure 15a, from which the normal beef samples were misidentified as the frozen-thawed beef by the baseline model, whereas the above samples were correctly identified by the YOLO-NF model. Discrimination results of the YOLO-NF model and baseline model for frozen-thawed beef are shown in Figure 15b, where one frozen-thawed beef sample was mistakenly detected as normal beef by the baseline model, whereas it was correctly categorized by the YOLO-NF model, and the other frozen-thawed beef sample was correctly discriminated by the baseline model, but its confidence was much lower than that of the YOLO-NF model. The qualitative analysis indicates that the SimAM and SE modules in the YOLO-NF model can make the model focus more on sample features while enhancing its ability to perceive global information, thereby improving the model’s discrimination performance.

3.5. Ablation Experiments

A series of ablation experiments were carried out to analyze the performance enhancement brought by different modules. The performance comparison results with different schemes are presented in Table 4. Scheme A was the baseline model. Scheme B was the baseline model with the SimAM module, which could dynamically adjust the weight of each pixel to enhance important features and suppress irrelevant ones. Scheme C presented the baseline model with the SE module, which could enhance the perception ability of the model for global features. Scheme D indicated the YOLO-NF model, which utilized the SimAM and SE modules to improve the performance for discrimination of normal and frozen-thawed beef.

Scheme B outperformed Scheme A with an improvement of 11.6% in precision, 9% in recall, 10.3% in F1-Score and 6.9% in mAP. This indicates that the SimAM module can focus more on target features and suppress irrelevant information, thus improving the model performance.

Scheme C was superior to Scheme A in discrimination performance, especially in the indicator of recall, which was improved by 14.5%, and the other indicators were improved by 4.8% to 10.6%. This showed that the SE module made global context information available for each channel and enhanced the ability of the model to perceive global features.

Compared with Scheme A, the model performance was significantly enhanced in Scheme D (P = 95.5%, R = 95.2%, F1-Score = 95.3%, mAP = 98.6%), precision increased by 12.7%, recall increased by 15.5%, F1-Score increased by 14.1% and mAP increased by 8.7%, respectively. By incorporating the SimAM and SE modules, Scheme D can significantly improve the feature extraction and localization capabilities of the model, thereby improving the precision and recall, and ultimately enhancing the model performance.

3.6. Model Interpretations

Feature maps obtained from different convolutional layers were extracted to interpret the model’s learning process, as shown in Figure 16. Shallow convolutional layers are used to extract the low-level features such as edges, contours, and textures. The feature maps produced by these layers have a high degree of similarity to the original image, with clear details and almost all the information of the original image included, as shown in Figure 16a. As the network goes deeper, the feature maps become increasingly abstract, as shown in Figure 16b,c. In the feature maps output by deep convolutional layers, information about the original image decreases, while that about the abstract semantics of the target increases, as shown in Figure 16d. Through the combined operations of multiple convolutional layers in the model, target features can be effectively extracted.

The Gradient-Weighted Class Activation Mapping (Grad-CAM) method was utilized to generate the activation heat maps by mapping the category prediction to the corresponding spatial location of the original image. Heat maps provides an intuitive representation of the areas that the model focuses on and its sensitivity to certain features, which could help us to understand the decision basis for the model. The redder the color, the more attention is paid to the area. As shown in Figure 17, less focus was placed on sample features by the baseline model, whereas our proposed YOLO-NF model devoted more attention to such features. It is because the SimAM and SE modules in the YOLO-NF model have enhanced the sample feature weights and the model’s ability to perceive global features. This also explains why the discrimination performance of our proposed YOLO-NF model is better than that of the baseline model.

3.7. Deployment of the YOLO-NF Model

The Gradio framework was adopted in the deployment of the proposed model, and the user interface built with it can be shared with others through web connections. If the share parameter is set to “False”, it means that the deployed model is only available on the local server and can only be accessed by a few people, as shown in Figure 18a, which enhances the security of the data. If the share parameter is set to “True”, a public uniform resource locator (URL) will be created, as shown in Figure 18b, which enables the deployed model to be accessed through a browser, thus improving the accessibility for users.

Taking the model deployment on the local server as an example, Figure 19 illustrates the discrimination process in detail. Users could either click the camera icon button to take a picture or upload an image from the photo album to the backend for identification, as shown in Figure 19a, and the identification results are exhibited to users, as depicted in Figure 19b. From uploading images to results return, the process takes about 0.94 s, meeting the demand for real-time identification [31]. Compared to the average running time of 0.022 s on the test set, the increased time is due to data transmission between the front-end and the back-end.

4. Conclusions

This study developed and deployed a novel YOLO-NF model to achieve non-destructive and real-time discrimination between normal and frozen-thawed beef using deep learning techniques. Quantitative analyses confirmed that the model achieved a discrimination mAP of 98.6% with an average running time of 0.022 s per image. Ablation experiments and activation heat maps validated the effectiveness of SimAM and SE modules in enhancing feature extraction and clarifying the decision-making basis, respectively. Additionally, deployment via Gradio yielded a total discrimination time of 0.94 s per image on a local server, highlighting its potential for user-friendly on-site use.

These findings advance food quality assessment by providing a rapid, high-performance, and non-invasive approach for beef quality inspection. The high mAP ensures reliable differentiation between normal and frozen-thawed beef, which is pivotal for preventing mislabeling and safeguarding consumer rights. The real-time capability of the model enables its seamless integration into industrial workflows, mitigating reliance on labor-intensive manual inspection and facilitating efficient market supervision.

However, this study has limitations that should be addressed in future work. First, the model was trained and tested on a dataset of 1200 beef samples, which may restrict its generalizability to larger or more diverse populations. Second, validation was limited to beef (M. longissimus lumborum), and adaptation to other meat types (e.g., pork, lamb) will require targeted training, though transfer learning could accelerate this process. Third, while an independent dataset (120 samples) was used for deployment validation, external validation with third-party datasets is needed to further confirm the model’s robustness across different acquisition conditions and laboratories.

This study highlights deep learning’s potential in food safety, laying a theoretical and technical foundation for quality control in the meat industry. Future work will focus on expanding the dataset, testing across multiple meat types, and conducting external validation to enhance the model’s practical utility.

Author Contributions

Conceptualization, Y.Z. and R.X.; Methodology, R.X.; Software, X.L.; Validation, X.L.; Formal Analysis, R.X.; Investigation, R.X.; Resources, Y.Z.; Data Curation, X.L., J.Y., P.L. and X.D.; Writing—Original Draft Preparation, R.X.; Writing—Review & Editing, Y.Z. and D.L.H.; Visualization, X.L.; Supervision, Y.Z.; Project Administration, Y.Z.; Funding Acquisition, Y.Z. and R.X. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the earmarked fund for China Agriculture Research System (beef and yak), grant number CARS-37; Shandong Provincial Natural Science Foundation, grant number ZR2024QC037.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Shu, Y.; Lv, J.; Lou, A.; Quan, W.; Shen, Q.W. Protein Lysine Acetylation Regulates Beef Color Stability During Storage. LWT 2025, 220, 117468. [Google Scholar] [CrossRef]
Rehman, S.U.; Seo, J.-K.; Romanyk, M.; Shin, D.-J.; Kim, Y.H.B. Effects of Aging and Repeated Freeze-thaw Cycles on Quality Attributes, Physicochemical and Biochemical Properties, and Sensory Characteristics of Beef Sirloins. Appl. Food Res. 2024, 4, 100612. [Google Scholar] [CrossRef]
An, Y.; Wang, Q.; Lu, P.; Liu, Y.; Gao, S.; Fowler, S.; Liang, R.; Zhu, L.; Wang, W.; Zhang, Y. Characterization and Identification of Frozen-thawed Beef: The Interactions of Frozen Storage and Chilled Display Time. Food Sci. Anim. Prod. 2025, 3, 9240098. [Google Scholar] [CrossRef]
Evans, S.D.; Nott, K.P.; Kshirsagar, A.A.; Hall, L.D. The Effect of Freezing and Thawing on the Magnetic Resonance Imaging Parameters of Water in Beef, Lamb and Pork Meat. Int. J. Food Sci. Technol. 1998, 33, 317–328. [Google Scholar] [CrossRef]
Tejerina, D.; Oliván, M.; García-Torres, S.; Franco, D.; Sierra, V. Use of Near-Infrared Spectroscopy to Discriminate DFD Beef and Predict Meat Quality Traits in Autochthonous Breeds. Foods 2022, 11, 3274. [Google Scholar] [CrossRef]
Antequera, T.; Caballero, D.; Grassi, S.; Uttaro, B.; Perez-Palacios, T. Evaluation of Fresh Meat Quality by Hyperspectral Imaging (HSI), Nuclear Magnetic Resonance (NMR) and Magnetic Resonance Imaging (MRI): A review. Meat Sci. 2021, 172, 108340. [Google Scholar] [CrossRef]
Park, S.; Hong, S.-J.; Kim, S.; Ryu, J.; Roh, S.; Kim, G. Classification of Fresh and Frozen-Thawed Beef Using a Hyperspectral Imaging Sensor and Machine Learning. Agriculture 2023, 13, 918. [Google Scholar] [CrossRef]
Zhang, Z.; Wang, S.; Zhang, Y. Non-Destructive Detection of Water Content in Pork Based on NIR Spatially Resolved Spectroscopy. Agriculture 2023, 13, 2114. [Google Scholar] [CrossRef]
Zhang, J.; Toldrá, F.; Kang, D.; Zhou, L.; Wang, J.; Zhang, W.; Hu, Y. Benefits of Ultrasonic Technology Application in Meat Field and Its Influential Mechanism: A Review. Crit. Rev. Food Sci. Nutr. 2024, 1–26. [Google Scholar] [CrossRef]
Soulat, J.; Monteils, V.; Andueza, D. The Freezing Duration and the Number of Freeze-thaw Cycles Influence the Near Infrared Spectra of Ageing Beef Muscles. LWT 2024, 214, 117147. [Google Scholar] [CrossRef]
Hou, M.; Zhong, X.; Zheng, O.; Sun, Q.; Liu, S.; Liu, M. Innovations in Seafood Freshness Quality: Non-destructive Detection of Freshness in Litopenaeus vannamei using the YOLO-Shrimp Model. Food Chem. 2025, 463, 141192. [Google Scholar] [CrossRef]
Liu, J.; Wang, L.; Xu, H.; Pi, J.; Wang, D. Research on Beef Marbling Grading Algorithm Based on Improved YOLOv8x. Foods 2025, 14, 1664. [Google Scholar] [CrossRef]
Yu, Y.; Chen, W.; Zhang, H.; Liu, R.; Li, C. Discrimination among Fresh, Frozen–Stored and Frozen–Thawed Beef Cuts by Hyperspectral Imaging. Foods 2024, 13, 973. [Google Scholar] [CrossRef]
Wu, B.; Qiu, C.; Guo, Y.; Zhang, C.; Guo, X.; Bouhile, Y.; Ma, H. Ultrasonic-Assisted Flowing Water Thawing of Frozen Beef with Different Frequency Modes: Effects on Thawing Efficiency, Quality Characteristics and Microstructure. Food Res. Int. 2022, 157, 111484. [Google Scholar] [CrossRef] [PubMed]
Perez-Palacios, T.; Ávila, M.; Antequera, T.; Torres, J.P.; González-Mohino, A.; Caro, A. MRI-Computer Vision on Fresh and Frozen-Thawed Beef: Optimization of Methodology for Classification and Quality Prediction. Meat Sci. 2023, 197, 109054. [Google Scholar] [CrossRef] [PubMed]
Matsuo, Y.; LeCun, Y.; Sahani, M.; Precup, D.; Silver, D.; Sugiyama, M.; Uchibe, E.; Morimoto, J. Deep Learning, Reinforcement Learning, and World Models. Neural Netw. 2022, 152, 267–275. [Google Scholar] [CrossRef] [PubMed]
Agarwal, M.; Dwivedi, A.K.; Hazra, D.; Gupta, S.K.; Garg, D. Development of IoT Enabled Deep Learning Model for Indian Food Classification: An Approach Based on Differential Evaluation. Food Anal. Methods 2025, 18, 172–189. [Google Scholar] [CrossRef]
Bai, Z.; Zhu, R.; He, D.; Wang, S.; Huang, Z. Adulteration Detection of Pork in Mutton Using Smart Phone with the CBAM-Invert-ResNet and Multiple Parts Feature Fusion. Foods 2023, 12, 3594. [Google Scholar] [CrossRef]
Xiong, Y.; Li, Y.; Wang, C.; Shi, H.; Wang, S.; Yong, C.; Gong, Y.; Zhang, W.; Zou, X. Non-Destructive Detection of Chicken Freshness Based on Electronic Nose Technology and Transfer Learning. Agriculture 2023, 13, 496. [Google Scholar] [CrossRef]
Genç, İ.Y.; Gürfidan, R.; Açikgözoğlu, E. Quality Determination of Frozen-Thawed Shrimp Using Machine Learning Algorithms Powered by Explainable Artificial Intelligence. Food Anal. Methods 2025, 18, 935–945. [Google Scholar] [CrossRef]
Genç, İ.Y.; Gürfidan, R.; Yiğit, T. Quality Prediction of Seabream Sparus Aurata by Deep Learning Algorithms and Explainable Artificial Intelligence. Food Chem. 2025, 474, 143150. [Google Scholar] [CrossRef]
Abd Elfattah, M.; Ewees, A.A.; Darwish, A.; Hassanien, A.E. Detection and Classification of Meat Freshness Using an Optimized Deep Learning Method. Food Chem. 2025, 489, 144783. [Google Scholar] [CrossRef]
Lin, Y.; Ma, J.; Sun, D.-W.; Cheng, J.-H.; Zhou, C. Fast Real-Time Monitoring of Meat Freshness Based on Fluorescent Sensing Array and Deep Learning: From Development to Deployment. Food Chem. 2024, 448, 139078. [Google Scholar] [CrossRef]
Huang, H.; Zhan, W.; Du, Z.; Hong, S.; Dong, T.; She, J.; Min, C. Pork Primal Cuts Recognition Method via Computer Vision. Meat Sci. 2022, 192, 108898. [Google Scholar] [CrossRef]
Yang, J.; Chen, X.; Duan, X.; Li, K.; Cheng, H.; Sun, G.; Luo, X.; Hopkins, D.L.; Holman, B.W.B.; Zhang, Y.; et al. Investigation of Oxygen Packaging to Maintain Beef Color Stability and Microbiology Safety after Periods of Long-Term Superchilled Storage. Meat Sci. 2024, 215, 109548. [Google Scholar] [CrossRef] [PubMed]
Zhang, Y.; Holman, B.W.B.; Mao, Y.; Chen, X.; Luo, X.; Hopkins, D.L.; Zhang, Y. Determination of a pH Threshold for Dark Cutting Beef Based On Visual Evaluation By Asian Consumers. Meat Sci. 2021, 172, 108347. [Google Scholar] [CrossRef] [PubMed]
Li, X.; Xue, S.; Li, Z.; Fang, X.; Zhu, T.; Ni, C. A Candy Defect Detection Method Based on StyleGAN2 and Improved YOLOv7 for Imbalanced Data. Foods 2024, 13, 3343. [Google Scholar] [CrossRef] [PubMed]
Qin, J.; Zhou, H.; Yi, H.; Ma, L.; Nie, J.; Huang, T. YOLOv7-GCM: A Detection Algorithm for Creek Waste Based on Improved YOLOv7 Model. Pattern Anal. Appl. 2024, 27, 116. [Google Scholar] [CrossRef]
Yang, L.; Zhang, R.-Y.; Li, L.; Xie, X. SimAM: A Simple, Parameter-Free Attention Module for Convolutional Neural Networks. In Proceedings of the 38th International Conference on Machine Learning, Virtual, 18–24 July 2021; pp. 11863–11874. [Google Scholar]
Hu, J.; Shen, L.; Sun, G. Squeeze-and-excitation Networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–22 June 2018; pp. 7132–7141. [Google Scholar]
Lin, Y.; Ma, J.; Cheng, J.-H.; Sun, D.-W. Visible Detection of Chilled Beef Freshness Using a Paper-Based Colourimetric Sensor Array Combining with Deep Learning Algorithms. Food Chem. 2024, 441, 138344. [Google Scholar] [CrossRef] [PubMed]
Kılcı, O.; Koklu, M. Automated Classification of Biscuit Quality Using YOLOv8 Models in Food Industry. Food Anal. Methods 2025, 18, 815–829. [Google Scholar] [CrossRef]
Hu, X.; Chu, L.; Pei, J.; Liu, W.; Bian, J. Model Complexity of Deep Learning: A survey. Knowl. Inf. Syst. 2021, 63, 2585–2619. [Google Scholar] [CrossRef]

Figure 1. The workflow of the discrimination method in this study. (a) Dataset preparation. For sample images, first row: normal beef; second row: frozen-thawed beef. (b) Discrimination model. (c) Model deployment.

Figure 2. Flowchart of data acquisition and pre-processing. (a) Data acquisition equipment. (b) Partial beef sample images. First row: normal beef; second row: frozen-thawed beef. (c) Data augmentation for training set. First row: normal beef; second row: frozen-thawed beef. (d) Partial images for model training. First row: normal beef; second row: frozen-thawed beef.

Figure 3. Overall network structure of the YOLO-NF model. Within this structural diagram, dark backgrounds denote the network structure sections, light-colored filled regions provide supplementary explanations of modules within these sections, and core improved modules are distinctly highlighted using red dashed boxes.

Figure 4. Network structure of the original YOLOv7 model. Within this structural diagram, dark backgrounds denote the network structure sections, and light-colored filled areas offer supplementary descriptions of modules within these sections.

Figure 5. Schematic diagram of the SimAM module.

Figure 6. Schematic diagram of the ELAN-SA module.

Figure 7. Schematic diagram of the SE network.

Figure 8. Network structure with the SE attention mechanism.

Figure 9. Flowchart of the model deployment.

Figure 10. Model training and validation results. First row: results on the training set; second row: results on the validation set.

Figure 11. Precision-Recall curves of discrimination results for all classes with the proposed YOLO-NF model and other models (YOLOv5, YOLOv7 and YOLOv8).

Figure 12. Precision-Recall curves of the proposed YOLO-NF model and YOLOv7 model. (a) Precision-Recall curves for all classes discrimination. (b) Precision-Recall curves for normal beef discrimination. (c) Precision-Recall curves for frozen-thawed beef discrimination.

Figure 13. Confusion matrices for discrimination of normal and frozen-thawed beef. (a) The proposed YOLO-NF model. (b) The baseline (YOLO v7) model.

Figure 14. Some discrimination results of the YOLO-NF model. (a) Discrimination results of normal beef. (b) Discrimination results of frozen-thawed beef.

Figure 15. Comparison of qualitative discrimination results between the YOLO-NF model and baseline (YOLOv7) model. (a) Normal beef. (b) Frozen-thawed beef.

Figure 16. Feature maps visualization. (a) Feature maps produced by Layer_0. (b) Feature maps produced by Layer_14. (c) Feature maps produced by Layer_35. (d) Feature maps produced by Layer_90.

Figure 17. Activation heat maps of the YOLO-NF model and the baseline (YOLOv7) model for normal and frozen-thawed beef discrimination.

Figure 18. User interfaces of the deployed model under different deployment methods. (a) Interface for the locally deployed model. (b) Interface for the model deployed via public URL. In the figure, the Chinese text “将图像拖放到此处-或-点击上传” is defined as “Drag and drop the image here-or-click to upload”, which instructs users on how to upload an image for beef discrimination.

Figure 19. Local deployment discrimination process. (a) Image upload. (b) Result presentation to users. In the figure, the Chinese text “将图像拖放到此处-或-点击上传” is defined as “Drag and drop the image here-or-click to upload”, which instructs users on how to upload an image for beef discrimination.

Table 1. Hyperparameter configurations for all experiments in this study.

Hyperparameter	Configuration
Epoch	80
Initial learning rate	0.0001
Final learning rate	0.1
Batch size	4
Momentum	0.97
Weight decay	0.0005
Input image size	640 × 640

Table 2. Quantitative comparisons of discrimination performance between the proposed YOLO-NF model and other models (YOLOv5, YOLOv7 and YOLOv8). Abbreviations: mAP (mean Average Precision) refers to average of AP (Average Precision), a core indicator for evaluating beef discrimination model performance; FLOPs (Floating Point Operations) refers to measure of model computational complexity.

Model	Precision (%)	Recall (%)	F1-Score (%)	mAP@0.5 (%)	FLOPs (G)	Running Time (s)
YOLOv5	79.3	83.3	81.2	84.8	15.8	0.015
YOLOv8	73.6	84.6	78.7	85.0	28.4	0.011
YOLOv7	82.8	79.7	81.2	89.9	103.2	0.0214
YOLO-NF	95.5	95.2	95.3	98.6	103.4	0.022

Table 3. Detailed discrimination results of the YOLO-NF model and baseline model. Abbreviations: mAP (mean Average Precision) refers to average of AP (Average Precision), a core indicator for evaluating beef discrimination model performance.

Samples	Model	Precision (%)	Recall (%)	F1-Score (%)	mAP@0.5 (%)
All	Baseline	82.8	79.7	81.2	89.9
All	YOLO-NF	95.5	95.2	95.3	98.6
Normal	Baseline	94.2	79.3	86.1	94.9
Normal	YOLO-NF	93.1	99.6	96.2	98.7
Frozen-thawed	Baseline	71.4	80.2	75.5	85.0
Frozen-thawed	YOLO-NF	97.9	90.9	94.3	98.5

Table 4. Ablation experiment results.

Scheme	Baseline	SimAM	SE	Precision (%)	Recall (%)	F1-Score (%)	mAP@0.5 (%)
A	√	-	-	82.8	79.7	81.2	89.9
B	√	√	-	94.4	88.7	91.5	96.8
C	√	-	√	87.6	94.2	91.8	97.1
D	√	√	√	95.5	95.2	95.3	98.6

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Xi, R.; Lyu, X.; Yang, J.; Lu, P.; Duan, X.; Hopkins, D.L.; Zhang, Y. Non-Destructive and Real-Time Discrimination of Normal and Frozen-Thawed Beef Based on a Novel Deep Learning Model. Foods 2025, 14, 3344. https://doi.org/10.3390/foods14193344

AMA Style

Xi R, Lyu X, Yang J, Lu P, Duan X, Hopkins DL, Zhang Y. Non-Destructive and Real-Time Discrimination of Normal and Frozen-Thawed Beef Based on a Novel Deep Learning Model. Foods. 2025; 14(19):3344. https://doi.org/10.3390/foods14193344

Chicago/Turabian Style

Xi, Rui, Xiangyu Lyu, Jun Yang, Ping Lu, Xinxin Duan, David L. Hopkins, and Yimin Zhang. 2025. "Non-Destructive and Real-Time Discrimination of Normal and Frozen-Thawed Beef Based on a Novel Deep Learning Model" Foods 14, no. 19: 3344. https://doi.org/10.3390/foods14193344

APA Style

Xi, R., Lyu, X., Yang, J., Lu, P., Duan, X., Hopkins, D. L., & Zhang, Y. (2025). Non-Destructive and Real-Time Discrimination of Normal and Frozen-Thawed Beef Based on a Novel Deep Learning Model. Foods, 14(19), 3344. https://doi.org/10.3390/foods14193344

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Non-Destructive and Real-Time Discrimination of Normal and Frozen-Thawed Beef Based on a Novel Deep Learning Model

Abstract

1. Introduction

2. Materials and Methods

2.1. Data Acquisition and Pre-Processing

2.2. The Proposed YOLO-NF Model

2.2.1. Overall Network Structure of the YOLO-NF Model

2.2.2. Network Structure of YOLOv7 Model

2.2.3. SimAM Module

2.2.4. SE Attention Mechanism Module

2.3. Evaluation Metrics

2.4. Supplementary Model Analyses

2.5. Model Deployment

3. Results and Discussion

3.1. Experimental Setup

3.2. Experimental Training Process

3.3. Quantitative Analysis

3.4. Qualitative Analysis

3.5. Ablation Experiments

3.6. Model Interpretations

3.7. Deployment of the YOLO-NF Model

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI