Next Article in Journal
Laser-Based Fabrication of Hydrogel Scaffolds for Medicine: From Principles to Clinical Applications
Previous Article in Journal
Preparation and Characterization of Chemically Cross-Linked Xanthan/Poly(Vinylalcohol) Hydrogel Films Containing Cerium Oxide Nanoparticles for Potential Application in Removal of Methylene Blue and Crystal Violet Dyes
Previous Article in Special Issue
The Thermodynamic and Gelation Properties of Ovalbumin and Lysozyme
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Assessment of Frozen Stored Silver Carp Surimi Gel Quality Using Synthetic Data-Driven Machine Learning (SDDML) Model

1
Department of Agricultural and Biosystems Engineering, Iowa State University, Ames, IA 50011, USA
2
Department of Computer Science, Iowa State University, Ames, IA 50011, USA
3
Department of Chemical and Biological Engineering, Iowa State University, Ames, IA 50011, USA
*
Author to whom correspondence should be addressed.
Gels 2025, 11(10), 810; https://doi.org/10.3390/gels11100810
Submission received: 30 August 2025 / Revised: 23 September 2025 / Accepted: 8 October 2025 / Published: 9 October 2025
(This article belongs to the Special Issue Application of Composite Gel in Food Processing and Engineering)

Abstract

The invasive Silver Carp (Hypophthalmichthys molitrix) in North America represents a promising resource for surimi production; however, its gel formability deteriorates significantly during frozen storage. This study investigated the deterioration of gel properties in Silver Carp surimi over six months of frozen storage, and showed that short-term frozen storage (<2 months) was beneficial for surimi gel-forming ability, while extended frozen storage (>2 months) tended to have detrimental effects. The adverse effect of long-term frozen storage could be mitigated via using food additives (e.g., manufactured microfiber, transglutaminase, and chicken skin collagen), among which transglutaminase was the most effective. Transglutaminase at a relatively low level (0.1 wt%) could effectively negate frozen storage’s effects, and produced surimi gel with quality attributes (e.g., gel strength, hardness, and chewiness) at levels comparable to those from fresh fish samples. To assess the effects of the addition of various food additives for quality improvement, a synthetic data-driven machine learning (SDDML) approach was developed. After testing multiple algorithms, the random forest model was shown to yield synthetic data points that represented experimental data characteristics the best (R2 values of 0.871–0.889). It also produced improved predictions for gel quality attributes from control variables (i.e., additive levels) compared to using experimental data alone, showing the potential to overcome data scarcity issues when only limited experimental data are available for ML models. A synthetic dataset of 240 data points was shown to supplement the experimental dataset (60 points) well for assessment of the Frozen Silver Carp (FSC) surimi gel quality attributes. The SDDML method could be used to find optimal recipes for generating additive profiles to counteract the adverse effects of frozen storage and to improve surimi gel quality to upgrade underutilized invasive species to value-added food products.

1. Introduction

Silver Carp (Hypophthalmichthys molitrix), one of the Asian Carps, is threatening the ecosystem of North America as an invasive species [1]. The utilization of the Asian Carp as a food resource remains limited because they are foreign to the American diet. To promote harvesting of these invasive fish, market-welcomed products are needed to produce strong economic incentives. Surimi gel is a fully cooked, protein-rich food made with fish mince through thermal gelation, which could provide a pathway for the marketing and utilization of silver carp. Surimi gel is very popular in East Asian countries and has gained increasing appeal in the European and North American markets over the last decade. Surimi gel is usually made with marine fish, and the overfishing and depletion of marine fish species are threatening the health of the surimi gel industry. Utilization of freshwater invasive fish such as Silver Carp (SC) could ameliorate this problem, reduce the cost of surimi gel production, and at the same time generate a commercial market for the fish with increased customer interest.
However, SC performs less well in surimi gelation. Surimi gel is formed when the myofibrillar proteins (about 70% of total protein in fish muscle) are aggregated and cross-linked through a thermal process [2,3]. The myofibrillar proteins of SC, especially myosin (about 60% of total myofibrillar proteins), are susceptible to the “modori” phenomenon, which causes gel degradation during the thermal process due to the breakdown of myosin by endogenous proteases [4,5]. The freshness of the fish is also a critical point for surimi gel products. The frozen storage process to extend fish shelf-life could lead to unexpected myofibrillar protein denaturation, aggregation, chemical modification, and loss of water-holding capacity (WHC), which causes poor surimi gel quality [6,7,8,9,10,11,12].
Protein denaturation during frozen storage is a detrimental process that contrasts with controlled thermal gelation. Unlike the reversible, ordered unfolding that forms a strong elastic gel network during heating [6], freezing causes irreversible damage [7]. The synergistic effect of ice crystallization and low temperature disrupts the protein’s secondary and tertiary structures [8]. This random unfolding exposes hydrophobic residues, leading to disordered and irreversible aggregation that prevents effective gel formation.
Furthermore, freezing induces harmful chemical modifications. It generates reactive oxygen species that oxidize amino acids, impairing crucial disulfide bond formation for crosslinking [9,10]. It also disrupts cells, releasing endogenous enzymes that proteolyze myofibrillar proteins, further diminishing gel-forming capacity [11].
This collective denaturation, aggregation, and modification cause myofibrillar proteins to lose their native function, resulting in a severe loss of water-holding capacity (WHC) [12]. Consequently, premium surimi products rely on fresh fish, making them expensive. This creates a high demand for remedies, such as surimi from FSC, that can mitigate these freeze-induced defects and improve gel quality, enhancing their economic viability.
The duration of the frozen storage also impacts the surimi gel quality. Research on four commonly used surimi fish in Thailand (i.e., threadfin bream, bigeye snapper, lizardfish, and croaker) stored at −18 °C for a 6-month period found a declining trend in gel-forming ability due to protein denaturation, and the degree of the deterioration in gel-forming ability was dependent upon the species and frozen time [13]. Research on SC captured in China stored at −18 °C for a 3-month period similarly revealed that the gel strength decreased along with the frozen time [14]. Although the native SC in China and the invasive SC in North America are the same species, research has found that significant genetic differences exist between the SC population in China (Yangtze, Pearl, and Amur Rivers) and the population in North America (Mississippi River) [15]. The differences in diet and growth conditions also influence the protein characteristics of fish [16,17]. Therefore, a study using SC captured in North America is necessary to understand the effects of frozen storage on surimi gel formation of the SC surimi in North America. In addition, research on Herring fish revealed that frozen time and pH could affect the rheological properties of Herring fish protein [18]. However, there are few studies on the effects of frozen storage time on the rheological properties of SC surimi paste. Understanding frozen-storage effects on SC surimi rheology can guide its utilization as food ink for 3D printing-based innovative food production, which can lead to value-added food production to further enhance the economic incentives for its utilization.
Our previous study found that additives such as manufactured microfiber (MMF), transglutaminase (TG), and chicken collagen (CLG) can improve the gel strength, texture properties, WHC, and gel microstructures of frozen-stored SC (FSC) surimi gel [19]. MMF is a cellulosic fiber with a length of a few micrometers and a width of a few hundred nanometers, serving as a potent reinforcing agent for protein crosslinking with a high aspect ratio, superior mechanical strength, and a high hydroxyl group density [20]. TG is broadly used in the food industry as meat glue since it can catalyze surimi gel cross-linking with ε-(γ-glutamyl) lysine bonds [21]. CLG is a functional protein that enriches the amino acids to form more protein crosslinking during thermal gel formation [22]. A Response-Surface Modeling (RSM) was developed in the previous study for process optimization to find the best MMF, TG, and CLG combination for improving FSC surimi gel, with limited success due to the complexity of the problem [19]. Recent research indicated that RSM’s fixed-order polynomial regression performed less well than the more accurate and flexible Machine Learning (ML) models [23]. Therefore, ML-based models are explored in this study to better elucidate how additives may be used to counteract FSC surimi’s poor gelation performance.
However, the construction of effective machine learning models can be greatly restricted due to the lack of sufficient data. Preparation of surimi gel samples out of FSC is time- and cost-consuming. In our previous report, we had only four replicated sample groups with 15 treatments (i.e., three levels of each of the three additives, with different combinations in a Box–Behnken experimental design [19]) for each group, which yielded 60 data points in total. The sparsity of the available data was part of the reason why the RSM model did not work very well. The scarcity of data in a small dataset could cause issues such as overfitting and high noise for ML models as well [24]. As physical constraints always exist in most experimental data acquisition work, data augmentation techniques such as data synthesis have been explored as a means to mitigate data scarcity issues related to small-dataset limitations [25]. In this study, we aimed to improve an ML-driven model for FSC surimi gel optimization using data synthesis to supplement the experimental dataset. Various regression approaches, including Linear Regression, Polynomial Regression, Ridge Regression, Support Vector Regression (SVR), Decision Tree, Random Forest, Gradient Boosting, Bayesian Ridge, Automatic Relevance Determination for Bayesian Ridge (ARD), Multilayer Perceptron (MLP), and Extreme Gradient Boosting (XGB) were investigated as means to generate synthetic dataset which could mimic the original experimental data, and the one with the best performance (i.e., highest coefficient of determination R2) was used to generate synthesized dataset. Kernel Density Estimation (KDE) was employed to generate the synthesized data based on the original experimental data because it performed better on small datasets [26]. The synthesized data were screened with the constraints of showing statistical characteristics mimicking the original dataset until a pre-determined size of dataset (i.e., 120, 180 and 240 data points) was reached, and the synthesized data were then pooled with the original experimental data to train ML models for the assessment and optimization of FSC surimi gel properties with different valuation of control variables (i.e., levels of additives).
In addition, in this study, the effects of frozen storage time on surimi gel quality for SC captured in North America was evaluated by gel strength, texture profile analysis (TPA) properties (e.g., hardness, springiness, and cohesiveness), rheological properties (via the small-amplitude oscillatory shear method (SAOS) and hysteresis loop test), expressible moisture (EM), and cooking loss. A Synthetic Data-Driven ML (SDDML) model was developed to predict the gel properties from additive dosage, and ultimately, determine the optimal additive combinations for achieving the desired gel properties tailored towards applications such as 3D printing food ink. The methods created in this research could be easily expanded to include other quality attributes and could be adaptable for other Asian Carp and/or fish species, providing a powerful tool for developing ML-driven predictive models for product optimization.

2. Results and Discussion

2.1. Gel Strength and Textural Properties

Gel strength, defined as the multiplication of breaking force and deformation distance [27], is one of the important parameters to evaluate surimi gel quality and the degree of protein denaturation [14]. Figure 1A shows a significant decrease (p < 0.05) in gel strength of control FSC surimi samples from fresh to 6-month frozen. There was a sharp decline from fresh to 2-month frozen, and it became relatively stable from 2-month frozen up to 6-month frozen. These results are consistent with the findings in SC captured in China [14] and threadfin bream, bigeye snapper, and croaker captured in Thailand [13], which also showed a noticeable decrease within the first 2 months frozen and followed by stabilization. The gel strength of the control samples decreased by 36.2% from fresh to 6-month frozen. The data for the 4th and 5th months of frozen storage were not reported as the 3-month time point was deemed a functionally critical threshold, demonstrating severe irreversible protein denaturation [13,14]. The experimental design focused on capturing the progression of quality loss during the active degradation phase (0–3 months) and a final endpoint representing extreme storage conditions (6 months). Indeed, no significant changes were observed in samples of 4- and 5-month frozen storage from those of either 3 months frozen storage or 6 months frozen storage.
The reduction in gel strength showed that the frozen storage progressively weakens the gel formability of FSC due to protein denaturation and loss of cross-linking ability [7,8]. The one-month frozen control sample showed no significant difference from the fresh sample because at this point, the proteins were just partially denatured. They had begun to lose their structural integrity, but were not completely unfolded and aggregated yet. After 2 months of frozen storage, more proteins had become partially or fully denatured, and after 6 months, a large portion of the proteins were fully denatured, and their gelation performance was significantly deteriorated. In contrast, the 0.1 wt% TG-added samples maintain stable gel strength along with frozen time, remaining statistically higher than the control samples after 2–6 months. Notably, there was no significant difference between 0.1 wt% TG-added samples for all frozen periods and the fresh control sample, suggesting that 0.1 wt% TG enzyme is sufficient to mitigate the adverse frozen impacts of SC on gel strength. TG enzyme catalyzes the formation of covalent ε-(γ-glutamyl) lysine bonds, thereby restoring the cross-linking ability damaged by frozen storage [21,27]. The other two additives (MMF and CLG) were also investigated at various levels to assess their abilities to negate the adverse effects of frozen storage at the 1-month time point. None showed any statistically significant effects on the quality attributes of the gel (e.g., gel strength, hardness, chewiness, etc.); hence, their effects were not further investigated. Consequently, additives such as microfiber (MMF) and collagen (CLG) were ineffective at negating these adverse effects. This is because their mode of action is that of an external filler rather than a functionally restorative for the denatured protein network. While MMF provides an extra physical network, especially for water retention [20], and CLG forms a kind of weak gelatin gel separately [22], neither additive can reverse the underlying protein denaturation or integrate well with the damaged proteins to restore the quality attributes.
Textural properties, including hardness, cohesiveness, springiness, resilience, and chewiness, were measured by Texture Profile Analysis (TPA) to simulate the chewing process of the surimi gel [28]. Hardness measures the force needed to fracture the gel. Cohesiveness indicates the gel’s ability to maintain its integrity during a second compression. Springiness is the capacity to return to its original shape after deformation, following a brief delay. In contrast, resilience is the immediate capacity to recover its height after deformation, without any delay. The product of hardness, cohesiveness, and springiness results in chewiness, which indicates how easily the surimi gel can be chewed [29].
Figure 1B shows a sharp decline in the hardness of the control FSC surimi gel from fresh to 2-month frozen, after which it remained relatively stable. Figure 1C indicates a rapid decrease from fresh to 1-month frozen and remained relatively stable afterward. Overall, hardness decreased by 25.5% and chewiness decreased by 25.6% from fresh to 6-month frozen, likely due to protein denaturation and loss of cross-linking ability during the frozen storage [7,8]. The decline in gel properties of the control samples from fresh to 2-month frozen storage showed that the protein denaturation was partial and in progress, which then stabilized after 2 months, suggesting that enough protein denaturation had occurred by 2 months; further frozen storage did not bring significant changes after that. In contrast, the 0.1 wt% TG-added FSC surimi samples had no significant differences from the fresh control sample, suggesting that 0.1 wt% TG was sufficient to counteract the adverse frozen impacts on hardness and chewiness. The results of hardness and chewiness had similar trends with gel strength because hardness and chewiness are generally consistent with gel strength, indicating the integrity of the 3-D gel network [27].
Table 1 lists the remaining TPA results, including springiness, cohesiveness, and resilience. No significant difference was observed in springiness, indicating that the frozen effect on springiness is too minor to be detected by TPA. Similarly, there was no significant difference in the cohesiveness readings among the samples, suggesting that frozen storage did not substantially impair the ability of the gel to maintain its structural integrity during deformation. In contrast, the 0.1 wt% TG-added samples exhibited higher cohesiveness than the control samples because TG enzyme promoted the formation of covalent ε-(γ-glutamyl) lysine bonds, strengthening the integrity of the 3-D gel network [21,27]. Interestingly, the resilience of the control samples sharply increased from fresh to 1-month frozen and then gradually decreased. This is consistent with the findings that short-term frozen storage can temporarily enhance the texture properties through partial protein denaturation and aggregation [30]. When a small or controlled amount of protein denaturation occurred, it acted as a beneficial “pre-cooking” step, leading to mild unfolding that exposed reactive parts of the protein (like hydrophobic groups and sulfhydryl groups) that are normally buried inside the folded structure. However, when denaturation went beyond a certain threshold, it became destructive, breaking down the protein’s ability to form a gel at all. In our study, this threshold for frozen storage of silver carp was shown to be 2 months of storage time. In comparison, the 0.1 wt% TG-added samples maintained higher resilience than the control, indicating that TG-enhanced cross-linking improved the gel’s ability to recover its structure after deformation [21,27].

2.2. Cooking Loss and Expressible Moisture

Cooking loss is the loss of liquid and soluble matter percentage during heating, mainly from water loss [31]. A denser and more stable network of surimi gel can retain more water (i.e., higher WHC) [30], resulting in lower cooking loss. In Figure 2A, the cooking loss of control samples is shown to sharply decline in the first 2 months of frozen storage, and then return to the level of the fresh control samples, consistent with the findings that short-term frozen storage can temporarily increase WHC by partial protein denaturation and aggregation [30]. These limited changes exposed hydrophilic groups and hydrophobic patches, prompting the formation of a temporary network to effectively entrap water, leading to a transient rise in WHC [30]. The cooking loss of 0.1 wt% TG-added samples increased rapidly after the first month of frozen storage. Two factors may explain this: (i) the WHC decreased during long-term frozen storage [14,30], though the minor change in the control samples after 3 months frozen storage suggested this effect might be quite limited; and (ii) the excessive TG cross-linking reduced protein–water interactions, thereby reducing WHC [32]. The results suggested that 0.1 wt% TG was effective when proteins were still relatively intact (after short-term frozen, <2 months), but became excessive in long-term frozen storage (>2 months storage), where denatured proteins became less able to retain water.
Expressible moisture (EM) is defined as the percentage of loose water released under force relative to total moisture and is commonly used to evaluate WHC [13]. A high EM indicates a poor ability to retain water (i.e., low WHC). As shown in Figure 2B, no significant difference was observed among the control samples over frozen storage up to six months, nor between the control and the 0.1 wt% TG-added samples at the same frozen period. This suggested that the frozen time effect on EM was too minor to be detected for WHC evaluation of control samples. A significant decrease in EM was observed in samples with 0.1 wt% TG after short-term frozen storage (<2 months), indicating an improvement in WHC. This aligned with previous findings, suggesting this TG concentration was effective when the frozen-caused partial protein denaturation and aggregation were overcome. However, its efficacy diminished with long-term storage (>2 months), as extensive protein denaturation reduced the substrate availability and functionality necessary for TG to form a well-hydrated gel network.

2.3. Rheological Properties

2.3.1. Small-Amplitude Oscillatory Shear (SAOS) Test

The SAOS test was conducted to measure the storage modulus (G′), loss modulus (G″), and the tan б, which were used to determine the relationship between elastic (solid-like) and viscous (liquid-like) behavior. In Figure 3, the G′s of both the control and 0.1 wt% TG-added samples were consistently greater than the corresponding G″s (i.e., tan б < 1), demonstrating solid-like behavior of well-formed gel [33]. Furthermore, the G′ and G″ were parallel and gradually increasing, indicating a strong gel network [33], which would benefit 3-D printing of the fish paste as a food ink to maintain its shape and ensure consistent extrusion flow during 3-D printing [34]. In Figure 3A, an increase in storage modulus (G′) of the control FSC paste was observed from fresh to 2-month frozen storage, and declined thereafter, with a notable reduction at 6 months. These results aligned with the findings that a short-term freeze can temporarily enhance the gel network [30], and the extended frozen time damages the protein structure, and decreases both G′ and G″. In contrast, the viscoelastic properties of the 0.1 wt% TG-added sample, as shown in Figure 3B, were relatively stable, suggesting that the TG enzyme can effectively mitigate the frozen effects on viscoelastic properties. The TG enzyme counteracted the damage of frozen storage by forming strong covalent ε-(γ-glutamyl) lysine bonds. This restored the proteins’ cross-linking ability, creating a stable gel network with robust viscoelastic properties [21,27].

2.3.2. Hysteresis Loop Assessment

Thixotropic behavior refers to the ability of a material to decrease in viscosity under applied shear stress and gradually recover when the stress is removed, which is an important property to evaluate the 3-D printability of a food ink [33]. Viscosity deduction during extrusion for 3-D printing allows the surimi paste to flow smoothly through the nozzle, while viscosity recovery after the extrusion would be instrumental in maintaining the printed shape and preventing its collapse. Hence, the samples with strong thixotropic behavior would be favored as food ink for 3D printing. Figure 4 showed that all samples exhibited thixotropic loops, indicating that all of them had thixotropic behavior. The degree of thixotropy is reflected by the area of the thixotropic loop [33]. Figure 4 shows a notable increase in loop area from fresh to 2 months of frozen storage for both the control and 0.1 wt% TG-added samples, which then returned to the level of the fresh sample after extended frozen storage. The TG-added samples exhibited a larger area in short-term frozen storage than the control samples, indicating that the short-term frozen storage and 0.1 wt% TG addition can improve the thixotropy of FSC paste, thereby benefiting its 3-D printability.
Shear-thinning behavior refers to decreased viscosity with increasing shear rate [33]. In 3D printing, viscosity plays a critical role: if the food ink is too viscous, it cannot pass smoothly through the nozzle, whereas the liquid-like food with low viscosity fails to maintain its printed shape. Therefore, selecting a suitable viscosity range is essential for 3D printing operations. Figure 5 shows that all samples exhibited shear-thinning behavior once the shear rate exceeded 0.025 s−1. At the same shear rate, the viscosity of the control samples increased from fresh to 2-month frozen storage, then dropped back after extended frozen storage (Figure 5A). In Figure 5B, the viscosity of the 0.1 wt% TG-added samples increased from fresh to 1-month frozen storage, then returned to the fresh level. These results are consistent with the finding that short-term frozen storage can increase the viscosity of myofibrillar protein [30], thereby improving control of FSC paste viscosity for use as food ink.
Overall, our findings indicated that among the three additives investigated, only TG was truly effective in terms of negating the adverse effects of long-term (>2 months) frozen storage on the gelation properties of the FSC. However, as indicated in our previous work [19], MMF and CLG could improve surimi gel properties when used in synergy with TG. To tailor the FSC gel properties to meet specific needs, such as 3D printing food ink, optimal recipes of additives need to be created. Such a task of optimization requires the ability to make sound assessments and/or predictions of quality attributes based on the compositional control variables (i.e., additive levels), which calls for reliable predictive models. Machine learning algorithms have been reported to provide better predictions than the RSM method [23,24]. In this study, we aimed to explore ML modeling for the assessment of FSC gel qualities. The first task was to explore data synthesis to expand the limited experimental dataset we had at hand, and then to select the best ML models to assess the FSC gel properties based on control variables.

2.4. Synthetic Data-Driven Machine Learning (SDDML) Models

2.4.1. Identify the Appropriate ML Model for Data Synthesis

The key to successfully creating synthesized data that could represent the characteristics of the experimental dataset is to identify the statistical characteristics of the experimental dataset that can be applied to the synthesized data. In this study, three common quality attributes used to evaluate gel quality (e.g., gel strength, hardness, and chewiness) were used as the basis to evaluate the characteristics of the experimental dataset. Various popular ML methods were tested using the normalized original data, with the aim of correlating the controlled variables (e.g., levels of additives of MMF, TG, and CLG) to the three quality attributes. The corresponding determination coefficients (R2) obtained with the validation dataset are shown in Table 2 for various ML methods investigated. Generally, methods based on linear regression did not perform well for predicting the quality attributes from the controlled variables, evidenced by the consistently lower R2, suggesting that simple linear regression cannot capture the complexity of the dataset. In comparison, multilayer perception (MLP) showed a relatively low mean R2 with occasionally higher best-case values, which is reasonable because this method is sensitive to the dataset size (60 data points is too small) [35]. Polynomial and ridge regression methods performed better than the linear model but were outperformed by the tree-based ensemble methods (Random Forest, Gradient Boosting, and Extreme Gradient Boosting (XGB)). Among them, the random forest model achieved the highest mean of R2 values for the three quality attributes (0.464 for gel strength, 0.605 for hardness, and 0.617 for chewiness). The best-case values of R2 for the random forest model were also relatively higher, indicating that the random forest method was the most appropriate model for capturing the characteristics of this dataset. Hence, it was chosen as the method for data synthesis.

2.4.2. Synthetic Data Generation and Validation

Figure 6 shows the process of synthetic data generation for our SDDML method. The details of the process were described in Section 4 below. The time needed for the synthetic data generation function to generate 240 synthetic data points was 1 to 2 min. The Random Forest model trained on the generated dataset and tested on experimental data points achieved relatively high R2 scores for gel strength (0.702), hardness (0.805), and chewiness (0.831), indicating that the generated synthetic data represented the original experimental data well. Then the generated and original data were mixed and split randomly (240 training and 60 testing) to find the R2 distribution of the Random Forest model (Figure 7). The mean R2 was further improved, with 0.889 for gel strength, 0.871 for hardness, and 0.888 for chewiness. These results suggested that the synthetic data generated by the SDDML method not only preserved the intrinsic characteristics of the original experimental dataset but also enhanced model robustness and predictive accuracy when combined with real data.
However, it should be noted that the improvement over model prediction accuracy by synthetic data was not limitless. There was no new information carried by the synthetic data, as they were created to represent the characteristics of the experimental data. Their function was to highlight what was already in the experimental data to assist training of the ML algorithm, not to add any new features. Therefore, when synthetic data were combined with the experimental data, the prediction accuracy of the SDDML model was better than that of the ML model of experimental data alone, but not by a lot. To test the prediction accuracy more directly, 18 experimental data points were randomly selected as a testing set, and then the remaining 42 experimental data points were pooled with various numbers of synthetic data points to generate the training dataset for the random forest ML model. The predicted values of the three quality attributes for the testing set were compared to the actual values. The average determination coefficients (R2) out of 100 repetitions (each repetition with a randomly selected 18 data point testing set) were compared. As shown in Table 3, the prediction accuracy was improved, evidenced by increases in R2 values as well as decreases in the margins of errors when the size of the synthetic dataset increased, significantly for hardness and chewiness, but not significantly for gel strength, even when the size of the synthetic dataset went beyond 1200. Furthermore, the improvement of R2 values over ML with only experimental data was not very large, especially for gel strength (~3%). The improvements over the prediction of textual properties (hardness and chewiness) were larger (~7% and 10%, respectively) in comparison. This is reasonable because the prediction for gel strength had the lowest accuracy to begin with for the original experimental data; meanwhile, the Random Forest model performed better at predicting hardness and chewiness values, leading to a larger improvement with the synthetic data from the SDDML method. The synthetic data would provide better training for the SDDML model, hence improving the prediction accuracy. However, they would not provide new information that was not contained in the experimental dataset, which is its limitation.
The less-than-impressive improvement of prediction accuracy of FSC quality attributes out of the control variables (i.e., additive levels) could be attributed to the complexity of the correlations between them. It is reasoned that changes in control variables led to changes in network crosslinking patterns (i.e., microstructures) in the FSC surimi gel formed, which in turn led to changes in quality attributes. Therefore, when efforts were made to predict the quality attributes out of the control variables directly, the complex intermediate process of crosslinking formation was bypassed, and the prediction accuracy, hence, was intrinsically compromised to a certain degree. This obstacle could be overcome by constructing multi-stage ML models corresponding to the intermediate steps: a first-layer ML model to correlate control variables to attributes of the crosslinking network of the FSC gel, then a second-layer ML model to correlate the microstructural parameters to the quality attributes. At each layer, more direct connections from variables to predictors would yield a better assessment of the quality attributes in the end. Work is underway to pursue this approach, and the data synthesis approach developed in this study will be utilized to improve ML assessment at each layer.

3. Conclusions

This study investigated the effects of frozen storage on the gel quality of surimi prepared from invasive SC in North America and explored remedies using food additives with the SDDML approach. Extended frozen storage significantly reduced gel strength, hardness, and chewiness due to protein denaturation and aggregation, while short-term frozen storage could be beneficial. The addition of 0.1 wt% TG could effectively negate the detrimental effects of extended frozen storage and brought the key texture properties to levels comparable to those of fresh fish samples, demonstrating its potential as a simple and effective mitigation strategy for frozen storage. However, the high cost of TG and the growing demand for nutrition-enhanced products call for the use of combined additives to gain more economic and quality advantages. As promising alternatives, CLG and MMF were combined with TG in our previous study to obtain higher WHC and better textural properties [19]. To find the best recipe of CLG + MMF + TG, assessment tools to predict quality attributes for surimi gel from control variables are needed. As the complex additive interaction patterns are difficult to predict from first principles, machine learning models were explored in this study as possible solutions. Random forest algorithm was shown to be the most reliable predictive assessment tool for the FSC gel. A data synthesis method was developed to supplement the limited experimental data to provide better assessment performance. The proposed SDDML method successfully generated synthetic data that closely represented the original dataset and enhanced model prediction accuracy when combined with experimental data. This approach offered a pathway to address the small dataset limitation common in food science and provided a robust and accessible framework of synthetic data-driven predictive modeling for non-experts. Ultimately, the SDDML approach offered a promising tool to guide the optimization of additive profiles to overcome the negative impacts of long-term frozen storage on surimi gel quality and improve the control over surimi gel production to generate products to meet specific needs. These findings not only supported the broader utilization of invasive Silver Carp in value-added food production but also established an adaptable methodological framework for other food science applications.

4. Materials and Methods

4.1. Materials

Silver Carp fillets (with skin and bones) were obtained from Two Rivers Fisheries, Inc. in Wickliffe, KY, USA. Manufactured microfiber (MMF) was supplied by the University of Maine, Orono, ME, USA. A food-grade TG (100–120 U/g activity), selected for its economic viability and common use in food applications, was purchased from Modernist Pantry, Eliot, ME, USA. Chicken collagen (CLG) was a product of Essentia Protein Solutions, Ankeny, IA, USA. The Sodium chloride, NaCl, was bought from Fisher Scientific, Waltham, MA, USA. Whatman filter paper No. 1 with a diameter of 110 mm (Whatman International Ltd., Maidstone, UK) was used throughout the research. All the chemicals used in this study were of analytical grade.

4.2. Surimi Gel Making

The surimi production protocol was adapted from a method established by Zhang et al. [27]. The fresh SC fillets were split into five groups: one fresh group and four groups subjected to frozen storage for 1, 2, 3, and 6 months. Fresh SC fillets were cut into 4–5 cm segments, and manually deboned and deskinned. The prepared fish segments underwent three washing cycles with cold deionized water (5 °C) to eliminate residual blood and soluble remains. Excess surface water was removed using gauze, resulting in a final moisture content of 78–80%. Then the prepared fish segments were minced using a meat blender (Oster 21304893 Cup Mini Food Chopper, Sunbeam Products, Inc., Boca Raton, FL, USA) maintained below 10 °C in an ice bath for 2 min to form a uniform mince. Then the mince was screened through a 16-mesh screen on ice to remove the fascia and small bone particles. The screened mince was divided half and half: one for control samples, and the other was supplemented with TG. Based on the findings of our previous study that low dosage of TG had a strong effect to enhance FSC surimi [19], 0.1 wt% TG was applied. The control group was mixed with 2.5 wt% NaCl, while the TG group was mixed with 0.1 wt% TG and 2.5 wt% NaCl. Both were blended under the chilled condition for 1 min. A small portion of the paste was stored at 5 °C for further Rheology tests, while the remainder was stuffed into 20 × 20 mm cylindrical casings, which were sealed and subjected to a two-step heat treatment: first at 35 °C for 2 h, followed by 90 °C for 20 min. This heating process was selected to optimize TG enzyme activity and gel formation [27]. Each group has three replicated samples. After heating, the gels were removed from their casings and stored at 5 °C until analysis. The frozen-stored SC fillets were thawed at 5 °C for 24 h once they reached the targeted frozen time, and followed the same protocol as the fresh group to make samples.

4.3. Gel Strength Test

Gel strength was determined as the product of breaking force (N) and deformation distance (m) at the point of break. These parameters were measured using a TA.XT Plus Texture Analyzer (Texture Technologies Corp., Surrey, UK) fitted with a P/5 probe, following a method adapted from Zhang et al. [27]. The instrument settings were: a pre-test speed of 2.0 mm/s, test speed of 1.0 mm/s, post-test speed of 10 mm/s, a trigger force of 10 g, and a 50% compression ratio.

4.4. Texture Profile Analysis (TPA) Test

The textural properties of the surimi gels, including hardness, cohesiveness, springiness, resilience, and chewiness, were measured using a Texture Profile Analysis (TPA). The analysis was also conducted on a TA.XT Plus Texture Analyzer (Texture Technologies Corp., Surrey, UK) according to a procedure adapted from Zhang et al. [27]. A cylindrical probe (P/50) was used to compress the samples to 30% of their original height at a test speed of 1.0 mm/s, with pre-test and post-test speeds set to 2.0 mm/s and 10 mm/s, respectively. A trigger force of 10 g was applied to initiate the test. The TPA parameters were automatically calculated from the resulting force-time curve by Exponent (Version 6.1, Stable Micro Systems, Surrey, UK).

4.5. Cooking Loss and Expressible Moisture

The method to find cooking loss was modified from Zhu et al. [31]. The mass of each surimi sample before heating gelation was recorded as m0. After heating, the surface water of surimi gel was wiped with filter paper, and the surimi mass was recorded the mass m1. The cooking loss was calculated as follows:
Cooking loss = (m0 − m1)/m0 × 100%
The expressible moisture was determined by modifying Petcharat’s group [36]. The gel samples were sectioned into 5 mm-thick discs and weighed (X). Each disc was then sandwiched between two layers of filter paper below and two layers above. A constant 5 kg weight was applied to the stack for two minutes. Following this compression, the sample was reweighed (Y). The expressible moisture was calculated as a percentage of the initial sample weight using the following equation:
Expressible moisture = (X − Y)/X × 100%.

4.6. Rheological Test

The viscoelastic properties of the SC paste were characterized using small-amplitude oscillatory shear (SAOS) testing on a Discovery HR-2 Rheometer (TA Instruments, New Castle, DE, USA). Following the method of Xie et al. [33], a frequency sweep was performed at a fixed strain of 1% across a frequency range of 0.1 to 100 Hz. Sample discs (1 mm thick) were tested using a 40 mm diameter parallel plate geometry. The storage modulus (G′), loss modulus (G″), and the tan б were recorded in response to the frequency change.
The thixotropic behavior of the SC paste was evaluated using a hysteresis loop test, following a method adapted from Xie et al. [33]. The test involved increasing the shear rate from 0.01 to 100 s−1 over a period of 180 s (upward curve), followed by a decrease to 0.01 s−1 over the same duration (downward curve). Sample discs (1 mm thick) were tested using a 40 mm diameter parallel plate geometry. Shear stress and viscosity were recorded throughout the test to characterize the time-dependent rheological properties of each sample.

4.7. Multiple Machine Learning Models Training

To identify the optimal predictive model for synthetic data generation, numerous regression algorithms were evaluated in Jupyter Notebook (Version 7.4, Project Jupyter, hosted by LF Charities, Inc., San Francisco, CA, USA) with Python (Version 3.11, Python Software Foundation, Wilmington, DE, USA). These algorithms included Linear Regression, Polynomial Regression, Ridge Regression, Support Vector Regression (SVR), Decision Tree, Random Forest, Gradient Boosting, Bayesian Ridge, Automatic Relevance Determination (ARD), Multilayer Perceptron (MLP), and Extreme Gradient Boosting (XGB). The hyperparameters of these models were set as default values because they were usually chosen by library developers to work reasonably well across a variety of datasets and provided a baseline performance. Although changing hyperparameters would improve the model performance sometimes, it increases the complexity of the method, which is not tech-friendly for non-experts.
Each model was trained on our previously published experimental dataset for gel strength, hardness, and chewiness of FSC surimi [19]. The data were generated using a Box–Behnken design (BBD) to investigate the impact of combinations of additives with different dosages (0.1, 0.5, 1.0 wt% MMF; 0.1, 0.5, 1.0 wt%TG; and 1.0, 3.0, 5.0 wt% CLG) on FSC surimi properties. The BBD comprised 15 combinations, each with 4 repetitions, resulting in a total of 60 data points. For model development, this dataset was randomly separated into training (42) and validation (18) groups. The model performance was assessed by comparing the predicted values (ŷi) and actual values (yi) using the coefficient of determination (R2). The equation of R2 calculation was as follows, where n is the number of data points [37]:
R 2 = 1 i = 1 n y i y i ^ 2 i = 1 n y i y ¯ 2 ,   w h e r e   y ¯ = 1 n i = 1 n y i .
The R2 directly reflects the proportion of variance in the response explained by the model and allows consistent comparison across gel strength, hardness, and chewiness. Error-based metrics such as RMSE or SMAPE are scale-dependent and less comparable across characteristics, while R2 provides an intuitive, unit-free measure [37]. The best-performing model, defined by the highest mean of R2 value, was subsequently employed in the data generation model.

4.8. Synthetic Data Generation and Validation

The synthetic data generation method was modified from Zhang’s group [25], who used the multi-distribution global trend diffusion (MD-MTD) method to generate spectral-like data, which is too complicated for our small, tabulated dataset. For a small dataset, kernel density estimation (KDE) is a common method to generate new data by approximating the underlying probability distribution [26]. The main steps of the data generation method were shown in Figure 7 and could be described as follows:
  • Initial Model Training: A Random Forest Regressor (RFR) was trained on the original, limited dataset of input-output pairs (x_original and y_original);
  • Modeling the Input and Output Distribution: A Kernel Density Estimation model was independently fitted to both the input and output values of the original data, and generated new input and output values (x_new and y_new) that resided in regions supported by the original data;
  • Finding Absolute Error: The trained RFR was used to predict the output for x_new to gain y_pred. The absolute error was calculated between the y_pred and y_new for the same x_new;
  • Defining Threshold: The validation threshold (τ) was defined as a fraction of the original output’s standard deviation (SD) as follows, where the fit value was adjustable to reach a desired bias between y_pred and y_new:
    τ = fit value × SD;
5.
Validation Loop: If absolute error ≤ τ, the x_new and y_new were deemed consistent with the RFR’s understanding of the input-output relationship and were accepted. If absolute error > τ, the y_new was rejected, as the discrepancy between the KDE-generated output and the RFR-predicted output is too large, and returned to step 2;
6.
Termination Condition: The loop iterates, batch-wise, until the number of accepted samples reaches the exact target.
Following the SDDML method with a fit value of 0.003, 240 synthetic data points were generated for gel strength, hardness, and chewiness. The 240 generated data were used as a training group to train the Random Forest model, and the 60 original data were used as a testing group. The R2 of the model was computed to validate the representativeness of the generated data.
To evaluate the model’s accuracy, a combination of original and synthetic points was split into training (240) and validation (60) sets. The distribution of R2 scores for gel strength, hardness, and chewiness was analyzed.
For a more direct test, 18 original data points were held out as a final testing set. The model was then trained on the remaining 42 original data points, augmented with different synthetic data sizes (0, 60, 120, 240, 600, 1200). The average determination coefficients (R2) out of 100 repetitions (each repetition with a randomly selected 18 data point testing set) were compared.

4.9. Data Analysis and Model Program

JMP (Version 17.2, SAS Institute Inc., Cary, NC, USA) was used for Tukey’s honestly significant difference (Tukey’s HSD) test to determine the significant differences among datasets. The p-value was set to p < 0.05.
Jupyter Notebook (Version 7.4, Project Jupyter, USA) with Python (Version 3.11, Python Software Foundation, Wilmington, DE, USA) was used to plot figures and build models. A template of an SDDML Jupyter Notebook file (accessed on 27 August 2025), including the program described in Section 4.7 and Section 4.8, is available below: https://github.com/JingyiY99/SDDML-Temp (accessed on 27 August 2025).

Author Contributions

Conceptualization, J.Y. and C.Y.; methodology, J.Y., S.C., and C.Y.; software, J.Y. and S.C.; validation J.Y., S.C., T.T., and C.Y.; formal analysis, J.Y.; investigation, J.Y. and T.T.; resources, C.Y.; data curation, J.Y.; writing—original draft preparation, J.Y.; writing—review and editing, J.Y., S.C., T.T., and C.Y.; visualization, J.Y. and C.Y.; supervision, C.Y.; project administration, C.Y. All authors have read and agreed to the published version of the manuscript.

Funding

The authors acknowledge partial funding provided through USDA Hatch Grant for NC 1194 multistate project “Nanotechnology and Biosensors” to support this research.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available upon request from the corresponding author.

Acknowledgments

The authors would like to acknowledge the developers and contributors of the open-source Python ecosystem, particularly the authors of packages used in this research (e.g., Jupyter, NumPy, pandas, scikit-learn, and Matplotlib), whose work greatly facilitated the data analysis and visualization.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
SCSilver Carp
FSCFrozen Silver Carp
SDDMLSynthetic Data-Driven Machine Learning
WHCWater-holding Capacity
MMFManufactured Microfiber
TGTransglutaminase
CLGCollagen
RSMResponse-Surface Modeling
MLMachine Learning
SVRSupport Vector Regression
ARDAutomatic Relevance Determination for Bayesian Ridge
MLPMultilayer Perceptron
XGBExtreme Gradient Boosting
KDEKernel Density Estimation
TPATexture Profile Analysis
SAOSSmall-amplitude Oscillatory Shear
EMExpressible Moisture
MD-MTDMulti-distribution Global Trend Diffusion
RFRRandom Forest Regressor
SDStandard Deviation
HSDHonestly Significant Difference
QAsQuality Attributes
SEMStandard Error of the Mean

References

  1. Chapman, D.C.; Hoff, D.H. Chapter 1: Introduction. In Invasive Asian Carps in North America; American Fisheries Society, AFS Symposium: Bethesda, MD, USA, 2011; Volume 74. [Google Scholar] [CrossRef]
  2. Sun, X.D.; Holley, R.A. Factors influencing gel formation by myofibrillar proteins in Muscle Foods. Compr. Rev. Food Sci. Food Saf. 2010, 10, 33–51. [Google Scholar] [CrossRef]
  3. Yang, N.; Fan, X.; Yu, W.; Huang, Y.; Yu, C.; Konno, K.; Dong, X. Effects of microbial transglutaminase on gel formation of frozen-stored longtail southern cod (patagonotothen ramsayi) mince. LWT 2020, 128, 109444. [Google Scholar] [CrossRef]
  4. Wang, Q.; Jiao, X.; Yan, B.; Meng, L.; Cao, H.; Huang, J.; Zhao, J.; Zhang, H.; Chen, W.; Fan, D. Inhibitory effect of microwave heating on cathepsin L-induced degradation of myofibrillar protein gel. Food Chem. 2021, 357, 129745. [Google Scholar] [CrossRef] [PubMed]
  5. Luo, Y.; Shen, H.; Pan, D.; Bu, G. Gel properties of surimi from silver carp (Hypophthalmichthys molitrix) as affected by heat treatment and soy protein isolate. Food Hydrocoll. 2008, 22, 1513–1519. [Google Scholar] [CrossRef]
  6. Xiong, Z.; Shi, T.; Jin, W.; Bao, Y.; Monto, A.R.; Yuan, L.; Gao, R. Gel performance of surimi induced by various thermal technologies: A Review. Crit. Rev. Food Sci. Nutr. 2022, 64, 3075–3090. [Google Scholar] [CrossRef]
  7. Lee, S.; Jo, K.; Jeong, H.G.; Choi, Y.-S.; Kyoung, H.; Jung, S. Freezing-induced denaturation of myofibrillar proteins in frozen meat. Crit. Rev. Food Sci. Nutr. 2022, 64, 1385–1402. [Google Scholar] [CrossRef]
  8. Arsiccio, A.; Pisano, R. The ice-water interface and protein stability: A Review. J. Pharm. Sci. 2020, 109, 2116–2130. [Google Scholar] [CrossRef] [PubMed]
  9. Zhang, Y.; Ertbjerg, P. On the origin of Thaw loss: Relationship between freezing rate and protein denaturation. Food Chem. 2019, 299, 125104. [Google Scholar] [CrossRef] [PubMed]
  10. Lund, M.N.; Heinonen, M.; Baron, C.P.; Estévez, M. Protein oxidation in muscle foods: A Review. Mol. Nutr. Food Res. 2010, 55, 83–95. [Google Scholar] [CrossRef]
  11. Benjakul, S.; Visessanguan, W.; Tueksuban, J. Changes in physico-chemical properties and gel-forming ability of lizardfish (Saurida tumbil) during post-mortem storage in ice. Food Chem. 2003, 80, 535–544. [Google Scholar] [CrossRef]
  12. Cheng, W.; Sun, D.W.; Pu, H.; Wei, Q. Interpretation and rapid detection of secondary structure modification of actomyosin during frozen storage by near-infrared hyperspectral imaging. J. Food Eng. 2019, 246, 200–208. [Google Scholar] [CrossRef]
  13. Benjakul, S.; Visessanguan, W.; Thongkaew, C.; Tanaka, M. Effect of frozen storage on chemical and gel-forming properties of fish commonly used for surimi production in Thailand. Food Hydrocoll. 2005, 19, 197–207. [Google Scholar] [CrossRef]
  14. Yu, Q.; Liu, J.; Liu, Y.; Zheng, Y.; Pi, R.; Mubango, E.; Tan, Y.; Luo, Y.; Hong, H. Inhibitive effect of cryoprotectants on the oxidative and structural changes in myofibrillar proteins of unwashed mince from silver carp during frozen storage. Food Res. Int. 2022, 161, 111880. [Google Scholar] [CrossRef]
  15. Lu, G.; Wang, C.; Zhao, J.; Liao, X.; Wang, J.; Luo, M.; Zhu, L.; Bernatzhez, L.; Li, S. Evolution and genetics of Bighead and silver carps: Native population conservation versus Invasive Species Control. Evol. Appl. 2020, 13, 1351–1362. [Google Scholar] [CrossRef]
  16. Rungruangsak-Torrissen, K.; Stien, L.H.; Daae, B.S.; Vågseth, T.; Thorsheim, G.B.; Tobin, D.; Ritola, O. Different dietary levels of protein to lipid ratio affected digestive efficiency, skeletal growth, and muscle protein in rainbow trout families. Sch. Res. Exch. 2009, 2009, 1–13. [Google Scholar] [CrossRef]
  17. Cardoso, P.G.; Gonçalves, O.; Cavalheri, T.; Amorim, V.E.; Cao, W.; Alexandrino, D.A.; Jia, Z.; Carvalho, M.F.; Vaz-Pires, P.; Ozório, R.O. Combined effects of temperature and dietary lipid level on body composition, growth, and freshness profile in European Seabass, Dicentrarchus labrax. Animals 2023, 13, 1068. [Google Scholar] [CrossRef] [PubMed]
  18. Geirsdottir, M.; Hlynsdottir, H.; Thorkelsson, G.; Sigurgisladottir, S. Solubility and viscosity of herring (Clupea harengus) proteins as affected by freezing and frozen storage. J. Food Sci. 2007, 72, C376–C380. [Google Scholar] [CrossRef] [PubMed]
  19. Yang, J.; Yu, X.; Dong, X.; Yu, C. Improvement of Surimi Gel from Frozen-Stored Silver Carp. Gels 2024, 10, 374. [Google Scholar] [CrossRef]
  20. Hubbe, M.A.; Tayeb, P.; Joyce, M.; Tyagi, P.; Kehoe, M.; Dimic-Misic, K.; Pal, L. Rheology of nanocellulose-rich aqueous suspensions: A Review. BioResources 2017, 12, 9556–9661. [Google Scholar] [CrossRef]
  21. Miwa, N. Innovation in the food industry using microbial transglutaminase: Keys to success and future prospects. Anal. Biochem. 2020, 597, 113638. [Google Scholar] [CrossRef]
  22. Adamiak, K.; Sionkowska, A. Current methods of collagen cross-linking: Review. Int. J. Biol. Macromol. 2020, 161, 550–560. [Google Scholar] [CrossRef]
  23. Kang, J.-K.; Lee, Y.-J.; Son, C.-Y.; Park, S.-J.; Lee, C.-G. Alternative assessment of machine learning to polynomial regression in response surface methodology for predicting decolorization efficiency in textile wastewater treatment. Chemosphere 2025, 370, 143996. [Google Scholar] [CrossRef]
  24. Dou, B.; Zhu, Z.; Merkurjev, E.; Ke, L.; Chen, L.; Jiang, J.; Zhu, Y.; Liu, J.; Zhang, B.; Wei, G.-W. Machine learning methods for small data challenges in molecular science. Chem. Rev. 2023, 123, 8736–8780. [Google Scholar] [CrossRef]
  25. Zhang, R.; Li, X.; Sun, C.; Yang, S.; Tian, Y.; Tian, J. State of charge and temperature joint estimation based on ultrasonic reflection waves for lithium-ion battery applications. Batteries 2023, 9, 335. [Google Scholar] [CrossRef]
  26. Plesovskaya, E.; Ivanov, S. An empirical analysis of KDE-based generative models on small datasets. Procedia Comput. Sci. 2021, 193, 442–452. [Google Scholar] [CrossRef]
  27. Zhang, N.; Yang, N.; Yu, W.; Jin, Z.; Jiang, P.; Yu, C.; Dong, X. Effects of microbial Transglutaminase on textural, water distribution, and microstructure of frozen-stored longtail Southern Cod (Patagonotothen ramsayi) fish mince gel. J. Texture Stud. 2022, 53, 844–853. [Google Scholar] [CrossRef]
  28. Alemu, T. Texture profile and design of food product. J. Agric. Hortic. Res. 2023, 6, 272–281. [Google Scholar] [CrossRef]
  29. Peleg, M. The instrumental texture profile analysis revisited. J. Texture Stud. 2019, 50, 362–368. [Google Scholar] [CrossRef] [PubMed]
  30. An, Y.; You, J.; Xiong, S.; Yin, T. Short-term frozen storage enhances cross-linking that was induced by transglutaminase in surimi gels from silver carp (Hypophthalmichthys molitrix). Food Chem. 2018, 257, 216–222. [Google Scholar] [CrossRef]
  31. Zhu, J.; Cheng, Y.; Ouyang, Z.; Yang, Y.; Ma, L.; Wang, H.; Zhang, Y. 3D printing surimi enhanced by surface crosslinking based on dry-spraying transglutaminase, and its application in dysphagia diets. Food Hydrocoll. 2023, 140, 108600. [Google Scholar] [CrossRef]
  32. Gaspar, A.L.; de Góes-Favoni, S.P. Action of microbial transglutaminase (MTGase) in the modification of Food Proteins: A Review. Food Chem. 2015, 171, 315–322. [Google Scholar] [CrossRef]
  33. Xie, Y.; Yu, X.; Wang, Y.; Yu, C.; Prakash, S.; Zhu, B.; Dong, X. Role of dietary fiber and flaxseed oil in altering the physicochemical properties and 3D printability of COD protein composite gel. J. Food Eng. 2022, 327, 111053. [Google Scholar] [CrossRef]
  34. Liu, Y.; Sun, Q.; Wei, S.; Xia, Q.; Pan, Y.; Ji, H.; Deng, C.; Hao, J.; Liu, S. Insight into the correlations among rheological behaviour, protein molecular structure and 3D printability during the processing of surimi from Golden Pompano (Trachinotus ovatus). Food Chem. 2022, 371, 131046. [Google Scholar] [CrossRef] [PubMed]
  35. Liu, R.; Li, Y.; Tao, L.; Liang, D.; Zheng, H.-T. Are we ready for a new paradigm shift? A survey on visual deep MLP. Patterns 2022, 3, 100520. [Google Scholar] [CrossRef] [PubMed]
  36. Petcharat, T.; Chaijan, M.; Karnjanapratum, S. Effect of furcellaran incorporation on gel properties of sardine surimi. Int. J. Food Sci. Technol. 2021, 56, 5957–5967. [Google Scholar] [CrossRef]
  37. Chicco, D.; Warrens, M.J.; Jurman, G. The coefficient of determination R2 is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation. PeerJ Comput. Sci. 2021, 7, e623. [Google Scholar] [CrossRef]
Figure 1. Gel strength (A), hardness (B), and chewiness (C) results of control FSC surimi samples (blue) and 0.1 wt% TG-added FSC surimi samples (orange) along with frozen storage time. Uppercase letters on error bars show the significant differences (p < 0.05).
Figure 1. Gel strength (A), hardness (B), and chewiness (C) results of control FSC surimi samples (blue) and 0.1 wt% TG-added FSC surimi samples (orange) along with frozen storage time. Uppercase letters on error bars show the significant differences (p < 0.05).
Gels 11 00810 g001
Figure 2. Cooking Loss (A) and EM (B) results of control FSC surimi samples (blue) and 0.1 wt% TG-added FSC surimi samples (orange) along with frozen storage time. Uppercase letters on error bars show the significant differences (p < 0.05).
Figure 2. Cooking Loss (A) and EM (B) results of control FSC surimi samples (blue) and 0.1 wt% TG-added FSC surimi samples (orange) along with frozen storage time. Uppercase letters on error bars show the significant differences (p < 0.05).
Gels 11 00810 g002
Figure 3. The storage modulus (G′) (solid line with solid marker), loss modulus (G″) (dashed line with solid marker), and the tan б (dashed line with hollow marker) of the control FSC paste samples (A) and 0.1 wt% TG-added FSC paste samples (B) along with frozen storage time.
Figure 3. The storage modulus (G′) (solid line with solid marker), loss modulus (G″) (dashed line with solid marker), and the tan б (dashed line with hollow marker) of the control FSC paste samples (A) and 0.1 wt% TG-added FSC paste samples (B) along with frozen storage time.
Gels 11 00810 g003
Figure 4. The thixotropic loop of the control FSC paste samples (A) and 0.1 wt% TG-added FSC paste samples (B) along with frozen storage time.
Figure 4. The thixotropic loop of the control FSC paste samples (A) and 0.1 wt% TG-added FSC paste samples (B) along with frozen storage time.
Gels 11 00810 g004
Figure 5. The viscosity loop of the control FSC paste samples (A) and 0.1 wt% TG-added FSC paste samples (B) along with frozen storage time.
Figure 5. The viscosity loop of the control FSC paste samples (A) and 0.1 wt% TG-added FSC paste samples (B) along with frozen storage time.
Gels 11 00810 g005
Figure 6. The flowchart of the SDDML method for data generation.
Figure 6. The flowchart of the SDDML method for data generation.
Gels 11 00810 g006
Figure 7. The distribution of R2 for the Random Forest model with the mixed dataset of gel strength (A), hardness (B), and chewiness (C).
Figure 7. The distribution of R2 for the Random Forest model with the mixed dataset of gel strength (A), hardness (B), and chewiness (C).
Gels 11 00810 g007
Table 1. Springiness, cohesiveness, and resilience results of control FSC surimi samples and 0.1 wt% TG-added FSC surimi samples along with frozen storage time.
Table 1. Springiness, cohesiveness, and resilience results of control FSC surimi samples and 0.1 wt% TG-added FSC surimi samples along with frozen storage time.
Storage Time
(Months)
TreatmentSpringiness
(Mean ± SEM, n = 3)
Cohesiveness
(Mean ± SEM, n = 3)
Resilience
(Mean ± SEM, n = 3)
0Control0.901 ± 0.003 A0.773 ± 0.001 C0.405 ± 0.002 D
0.1 wt% TG0.881 ± 0.014 A0.794 ± 0.003 BC0.440 ± 0.003 BC
1Control0.881 ± 0.013 A0.777 ± 0.007 C0.434 ± 0.007 BC
0.1 wt% TG0.937 ± 0.012 A0.815 ± 0.009 AB0.482 ± 0.007 A
2Control0.913 ± 0.004 A0.775 ± 0.003 C0.443 ± 0.003 B
0.1 wt% TG0.939 ± 0.017 A0.825 ± 0.003 A0.496 ± 0.006 A
3Control0.907 ± 0.003 A0.793 ± 0.003 BC0.430 ± 0.001 BCD
0.1 wt% TG0.917 ± 0.005 A0.823 ± 0.003 A0.497 ± 0.003 A
6Control0.903 ± 0.004 A0.773 ± 0.003 C0.417 ± 0.003 CD
0.1 wt% TG0.902 ± 0.007 A0.817 ± 0.003 AB0.487 ± 0.003 A
Different uppercase letters in a column indicate significant differences (p < 0.05).
Table 2. Determination coefficient (R2) on the validation set for different ML methods based on experimental data with MMF, TG, and CLG as additives to improve FSC surimi gel * [19].
Table 2. Determination coefficient (R2) on the validation set for different ML methods based on experimental data with MMF, TG, and CLG as additives to improve FSC surimi gel * [19].
Quality Attributes (QAs)Performance SummaryML Methods
Linear RegressionPolynomial Regression **Ridge Regression **SVRDecision TreeRandom ForestGradient BoostingBayesian Ridge **ARDMLPXGB
Gel StrengthMean± SD0.272 ± 0.3250.369 ± 0.3400.402 ± 0.2480.304 ± 0.3020.403 ± 0.2650.464± 0.2180.406 ± 0.2640.406 ± 0.2640.410 ± 0.2640.005 ± 0.6180.409 ± 0.262
Best Case0.6200.8240.7240.7320.8250.8040.8240.7290.7340.7280.825
HardnessMean± SD0.054 ± 0.3110.492 ± 0.2970.502 ± 0.2560.493 ± 0.2730.588 ± 0.2520.605± 0.2200.588 ± 0.2550.508 ± 0.2620.526 ± 0.2470.115 ± 0.8040.586 ± 0.259
Best Case0.4670.8370.8170.8850.8620.8890.8620.8260.8170.8850.873
ChewinessMean± SD−0.173 ± 0.2200.199 ± 0.4090.270 ± 0.2720.577 ± 0.2630.595 ± 0.2780.617± 0.1960.606 ± 0.2400.266 ± 0.2880.260 ± 0.425−0.120 ± 0.9810.584 ± 0.327
Best Case0.1220.5950.6420.8970.9190.9060.9190.6420.6410.8900.919
* This dataset contained 60 data points, randomly separated into training (42) and validation (18) groups. The random state of the model is tested from 0 to 99. ** The polynomial degree used for Polynomial Regression, Ridge Regression, Bayesian Ridge, and ARD was 2.
Table 3. Mean determination coefficient (R2) ± SD on the testing set for different sizes of synthetic datasets using the SDDML model.
Table 3. Mean determination coefficient (R2) ± SD on the testing set for different sizes of synthetic datasets using the SDDML model.
QAsSize of Synthetic Dataset Added to Experimental Data (42) in Training
0601202406001200
Gel Strength0.464 ± 0.218 A0.470 ± 0.209 A0.471 ± 0.216 A0.476 ± 0.206 A0.481 ± 0.215 A0.485 ± 0.208 A
Hardness0.605 ± 0.220 A0.646 ± 0.193 B0.645 ± 0.195 B0.648 ± 0.195 B0.646 ± 0.197 B0.646 ± 0.196 B
Chewiness0.617 ± 0.196 A0.670 ± 0.137 B0.671 ± 0.143 B0.679 ± 0.136 B0.679 ± 0.134 B0.678 ± 0.137 B
Different uppercase letters in a row indicate significant differences (p < 0.05).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yang, J.; Chen, S.; Tong, T.; Yu, C. Assessment of Frozen Stored Silver Carp Surimi Gel Quality Using Synthetic Data-Driven Machine Learning (SDDML) Model. Gels 2025, 11, 810. https://doi.org/10.3390/gels11100810

AMA Style

Yang J, Chen S, Tong T, Yu C. Assessment of Frozen Stored Silver Carp Surimi Gel Quality Using Synthetic Data-Driven Machine Learning (SDDML) Model. Gels. 2025; 11(10):810. https://doi.org/10.3390/gels11100810

Chicago/Turabian Style

Yang, Jingyi, Shuairan Chen, Tianjian Tong, and Chenxu Yu. 2025. "Assessment of Frozen Stored Silver Carp Surimi Gel Quality Using Synthetic Data-Driven Machine Learning (SDDML) Model" Gels 11, no. 10: 810. https://doi.org/10.3390/gels11100810

APA Style

Yang, J., Chen, S., Tong, T., & Yu, C. (2025). Assessment of Frozen Stored Silver Carp Surimi Gel Quality Using Synthetic Data-Driven Machine Learning (SDDML) Model. Gels, 11(10), 810. https://doi.org/10.3390/gels11100810

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop