Next Article in Journal
Deciphering Flavor Signatures of Early-Maturing Table Grapes: A Synergistic Multi-Sensor Approach Using E-Nose, GC-MS, and GC-IMS
Previous Article in Journal
L-Proline-Mediated Modulation of Astringency in Black Chokeberry Puree: Molecular Interactions, Process Optimization, and Quality Preservation
Previous Article in Special Issue
Mass Spectrometry-Based Lipidomics in Coffee: Linking Lipid Transformation to Flavor Formation and Quality Control
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Research Progress in Multi-Omics Analysis of Dairy Products: Nutritional Quality, Safety Evaluation, and Health Functions

Key Laboratory of Microbiological Metrology, Measurement & Bio-Product Quality Security, State Administration for Market Regulation, College of Life Sciences, China Jiliang University, Hangzhou 310018, China
*
Author to whom correspondence should be addressed.
Foods 2026, 15(13), 2389; https://doi.org/10.3390/foods15132389 (registering DOI)
Submission received: 11 May 2026 / Revised: 22 June 2026 / Accepted: 3 July 2026 / Published: 4 July 2026

Abstract

This review evaluates multi-omics applications in dairy research across nutrition, safety, and health. Through multi-omics integration, we reveal nutrient differences driven by species, rearing practices, and processing techniques, identify protein patterns and allergen profiles, and construct adulteration detection fingerprints and species-specific peptide markers, thereby improving the timeliness and accuracy of safety assessment. The coupling of metagenomics and metabolomics effectively predicts spoilage-related microbial risks, enabling better risk control. Furthermore, multi-omics approaches systematically elucidate the functional mechanisms of bioactive peptides (e.g., ACE-inhibitory peptides), clarify the prebiotic effects of functional oligosaccharides, and build interaction networks between dairy components and gut microbiota. The introduction of machine learning enables origin and shelf-life prediction, as well as the discovery of novel biomarkers, promoting personalized nutrition and precision fermentation strategies. However, the field is currently constrained by severe reproducibility issues arising from the absence of standardized operating procedures, excessive optimism regarding machine learning models that rarely generalize across laboratories or product matrices, and a persistent disconnect between laboratory-scale biomarker discovery and industrial implementation. Without rigorous cross-platform validation and openly shared multi-omics reference datasets, most published markers remain unfit for regulatory or industrial application. Future efforts should establish standardized workflows and expand the evidence base to drive the dairy industry toward safer, healthier, and more traceable directions.

1. Introduction

Dairy products are globally important food resources. Their nutritional value encompasses not only basic nutrients, but also a range of bioactive molecules that collectively influence growth, immunity, and metabolic regulation [1]. Although mammalian species are numerous worldwide, commercial dairy products are primarily derived from a limited number of species, such as cattle, goats, and camels [2]. The milk of different species exhibits systematic differences in proteins, lipids, minerals, and other components [3]. These differences provide a molecular basis for the development of customized nutrition and specific processing technologies.
However, the dairy industry faces significant safety and quality challenges. The health risks posed by adulteration practices (such as blending low-cost milk sources with high-value ones) have made efficient and traceable detection systems a regulatory focus [4,5]. Traditional methods such as sensory evaluation, electrophoresis, and immunoassays have limitations in resolution and standardization when dealing with complex adulteration scenarios or deeply processed samples [6].
Against this background, multi-omics technologies represented by proteomics, metabolomics, and metagenomics, combined with bioinformatics analyses (e.g., machine learning, network modeling, molecular docking, etc.), are driving a paradigm shift in dairy research from a “single molecule–single function” to a “multi-molecule network–multi-dimensional function” perspective. These technologies provide systematic tools for non-targeted compositional characterization, source identification, and functional evaluation. For example, LC-MS/MS-based metabolomics, lipidomics, and 4D label-free proteomics have successfully revealed significant differences in metabolites, lipids, and proteins among different dairy products; techniques such as mass spectrometry imaging also play important roles in glycosylation analysis [7,8].
Although multi-omics technologies have achieved significant progress in various scenarios such as dairy species identification, adulteration quantification, thermal damage assessment, fermentation monitoring, and probiotic function prediction, their large-scale application still faces challenges. The main bottlenecks include poor cross-platform data comparability; a lack of international standard operating procedures covering the entire workflow from sampling and sample preparation to analysis and annotation; and an unclear pathway for translating laboratory discoveries into regulatory compliance, process optimization, and health claims.
To overcome the above challenges and fully realize the potential of bioinformatics-driven multi-omics, future research should focus on two major strategic directions. The first is dissecting the molecular mechanism of nutritional quality and safety authentication, i.e., using multi-omics data to identify species-specific protein markers, characteristic lipid clusters, metabolic fingerprints, and functional gene profiles of microbiota, thereby supporting high-resolution traceability and risk early warning. The second is the construction of a systematic validation framework for health functions that combines multi-omics with in vitro cell models, organoids, and clinical cohorts to move from correlation to causal inference.
This review will systematically summarize the latest advances in bioinformatics-driven multi-omics integration for dairy nutritional quality, safety assessment, and health functions in the above directions and propose a research framework for standardization and industrial translation. Multi-omics integration encompasses data-level fusion, methodological integration, and industrial translation.
This review follows the principles of a systematic literature review. Peer-reviewed articles published between January 2015 and March 2026 were retrieved from the PubMed database. The search strategy combined the following keywords: (i) dairy products (such as “milk”, “dairy products”, “yogurt”, “cheese”); (ii) multi-omics technologies (such as “proteomics”, “metabolomics”, “lipidomics”, “metagenomics”, “peptidomics”, “foodomics”); and (iii) application fields (such as “nutritional quality”, “safety”, “adulteration”, “allergens”, “bioactive peptides”, “gut microbiota”).
Inclusion criteria were as follows: (1) original research or review articles; (2) in Chinese or English; (3) using dairy products as the main matrix; (4) applying at least one omics technology; (5) results involving nutritional quality, safety assessment, or health functions. Exclusion criteria were as follows: (1) non-dairy matrix; (2) unpublished conference abstracts, dissertations, or preprints; (3) only focusing on the detection of a single metabolite, without covering a multi-omics scope. Quality assessment of original research was performed using the observational study standards adapted from the Newcastle–Ottawa Scale and clinical trial standards derived from CONSORT.

2. Multi-Omics Analysis for Dissecting the Nutritional Quality of Dairy Products

2.1. Species, Environment, and Processing: Three Major Factors Influencing Dairy Product Nutritional Quality

The nutritional quality of dairy products is a complex systemic concept, encompassing multiple dimensions from macronutrients to trace bioactive components. It is mainly influenced by three major factors: species differences, rearing practices (environment), and processing technologies [9]. Milk from different species exhibits inherent differences in proteins, lipids, and minerals, providing a molecular basis for customized nutrition. Rearing practices and environmental stresses can reshape milk composition by modulating rumen microorganisms and host metabolism [10,11]. Thermal treatments, fermentation, and other processing steps further alter protein structures and metabolite profiles, and research to date has failed to comprehensively reveal the holistic impact of the above factors on the molecular composition of dairy products.
Multi-omics technologies (metabolomics, proteomics, lipidomics, and metagenomics) provide systematic profiling by enabling high-throughput, systems-level profiling (Figure 1) [12,13,14,15,16]. Table 1 summarizes the core targets, application directions, and distinctive advantages of each omics approach.

2.2. Nutritional Differences Among Milks from Different Sources: Multi-Omics Fingerprinting with Machine Learning Discrimination

The integrated application of multi-omics technologies, particularly metabolomics, lipidomics, and proteomics, has become a key tool for systematically resolving nutritional differences in dairy products from different milk sources and points to discriminative features amenable to machine learning [15]. By integrating multi-dimensional molecular information from proteomics, metabolomics, and lipidomics, it is possible to construct milk-source-specific molecular fingerprints, providing a high-quality data foundation for subsequent target discrimination based on machine learning.
At the metabolomics level, the application of UHPLC-QTOF-MS, Orbitrap high-resolution mass spectrometry, and 4D label-free quantification technology has enabled deep characterization of metabolite profiles across milk sources from different species [26]. Through systematic analysis of human, porcine, bovine, yak, and buffalo milk, a total of 1992 metabolites were identified, with lipids (21.08%), organic acids and derivatives (18.62%), heterocyclic compounds (14.56%), benzenoids (13.91%), and organooxygen compounds (9.29%) as the major components, providing a high-quality chemical atlas for deciphering the nutritional differences and key biomolecular associations among different milk sources [27]. This study also revealed species-specific molecular networks within individual species, laying a solid data foundation for the subsequent construction of milk-source-specific discriminant models.
At the level of differential comparison, there are systematic differences in protein profiles, metabolite composition, and lipid structures among different milk sources. Rocchetti and O’Callaghan point out that metabolomics technologies have been widely applied in milk discrimination based on factors such as breed, feeding methods, and processing techniques and emphasize that technologies like LC-MS and NMR are driving the evolution from univariate statistical modeling toward multivariate statistics and machine learning modeling, which are more suitable for high-dimensional metabolomics data [28]. At the proteomics level, the expression patterns of casein subtypes and whey proteins exhibit significant species specificity. Their differences not only provide a basis for species identification, but are also closely related to the digestion and absorption characteristics, as well as the functional activity, of the proteins [29]. At the lipidomics level, goat milk is rich in medium-chain fatty acids and polyunsaturated fatty acids, while the lipid profile of cow milk exhibits different distribution characteristics. Similarly, Li et al. employed a multi-omics strategy integrating proteomics, metabolomics, and lipidomics to compare the characteristics of raw milk from Normandy and Holstein cattle [30]. They found systematic differences in proteins, fats, bioactive components, and metabolic pathways and successfully constructed breed-specific molecular fingerprints.
Based on the above multi-omics fingerprint data, the integration of machine learning technologies has fundamentally transformed milk source discrimination from descriptive analysis to predictive modeling. A study reports a novel dairy authentication method based on MALDI-TOF MS lipid–peptide dual-omics fingerprints combined with a machine learning ensemble: the ensemble model constructed from the combined lipid–peptide dataset achieved up to 100% accuracy in organic milk authentication, significantly outperforming models based solely on single-omics feature sets [31]. This result strongly demonstrates that the combination of data fusion strategies and machine learning represents an important trend in multi-omics discrimination systems. Karamoutsios et al. summarize the past decade of studies on species identification and adulteration detection using proteomics and chemometrics, highlighting that multivariate methods such as PCA and PLS-DA are widely used, and that many protein-based biomarkers have achieved detection limits as low as 0.1% or even lower [32].
Collectively, the multidimensional data described above establish milk-source-specific molecular fingerprints, providing a systematic basis for the nutritional quality assessment of dairy products [30,33]. The nutritional differences among dairy products from different milk sources are summarized in Table 2.

2.3. Rearing Practices and Environmental Factors: Multi-Omics Joint Analysis Revealing Microbe–Metabolite Associations

The nutritional quality of dairy products is influenced not only by species-related factors, but also by rearing practices and environmental conditions [40]. Multi-omics technologies provide essential tools for elucidating the effects of these exogenous factors on milk composition [41].
Environmental stress, particularly heat stress, can significantly affect the molecular composition of dairy products [42]. Integrated multi-omics studies indicate that heat stress reduces the antioxidant capacity of bovine milk (decreased SOD and GSH-Px activities, increased MDA levels), disrupts energy, amino acid, and lipid metabolism, and leads to a reduction in nutrients such as essential amino acids, unsaturated fatty acids, and polar lipids, while increasing the abundance of spoilage-related microorganisms and significantly elevating off-flavor volatile compounds [43].
Feeding systems also have a significant impact on milk composition. Using metabolomics techniques, studies have found that milk from cows grazing on functionally diverse pastures is enriched with various metabolites associated with enhanced consumer health, whereas milk from cows grazing on single-species pastures does not show this phenomenon, indicating that metabolomics can provide strong evidence for identifying characteristic metabolites of “grass-fed” diets and their health benefits [44]. In terms of feed type, urine metabolomics studies have shown that cows fed with alfalfa (high-quality roughage) exhibit higher milk performance than those fed with corn stover (low-quality roughage). Hippuric acid (HUA) and N-methylglutamic acid (NML-Glu) are key urinary metabolites that distinguish the two feed types, with HUA serving as a biomarker for the metabolic utilization of low-quality roughage [12,45].
At the rumen microbiome level, multi-omics techniques have revealed a close association between the rumen microecology and the nutritional quality of dairy products. Through integrated analysis of metagenomics and metabolomics, researchers found significant differences in the composition and function of the rumen microbiome between cows with high milk protein yield and those with low milk protein yield: the high-yield group was enriched in Prevotella and branched-chain amino acid biosynthesis pathways and exhibited significantly higher concentrations of rumen metabolites (amino acids, carboxylic acids, and volatile fatty acids) [46]. Using linear mixed-effects models, it was estimated that rumen metabolites contributed more to the variation in milk protein yield (29.76%) than microbial composition (17.81%) or microbial function (21.56%) [47].
In terms of functional feed regulation, metabolomics studies have found that supplementation with perilla leaves increases the abundance of pyrimidine nucleotides in rumen fluid and milk and enriches pyrimidine metabolism and unsaturated fatty acid biosynthesis pathways, providing new insights for the development of functional dairy products [48]. Furthermore, feed processing methods also affect milk protein composition: diet type can regulate milk protein synthesis and secretion by influencing rumen microbial protein, yield and the uptake and utilization of amino acids by the mammary gland [49].
In summary, the application of multi-omics technologies has systematically revealed the multi-level mechanisms through which rearing practices and environmental factors influence the nutritional quality of dairy products. Integrated multi-omics strategies provide a systematic tool for gaining a deeper understanding of the associations between these exogenous factors and milk components.

2.4. Remodeling of Nutrients by Processing: Multi-Omics Dynamic Network Analysis

During the processing and storage of dairy products, their molecular composition undergoes significant changes. Multi-omics approaches provide powerful tools for systematically dissecting the dynamic changes in nutrients and functional components during processing, as well as for identifying potential markers and reaction pathways [49]. As mentioned earlier, the nutritional quality of dairy products is comprehensively influenced by three major factors—species, rearing environment, and processing technology—and processing technology serves as a key downstream link that determines the quality of the final product.
The use of high-throughput, untargeted multi-omics monitoring strategies during storage has become a cutting-edge approach in this field. A study utilizing a combined 1H NMR and UHPLC-QToF/MS platform to analyze metabolite changes in raw milk during freeze-drying and subsequent storage indicated that, during ambient storage, the levels of orotic acid, riboflavin, and acetyl-carbohydrates continuously decreased, while the levels of fatty acids, threonic acid, and uridine showed an upward trend [50]. Subsequent studies further revealed the metabolic profile evolution of spray-dried milk powder under constant temperature and accelerated storage conditions, confirming that storage temperature is a key environmental factor driving this process, thus laying a data foundation for establishing reasonable shelf life and optimizing drying process parameters [50,51].
In the thermal processing stage, multi-omics integration analysis provides molecular-level insights into the effects of different sterilization techniques on milk quality. One study systematically evaluated the differential impacts of industrial-scale direct (130 °C/0.5 s) and indirect (75 °C/15 s) heat treatments on milk protein, metabolite, and lipid profiles, revealing that direct heating is superior to conventional indirect heating in preserving milk proteins and endogenous functional metabolites [52].
In the context of fermented dairy products, the characteristics of lactic acid bacterial strains significantly influence metabolite generation. A study using UPLC-QE-MS untargeted metabolomics to track the fermentation process of a multi-species probiotic culture found that metabolite changes were mainly concentrated in the early stage of fermentation, with various metabolites showing significant correlations with new colony formation, as well as flavor and nutritional quality [53]. Another study compared the contributions of two strains, including Lactococcus lactis LA1 and Lactobacillus helveticus Lh59, to fermentation, revealing strain-specific signature metabolites and interactions, indicating that directed accumulation of effective functional small molecules through the selection of functional lactic acid bacterial strains is a well-founded strategy [54].
During cheese ripening, metabolomics analyses have revealed that tyrosine and its derivatives are closely associated with the degree of ripening, quality, and flavor formation. Biochemical studies indicate that the catabolism of tyrosine (Tyr) and phenylalanine (Phe) by lactic acid bacteria is closely related to the generation of flavor defect compounds such as “barn-utensil” and “floral” notes in cheese [55]. Emerging targeted quantitative multi-omics techniques, such as the LC-MS/MS high-throughput analytical panel covering all 56 glutamyl dipeptides from the α-Glu-X, X-Glu, and γ-Glu-X subgroups, have been successfully applied to precisely assess the effects of cheese ripening, geographic origin, and starter cultures on the contribution to kokumi taste [56].
Furthermore, the initial quality of raw milk is also critical for maintaining processing stability. Targeted lipidomics analysis has revealed that, during cold storage of raw milk at 4 °C for 6 days, lipid molecules such as triglycerides, phospholipids, and free fatty acids undergo metabolic changes that are directly linked to preserving the quality of subsequent processed products [57]. The integrated use of metaproteomics and metabolomics is expected to comprehensively reveal the mechanism of quality evolution during cold storage of raw milk at both the protein and metabolite levels, providing a decision-making basis for developing more effective preservation technologies and optimizing downstream processing techniques [58].
In summary, multi-omics technologies can systematically elucidate the effects of protein degradation, aggregation, and post-translational modifications on the structural properties and nutritional value of dairy products during processing, as well as reveal the dynamic remodeling networks of molecular composition at different processing stages, thereby providing a systematic tool for process optimization and quality enhancement.

2.5. Horizontal Comparison: Applicability and Limitations of Different Omics Technologies in Nutritional Quality Research

Each omics technology has its own strengths and weaknesses when applied to assessing the nutritional quality of dairy products. Metabolomics covers small molecules such as organic acids, amino acids, and nucleotides, offering high detection throughput and good sensitivity, making it suitable for large-scale screening. However, it has limited capability for structural identification of unknown metabolites and is susceptible to matrix interference [59]. Proteomics targets caseins and whey proteins, providing high specificity and quantitative accuracy, and is particularly useful for species identification and allergen analysis. Nevertheless, its dynamic range is limited, making the detection of low-abundance proteins challenging. Lipidomics focuses on fatty acids and phospholipids, which are directly related to dairy functionality (e.g., conjugated linoleic acid), but this approach suffers from low standardization, leading to large inter-laboratory variations in lipid identification [23]. Metagenomics reveals microbial composition and functional genes, offering mechanistic insights, but cannot directly reflect the nutritional components of the final product and is relatively costly [60]. Therefore, nutritional quality research should rationally select or integrate multiple omics technologies based on specific objectives: metabolomics for preliminary screening, proteomics for validation, lipidomics for functional targeting, and metagenomics for mechanistic exploration.

3. Multi-Omics for Constructing a Dairy Safety Assessment Chain

3.1. From Authenticity to Risk Grading: The Progressive Logic of Dairy Safety Assessment

The previous chapter discussed the application of multi-omics technologies in analyzing the nutritional quality of dairy products. This chapter focuses on recent advances in safety assessment. Multi-omics technologies have shown significant potential in dairy safety assessment, offering insights into quality and traceability (Figure 2) [16]. Their applications mainly cover milk authenticity and adulteration identification, species-specific peptide markers, and allergen recognition and risk assessment [61]. By integrating metabolomics, proteomics, genomics, and metagenomics, a comprehensive evaluation of the nutritional quality, safety authentication, and health functions of dairy products can be achieved. This review summarizes key methods, markers, platforms, models, applicable products, processing conditions, and scenarios for each area, as presented in Table 3.

3.2. Adulteration Detection: Multi-Omics Fingerprinting with Database Search for Species-Specific Peptides

As globally consumed nutritional commodities with high economic value, dairy products are frequently subjected to adulteration, especially by mixing with lower-cost milk types [32]. Multi-omics technologies integrate multidimensional molecular information from the proteome, peptidome, metabolome, genome, and others to construct milk-source-specific molecular fingerprinting profiles. Combined with database searching and machine learning algorithms, this approach enables high-sensitivity detection of adulterants and species origin tracing [72].
Mass spectrometry-based proteomics and peptidomics are among the core technologies for adulteration detection [73]. Whey proteins (such as β-lactoglobulin and α-lactalbumin) and casein subtypes exhibit systematic differences across species. Owing to their high thermal stability and processing tolerance, peptides offer greater value than intact proteins for species identification in highly processed dairy products [74]. Using quantitative proteomics and metabolomics, Ji et al. found that protein biomarkers (e.g., OPN, TF) detected bovine milk adulteration at 1% in pasteurized mare milk but only 10% in mare milk powder, whereas N6-methyladenosine detected 0.1% in powder. They also noted that processing reduces biomarker sensitivity, necessitating method adaptation for different product forms [63]. Another study utilized proteomics combined with artificial neural network technology to analyze thousands of mass spectra of cow, goat, and sheep milk collected by MALDI-TOF MS. Through machine learning algorithms, it effectively distinguished the species-specific mass spectral features, providing a high-precision solution for dairy product authentication [75].
In terms of integrating peptidomics and proteomics, a MALDI-TOF-MS-based integrated platform enables simultaneous peptidomic and proteomic analysis. By identifying protein or peptide markers specific to buffalo, goat, or sheep milk, it can rapidly detect whether undeclared cow milk has been adulterated and estimate the extent of adulteration [76,77]. This method has also been extended to characterize heat treatment markers. Compared with earlier approaches that relied solely on protein mass spectrometry, complementary peptide profiling measurements further validate the results and broaden the applicability [78].
Metabolomics also provides important clues for adulteration identification [28]. The species-specific profiles of endogenous small molecules (such as nucleotides, organic acids, and lipids) can be captured by NMR, GC-MS, and LC-MS platforms. Combined with multivariate statistical methods such as partial least squares discriminant analysis, they can effectively distinguish milk subjected to different intensities of heat treatment, thereby aiding in the identification of adulteration [26]. A study systematically reviewed the effects of intrinsic factors (cattle breed, lactation stage) and extrinsic factors (feed, season, region, processing, and storage) on milk metabolites, noting that these metabolite changes can serve as potential biomarkers for evaluating milk origin traceability and quality [26,79].
At the genomics level, by targeting mitochondrial DNA or single-nucleotide polymorphism sites in nuclear genes (such as the casein gene CSN1S1 and the whey protein gene LALBA), precise species tracing and genetic background verification of dairy products can be achieved [80]. Metagenomics can detect residual host cell DNA and symbiotic microbial community structure in raw milk, assisting in ruling out illegal mixing or dilution [81].
Based on the aforementioned multi-omics fingerprints, the integration of machine learning algorithms shifts adulteration identification from descriptive analysis to predictive modeling [82]. Chemometric methods, such as principal component analysis and partial least squares discriminant analysis, are increasingly being integrated into proteomics workflows to handle high-dimensional datasets [32]. Recent studies have explored the application of various machine learning techniques in dairy adulteration detection. For example, a multi-functional detection method using hyperspectral imaging combined with models such as logistic regression, decision trees, and support vector machines achieved high discrimination accuracy on validation sets [83]. Gradient boosting algorithms, represented by XGBoost and LightGBM, have demonstrated classification performance in adulteration detection [84]. However, the lack of standardized protocols, high variability in sample preparation, and insufficient cross-breed and cross-region validation remain core challenges currently faced.
In terms of machine learning algorithm selection, different models exhibit significant differences. Support vector machine (SVM) is robust with small sample sizes but has limited capacity for fitting nonlinear relationships [85]. Random forest (RF) can handle high-dimensional features but is prone to overfitting, especially when the number of features far exceeds the number of samples [86]. XGBoost and LightGBM achieve the highest AUC in most datasets, yet they are sensitive to hyperparameters and require relatively large training sets [87]. Deep learning models (e.g., CNN) demonstrate feature extraction capabilities for spectral and peptidomic data; however, their “black-box” nature, dependence on massive training samples, and poor cross-platform transferability limit their application in regulatory scenarios. Currently, there is no universally optimal algorithm. It is recommended to adopt ensemble strategies or multi-model voting mechanisms based on sample size, feature dimensionality, and detection context. Furthermore, explainable artificial intelligence (e.g., SHAP values) is emerging as an effective tool for screening key biomarkers and reducing model complexity [88].
In summary, multi-omics fingerprints (metabolomics, proteomics, peptidomics, genomics, metagenomics) combined with database searching and machine learning discrimination provide an integrated analytical pipeline from molecular characterization to machine learning classification model for dairy product adulteration detection. In the future, it is necessary to further establish standardized workflows and multi-center validation systems to promote the practical application of this technology in production enterprises and regulatory authorities [75,78].

3.3. Allergen Identification: Multi-Omics Combined with Immunoinformatics for Sequence Alignment and Epitope Prediction

Allergen identification is key to the accurate diagnosis of and effective intervention for allergic diseases [89]. Dairy allergy, particularly cow’s milk allergy (CMA), is one of the most common food allergies in infants worldwide. The major allergens include the casein family (αS1-, αS2-, β-, and κ-casein) and whey proteins (β-lactoglobulin, α-lactalbumin) [90]. In recent years, multi-omics technologies, by integrating data from different levels such as proteomics, metabolomics, and metagenomics, have provided systematic tools for the comprehensive characterization of allergens in dairy products and the assessment of their allergenicity [91,92].
Mass spectrometry-based proteomics has become a core technology for the accurate detection and quantification of allergens. Xu et al. employed bottom-up proteomics combined with high-resolution mass spectrometry to develop a targeted quantitative method for the simultaneous detection of six cow’s milk allergens (α-lactalbumin, β-lactoglobulin, αS1-casein, αS2-casein, β-casein, and κ-casein) in hypoallergenic formulas. They screened and validated 15 signature peptides and ultimately selected 6 peptides for quantitative analysis [93]. This method outperformed both literature-reported and VITAL thresholds and successfully detected allergen residues in commercially available partially and extensively hydrolyzed formulas, providing a direct tool for the quality control of hypoallergenic formula products [92,93].
At the level of epitope recognition and allergenicity assessment, the application of multi-omics approaches has achieved significant advances. Shen et al. developed a highly sensitive, wide-range quantitative method based on IgE multi-epitope-specific antibodies for detecting bovine αS1-casein in foods and predicting potential milk allergenicity [94]. Using polyclonal antibodies against nine IgE epitopes of αS1-casein as detection probes, they established both a conventional sandwich enzyme-linked immunosorbent assay (ELISA) and an Fe-N-C single-atom nanozyme probe sandwich ELISA. The method demonstrated good performance in terms of sensitivity, specificity, accuracy, precision, and recovery, with a linear detection range spanning four to five orders of magnitude [94]. A study involving 118 Spanish children with CMA used a peptide microarray to detect specific IgE/IgG4 linear epitope binding of five major cow’s milk allergens. Combined with baseline serological variables, an XGBoost algorithm was employed to establish a tolerance prediction model [95]. The model achieved AUCs of 0.883 and 0.833 at 6 and 30 months, respectively, indicating that baseline epitope binding profiles can effectively predict when affected children will acquire tolerance [95].
At the level of multi-omics-integrated risk assessment, the integration of multi-omics data with machine learning models has opened new avenues for personalized allergy risk assessment. Hendrickx et al. employed a late-integration multi-view learning approach to fuse clinical data, microbiome data, metaproteomic data, immune data, and metabolomic data, constructing a machine learning classifier capable of predicting CMA remission in infants [91]. The results showed that integrating multi-omics data significantly improved classification performance for CMA remission compared to using single data types alone. Notably, the integration of microbiome data with clinical, immune, metaproteomic, and metabolomic data substantially enhanced classification accuracy in infants with CMA [91]. Furthermore, pathways previously associated with CMA development were also linked to allergy remission, providing new insights into intervening in the allergic process through modulation of host–microbe interactions.
In summary, mass spectrometry-based proteomics provides reliable technical means for high-throughput, high-sensitivity quantitative detection of allergens; the integration of metabolomics and metagenomics offers new molecular-level insights into understanding allergic mechanisms and identifying biomarkers; and risk assessment models integrating multi-omics with machine learning are driving allergy research from association analysis toward clinical applications and personalized prediction.

3.4. Risk Assessment: Integration of Multi-Omics Markers and Microbial Early-Warning Models

Dairy product spoilage and pathogenic microorganism contamination pose serious challenges to global food safety [96]. Traditional risk assessment methods rely on conventional microbiological approaches based on shelf-life studies or evaluations of specific spoilage organisms [97]. However, these data are typically generated using traditional methods and often overlook critical information such as antimicrobial resistance, biofilm formation, and virulence factors, as well as the complex interactions among the biochemical characteristics of the food matrix [98]. Moreover, microbial risks in food are diverse and heterogeneous, potentially resulting from the growth and activity of multiple microbial populations rather than contamination by a single species, which limits the efficiency of traditional risk assessment [99]. Against this background, the integration of multi-omics technologies provides revolutionary systemic tools for risk assessment in the dairy industry [100]. By employing metagenomics to identify the composition and function of spoilage and pathogenic microorganisms, metabolomics to synchronously monitor the accumulation kinetics of spoilage markers, and proteomics to elucidate microbe-driven protein degradation mechanisms, multi-omics integrated strategies are shifting from traditional qualitative or quantitative risk assessment toward molecular evidence-based precision risk assessment [61].
In addressing the issue of microbe-driven deterioration of the quality of raw milk during cold storage, multi-omics technologies have demonstrated great potential for in-depth analysis of the community structure and function of spoilage microorganisms [101]. A study using metagenomics and proteomics to analyze raw milk samples stored at 4 °C for 6 days revealed that the relative abundance of Pseudomonas increased significantly with extended cold storage time, and, together with Acinetobacter, it became the dominant bacterial genus in milk after 6 days [102]. The analysis showed that differential proteins in the early stage of cold storage were mainly involved in immune regulation and signal transduction, while those in the later stage were primarily associated with carbohydrate and lipid metabolism [102]. Another study using coupled metagenomic and metabolomic analysis found that, during storage of raw milk at 4 °C for 3–4 days, the microbial community shifts from Acinetobacter and other species toward Pseudomonas, with the most significant metabolic changes coinciding with the rapid growth phase of psychrophilic bacteria, representing a critical window for quality control [103].
Regarding the dynamic interactions between microbiota and metabolites during the storage and processing of raw milk and dairy products, a study using metagenomic next-generation sequencing combined with LC-MS and GC-MS analyzed the changes in microbiota and metabolites during storage of raw milk, pasteurized milk, and UHT milk. The results showed that the succession trends differed significantly between raw milk and heat-treated dairy products. In refrigerated pasteurized milk, Microbacterium, unclassified Actinomycetia, and Micrococcus showed the most significant increases in abundance and were highly correlated with multiple metabolites, indicating that these genera are the predominant proliferating and metabolically active microbiota [104]. This integrated multi-omics analysis strategy provides a new perspective for systematically studying interacting factors during dairy processing and storage. A study using metagenomics and proteomics validated that the microbe–metabolite–protein association network can be used to construct a risk assessment model for quality deterioration of refrigerated milk. Metagenomics revealed an increase in Pseudomonas abundance, while proteomics showed that proteins in the later stage of cold storage were enriched in carbohydrate and amino acid metabolic pathways [102].
In summary, the deep integration of multi-omics with metagenomics, metabolomics, and proteomics is driving the transformation of dairy risk assessment from traditional, empirical-based linear models toward a precision assessment framework grounded in molecular evidence. By integrating multi-level data from metagenomics, proteomics, and metabolomics to construct microbe–metabolite–protein association networks, spoilage-related microbial and metabolic biomarkers can be identified, providing early warning signals for dairy spoilage risk. However, matrix effects from complex food matrices, insufficient cross-region and cross-breed validation, lack of data standardization, and detection stability in industrial application scenarios remain core challenges that must be overcome for practical implementation in this field [15,96].

4. Multi-Omics for Revealing the Mechanisms of Dairy Health Functions

4.1. From Molecules to Gut Microbiota: Hierarchical Dissection of Dairy Health Functions

From a foodomics perspective, research on the health functions of dairy products is shifting from the identification of single bioactive components toward multi-level, multi-scale systematic analysis. Components such as bioactive peptides, functional oligosaccharides, and lipids found in dairy products can exert various health benefits—including antioxidant, antihypertensive, immunomodulatory, and intestinal barrier repair effects—by directly acting on host targets or indirectly modulating the gut microbiota and its metabolites [105].
This chapter will elaborate on the latest advances in multi-omics applied to the study of dairy health functions from three progressive levels. First, at the molecular level, the integration of peptidomics with molecular docking simulations enables efficient mining of bioactive peptides with angiotensin-converting enzyme (ACE)-inhibitory, antioxidant, and other activities, as well as prediction of their binding modes to target proteins. Second, at the metabolic pathway level, the combined analysis of metagenomics and metabolomics can elucidate the prebiotic effects of functional oligosaccharides, revealing how they modulate gut microbial metabolism to produce beneficial metabolites such as short-chain fatty acids (SCFAs). Finally, at the ecosystem level, multi-omics co-occurrence networks and causal inference models can construct complex interaction maps between dairy components and the gut microbiota, identifying key microbial taxa and metabolic signaling pathways that drive host health. This hierarchical strategy provides a scientific basis for developing precision nutrition strategies and functional dairy products [106].
To provide a concise overview of the multi-omics approaches discussed in the following Section 4.2, Section 4.3 and Section 4.4, Table 4 summarizes the multi-omics technologies and strategies applied at the molecular, metabolic pathway, and ecosystem levels to elucidate the health functions of dairy products.

4.2. Bioactive Peptide Mining: Multi-Omics Integrated Molecular Docking and Function Prediction

Milk-derived bioactive peptides are natural functional fragments embedded within the amino acid sequences of milk proteins, typically composed of 2 to 20 amino acid residues [107]. Released through gastrointestinal digestion, in vitro enzymatic hydrolysis, or microbial fermentation, they can exert various physiological functions such as antioxidant, antihypertensive, antimicrobial, and immunomodulatory effects [108]. Traditional approaches rely on an orthogonal strategy involving in vitro enzymatic hydrolysis, chromatographic separation and purification, and activity screening, which suffer from inherent limitations including long turnaround times, low throughput, and high costs [109,110]. The integrated strategy combining omics and computational simulation is profoundly reshaping this paradigm, upgrading the discovery of active components—such as antihypertensive (ACE inhibitory) peptides, hypoglycemic (DPP-IV inhibitory) peptides, and antioxidant peptides—from empirical experimental screening to data-driven rational design [111].
In the realm of databases and computational prediction, the BIOPEP-UWM database is the most widely used bioactive peptide information platform in food science [112]. It comprises four main modules––a protein sequence database, a bioactive peptide sequence database, an allergenic peptide and epitope database, and a sensory peptide database––thereby laying the foundation for systematic knowledge of milk-derived active peptides [113]. A study using this database tool predicted that bovine milk proteins, after treatment with the three enzymes, could generate 59 bioactive peptides with ACE-inhibitory, DPP-IV-inhibitory, and antioxidant activities [114]. Further validation using the INFOGEST in vitro simulated digestion model identified 36 of these peptides across three milk protein fractions, confirming good correspondence between the computational predictions and the in vitro digestion results [115]. Meanwhile, machine learning-based functional prediction models are also being gradually integrated into this workflow. One study trained five machine learning models—GBDT, XGBoost, LightGBM, CatBoost, and random forest—to predict DPP-IV-inhibitory activity, with the LightGBM model achieving the best performance [116]. This model was then applied to large-scale prediction of DPP-IV inhibitory peptides generated from in silico enzymatic hydrolysis of bovine milk proteins, successfully identifying two candidate peptides, GPVRGPF and HPHPHL, with promising activity [116].
In the experimental identification and characterization of peptide sequences, liquid chromatography–tandem mass spectrometry (LC-MS/MS) platforms based on high-resolution mass spectrometry have become the mainstream tool for identifying the sequences of milk-derived bioactive peptides [117]. Labeled quantitative peptidomics analysis of human milk, bovine colostrum, and donkey colostrum compared the differences in major proteins and their released endogenous peptide profiles, providing a basis for understanding the composition patterns of bioactive peptides in colostrum from different species. On this basis, a study integrated simulated gastrointestinal digestion, proteomics, and computational prediction tools to identify three novel peptides with distinct biological activities from caseins. Among them, EMPFPKY (βCN) exhibited antioxidant activity in vitro and was predicted by molecular docking to bind to the Keap1 protein; FVAPFPEVFG (αCN) showed dual ACE-inhibitory and anticancer activities against MCF-7 breast cancer cells; and NLLRF (αCN) demonstrated selective toxicity toward cancer cells while preserving normal cell activity. These findings illustrate the synergistic advantage of combining experimental and computational approaches to identify and characterize multifunctional peptides [118].
Molecular docking and mechanism validation are central to verifying biological activity and elucidating molecular mechanisms. In a study on the synthesis and in vitro activity characterization of ACE-inhibitory tripeptides and pentapeptides derived from bovine milk proteins, molecular docking analysis not only revealed the binding modes of these peptides to the S1 and S2 pocket regions and the zinc ion domain of ACE, but also elucidated the mechanism by which they block ACE catalytic function through competitive inhibition. Hydrogen bonds were identified as the main driving force maintaining molecular interactions, and all three peptides maintained relatively stable ACE inhibitory activity after simulated gastrointestinal digestion [119]. In a study on the ACE inhibitory activity and molecular docking of heptapeptides derived from bovine κ-casein hydrolyzed by chymosin, trypsin, and pepsin, molecular docking simulations further indicated that oxyanion and cationic hydrogen bonds play important roles in the stable binding of the heptapeptides to the ACE active center, and the proline residue at the C-terminus is crucial for maintaining the inhibitory activity of the peptides [120]. The docking and simulation results of YKVPQLEIVP derived from bovine milk β-casein and its analogs further indicate that the contributions of key amino acid residues Leu, Val, and Pro to DPP-IV inhibitory activity cannot be neglected, providing theoretical guidance at the structural level for subsequent rational design and structural optimization [121].
In terms of peptide function screening approaches, the three mainstream strategies each present their own strengths and limitations. Molecular docking provides structural information on the binding modes between peptides and target proteins (e.g., ACE, DPP-IV), identifying key residues involved in hydrogen bonds, hydrophobic interactions, etc. [122]. However, it relies on known crystal structures and cannot predict novel targets or dynamic conformational changes [123]. Machine learning-based prediction (e.g., LightGBM, random forest) enables high-throughput screening of thousands of peptides with low cost and high speed, but the predicted results require in vitro validation, and the generalizability of models varies considerably depending on the training dataset [124]. In vitro experiments (including enzyme activity inhibition assays and cell models) represent the gold standard for functional validation, yet they suffer from low throughput, long time consumption, and high cost [125]. The integrated strategy combining these three approaches—machine learning-based preliminary screening to narrow down candidates, followed by molecular docking to decipher the binding mechanism, and finally in vitro experiments to validate activity—is currently recognized as the optimal pathway [126]. This strategy has been successfully applied to the efficient discovery of ACE-inhibitory peptides and DPP-IV-inhibitory peptides.
In summary, the integrated multi-omics computational-assisted mining paradigm—using machine learning models and peptidomics as high-throughput prediction and experimental identification tools, and molecular docking as a mechanism validation approach—not only accelerates the identification efficiency of bioactive peptides with potential functions such as ACE-inhibitory, DPP-IV-inhibitory, and antioxidant activities, but also provides reliable methodological support for systematic structure–activity relationship analysis and the precision development of functional dairy products.

4.3. Prebiotic Effects of Functional Oligosaccharides: Multi-Omics Combined Metabolic Pathway Enrichment Analysis

Functional oligosaccharides are a class of carbohydrates that are not digested or absorbed by the host’s gastrointestinal tract but can be selectively utilized by gut microbiota and confer health benefits to the host [127]. They include human milk oligosaccharides (HMOs), fructo-oligosaccharides (FOSs), and galacto-oligosaccharides (GOSs), among others (see Figure 3 for HMO structure, and Figure 4a and Figure 4b for FOS and GOS structures, respectively). In recent years, multi-omics technologies, particularly the combined application of metagenomics and metabolomics, have become core research tools for systematically elucidating the prebiotic effects of functional oligosaccharides and their patterns of enrichment in metabolic pathways [128].
As the third most abundant solid component in breast milk after lactose and fat, HMOs play an indispensable regulatory role in the assembly of the infant gut microbiota and the shaping of metabolic functions. A study integrating metagenomics and metabolomics found that human milk oligosaccharides (2′-FL, LNT, 3′-SL) can modulate the infant gut microbiota (increasing Bifidobacterium, Lactobacillus, and Enterococcus, while reducing Escherichia–Shigella), significantly perturb amino acid, purine, and lipid metabolic pathways, and elevate short-chain fatty acid levels, revealing that HMOs are associated with prebiotic effects, as indicated by enrichment of microbiota-linked metabolic pathways [129].
Fructo-oligosaccharides (FOSs) selectively promote beneficial gut microbes, enhance immune responses, and increase SCFA production. A study integrating metagenomics and metabolomics in an FOS supplementation trial found substantial inter-individual variability in gut microbial composition and function: the increase in Bifidobacterium abundance was significantly greater in some participants than in others, which was associated with differences in the expression of genes encoding enzymes involved in fructose metabolic pathways; metabolomics further revealed individualized metabolic responses in fructose utilization patterns [130]. Pulse metatranscriptomics, by tracking the transcriptional response dynamics of gut microbiota to fructo-oligosaccharides, elucidated an induced expression cascade of CAZymes and identified multiple atypical FOS-metabolizing strains, providing key support for deciphering their metabolic profiling and prebiotic effects [131].
In terms of functional elucidation of galacto-oligosaccharides (GOSs), a double-blind randomized controlled trial showed that, after 4 weeks of intervention with GOS-enriched low-lactose milk, responders had reduced symptom scores, while non-responders showed increased Bifidobacterium abundance; plasma metabolomics identified 12 differential metabolites (mainly involved in amino acid and lipid metabolic pathways), indicating that individualized baseline gut microbiota composition determines the response to GOS intervention, providing feasibility evidence for precision prebiotic intervention [132].
In summary, multi-omics studies on human milk oligosaccharides, fructo-oligosaccharides, and galacto-oligosaccharides collectively demonstrate that the combined analysis of metagenomics and metabolomics can systematically reveal the regulatory effects of oligosaccharides on gut microbiota composition, short-chain fatty acid metabolism, and amino acid and lipid pathways, thereby providing a technical framework for elucidating the molecular mechanisms of prebiotic effects and for precision nutrition intervention.

4.4. Dairy–Gut Microbiota Interactions: Multi-Omics Co-Occurrence Networks and Causal Inference

The interaction between dairy components and the gut microbiota is not a simple linear relationship, but rather involves multi-level synergistic changes within the microbiota, between the microbiota and metabolites, and between metabolites and host phenotypes [133]. Traditional correlation analysis can only capture pairwise statistical dependencies and is insufficient to reveal the network structure of multi-party interactions in complex ecosystems, often misinterpreting indirect associations as direct causality [134]. The integration of multi-omics co-occurrence networks and causal inference methods provides a systematic tool to address this challenge [135].
Co-occurrence network analysis, by constructing an association map among microbes, metabolites, and host indicators, can identify key functional modules driving the health effects of dairy products. A randomized controlled study in healthy adults showed that, after an 8-week intervention with probiotic fermented milk, the density of the gut microbiota–metabolite co-occurrence network and the number of connections related to anti-inflammatory metabolites increased, and the Lactobacillus–Bifidobacterium–SCFA module was positively correlated with IL-10. This indicates that fermented milk enhances the functional stability of the microbiota by remodeling the network structure [136]. In a study of gut microbiota enterotypes in dairy goats, co-occurrence network analysis also revealed complex interactions between different microbial community patterns and lactation performance, as well as differences in key species [136]. A systematic review further integrated multi-omics evidence, indicating that fermented dairy products synergistically enhance microbial diversity, strengthen the intestinal epithelial barrier, and modulate immune signaling pathways through live cultures and their metabolites. However, current research is limited by heterogeneous experimental designs and short intervention periods, highlighting the urgent need to integrate longitudinal multi-omics data to establish persistence and causal validation [128].
In terms of causal inference, Mendelian randomization uses genetic variants as instrumental variables to infer the causal direction between dairy consumption and gut microbiota in observational studies. A bidirectional Mendelian randomization study in a European population, using the LCT gene variant as an instrumental variable, found that a genetically predicted increase in milk intake led to increased abundances of Bifidobacterium and Lactobacillus and decreased abundances of Streptococcus and Escherichia/Shigella, with no significant reverse causality, supporting a unidirectional causal relationship of “dairy intake driving microbiota remodeling.” This study further extended the analysis of the association between LCT variants and gut microbiota as a function of dairy intake [137]. Furthermore, a study using two-sample Mendelian randomization analyzed the causal relationship between milk intake and body mass index, as well as blood lipids, providing genetic evidence for causal inference regarding the impact of dairy consumption on metabolic phenotypes [138].
Structural equation modeling can simultaneously evaluate direct and indirect pathways among multiple latent variables and is suitable for dissecting the multi-level causal chain of “dairy products–gut microbiota–host health.” A review by Kinkpe et al. systematically summarized the application of network analysis combined with multi-omics data in identifying key species, emphasizing that multi-omics integration is a key approach to separating true ecological relationships from statistical noise and linking microbes to functions, thereby providing methodological support for constructing a microbiota–metabolite–host health network framework [139]. Martinez-Boggio et al. utilized structural equation modeling and mediation analysis to investigate the effects of the host genome and rumen microbiome on feed efficiency in dairy cows, demonstrating a practical application of mediation-based structural equation modeling in inferring microbe-mediated hierarchical causal pathways [138]. Li et al. employed a multi-omics approach combined with mediation analysis and causal network inference to systematically dissect the microbial metabolic pathways and causal links through which probiotic fermented milk remodels the gut microbiota and exerts angiotensin-converting enzyme (ACE) inhibitory functions, demonstrating the feasibility of the multi-omics causal analysis framework in moving from correlation to mechanistic validation [140].
Most studies cited in this section report correlational associations rather than definitive causal mechanisms. Mendelian randomization and structural equation modeling can suggest directionality, but they rely on strong assumptions (e.g., no horizontal pleiotropy) that are rarely fully met in dairy nutrition studies. Gold-standard causal evidence would require randomized controlled trials with multi-omics endpoints, which remain scarce. In summary, co-occurrence network analysis reveals functional association patterns between dairy components and the gut microbiota, while causal inference methods such as Mendelian randomization and structural equation modeling translate these associations into testable causal pathways. The multi-omics integrated causal analysis framework is advancing research on dairy–gut microbiota interactions from correlational description toward mechanistic validation, providing actionable scientific evidence for precision nutrition interventions based on the gut ecosystem and for the development of functional dairy products.

5. Data Integration and Standardization Challenges: Toward Reproducible Dairy Omics Research

5.1. Cross-Platform Data Standardization and Multi-Omics Workflow Construction

Moving from laboratory discoveries to industrial applications faces multiple challenges [140]. One of the most prominent bottlenecks is the lack of standardization in multi-omics data preprocessing. Aside from the absence of unified standard operating procedures, persistent unresolved technical obstacles including unpredictable batch effects stemming from long-term instrument drift without adequate pooled reference samples or randomized sequencing arrangements, inconsistent sample preparation workflows across labs that compromise result repeatability, divergent outcomes from varied peak-processing algorithms and insufficient documentation of relevant software settings, uneven metabolite database annotation stemming from disparate spectral repositories and insufficient verification against authentic reference compounds, and poor cross-platform compatibility among LC-MS, NMR, and GC-MS datasets paired with benchmark-deficient integration approaches reliant on subjective parameter adjustment for dairy-related samples collectively demand joint efforts from the research community to develop standardized reference materials, enforce compliance with established reporting guidelines such as MIAPE and metabolomics standards initiative checklists, and launch dedicated interlaboratory ring tests targeting dairy matrices to achieve effective resolution [141].
Closely related to this is the specialized demand for bioinformatics expertise: multi-omics datasets are typically large in scale and highly heterogeneous, and their storage, processing, and analysis require interdisciplinary knowledge of topics including biostatistics, machine learning, programming, and biology [15]. Developing customized bioinformatics pipelines that incorporate different methods, flexible parameter settings, and robust version control remains a major bottleneck in this field [61].
Furthermore, the selection of integration methods is also considerably complex. Currently, a variety of multi-omics integration methods exist (such as MOFA, DIABLO, SNF, etc.), each employing different algorithmic strategies [142]. The lack of a general framework to guide which method is most suitable for a given dataset or biological question often leaves researchers confused [143]. Even when data integration is successfully completed, translating algorithmic outputs into actionable biological insights remains a significant bottleneck. In dairy research, this specifically manifests as difficulty in directly linking multi-omics findings to milk quality indicators, health functions, or production process optimization [144].
To overcome the above challenges, a standardized framework for dairy multi-omics research needs to be established. At the data level, a cross-platform molecular reference database (e.g., LactoBase) should be built to collect species-specific protein markers, characteristic lipid clusters, metabolic fingerprints, and functional gene profiles of microbiota, while promoting unified formats for public data repositories to ensure that data are Findable, Accessible, Interoperable, and Reusable (FAIR principles) [145]. At the methodological level, joint efforts by ISO and the International Dairy Federation (IDF) should be promoted to develop multi-omics standard operating procedures (SOPs) covering the entire workflow from sample collection, pretreatment, and mass spectrometry analysis to data annotation, with differentiated pretreatment and analysis parameters tailored to different dairy matrices (liquid milk, milk powder, fermented milk) [146]. At the application level, a closed-loop translation pathway from omics discovery to biomarker validation, regulatory approval, and industrial adaptation needs to be established, simplifying complex omics workflows into low-cost, easy-to-operate rapid detection solutions that can be adopted by small and medium-sized dairy enterprises [147].
Despite the proliferation of high-accuracy claims, the reality is that the vast majority of published multi-omics biomarkers fail in independent validation. This is rarely disclosed in the literature. The reasons are structural: (i) batch effects across mass spectrometry platforms are consistently underestimated, with day-to-day instrument drift alone often exceeding biological variation [148]; (ii) sample sizes in most dairy omics studies are grossly insufficient for the dimensionality of features (thousands of metabolites/peptides), leading to massively inflated classification performance through overfitting [149]; (iii) machine learning models are frequently trained and tested on data from the same batch or same farm, yet their performance collapses when applied to milk from different seasons, regions, or processing conditions [148]; (iv) publication bias strongly favors positive results, while negative validation studies remain largely unpublished or buried in supplementary materials [150].
Consequently, claims such as “achieved 100% accuracy for organic milk authentication” or “LOD as low as 0.1% for adulteration” must be read with caution: they are typically derived from convenience samples under optimal laboratory conditions, not from the heterogeneous, uncontrolled environment of routine industrial quality control. The field urgently needs mandatory cross-laboratory ring trials and pre-registered validation protocols before any multi-omics signature can be considered for standardization [151].

5.2. Multi-Omics Machine Learning Prediction Models: From Biomarkers to Personalized Nutrition and Precision Fermentation

Integrating machine learning with multi-omics is one of the key pathways to overcome the above bottlenecks. Traditional correlation analysis primarily captures linear relationships and struggles to represent the complex nonlinear interactions in biological systems. Machine learning algorithms (e.g., random forest, support vector machines, and XGBoost) can handle high-dimensional data and identify more robust and biologically relevant combinations of biomarkers [152]. Deep learning models (e.g., deep autoencoders, hypergraph convolutional networks) can automatically learn complex features and patterns in data, making them particularly suitable for processing high-dimensional, nonlinear multi-omics data [153]. In dairy research, deep learning can be used to predict molecular changes in products under different processing conditions, identify key biomarker combinations affecting quality and safety, and build end-to-end prediction models from multi-omics data to final product attributes [142].
However, the practical reality of machine learning in dairy multi-omics is far more sobering. Published models rarely report performance on truly independent external test sets (e.g., from different countries or different processing plants) [154]. When such tests are performed, the area under the curve (AUC) typically drops by 0.2–0.4 compared to internal cross-validation [155]. Moreover, most models are not reproducible: the combination of feature selection, hyperparameter tuning, and random seed variance leads to entirely different sets of “key biomarkers” when the same analysis pipeline is re-run on the same data [156]. This “instability of discovery” directly undermines the credibility of multi-omics for regulatory use. A few critical limitations that are systematically underreported include (i) the fact that the sample-to-feature ratio rarely exceeds 1:100, making deep learning models almost certain to memorize noise; (ii) the lack of prospective validation—almost no dairy omics model has been tested in a real production line with continuous on-line monitoring; (iii) the black-box nature of the analysis– even when SHAP values are provided, they do not guarantee biological causality, and regulatory agencies remain reluctant to accept non-mechanistic models for safety decisions. Future work must prioritize small, interpretable feature sets validated across at least three independent laboratories and on at least two different mass spectrometry platforms before any industrial deployment is considered.
Based on the above analysis, future research should focus on two major strategic directions. The first direction is the molecular mechanism dissection of nutritional quality and safety authentication, namely, using machine learning to build a machine learning classification model for species-specific protein markers, characteristic lipid clusters, and metabolic fingerprints [157]. Recent studies have demonstrated the effectiveness of machine learning algorithms such as Random Forest and Support Vector Machine in identifying unknown protein adulterants in dairy products [158]. The development of explainable artificial intelligence can further screen sparse biomarker combinations, reducing detection costs and improving model generalizability [159]. The second direction is the construction of a systematic validation framework for health functions, which requires combining multi-omics data with in vitro cell models, organoids, and clinical cohort studies to move from correlation to causal inference [160]. Methods such as Mendelian randomization and mediation analysis can be used to validate the actual physiological effects of bioactive peptides, functional oligosaccharides, and other components [161]. Molecular docking simulations can predict the binding affinity of peptides to targets (e.g., angiotensin-converting enzyme), while metabolic pathway enrichment analysis elucidates their mechanisms of action. Ultimately, co-occurrence networks linking dairy components and gut microbiota can be constructed to identify key microbial metabolites (e.g., short-chain fatty acids) that drive host health [28].
Looking to the future, dairy multi-omics research is entering a critical turning point toward industrialization. Priority should be given to the development of real-time detection technologies, transforming validated stable biomarkers into rapid detection kits (e.g., ELISA, lateral flow test strips) or on-line monitoring sensors to meet the demands of production sites and market surveillance [16]. Meanwhile, a comprehensive dairy multi-omics database should be constructed to integrate data including identified bioactive peptide sequences, allergen epitopes, and species-specific peptide biomarkers, which can serve as a unified benchmark for relevant detection. Furthermore, cross-disciplinary cooperation among dairy science, computational biology, clinical medicine, and regulatory policy must be reinforced, and interdisciplinary professionals skilled in both dairy manufacturing and data science should be trained—these two measures are fundamental prerequisites to advance the translation of relevant technologies [28]. Finally, efforts should be made to incorporate multi-omics detection methods into national and international food safety standards (e.g., AOAC, ISO), so that omics markers can become officially recognized markers for adulteration detection and quality assessment [153]. The roadmap at the end of this chapter (Figure 5) summarizes the complete pathway from data standardization and cross-platform validation to industrial application, providing an actionable guide for the implementation of multi-omics technologies in the dairy industry.

5.3. Economic and Industrial Feasibility: Barriers to Routine Implementation

Most multi-omics techniques described in this review—particularly high-resolution LC-MS/MS, 4D-label-free proteomics, and metagenomic shotgun sequencing—remain prohibitively expensive and require specialized personnel and infrastructure [162]. This places them beyond the routine quality control budgets of most dairy processing plants, especially small and medium-sized enterprises (SMEs). Furthermore, sample turnaround times (often days to weeks) are incompatible with real-time production line decisions. Therefore, a pragmatic pathway should prioritize (i) converting validated multi-omics biomarkers into low-cost, rapid formats such as ELISA kits, lateral flow strips, or targeted qPCR assays; (ii) developing simplified, portable mass spectrometers or near-infrared sensors calibrated against multi-omics reference data; and (iii) reserving full-scale omics for reference laboratories, dispute resolution, and regulatory enforcement rather than daily screening. Without such translation strategies, the industrial impact of dairy multi-omics will remain marginal.

5.4. Multi-Omics Data Fusion and Machine Learning Classification Model Analysis Strategies

Multi-omics integration is not a simple concatenation of data, but rather requires the selection of appropriate fusion levels, algorithmic strategies, and validation schemes based on the research objectives and data characteristics.

5.4.1. Three-Level Architecture for Multi-Omics Data Fusion

Low-level fusion (early integration) directly concatenates raw data or preprocessed features and is suitable for same-platform data (e.g., metabolomics and lipidomics simultaneously acquired by LC-MS); it offers minimal information loss but suffers from the curse of dimensionality, difficulty in aligning heterogeneous data, and a high risk of overfitting [163]. Intermediate-level fusion first performs dimensionality reduction or feature extraction on each omics dataset separately (e.g., PCA, sparse PLS, and autoencoders) before integration, with representative methods including MOFA, DIABLO, and sPLS-DA; this approach is suitable for cross-platform data (e.g., 16S rRNA sequencing + LC-MS metabolomics) and represents the current mainstream strategy for multi-omics integration [164]. High-level fusion (late integration) builds independent models for each omics layer and then combines them via voting or weighted averaging at the decision level, which is suitable for highly heterogeneous data (e.g., genomics + metabolomics + clinical phenotypes); it provides good robustness, such that noise in one omics layer does not easily affect the overall result, but it ignores cross-omics interactions. It is recommended that dairy multi-omics studies prioritize intermediate-level fusion and report the performance improvement (e.g., classification accuracy, AUC) before and after fusion.

5.4.2. Chemometrics and Machine Learning Fusion for Omics Spectral Data

Spectral data generated by techniques such as Raman spectroscopy, near-infrared (NIR) spectroscopy, and hyperspectral imaging (HSI) are essentially “pseudo-omics” data characterized by high dimensionality, high collinearity, and variable signal-to-noise ratios. Commonly used chemometric algorithms include PCA, PLS-DA, SVM, RF, CNN, and one-dimensional convolutional neural networks (1D-CNN). In terms of fusion strategies, inputting spectral features together with mass spectrometry-based omics markers into a model can significantly improve generalization ability. For example, Zhou et al. fused lipid fingerprints and peptide fingerprints from MALDI-TOF MS and combined them with an ensemble learning model, achieving 100% accuracy in organic milk authentication, which was far superior to single-omics models [31].

5.4.3. Role of Explainable Artificial Intelligence (XAI) in Multi-Omics Integration

Although black-box models (e.g., XGBoost, deep learning) offer high predictive accuracy, they cannot meet regulatory requirements for interpretability [163]. Explainable artificial intelligence (XAI) methods, including SHAP, LIME, and attention mechanisms, can be used to identify key markers, simplify detection panels, and improve cross-laboratory reproducibility [165]. In dairy adulteration detection, using SHAP values to select the most stable five peptides allows compression of the model from >200 features to fewer than 10, while maintaining a cross-platform validation AUC > 0.90 [166]. Introducing XAI is a critical step toward moving multi-omics from the laboratory to regulatory acceptance.

5.4.4. Smart Sensing Technologies and On-Line Monitoring: From Laboratory to Industrialization

Smart sensing technologies such as Raman spectroscopy, NIR, HSI, electronic noses, and electronic tongues enable non-destructive, real-time, on-line detection [167]. The integration strategy with multi-omics data involves first discovering markers (e.g., specific metabolites, peptides) through multi-omics, then training spectral models, and finally deploying them on production lines. The AI-integrated Raman platform developed by Bhowmik et al. achieved a classification accuracy exceeding 93% for microbial spoilage in milk, demonstrating commercial potential [168]. However, matrix effects, long-term sensor drift, and insufficient validation in diverse scenarios remain major challenges in real-world production environments. The future path to industrialization should reserve full-scale omics for reference laboratories and dispute arbitration, while converting validated markers into low-cost, rapid detection kits (ELISA, dipsticks, and qPCR) or portable sensors [153,167].

5.5. Current Knowledge Gaps and Priority Actions

Despite the significant progress in multi-omics in dairy research, the following four core research gaps, which are directly related to industrial implementation and regulatory acceptance, remain unresolved. These gaps are summarized in Table 5.

6. Conclusions

Multi-omics technologies have evolved from single-omics analysis to multi-omics integration, providing systematic molecular insights into the nutritional quality, safety assessment, and health functions of dairy products. In terms of nutritional quality, the combined application of proteomics, metabolomics, and lipidomics has revealed the systematic effects of species, feeding practices, and processing technologies on the molecular composition of dairy products, successfully constructing specific molecular fingerprints for different milk origins.
In terms of safety assessment, adulteration detection methods based on species-specific peptide markers have been able to accurately identify adulteration at extremely low levels. The coupling of metagenomics and metabolomics effectively supports risk warning of spoilage microorganisms and shelf-life prediction.
In terms of health functions, multi-omics technologies have systematically elucidated the targets and signaling pathways of bioactive peptides, clarified the prebiotic pathways of functional oligosaccharides in regulating gut microbiota and producing short-chain fatty acids, and revealed the systemic effects of gut microbiota metabolites as signaling molecules in regulating host immunity and metabolism.
Despite existing research advances, five critical knowledge gaps remain (as detailed in Section 5.5): (1) a lack of publicly available reference datasets with raw LC-MS/MS data from interlaboratory studies; (2) unvalidated false discovery rates for species-specific peptide markers across different geographical regions, heat treatments, and blended dairy products; (3) the absence of randomized controlled trials establishing causal links between multi-omics-identified bioactive peptides and clinical outcomes such as blood pressure reduction or glycemic control; (4) no cost-effectiveness comparison between multi-omics-based adulteration screening and conventional industrial methods (ELISA, qPCR); and (5) a lack of officially recognized regulatory thresholds for machine learning classification algorithms used in dairy authenticity verification.
Future efforts must address economic viability by developing low-cost, rapid assays derived from validated omics biomarkers, making routine application feasible even for small dairies. Future efforts also should prioritize standardized workflows, cross-laboratory validation, and explainable AI-driven biomarker screening, alongside low-cost solutions suitable for small and medium-sized dairies.

Author Contributions

Writing—original draft preparation, M.X. and B.M.; writing—review and editing, B.M. and K.Z.; project administration, W.T., C.L. and P.H.; funding acquisition, B.M. and M.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Key Research and Development Program of China (2023YFF1104704) and the Zhejiang Provincial Department of Agriculture and Rural Affairs Project (2025SNJF021).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Acknowledgments

We thank the researchers who facilitated the completion of this study, and we also acknowledge the open-access website BioGDP (https://biogdp.com/zh, accessed on 26 April 2026) for providing some visual elements used in Figure 1, Figure 2 and Figure 5, for which we have obtained proper publication licenses.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
ACEAngiotensin-Converting Enzyme
CATCatalase
DIAData-Independent Acquisition
ELISAEnzyme-Linked Immunosorbent Assay
FOSFructo-Oligosaccharides
Foxp3Forkhead Box P3
GOSGalacto-Oligosaccharides
GPRG Protein-Coupled Receptor
GSH-PxGlutathione Peroxidase
HDACHistone Deacetylase
HMOsHuman Milk Oligosaccharides
IgEImmunoglobulin E
LC-MS/MSLiquid Chromatography–Tandem Mass Spectrometry
LODLimit of Detection
MDAMalondialdehyde
NMRNuclear Magnetic Resonance
NSLABNon-Starter Lactic Acid Bacteria

References

  1. Karoui, R.; Bouaicha, I. A review on nutritional quality of animal and plant-based milk alternatives: A focus on protein. Front. Nutr. 2024, 11, 1378556. [Google Scholar] [CrossRef] [PubMed]
  2. Assen, K.A.; Abegaz, M.A. Review on quality attributes of milk and commonly produced dairy products in Ethiopia. Heliyon 2024, 10, e40089. [Google Scholar] [CrossRef] [PubMed]
  3. Sarda, B.; Kesse-Guyot, E.; Srour, B.; Deschasaux-Tanguy, M.; Fialon, M.; Fezeu, L.K.; Galan, P.; Hercberg, S.; Touvier, M.; Julia, C. How far can reformulation participate in improving the nutritional quality of diets at population level? A modelling study using real food market data in France. BMJ Glob. Health 2024, 9, e014162. [Google Scholar] [CrossRef] [PubMed]
  4. Bednarski, M.; Kupczyński, R. Factors Affecting Milk Productivity, Milk Quality and Dairy Cow Health. Animals 2024, 14, 3707. [Google Scholar] [CrossRef] [PubMed]
  5. Nagraik, R.; Sharma, A.; Kumar, D.; Chawla, P.; Kumar, A.P. Milk adulterant detection: Conventional and biosensor based approaches: A review. Sens. Bio-Sens. Res. 2021, 33, 100433. [Google Scholar] [CrossRef]
  6. Sarkar, D.; Naha, S. Advancement in species specific adulteration identification in camel milk. J. Food Compos. Anal. 2024, 130, 106168. [Google Scholar] [CrossRef]
  7. Veenstra, T.D. Systems biology and multi-omics. Proteomics 2021, 21, e2000306. [Google Scholar] [CrossRef] [PubMed]
  8. Baysoy, A.; Bai, Z.; Satija, R.; Fan, R. The technological landscape and applications of single-cell multi-omics. Nat. Rev. Mol. Cell Biol. 2023, 24, 695–713. [Google Scholar] [CrossRef] [PubMed]
  9. Hayes, C.N.; Nakahara, H.; Ono, A.; Tsuge, M.; Oka, S. From Omics to Multi-Omics: A Review of Advantages and Tradeoffs. Genes 2024, 15, 1551. [Google Scholar] [CrossRef] [PubMed]
  10. Liu, X.; Peng, T.; Xu, M.; Lin, S.; Hu, B.; Chu, T.; Liu, B.; Xu, Y.; Ding, W.; Li, L. Spatial multi-omics: Deciphering technological landscape of integration of multi-omics and its applications. J. Hematol. Oncol. 2024, 17, 72. [Google Scholar] [CrossRef] [PubMed]
  11. Mohr, A.E.; Ortega-Santos, C.P.; Whisner, C.M.; Klein-Seetharaman, J.; Jasbi, P. Navigating Challenges and Opportunities in Multi-Omics Integration for Personalized Healthcare. Biomedicines 2024, 12, 1496. [Google Scholar] [CrossRef] [PubMed]
  12. Hui-Zeng, S.; Mi, Z.; Ou, W.; Yanhong, C.; Jian-Xin, L.; Luo, G.L. Multi-omics reveals functional genomic and metabolic mechanisms of milk production and quality in dairy cows. Bioinformatics 2019, 8, 2530–2537. [Google Scholar] [CrossRef] [PubMed]
  13. Sattin, E.; Andreani, N.A.; Carraro, L.; Lucchini, R.; Fasolato, L.; Telatin, A.; Balzan, S.; Novelli, E.; Simionati, B.; Cardazzo, B. A multi-omics approach to evaluate the quality of milk whey used in ricotta cheese production. Front. Microbiol. 2016, 7, 1272. [Google Scholar] [CrossRef] [PubMed]
  14. Ren, A.H.; Diamandis, E.P.; Kulasingam, V. Uncovering the Depths of the Human Proteome: Antibody-based Technologies for Ultrasensitive Multiplexed Protein Detection and Quantification. Mol. Cell. Proteom. 2021, 20, 100155. [Google Scholar] [CrossRef] [PubMed]
  15. Becchi, P.P.; Rocchetti, G.; Lucini, L. Advancing dairy science through integrated analytical approaches based on multi-omics and machine learning. Curr. Opin. Food Sci. 2025, 63, 101289. [Google Scholar] [CrossRef]
  16. Su, G.; Yu, C.; Liang, S.; Wang, W.; Wang, H. Multi-omics in food safety and authenticity in terms of food components. Food Chem. 2024, 437, 137943. [Google Scholar] [CrossRef] [PubMed]
  17. Qin, C.; Liu, L.; Wang, Y.; Leng, T.; Zhu, M.; Gan, B.; Xie, J.; Yu, Q.; Chen, Y. Advancement of omics techniques for chemical profile analysis and authentication of milk. Trends Food Sci. Technol. 2022, 127, 114–128. [Google Scholar] [CrossRef]
  18. Hu, S.; Liu, C.; Liu, X. Innovative Application of Metabolomics on Bioactive Ingredients of Foods. Foods 2022, 11, 2974. [Google Scholar] [CrossRef] [PubMed]
  19. Feng, X.; Ma, R.; Wang, Y.; Tong, L.; Wen, W.; Mu, T.; Tian, J.; Yu, B.; Gu, Y.; Zhang, J. Non-targeted metabolomics identifies biomarkers in milk with high and low milk fat percentage. Food Res. Int. 2024, 179, 113989. [Google Scholar] [CrossRef] [PubMed]
  20. Tigga, A.; Mallappa, R.H.; Muniyappa, S.K.; Kadyan, S.; Pradhan, D.; Niharika, E.S.; Grover, S. 16S metagenomics and metabolomics unveil the microbial compositions and metabolite profiles in Dahi, a traditional Indian fermented milk product prepared by the backslopping method. J. Food Sci. Technol. 2024, 62, 584–597. [Google Scholar] [CrossRef] [PubMed]
  21. Martínez, A.; Abanto, M.; Días, N.B.; Olate, P.; Pérez Nuñez, I.; Díaz, R.; Sepúlveda, N.; Paz, E.A.; Quiñones, J. Recent Trends in Food Quality and Authentication: The Role of Omics Technologies in Dairy and Meat Production. Int. J. Mol. Sci. 2025, 26, 4405. [Google Scholar] [CrossRef] [PubMed]
  22. Ma, Q.; Chai, Y.; Huang, A. Untargeted metabolomics revealed the metabolites profile and conjugated linoleic acid formation mechanisms of fermented milk produced by novel Limosilactobacillus fermentum L1. LWT 2025, 237, 118686. [Google Scholar] [CrossRef]
  23. Zhao, Y.; Yu, S.; Zhao, H.; Li, L.; Li, Y.; Tu, Y.; Jiang, L.; Zhao, G. Lipidomic profiling using GC and LC-MS/MS revealed the improved milk quality and lipid composition in dairy cows supplemented with citrus peel extract. Food Res. Int. 2022, 161, 111767. [Google Scholar] [CrossRef] [PubMed]
  24. Daddam, J.R.; Hammon, H.M.; Tröscher, A.; Vogel, L.; Gnott, M.; Kra, G.; Levin, Y.; Sauerwein, H.; Zachut, M. Phosphoproteomic Analysis of Subcutaneous and Omental Adipose Tissue Reveals Increased Lipid Turnover in Dairy Cows Supplemented with Conjugated Linoleic Acid. Int. J. Mol. Sci. 2021, 22, 3227. [Google Scholar] [CrossRef] [PubMed]
  25. Zhang, Y.; Yan, H.; Xu, X.; Xiao, X.; Yuan, Y.; Guo, N.; Zhang, T.; Li, M.; Zhu, L.; Zhao, C.; et al. Metagenomic/Metaproteomic Investigation of the Microbiota in Dongbei Suaicai, a Traditional Fermented Chinese Cabbage. Fermentation 2024, 10, 185. [Google Scholar] [CrossRef]
  26. Zhu, D.; Kebede, B.; McComb, K.; Hayman, A.; Chen, G.; Frew, R. Milk biomarkers in relation to inherent and external factors based on metabolomics. Trends Food Sci. Technol. 2021, 109, 51–64. [Google Scholar] [CrossRef]
  27. Li, Q.; Wang, X.; Zhang, Q.; Wu, Y.; Chen, R.; Sun, Y.; Pan, Y.; Li, S.; Wang, Z. Multi-omics analysis of five species of milk and specific composition links within each species. Food Chem. 2024, 457, 140028. [Google Scholar] [CrossRef] [PubMed]
  28. Rocchetti, G.; O’Callaghan, T.F. Application of metabolomics to assess milk quality and traceability. Curr. Opin. Food Sci. 2021, 40, 168–178. [Google Scholar] [CrossRef]
  29. Agregán, R.; Echegaray, N.; Nawaz, A.; Hano, C.; Gohari, G.; Pateiro, M.; Lorenzo, J.M. Foodomic-Based Approach for the Control and Quality Improvement of Dairy Products. Metabolites 2021, 11, 818. [Google Scholar] [CrossRef] [PubMed]
  30. Li, Y.; Jia, H.; Zheng, Y.; Sun, X.; Zhang, Y.; Liu, W.; Zhang, W.; Jiang, Y.; Zhao, Q.; Guo, L. The Nutritional Composition and Multi-Omics Analysis of Raw Milk From Normande and Holstein Cows. Food Front. 2025, 6, 3042–3058. [Google Scholar] [CrossRef]
  31. Zhou, M.; Li, T.; Shen, L.; Zhong, Q.; Huang, T.; Zhou, T. Enhanced authentication of organic milk using MALDI-TOF MS with combined lipid-peptide fingerprinting and machine learning integration. Food Chem. 2025, 475, 143346. [Google Scholar] [CrossRef] [PubMed]
  32. Karamoutsios, A.; Lekka, P.; Voidarou, C.C.; Dasenaki, M.; Thomaidis, N.S.; Skoufos, I.; Tzora, A. Assessing Milk Authenticity Using Protein and Peptide Biomarkers: A Decade of Progress in Species Differentiation and Fraud Detection. Foods 2025, 14, 2588. [Google Scholar] [CrossRef] [PubMed]
  33. Yang, Y.; Bu, D.; Zhao, X.; Sun, P.; Wang, J.; Zhou, L. Proteomic Analysis of Cow, Yak, Buffalo, Goat and Camel Milk Whey Proteins: Quantitative Differential Expression Patterns. J. Proteome Res. 2013, 12, 1660–1667. [Google Scholar] [CrossRef] [PubMed]
  34. Chen, L.; Hong, T.; Li, Z.; Shen, G.; Gu, Y.; Han, J. A comparison of milk fat globule membranes and whey proteomes: New insight into variation nutrient differences between Buffalo, Cow, Goat, and Yak. Food Chem. 2023, 429, 136845. [Google Scholar] [CrossRef] [PubMed]
  35. Ma, Y.; Zhang, L.; Wu, Y.; Zhou, P. Changes in milk fat globule membrane proteome after pasteurization in human, bovine and caprine species. Food Chem. 2019, 279, 209–215. [Google Scholar] [CrossRef] [PubMed]
  36. Yang, Y.; Zheng, N.; Zhao, X.; Zhang, Y.; Han, R.; Ma, L.; Zhao, S.; Li, S.; Guo, T.; Wang, J. Proteomic characterization and comparison of mammalian milk fat globule proteomes by iTRAQ analysis. J. Proteom. 2015, 116, 34–43. [Google Scholar] [CrossRef] [PubMed]
  37. Luo, J.; Huang, Z.; Liu, H.; Zhang, Y.; Ren, F. Yak milk fat globules from the Qinghai-Tibetan Plateau: Membrane lipid composition and morphological properties. Food Chem. 2018, 245, 731–737. [Google Scholar] [CrossRef] [PubMed]
  38. Li, S.; Yang, Y.; Chen, C.; Li, L.; Valencak, T.G.; Ren, D. Differences in milk fat globule membrane proteins among Murrah, Nili-Ravi and Mediterranean buffaloes revealed by a TMT proteomic approach. Food Res. Int. 2021, 139, 109847. [Google Scholar] [CrossRef] [PubMed]
  39. Chen, L.; Li, X.; Li, Z.; Deng, L. Analysis of 17 elements in cow, goat, buffalo, yak, and camel milk by inductively coupled plasma mass spectrometry (ICP-MS). RSC Adv. 2020, 10, 6736–6742. [Google Scholar] [CrossRef] [PubMed]
  40. Yang, X.; Meng, L.; Rahman, A.; Wang, J.; Zheng, N. Comparison of fatty acid characteristics in human milk, bovine milk, and infant formula for better emulation. Curr. Opin. Food Sci. 2025, 61, 101240. [Google Scholar] [CrossRef]
  41. McWhorter, T.M.; Sargolzaei, M.; Sattler, C.G.; Utt, M.D.; Tsuruta, S.; Misztal, I.; Lourenco, D. Single-step genomic predictions for heat tolerance of production yields in US Holsteins and Jerseys. J. Dairy Sci. 2023, 106, 7861–7879. [Google Scholar] [CrossRef] [PubMed]
  42. Tadesse, A.; Gebremichael, D.; Hailay, B.; Hailemariam, F.; Hadgu, H.; Kalayu, G. The effect of season and agro-ecology on physicochemical properties of cow’s raw milk in Central and North-Western Zone of Tigray, Ethiopia. Heliyon 2024, 10, e39050. [Google Scholar] [CrossRef] [PubMed]
  43. Zhao, Y.; Wang, F.; Wang, Y.; Tan, J.; Niu, H.; Guo, G.; Fang, L.; Jiang, L. Heat stress compromises nutritional quality and flavor of bovine raw milk: Evidence from multi-omics analyses. Food Chem. X 2025, 32, 103361. [Google Scholar] [CrossRef] [PubMed]
  44. Fleming, A.; Wescombe, P.; Gregorini, P. Connecting Grazing, Forage, and Milk Metabolomes to Enhance Consumer Health. Dairy 2025, 6, 33. [Google Scholar] [CrossRef]
  45. Sun, H.; Wang, B.; Wang, J.; Liu, H.; Liu, J. Biomarker and pathway analyses of urine metabolomics in dairy cows when corn stover replaces alfalfa hay. J. Anim. Sci. Biotechnol. 2016, 7, 49. [Google Scholar] [CrossRef] [PubMed]
  46. Xue, M.-Y.; Sun, H.-Z.; Wu, X.-H.; Liu, J.-X.; Guan, L.L. Multi-omics reveals that the rumen microbiome and its metabolome together with the host metabolome contribute to individualized dairy cow performance. Microbiome 2020, 8, 64. [Google Scholar] [CrossRef] [PubMed]
  47. Zhang, C.; Wang, M.; Liu, H.; Jiang, X.; Chen, X.; Liu, T.; Yin, Q.; Wang, Y.; Deng, L.; Yao, J.; et al. Multi-omics reveals that the host-microbiome metabolism crosstalk of differential rumen bacterial enterotypes can regulate the milk protein synthesis of dairy cows. J. Anim. Sci. Biotechnol. 2023, 14, 63. [Google Scholar] [CrossRef] [PubMed]
  48. Wang, B.; Sun, Z.; Tu, Y.; Si, B.; Liu, Y.; Yang, L.; Luo, H.; Yu, Z. Untargeted metabolomic investigate milk and ruminal fluid of Holstein cows supplemented with Perilla frutescens leaf. Food Res. Int. 2021, 140, 110017. [Google Scholar] [CrossRef] [PubMed]
  49. Shen, J.S.; Song, L.J.; Sun, H.Z.; Wang, B.; Chai, Z.; Chacher, B.; Liu, J.X. Effects of corn and soybean meal types on rumen fermentation, nitrogen metabolism and productivity in dairy cows. Asian-Australas. J. Anim. Sci. 2015, 28, 351–359. [Google Scholar] [CrossRef] [PubMed]
  50. Zhu, D.; Kebede, B.; Chen, G.; McComb, K.; Frew, R. Impact of freeze-drying and subsequent storage on milk metabolites based on 1H NMR and UHPLC-QToF/MS. Food Control 2020, 116, 107017. [Google Scholar] [CrossRef]
  51. Kovalenko, E.; Sheludchenko, M.; Shoshina, O.; Odintsova, V.; Koshechkin, S.; Volokh, O. Peptidome and Metabolome profiling of the fermented milk products revealed accumulation of bioactive compounds during two weeks of cold storage. bioRxiv 2025. [Google Scholar] [CrossRef]
  52. Chen, M.; Xiao, R.; Du, S.; Rahman, A.; Zheng, N.; Zhang, Y.; Niu, T.; Wang, J. Proteomic, metabolomic and lipidomic profiles provide insights on milk quality differences during heat treatment. Food Chem. 2025, 496, 146695. [Google Scholar] [CrossRef] [PubMed]
  53. Sun, Y.; Guo, S.; Wu, T.; Zhang, J.; Kwok, L.-Y.; Sun, Z.; Zhang, H.; Wang, J. Untargeted mass spectrometry-based metabolomics approach unveils biochemical changes in compound probiotic fermented milk during fermentation. npj Sci. Food 2023, 7, 21. [Google Scholar] [CrossRef] [PubMed]
  54. Álvarez, J.R.; Suárez, P.R.; Álvarez-Buylla, J.R.; Vázquez, L.; González-Iglesias, H.; Pérez, B.M.; Flórez, A.B. Differential Contribution of Three Lactic Acid Bacteria Strains from a Microbial Consortium to Milk Fermentation by Untargeted Metabolomics and Analysis of Volatile Compounds; Zenodo: Geneva, Switzerland, 2025. [Google Scholar] [CrossRef]
  55. Gummalla, S.; Broadbent, J.R. Tyrosine and Phenylalanine Catabolism by Lactobacillus Cheese Flavor Adjuncts1. J. Dairy Sci. 2001, 84, 1011–1019. [Google Scholar] [CrossRef] [PubMed]
  56. Fröhlich, S.M.; Jünger, M.; Mittermeier-Kleßinger, V.K.; Dawid, C.; Hofmann, T.F.; Somoza, V.; Dunkel, A. Towards prediction of maturation-dependent kokumi taste in cheese by comprehensive high throughput quantitation of glutamyl dipeptides. Food Chem. 2025, 463, 141130. [Google Scholar] [CrossRef] [PubMed]
  57. Gao, Y.; Bian, Y.; Zhuang, J.; Qi, J.; Chen, Y.; Shi, Y.; Ju, N. Targeted lipidomic analysis of the changes in lipid metabolism of raw milk during refrigeration. Trans. Chin. Soc. Agric. Eng. 2023, 39, 306–315. [Google Scholar] [CrossRef]
  58. Gao, Y.; Zhuang, J.; Chen, Y.; Qi, J.; Gu, X.; He, X.; Liu, Y.; Zhang, Y.; Guo, S.; Ju, N. Contribution of microorganisms and proteins to quality changes in refrigerated raw milk determined using macrogenomics and proteomics. Food Biosci. 2025, 69, 106904. [Google Scholar] [CrossRef]
  59. Rocchetti, G.; Gallo, A. F3-metabolomics: Integrating feed, fluid, and food metabolomics in dairy production. Anim. Feed. Sci. Technol. 2026, 334, 116653. [Google Scholar] [CrossRef]
  60. Zhang, Q.; Li, Q.; Wang, X.; Wu, Y.; Chen, R.; Sun, Y.; Pan, Y.; Li, S.; Wang, Z. Comprehensive analysis through multi-omics integration to compare and elucidate the specific substances in milk of donkey, horse, camel, human and pig. Food Chem. X 2025, 28, 102592. [Google Scholar] [CrossRef] [PubMed]
  61. Joshi, A.; Bhardwaj, D.; Kaushik, A.; Juneja, V.K.; Taneja, P.; Thakur, S.; Taneja, N.K. Advances in multi-omics based quantitative microbial risk assessment in the dairy sector: A semi-systematic review. Food Res. Int. 2022, 156, 111323. [Google Scholar] [CrossRef] [PubMed]
  62. Xie, Y.; Nie, X.; Wang, C.; Xu, X.; Zhang, F. A metabolomics approach for monitoring thermal processing temperature of bovine milk using ultra-performance liquid chromatography tandem Q-Exactive mass spectrometry and multivariate data analysis. J. Future Foods 2024, 4, 83–90. [Google Scholar] [CrossRef]
  63. Ji, Z.; Zhang, J.; Deng, C.; Guo, T.; Han, R.; Yang, Y.; Zang, C.; Chen, Y. Identification of pasteurized mare milk and powder adulteration with bovine milk using quantitative proteomics and metabolomics approaches. Food Chem. X 2024, 22, 101265. [Google Scholar] [CrossRef] [PubMed]
  64. Ji, Z.; Zhang, J.; Deng, C.; Hu, Z.; Du, Q.; Guo, T.; Wang, J.; Fan, R.; Han, R.; Yang, Y. Identification of mare milk adulteration with cow milk by liquid chromatography-high resolution mass spectrometry based on proteomics and metabolomics approaches. Food Chem. 2023, 405, 134901. [Google Scholar] [CrossRef] [PubMed]
  65. Couvillion, S.P.; Mostoller, K.E.; Williams, J.E.; Pace, R.M.; Stohel, I.L.; Peterson, H.K.; Nicora, C.D.; Nakayasu, E.S.; Webb-Robertson, B.-J.M.; McGuire, M.A.; et al. Interrogating the role of the milk microbiome in mastitis in the multi-omics era. Front. Microbiol. 2023, 14, 1105675. [Google Scholar] [CrossRef] [PubMed]
  66. Wang, T.; Wei, G.; Chen, F.; Ma, Q.; Huang, A. Integrated metabolomics and peptidomics to delineate characteristic metabolites in milk fermented with novel Lactiplantibacillus plantarum L3. Food Chem. X 2023, 18, 100732. [Google Scholar] [CrossRef] [PubMed]
  67. Ning, J.; Li, M.; Chen, W.; Zhao, H.; Chen, J.; Yang, M.; Cao, X.; Yue, X. Peptidomics as a tool to analyze endogenous peptides in milk and milk-related peptides. Food Biosci. 2022, 50, 102199. [Google Scholar] [CrossRef]
  68. Martini, S.; Solieri, L.; Tagliazucchi, D. Peptidomics: New trends in food science. Curr. Opin. Food Sci. 2021, 39, 51–59. [Google Scholar] [CrossRef]
  69. Suh, J.H. Critical review: Metabolomics in dairy science—Evaluation of milk and milk product quality. Food Res. Int. 2022, 154, 110984. [Google Scholar] [CrossRef] [PubMed]
  70. Westermann, A.J.; Vogel, J. Cross-species RNA-seq for deciphering host–microbe interactions. Nat. Rev. Genet. 2021, 22, 361–378. [Google Scholar] [CrossRef] [PubMed]
  71. Agregán, R.; Echegaray, N.; López-Pedrouso, M.; Kharabsheh, R.; Franco, D.; Lorenzo, J.M. Proteomic Advances in Milk and Dairy Products. Molecules 2021, 26, 3832. [Google Scholar] [CrossRef] [PubMed]
  72. Xiao, L.; Li, Z.; Dou, X.; Long, Y.; Li, Z.; Liu, Y.; Dong, Q. Advancing microbial risk assessment: Perspectives from the evolution of detection technologies. npj Sci. Food 2025, 9, 157. [Google Scholar] [CrossRef] [PubMed]
  73. Galli, B.D. Sustainability implications and relevance of using omics sciences to investigate cheeses with protected designation of origin. J. Sci. Food Agric. 2024, 104, 6388–6396. [Google Scholar] [CrossRef] [PubMed]
  74. Cabrera, D.; Fraser, K.; Roy, N.C. A metabolomics analysis of interspecies and seasonal trends in ruminant milk: The molecular difference between bovine, caprine, and ovine milk. J. Dairy Sci. 2024, 107, 6511–6527. [Google Scholar] [CrossRef] [PubMed]
  75. Karamoutsios, A.; Oikonomou, E.D.; Voidarou, C.; Hatzizisis, L.; Fotou, K.; Nikolaou, K.; Gouva, E.; Gkiza, E.; Giannakeas, N.; Skoufos, I.; et al. Innovations in Proteomic Technologies and Artificial Neural Networks: Unlocking Milk Origin Identification. BioTech 2025, 14, 33. [Google Scholar] [CrossRef] [PubMed]
  76. Wang, H.-L.; Chen, X.; Peng, X.; Gao, Y.; Wang, J.; Li, W.; Tan, C.; Liu, Y.f.; Zou, W.; Xu, Y.-J. Lipidomics revealed the geographical origin and breed variability of sheep milk. Food Biosci. 2025, 72, 107536. [Google Scholar] [CrossRef]
  77. Liu, H.; Guo, X.; Zhao, Q.; Qin, Y.; Zhang, J. Lipidomics analysis for identifying the geographical origin and lactation stage of goat milk. Food Chem. 2020, 309, 125765. [Google Scholar] [CrossRef] [PubMed]
  78. Sassi, M.; Arena, S.; Scaloni, A. MALDI-TOF-MS Platform for Integrated Proteomic and Peptidomic Profiling of Milk Samples Allows Rapid Detection of Food Adulterations. J. Agric. Food Chem. 2015, 63, 6157–6171. [Google Scholar] [CrossRef] [PubMed]
  79. Noce, A.; Pazzola, M.; Dettori, M.L.; Amills, M.; Castelló, A.; Cecchinato, A.; Bittante, G.; Vacca, G.M. Variations at regulatory regions of the milk protein genes are associated with milk traits and coagulation properties in the Sarda sheep. Anim. Genet. 2016, 47, 717–726. [Google Scholar] [CrossRef] [PubMed]
  80. Papadimitriou, K.; Anastasiou, R.; Georgalaki, M.; Bounenni, R.; Paximadaki, A.; Charmpi, C.; Alexandraki, V.; Kazou, M.; Tsakalidou, E. Comparison of the Microbiome of Artisanal Homemade and Industrial Feta Cheese through Amplicon Sequencing and Shotgun Metagenomics. Microorganisms 2022, 10, 1073. [Google Scholar] [CrossRef] [PubMed]
  81. Salazar, J.K.; Carstens, C.K.; Ramachandran, P.; Shazer, A.G.; Narula, S.S.; Reed, E.; Ottesen, A.; Schill, K.M. Metagenomics of pasteurized and unpasteurized gouda cheese using targeted 16S rDNA sequencing. BMC Microbiol. 2018, 18, 189. [Google Scholar] [CrossRef] [PubMed]
  82. Simões, P.H.C.; Martins, A.M.B.; Oliveira, M.F.; Siqueira, J.C.; Luna, A.S.; Pinto, L. Coupled models for food fraud evaluation of unripe banana flour: Enhanced prediction with XGBoost and variable importance assessed by SHAP. Food Control 2026, 183, 111936. [Google Scholar] [CrossRef]
  83. Rojas-Gómez, P.; Pariyani, R.; McCarthy, N.A.; Bateman, L.M.; O’Mahony, J.A.; O’Callaghan, T.F. The milk metabolome and its potential utilisation for enhanced dairy processing. Int. J. Dairy Technol. 2025, 78, e70058. [Google Scholar] [CrossRef]
  84. Mhapsekar, R.; Kilbane, D.; Davy, S.; Abraham, L.; Fenelon, M.; O’Shea, N. A systematic review of the internet of things and artificial intelligence applications in milk quality monitoring and analysis. Int. J. Dairy Technol. 2025, 78, e70049. [Google Scholar] [CrossRef]
  85. Ke, T.; Zhang, L.; Ge, X.; Lv, H.; Li, M. A Robust Least Squares Support Vector Machine Based on L∞-norm. Neural Process. Lett. 2020, 52, 2371–2397. [Google Scholar] [CrossRef]
  86. Fouodo, C.J.; Kronziel, L.L.; König, I.R.; Szymczak, S. Effect of hyperparameters on variable selection in random forests. arXiv 2023, arXiv:2309.06943. [Google Scholar] [CrossRef]
  87. Bentéjac, C.; Csörgő, A.; Martínez-Muñoz, G. A comparative analysis of gradient boosting algorithms. Artif. Intell. Rev. 2021, 54, 1937–1967. [Google Scholar] [CrossRef]
  88. Đurasević, M.; Gil-Gala, F.J.; Đumić, M. Application of electoral systems for decision making in ensembles of heuristics generated by genetic programming. Appl. Soft Comput. 2025, 186, 114050. [Google Scholar] [CrossRef]
  89. Banerjee, R.; Maheswarappa, N.B.; Biswas, S.; Dasoju, S.; Mohan, K.; Ramanathan, R. Peptidomics approaches for meat authentication employing rapid in-solution GELFrEE fractionation and MALDI-TOF MS. J. Food Compos. Anal. 2023, 121, 105349. [Google Scholar] [CrossRef]
  90. Imperiale, S.; Morozova, K.; Ferrentino, G.; Scampicchio, M. Analysis of milk with liquid chromatography–mass spectrometry: A review. Eur. Food Res. Technol. 2023, 249, 861–902. [Google Scholar] [CrossRef]
  91. Hendrickx, D.M.; Savova, M.V.; Zhu, P.; An, R.; Boeren, S.; Klomp, K.; Mutte, S.K.; PRESTO study team; Wopereis, H.; van der Molen, R.G.; et al. A multi-omics machine learning classifier for outgrowth of cow’s milk allergy in children. Mol. Omics 2025, 21, 343–352. [Google Scholar] [CrossRef] [PubMed]
  92. Xu, D.; Huang, H.; Yang, Y.; Ying, Y.; Zong, L.; Liu, Z.; Wang, Y.; Hu, W.; Liu, X.; Yang, J. Quantitative detection of hidden milk allergens in hypoallergenic formulas through bottom-up proteomics, high-resolution mass spectrometry and the mechanism of D-ribose in reducing the allergenicity in vitro. LWT 2025, 217, 117446. [Google Scholar] [CrossRef]
  93. Xu, D.K.; Yang, Y.; Huang, H.L.; Sun, X.J.; Hu, W.Y.; Liu, X.M.; Yang, J. Quantitation of cow’s milk allergen in hypoallergenic infant formulas using ultra-high performance liquid chromatography- tandem mass spectrometry. Se Pu 2025, 43, 1244–1251. [Google Scholar] [CrossRef] [PubMed]
  94. Shen, C.; Li, L.; Long, C.; Luo, Y.; Dong, Y.; Yu, J.; Yao, Y.; Hu, G.; Li, X.; Chen, H. Highly sensitive and wide-range quantification of bovine αS1-casein in foods for predicting potential milk allergenicity using an IgE multi-epitope-specific antibody. Food Chem. 2025, 495, 146394. [Google Scholar] [CrossRef] [PubMed]
  95. Fernández-Lozano, C.; Olmos-Piñero, S.; Sánchez-Ruano, L.; Terrados, S.; Diéguez, M.d.C.; Fernández-Rivas, M.; Vlaicu, C.; Cerecedo, I.; Gonzalo-Fernandez, A.; de la Hoz, B.; et al. Predicting Tolerance to Cow’s Milk Allergy in Children Using IgE and IgG4 Peptide Binding Profiles. Cells 2025, 14, 344. [Google Scholar] [CrossRef] [PubMed]
  96. Abhyankar, W.; Stelder, S.; de Koning, L.; de Koster, C.; Brul, S. ‘Omics’ for microbial food stability: Proteomics for the development of predictive models for bacterial spore stress survival and outgrowth. Int. J. Food Microbiol. 2017, 240, 11–18. [Google Scholar] [CrossRef] [PubMed]
  97. Hao, L.; Wang, S.; Ji, W. Cow’s milk protein allergy: A comprehensive review of epidemiology, pathogenesis, clinical manifestations, diagnostics, and management strategies. Asia Pac. J. Clin. Nutr. 2025, 34, 298. [Google Scholar] [CrossRef] [PubMed]
  98. Lajnaf, R.; Ayadi, M.A.; Attia, H.; Feki, S.; Masmoudi, H. Characteristics of Cow Milk Proteins and the Effect of Processing on Their Allergenicity. In Milk Protein—New Research Approaches; Chaiyabutr, N., Ed.; IntechOpen: London, UK, 2022. [Google Scholar]
  99. Haddad, N.; Johnson, N.; Kathariou, S.; Métris, A.; Phister, T.; Pielaat, A.; Tassou, C.; Wells-Bennik, M.H.J.; Zwietering, M.H. Next generation microbiological risk assessment—Potential of omics data for hazard characterisation. Int. J. Food Microbiol. 2018, 287, 28–39. [Google Scholar] [CrossRef] [PubMed]
  100. Sanches, P.H.G.; de Melo, N.C.; Porcari, A.M.; de Carvalho, L.M. Integrating Molecular Perspectives: Strategies for Comprehensive Multi-Omics Integrative Data Analysis and Machine Learning Applications in Transcriptomics, Proteomics, and Metabolomics. Biology 2024, 13, 848. [Google Scholar] [CrossRef] [PubMed]
  101. Kaulich, P.T.; Jeong, K.; Kohlbacher, O.; Tholey, A. Influence of different sample preparation approaches on proteoform identification by top-down proteomics. Nat. Methods 2024, 21, 2397–2407. [Google Scholar] [CrossRef] [PubMed]
  102. He, X.; Yang, F.; Qu, G.; Zhang, H.; Yi, M.; Wang, X.; Sun, S. Integrated microbial and proteomic analysis elucidates quality degradation mechanisms of fresh milk through the industrial processing stage. Food Chem. X 2025, 31, 103062. [Google Scholar] [CrossRef] [PubMed]
  103. Zhuang, J.; Hou, Y.; Wang, Y.; Gao, Y.; Chen, Y.; Qi, J.; Li, P.; Bian, Y.; Ju, N. Relationship between microorganisms and milk metabolites during quality changes in refrigerated raw milk: A metagenomic and metabolomic exploration. Int. J. Food Microbiol. 2024, 425, 110891. [Google Scholar] [CrossRef] [PubMed]
  104. Zhang, Y.; Yu, P.; Tao, F. Dynamic Interplay between Microbiota Shifts and Differential Metabolites during Dairy Processing and Storage. Molecules 2024, 29, 2745. [Google Scholar] [CrossRef] [PubMed]
  105. Panchaud, A.; Affolter, M.; Kussmann, M. Mass spectrometry for nutritional peptidomics: How to analyze food bioactives and their health effects. J. Proteom. 2012, 75, 3546–3559. [Google Scholar] [CrossRef] [PubMed]
  106. Bhattacharya, M.; Salcedo, J.; Robinson, R.C.; Henrick, B.M.; Barile, D. Peptidomic and glycomic profiling of commercial dairy products: Identification, quantification and potential bioactivities. npj Sci. Food 2019, 3, 4. [Google Scholar] [CrossRef] [PubMed]
  107. Akbarian, M.; Khani, A.; Eghbalpour, S.; Uversky, V.N. Bioactive Peptides: Synthesis, Sources, Applications, and Proposed Mechanisms of Action. Int. J. Mol. Sci. 2022, 23, 1445. [Google Scholar] [CrossRef] [PubMed]
  108. Huang, C.; Cheng, L. Unlocking Casein Bioactivity: Lactic Acid Bacteria and Molecular Strategies for Peptide Release. Int. J. Mol. Sci. 2025, 26, 8119. [Google Scholar] [CrossRef] [PubMed]
  109. De Fazio, R.; Di Francesco, A.; Di Ciccio, P.A.; Cunsolo, V.; Britti, D.; Lomagistro, C.; Roncada, P.; Piras, C. Detection of Bioactive Peptides’ Signature in Podolica Cow’s Milk. Foods 2025, 14, 877. [Google Scholar] [CrossRef] [PubMed]
  110. Yamada, A.; Sakurai, T.; Ochi, D.; Mitsuyama, E.; Yamauchi, K.; Abe, F. Antihypertensive effect of the bovine casein-derived peptide Met-Lys-Pro. Food Chem. 2015, 172, 441–446. [Google Scholar] [CrossRef] [PubMed]
  111. Le, T.H. Next-generation foodomics: Integrated multi-omics approaches for structural and functional profiling of plant-based proteins and polyphenols. J. Food Compos. Anal. 2025, 148, 108634. [Google Scholar] [CrossRef]
  112. Minkiewicz, P.; Iwaniak, A.; Darewicz, M. BIOPEP-UWM Database of Bioactive Peptides: Current Opportunities. Int. J. Mol. Sci. 2019, 20, 5978. [Google Scholar] [CrossRef] [PubMed]
  113. Minkiewicz, P.; Iwaniak, A.; Darewicz, M. BIOPEP-UWM Virtual—A Novel Database of Food-Derived Peptides with In Silico-Predicted Biological Activity. Appl. Sci. 2022, 12, 7204. [Google Scholar] [CrossRef]
  114. Brodkorb, A.; Egger, L.; Alminger, M.; Alvito, P.; Assunção, R.; Ballance, S.; Bohn, T.; Bourlieu-Lacanal, C.; Boutrou, R.; Carrière, F.; et al. INFOGEST static in vitro simulation of gastrointestinal food digestion. Nat. Protoc. 2019, 14, 991–1014. [Google Scholar] [CrossRef] [PubMed]
  115. Iwaniak, A.; Minkiewicz, P.; Mogut, D.; Borawska-Dziadkiewicz, J.; Żulewska, J.; Darewicz, M. Bovine Milk Protein-Derived Preparations and Their Hydrolysates as Sources of ACE-Inhibitory, DPP-IV-Inhibitory, and Antioxidative Peptides Analyzed Using in Silico and in Vitro Protocols. Int. J. Mol. Sci. 2025, 26, 4323. [Google Scholar] [CrossRef] [PubMed]
  116. Zhang, Y.; Zhu, Y.; Bao, X.; Dai, Z.; Shen, Q.; Wang, L.; Xue, Y. Mining Bovine Milk Proteins for DPP-4 Inhibitory Peptides Using Machine Learning and Virtual Proteolysis. Research 2024, 7, 0391. [Google Scholar] [CrossRef] [PubMed]
  117. Zhao, Y.; Zhao, H.; Li, L.; Tan, J.; Wang, Y.; Liu, M.; Jiang, L. Multi-omics analysis reveals that the metabolite profile of raw milk is associated with dairy cows’ health status. Food Chem. 2023, 428, 136813. [Google Scholar] [CrossRef] [PubMed]
  118. Amarasekara, Y.; Pathak, R.; Reddy, N.; Nevalainen, H.; Te’o, J.; Sunna, A. Integrated proteomics and in silico analysis of casein-derived peptides with antioxidant, antihypertensive, and anticancer activities. Food Chem. 2025, 493, 145911. [Google Scholar] [CrossRef] [PubMed]
  119. Vukic, V.R.; Vukic, D.V.; Milanovic, S.D.; Ilicic, M.D.; Kanuric, K.G.; Johnson, M.S. In silico identification of milk antihypertensive di-and tripeptides involved in angiotensin I–converting enzyme inhibitory activity. Nutr. Res. 2017, 46, 22–30. [Google Scholar] [CrossRef] [PubMed]
  120. Wu, N.; Wuhanqimuge; Shuang, Q. Screening, Characterization, and Mechanistic Evaluation of Angiotensin Converting Enzyme Inhibitory Peptides Derived from Milk Fermented with Lactobacillus delbrueckii QS306 with and Without Ultrahigh-Pressure Treatment. J. Agric. Food Chem. 2023, 71, 17420–17431. [Google Scholar] [CrossRef] [PubMed]
  121. Elisha, C.; Bhagwat, P.; Pillai, S. In silico and in vitro analysis of dipeptidyl peptidase-IV and angiotensin-converting enzyme inhibitory peptides derived from milk lactoferrin. Int. Dairy J. 2025, 160, 106092. [Google Scholar] [CrossRef]
  122. Chen, J.; Yu, X.; Chen, Q.; Wu, Q.; He, Q. Screening and mechanisms of novel angiotensin-I-converting enzyme inhibitory peptides from rabbit meat proteins: A combined in silico and in vitro study. Food Chem. 2022, 370, 131070. [Google Scholar] [CrossRef] [PubMed]
  123. Zhang, S.; Liu, L.; Gao, L.; Zhao, Y.; Shi, N.; Wang, H.; Xu, H. Bioinformatics and computer simulation approaches to the discovery and analysis of bioactive peptides. Curr. Pharm. Biotechnol. 2022, 23, 1541–1555. [Google Scholar] [CrossRef] [PubMed]
  124. Du, Z.; Ding, X.; Hsu, W.; Munir, A.; Xu, Y.; Li, Y. pLM4ACE: A protein language model based predictor for antihypertensive peptide screening. Food Chem. 2024, 431, 137162. [Google Scholar] [CrossRef] [PubMed]
  125. Li, J.; Liu, X.; Li, W.; Wu, D.; Zhang, Z.; Chen, W.; Yang, Y. A screening strategy for identifying umami peptides with multiple bioactivities from Stropharia rugosoannulata using in silico approaches and SPR sensing. Food Chem. 2024, 431, 137057. [Google Scholar] [CrossRef] [PubMed]
  126. He, Y.; Deng, Z.; Lyu, Y.; Yan, Y.; Liu, J.; Liu, H. Integration of pre-trained GRU and molecular docking for virtual screening of quinoa seed derived ACE inhibitory peptides: An innovative prediction strategy. Food Chem. 2025, 492, 145591. [Google Scholar] [CrossRef] [PubMed]
  127. Ferrocino, I.; Rantsiou, K.; McClure, R.; Kostic, T.; De Souza, R.S.C.; Lange, L.; FitzGerald, J.; Kriaa, A.; Cotter, P.; Maguin, E. The need for an integrated multi-OMICs approach in microbiome science in the food system. Compr. Rev. Food Sci. Food Saf. 2023, 22, 1082–1103. [Google Scholar] [CrossRef] [PubMed]
  128. Gao, Y.; Liu, Y.; Ma, T.; Liang, Q.; Sun, J.; Wu, X.; Song, Y.; Nie, H.; Huang, J.; Mu, G. Fermented Dairy Products as Precision Modulators of Gut Microbiota and Host Health: Mechanistic Insights, Clinical Evidence, and Future Directions. Foods 2025, 14, 1946. [Google Scholar] [CrossRef] [PubMed]
  129. Luo, L.; Yang, B.; Zhao, J.; Li, X.; Qiao, W.; Zhang, M.; Tian, L.; Chen, L. Regulation of infant gut microbiota and metabolic pathways by human milk oligosaccharides: A multi-omics investigation. RSC Adv. 2026, 16, 5641–5651. [Google Scholar] [CrossRef] [PubMed]
  130. Kato, T.; Kagawa, M.; Suda, W.; Tsuboi, Y.; Inoue-Suzuki, S.; Kikuchi, J.; Hattori, M.; Ohta, T.; Ohno, H. Integrated Multi-Omics Analysis Reveals Differential Effects of Fructo-Oligosaccharides (FOS) Supplementation on the Human Gut Ecosystem. Int. J. Mol. Sci. 2022, 23, 11728. [Google Scholar] [CrossRef] [PubMed]
  131. Prattico, C.; Gonzalez, E.; Dridi, L.; Jazestani, S.; Low, K.E.; Abbott, D.W.; Maurice, C.F.; Castagner, B. Identification of novel fructo-oligosaccharide bacterial consumers by pulse metatranscriptomics in a human stool sample. mSphere 2025, 10, e00668-24. [Google Scholar] [CrossRef] [PubMed]
  132. Siegwald, L.; Cherta-Murillo, A.; Christen, S.; Boulangé, C.L.; Chou, C.J.; Foata, F.; Lahiry, A.; Frézal, A.; Giner, M.P.; Godin, J.-P. The impact of low-lactose, high galacto-oligosaccharides (GOS) milk on gut microbiome and plasma metabolome in healthy adults: A randomized, double-blind, controlled clinical trial complemented by ex vivo experiments. Curr. Dev. Nutr. 2025, 9, 107506. [Google Scholar] [CrossRef] [PubMed]
  133. Zhu, C.; Zhang, Q.; Zhao, X.; Yang, Z.; Yang, F.; Yang, Y.; Tang, J.; Laghi, L. Metabolomic Analysis of Multiple Biological Specimens (Feces, Serum, and Urine) by 1H-NMR Spectroscopy from Dairy Cows with Clinical Mastitis. Animals 2023, 13, 741. [Google Scholar] [CrossRef] [PubMed]
  134. Ames, S.R.; Lotoski, L.C.; Azad, M.B. Comparing early life nutritional sources and human milk feeding practices: Personalized and dynamic nutrition supports infant gut microbiome development and immune system maturation. Gut Microbes 2023, 15, 2190305. [Google Scholar] [CrossRef] [PubMed]
  135. Sadurski, J.; Polak-Berecka, M.; Staniszewski, A.; Waśko, A. Step-by-Step Metagenomics for Food Microbiome Analysis: A Detailed Review. Foods 2024, 13, 2216. [Google Scholar] [CrossRef] [PubMed]
  136. Wang, D.; Yin, Q.; Tang, G.; Sun, L.; Yu, J.; Cao, Y.; Yao, J. Gut microbiome enterotypes drive divergent lactation performance in dairy goats through host-microbe metabolic cross-talk. Microbiol. Spectr. 2025, 13, e01367-25. [Google Scholar] [CrossRef] [PubMed]
  137. Qin, Y.; Havulinna, A.S.; Liu, Y.; Jousilahti, P.; Ritchie, S.C.; Tokolyi, A.; Sanders, J.G.; Valsta, L.; Brożyńska, M.; Zhu, Q. Combined effects of host genetics and diet on human gut microbiota and incident disease in a single population cohort. Nat. Genet. 2022, 54, 134–142. [Google Scholar] [CrossRef] [PubMed]
  138. Vimaleswaran, K.S.; Zhou, A.; Cavadino, A.; Hyppönen, E. Evidence for a causal association between milk intake and cardiometabolic disease outcomes using a two-sample Mendelian Randomization analysis in up to 1,904,220 individuals. Int. J. Obes. 2021, 45, 1751–1762. [Google Scholar] [CrossRef] [PubMed]
  139. Kinkpe, L.; Solomon, A.I.; Niu, Y.; Goswami, N.; Ikele, C.M.-C.; Hu, D.; Abdessan, R.; Zhigang, H.; Xia, W. A guide to network analysis, multi-omics integration, and applications in livestock microbiome research. World J. Microbiol. Biotechnol. 2026, 42, 17. [Google Scholar] [CrossRef] [PubMed]
  140. Yuan, L.; Li, Y.; Wang, Z.; Xie, X.; Wu, Q. Gut Microbiota-Mediated Antihypertensive Effects of Probiotic Fermented Milk: A Multi-Omics Study. J. Food Sci. 2025, 90, e70654. [Google Scholar] [CrossRef] [PubMed]
  141. Santiago-Rodriguez, T.M.; Hollister, E.B. Multi ‘omic data integration: A review of concepts, considerations, and approaches. Semin. Perinatol. 2021, 45, 151456. [Google Scholar] [CrossRef] [PubMed]
  142. Fonseca, P.A.d.S.; Suárez-Vega, A.; Casas, L.; Marina, H.; Gutiérrez-Gil, B.; Arranz, J.J. Integrating omics and functional data via representation learning to prioritize candidate genes for pleiotropic effect in dairy sheep. PNAS Nexus 2025, 4, pgaf361. [Google Scholar] [CrossRef] [PubMed]
  143. Zheng, W.; Zhang, J.; Huang, J.; Han, B.; Liu, Y.; Sun, D. A pilot multi-omics study reveals genetic mechanisms regulating milk component traits in dairy cattle. Commun. Biol. 2025, 8, 1150. [Google Scholar] [CrossRef] [PubMed]
  144. Wang, M.; Zhang, L.; Jiang, X.; Song, Y.; Wang, D.; Liu, H.; Wu, S.; Yao, J. Multiomics analysis revealed that the metabolite profile of raw milk is associated with the lactation stage of dairy cows and could be affected by variations in the ruminal microbiota. J. Dairy Sci. 2024, 107, 8709–8721. [Google Scholar] [CrossRef] [PubMed]
  145. Wilkinson, M.D.; Dumontier, M.; Aalbersberg, I.J.; Appleton, G.; Axton, M.; Baak, A.; Blomberg, N.; Boiten, J.-W.; da Silva Santos, L.B.; Bourne, P.E.; et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 2016, 3, 160018. [Google Scholar] [CrossRef] [PubMed]
  146. Okoye, C.O.; Jiang, H.; Nazar, M.; Tan, X.; Jiang, J. Redefining modern food analysis: Significance of omics analytical techniques integration, chemometrics and bioinformatics. TrAC Trends Anal. Chem. 2024, 175, 117706. [Google Scholar] [CrossRef]
  147. Wadood, A.A.; Bordbar, F.; Zhang, X. Integrating omics approaches in livestock biotechnology: Innovations in production and reproductive efficiency. Front. Anim. Sci. 2025, 6, 1551244. [Google Scholar] [CrossRef]
  148. Yu, Y.; Mai, Y.; Zheng, Y.; Shi, L. Assessing and mitigating batch effects in large-scale omics studies. Genome Biol. 2024, 25, 254. [Google Scholar] [CrossRef] [PubMed]
  149. Lee, K.H.; Assassi, S.; Mohan, C.; Pedroza, C. Addressing statistical challenges in the analysis of proteomics data with extremely small sample size: A simulation study. BMC Genom. 2024, 25, 1086. [Google Scholar] [CrossRef] [PubMed]
  150. Richardson, R.; Tejedor Navarro, H.; Amaral, L.A.N.; Stoeger, T. Meta-research: Understudied genes are lost in a leaky pipeline between genome-wide assays and reporting of results. eLife 2024, 12, RP93429. [Google Scholar] [CrossRef]
  151. Sopić, M.; Devaux, Y.; de Gonzalo-Calvo, D. Navigating the path of reproducibility in microRNA-based biomarker research with ring trials. Clin. Chem. Lab. Med. (CCLM) 2024, 62, 2393–2397. [Google Scholar] [CrossRef] [PubMed]
  152. Xiao, X.; Fu, L.; Zhang, Z.; Tu, Z.; Shen, N.; Fan, S. Recent Advances in Integrating Machine Learning with Omics Approaches in Food Science and Nutrition Research. J. Agric. Food Chem. 2025, 73, 29998–30025. [Google Scholar] [CrossRef] [PubMed]
  153. Bansal, S.; Rodriguez, C.Z.; Thompson-Witrick, K.A.; Wang, Y.; Taft, D.H.; Zhang, B. Machine learning-powered multi-omics for food microbiology and smarter food safety. Trends Food Sci. Technol. 2025, 163, 105145. [Google Scholar] [CrossRef]
  154. Desaire, H. How (not) to generate a highly predictive biomarker panel using machine learning. J. Proteome Res. 2022, 21, 2071–2074. [Google Scholar] [CrossRef] [PubMed]
  155. Pestarino, L.; Fortner, R.T.; Nøst, T.H.; Fotopoulos, I.; Urbarova, I.; Røe, O.D.; Langseth, H.; Rounge, T.B. Recommendations for validating omics prediction models: Insights from a lung cancer RNA biomarker study. Cancer Epidemiol. Biomark. Prev. 2025, 34, 1852–1860. [Google Scholar] [CrossRef] [PubMed]
  156. Vos, G.; van Eijk, L.; Sarnyai, Z.; Azghadi, M.R. Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights. Comput. Methods Programs Biomed. 2025, 269, 108899. [Google Scholar] [CrossRef] [PubMed]
  157. Patel, S.S.; Bains, A.; Kaushik, R.; Dhull, S.B.; Nagraik, R.; Fareed, M.; Janghu, S.; Chawla, P. Leveraging Smart Technologies to Enhance Safety in Milk and Milk-based Products. Food Control 2025, 182, 111871. [Google Scholar] [CrossRef]
  158. Yang, H.; Wang, Y.; Zhao, J.; Li, P.; Li, Z.; Li, L.; Fan, B.; Wang, F. Data-driven pipeline modeling for predicting unknown protein adulteration in dairy products. Food Chem. 2025, 471, 142736. [Google Scholar] [CrossRef] [PubMed]
  159. Goyal, K.; Kumar, P.; Verma, K. XAI-empowered IoT multi-sensor system for real-time milk adulteration detection. Food Control 2024, 164, 110495. [Google Scholar] [CrossRef]
  160. Gu, J.; Yue, S.; Yang, X.; Lu, Z.; Wang, Z.; Wei, X.; Zhou, L.; Zhao, Z.; Shi, Z.; Zhao, Z. Unlocking novel angiotensin-converting enzyme inhibitory peptides from Nili Ravi buffalo milk β-casein: A synergistic strategy combining targeted hydrolysis, 3D-QSAR modeling, and mechanistic insights. Food Biosci. 2025, 71, 107346. [Google Scholar] [CrossRef]
  161. Zhao, W.-J.; Xiao, Q.-A. Effects of different types of milk consumption on Crohn’s disease and the mediating effects of amino acids: A Mendelian randomization study. J. Dairy Sci. 2025, 108, 2199–2205. [Google Scholar] [CrossRef] [PubMed]
  162. Angel, T.E.; Aryal, U.K.; Hengel, S.M.; Baker, E.S.; Kelly, R.T.; Robinson, E.W.; Smith, R.D. Mass spectrometry-based proteomics: Existing capabilities and future directions. Chem. Soc. Rev. 2012, 41, 3912–3928. [Google Scholar] [CrossRef] [PubMed]
  163. Wang, X.; Jahagirdar, S.; Bakker, W.; Lute, C.; Kemp, B.; Knegsel, A.V.; Saccenti, E. Discrimination of Lipogenic or Glucogenic Diet Effects in Early-Lactation Dairy Cows Using Plasma Metabolite Abundances and Ratios in Combination with Machine Learning. Metabolites 2024, 14, 230. [Google Scholar] [CrossRef] [PubMed]
  164. Batavani, R.A.; Asri, S.; Naebzadeh, H. The effect of subclinical mastitis on milk composition in dairy cows. Iran. J. Vet. Res. 2007, 8, 205–211. [Google Scholar] [CrossRef]
  165. Ding, H.; Xie, Z.; Yu, W.; Cui, X.; Wilson, D.I. Artificial intelligence enhances food testing process: A comprehensive review. Food Biosci. 2025, 68, 106404. [Google Scholar] [CrossRef]
  166. Zhang, W.; Xue, H.; Pan, X.; Liu, J. Research on an improved black winged kite algorithm for detecting adulteration in camel milk powder. J. Food Compos. Anal. 2026, 153, 109101. [Google Scholar] [CrossRef]
  167. Chen, Z.; Zhang, G.; Zhang, F. Multimodal AI for Real-Time Food Safety and Quality: From Sensors to Foundation Models, Edge Deployment, and Regulation. Food Sci. Nutr. 2026, 14, e71534. [Google Scholar] [CrossRef] [PubMed]
  168. Bhowmik, D.; Rickard, J.J.S.; Goldberg Oppenheimer, P. Advanced Real-Time, Non-Destructive Spectral Fingerprinting for Early microbial Spoilage Detection: AI-Integrated Raman Biosensing Platform for Scalable Food Safety and Quality. Food Res. Int. 2026, 231, 118745. [Google Scholar] [CrossRef] [PubMed]
  169. Liao, X.; Ederveen, T.H.A.; Niehues, A.; de Visser, C.; Huang, J.; Badmus, F.; Doornbos, C.; Orlova, Y.; Kulkarni, P.; van der Velde, K.J.; et al. FAIR Data Cube, a FAIR data infrastructure for integrated multi-omics data analysis. J. Biomed. Semant. 2024, 15, 20. [Google Scholar] [CrossRef] [PubMed]
  170. Niehues, A.; de Visser, C.; Hagenbeek, F.A.; Kulkarni, P.; Pool, R.; Karu, N.; Kindt, A.S.D.; Singh, G.; Vermeiren, R.R.J.M.; Boomsma, D.I.; et al. A multi-omics data analysis workflow packaged as a FAIR Digital Object. GigaScience 2024, 13, giad115. [Google Scholar] [CrossRef] [PubMed]
  171. Abbatiello, S.E.; Schilling, B.; Mani, D.R.; Zimmerman, L.J.; Hall, S.C.; MacLean, B.; Albertolle, M.; Allen, S.; Burgess, M.; Cusack, M.P.; et al. Large-Scale Interlaboratory Study to Develop, Analytically Validate and Apply Highly Multiplexed, Quantitative Peptide Assays to Measure Cancer-Relevant Proteins in Plasma. Mol. Cell. Proteom. 2015, 14, 2357–2374. [Google Scholar] [CrossRef] [PubMed]
  172. Wang, Z.; Li, Y.; Fu, W.; Wu, J. Unraveling the mechanisms of action of food-derived bioactive peptides: Insights from Omics approaches. Food Nutr. 2025, 1, 100019. [Google Scholar] [CrossRef]
  173. Chen, L.G.; Tubbs, J.D.; Liu, Z.; Thach, T.-Q.; Sham, P.C. Mendelian randomization: Causal inference leveraging genetic data. Psychol. Med. 2024, 54, 1461–1474. [Google Scholar] [CrossRef] [PubMed]
  174. Radio, S.; Di Marsico, M.; Bersani, C.; Malinverni, R.; Casacuberta, J.; Corpetti, C.; Cigliano, R.A.; Sanseverino, W. Development of a roadmap for action on the application of Omics and associated Bioinformatics Approaches in Risk Assessment. EFSA Support. Publ. 2024, 21, 9086E. [Google Scholar] [CrossRef]
  175. Rao, P.P. Foodomics for Food Safety and Quality: A Critical Performance Analysis and Implementation Framework. Food Rev. Int. 2026, 1–62. [Google Scholar] [CrossRef]
  176. Jackson, D.; Gousiopoulos, A.; Soldatos, T.G. Language Models and Food–Health Evidence: Challenges, Opportunities, and Implications. BioMedInformatics 2026, 6, 13. [Google Scholar] [CrossRef]
Figure 1. Schematic diagram of the multi-omics detection and analysis process for dairy products. This workflow consists of four sequential stages: sample collection, sample preparation, instrumental testing, and data analysis. Arrows indicate the direction of the analytical pipeline. Different colors distinguish the four stages.
Figure 1. Schematic diagram of the multi-omics detection and analysis process for dairy products. This workflow consists of four sequential stages: sample collection, sample preparation, instrumental testing, and data analysis. Arrows indicate the direction of the analytical pipeline. Different colors distinguish the four stages.
Foods 15 02389 g001
Figure 2. Integrated application framework of multi-omics in dairy product safety assessment. This framework consists of three modules: multimodal technology input (metabolomics, proteomics, and metagenomics), integrated analysis (multivariate statistical biomarker screening, authenticity and falsification identification, species-specific peptide labeling, and allergen risk assessment), and three major application outputs. Arrows indicate the data flow from input through integration to final applications. Color-coded blocks differentiate technology types, analytical methods, and application outcomes.
Figure 2. Integrated application framework of multi-omics in dairy product safety assessment. This framework consists of three modules: multimodal technology input (metabolomics, proteomics, and metagenomics), integrated analysis (multivariate statistical biomarker screening, authenticity and falsification identification, species-specific peptide labeling, and allergen risk assessment), and three major application outputs. Arrows indicate the data flow from input through integration to final applications. Color-coded blocks differentiate technology types, analytical methods, and application outcomes.
Foods 15 02389 g002
Figure 3. HMO structure diagram (ac).
Figure 3. HMO structure diagram (ac).
Foods 15 02389 g003
Figure 4. Schematic diagrams of FOS and GOS structures. (a) Schematic diagram of FOS structure. (b) Schematic diagram of GOS structure.
Figure 4. Schematic diagrams of FOS and GOS structures. (a) Schematic diagram of FOS structure. (b) Schematic diagram of GOS structure.
Foods 15 02389 g004
Figure 5. Dairy multi-omics research translation roadmap: standardization, validation, and application. This roadmap illustrates the three sequential stages from research to industrial practice: standardization (data standards and SOPs), validation (biomarker/model robustness assessment with explainable AI), and application (detection kits, rapid sorting, management systems, and functional product development).
Figure 5. Dairy multi-omics research translation roadmap: standardization, validation, and application. This roadmap illustrates the three sequential stages from research to industrial practice: standardization (data standards and SOPs), validation (biomarker/model robustness assessment with explainable AI), and application (detection kits, rapid sorting, management systems, and functional product development).
Foods 15 02389 g005
Table 1. Comparison of multi-omics approaches.
Table 1. Comparison of multi-omics approaches.
Omics TypeTarget MoleculesApplicationKey FindingsAdvantageSample Type/Validation Design
MetabolomicsOrganic acids, amino acidsSpecies discriminationIdentified biomarkers (e.g., uric acid)High sensitivityMilk from cow, goat, buffalo; validated by PCA/OPLS-DA cross-validation [17,18,19,20]
ProteomicsCaseins, whey proteinsAllergen profilingSpecies-specific peptidesHigh specificityRaw/pasteurized milk; validated by LC-MS/MS with spectral library matching [21]
LipidomicsFatty acids, phospholipidsNutritional evaluationCLA variationFunctional insightRaw milk from grass- vs. grain-fed cows; validated by GC-MS and orthogonal partial least squares [22,23,24]
MetagenomicsMicrobiotaFermentation impactMicrobial metabolism linkMechanisticFermented milk (dahi, yogurt); validated by 16S rRNA sequencing with repeated sampling [17,20,25]
Table 2. Comparison of nutritional characteristics and health effects of different milk sources.
Table 2. Comparison of nutritional characteristics and health effects of different milk sources.
Milk Source TypeMain Nutritional AdvantageKey Functional ComponentsPotential Health EffectsSample Type/Validation Designs
Human MilkComprehensive essential nutrients, rich in functional proteinsImmune factors, growth factors, hormones, whey proteinsSupports overall infant development, establishes immune systemPooled mature milk samples (n = 30); validated by longitudinal cohort [34,35,36]
Yak MilkHigh content of functional proteins, high nutrient densityHigh levels of lactoferrin, osteopontin, immunoglobulinsEnhances immunity, promotes bone healthTibetan plateau samples (n = 25); cross-validated with ELISA and proteomics [33,37]
Buffalo MilkHigh mineral and fat content, high probiotic survival rateHigh cholesterol, sphingomyelin, protein, mineralsSupports bone health, beneficial for fermented dairy productionMurrah buffalo milk (n = 20); repeated measures across lactation stages [38,39]
Sow MilkRich in neurodevelopment-related components and special lipidsN-acetylneuraminic acid, polar lipids, long-chain polyunsaturated fatty acidsPromotes neural development, supports cell membrane functionColostrum and mature milk (n = 15); validated by targeted lipidomics [36]
Cow MilkHigh protein content, high yield, wide applicationCasein, β-lactoglobulinProvides high-quality proteinBulk tank milk from 10 farms; validated by repeated monthly sampling [35]
Table 3. Application of multi-omics approaches in dairy safety assessment: authenticity, peptide markers, and allergen risk.
Table 3. Application of multi-omics approaches in dairy safety assessment: authenticity, peptide markers, and allergen risk.
Assessment AreaOmics TechnologyDetection PlatformModel/AlgorithmApplicable ProductsReferences
Milk Authenticity and Adulteration IdentificationMetabolomicsUPLC-HRMSPCA, OPLS-DA, SVMCow milk, goat milk, horse milk, pasteurized milk, UHT milk[32,62,63,64,65]
ProteomicsLC-MS/MS, microLC-IM-QTOFPCA, PLS-DARaw milk, pasteurized milk, milk powder
GenomicsqPCR, high-throughput sequencingPhylogenetic analysisRaw milk, fermented milk, milk powder
Metagenomics16S rRNA gene sequencing, shotgun metagenomic sequencingAlpha/beta diversity analysisRaw milk, cheese, fermented milk
Species-Specific Peptide MarkersProteomics/PeptidomicsmicroLC-IM-QTOF, LC-MS/MSDatabase matching, de novo sequencingRaw milk, pasteurized milk, UHT milk, fermented milk[66,67,68]
Allergen Detection and Risk AssessmentProteomicsLC-MS/MS, ELISAImmunoinformatics algorithmsRaw milk, processed dairy products
MetabolomicsUPLC-HRMSCorrelation analysisProcessed dairy products[69,70,71]
Metagenomics16S rRNA gene sequencing, shotgun metagenomic sequencingMachine learning (e.g., random forest)Infant formula
Table 4. Multi-omics technologies and strategies for investigating dairy product health functions.
Table 4. Multi-omics technologies and strategies for investigating dairy product health functions.
Research LevelCore Omics Technologies/MethodsResearch Strategy and ContentTarget Health FunctionsSample Type/Validation Design
Molecular LevelPeptidomics, Molecular Docking SimulationHigh-throughput mining of milk-derived bioactive peptides; predicting binding modes and structure–activity relationships between peptides and target proteinsAntioxidant, angiotensin-converting enzyme (ACE) inhibition, immunomodulation, etc.In vitro digests of casein/whey; docking validated with known ACE structure [102,103,104,105,106,107,108,109,110,111,112,113,114,115,116]
Metabolic Pathway LevelMetagenomics, Metabolomics (Integrated Analysis)Elucidating the effects of functional oligosaccharides on gut microbiota structure and metabolic pathways; revealing the production pathways of beneficial metabolites such as short-chain fatty acidsPrebiotic effects, modulating microbial metabolism to improve gut barrier and immunity, etc.FOS/GOS intervention in human/infant cohorts; validated by paired metagenomics–metabolomics [117,118,119,120,121,122]
Ecosystem LevelMulti-omics Co-occurrence Network Analysis, Causal Inference Models (e.g., Mendelian Randomization, Structural Equation Modeling)Constructing interaction networks of dairy components–gut microbes–metabolites–host phenotypes; identifying key microbial taxa and metabolic signaling pathways driving health; moving from association to causal verificationPrecision modulation of gut microecology for targeted health interventionsFermented milk intervention (8 weeks, n = 60); validated by Mendelian randomization using LCT variant [123,124,125,126,127,128,129,130]
Table 5. The four major unresolved core issues.
Table 5. The four major unresolved core issues.
Knowledge Gap CategorySpecific ProblemPriority ActionReferences
Data standardization and accessibilityLack of cross-laboratory, cross-platform, publicly available reference datasets, particularly raw LC-MS/MS data; most studies do not adhere to FAIR principlesEstablish a dairy multi-omics reference database, and enforce data sharing and standardized formats[15,169,170]
Cross-regional validation of markersThe vast majority of peptide markers have only been validated in a single laboratory, in a single breed, or under a single processing condition, and the false discovery rate (FDR) remains unknownPromote multi-center ring trials, requiring validation by at least three independent laboratories and two or more mass spectrometry platforms[32,56,171]
Lack of causal evidenceMost studies report associations (e.g., co-occurrence networks, Spearman correlations) and lack of randomized controlled trials (RCTs) to validate the clinical effects of bioactive peptidesConduct RCTs with multi-omics endpoints, combined with Mendelian randomization for causal inference[9,172,173]
Regulatory acceptanceThere are no officially recognized standards for multi-omics testing, performance thresholds, or guidelines for machine learning model validationPromote the initiation of projects within international standardization organizations such as AOAC, ISO, and IDF and establish an approval process for multi-omics biomarkers[172,174,175,176]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xu, M.; Ma, B.; Zhu, K.; Tu, W.; Li, C.; Hao, P.; Zhang, M. Research Progress in Multi-Omics Analysis of Dairy Products: Nutritional Quality, Safety Evaluation, and Health Functions. Foods 2026, 15, 2389. https://doi.org/10.3390/foods15132389

AMA Style

Xu M, Ma B, Zhu K, Tu W, Li C, Hao P, Zhang M. Research Progress in Multi-Omics Analysis of Dairy Products: Nutritional Quality, Safety Evaluation, and Health Functions. Foods. 2026; 15(13):2389. https://doi.org/10.3390/foods15132389

Chicago/Turabian Style

Xu, Mengqi, Biao Ma, Kaichen Zhu, Wenke Tu, Chenjia Li, Peiying Hao, and Mingzhou Zhang. 2026. "Research Progress in Multi-Omics Analysis of Dairy Products: Nutritional Quality, Safety Evaluation, and Health Functions" Foods 15, no. 13: 2389. https://doi.org/10.3390/foods15132389

APA Style

Xu, M., Ma, B., Zhu, K., Tu, W., Li, C., Hao, P., & Zhang, M. (2026). Research Progress in Multi-Omics Analysis of Dairy Products: Nutritional Quality, Safety Evaluation, and Health Functions. Foods, 15(13), 2389. https://doi.org/10.3390/foods15132389

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop