Integrated Transcriptomic Analysis Identifies Core Hub Genes Regulating Mammary Gland Traits (Milk Quality/Lactation) in Dairy Livestock: Bos taurus and Ovis aries
Abstract
1. Introduction
2. Materials and Methods
3. Results
3.1. Transcriptomic Data Characteristics and Differential Gene Screening
3.2. WGCNA Co-Expression Network Construction and Core Module Screening
3.3. Machine Learning Model Performance and High-Contribution Gene Screening
3.4. Core Gene Intersection, Functional Enrichment, and Hub Gene Identification
4. Discussion
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Cabiddu, A.; Delgadillo-Puga, C.; Decandia, M.; Molle, A.G. Extensive Ruminant Production Systems and Milk Quality with Emphasis on Unsaturated Fatty Acids, Volatile Compounds, Antioxidant Protection Degree and Phenol Content. Animals 2019, 9, 771. [Google Scholar] [CrossRef]
- Oltjen, J.W.; Beckett, J.L. Role of ruminant livestock in sustainable agricultural systems. J. Anim. Sci. 1996, 74, 1406–1409. [Google Scholar] [CrossRef]
- Haug, A.; Høstmark, A.T.; Harstad, O.M. B. taurus milk in human nutrition—A review. Lipids Health Dis. 2007, 6, 25. [Google Scholar] [CrossRef]
- Law, F.M.; Moate, P.J.; Leaver, D.D.; Diefenbach-Jagger, H.; Grill, V.; Ho, P.W.; Martin, T.J. Parathyroid hormone-related protein in milk and its correlation with B. taurus milk calcium. J. Endocrinol. 1991, 128, 21–26. [Google Scholar] [CrossRef]
- Palupi, E.; Jayanegara, A.; Ploeger, A.; Kahl, J. Comparison of nutritional quality between conventional and organic dairy products: A meta-analysis. J. Sci. Food Agric. 2012, 92, 2774–2781. [Google Scholar] [CrossRef]
- Singh, M.; Thomson, P.C.; Sheehy, P.A.; Raadsma, H.W. Comparative transcriptome analyses reveal conserved and distinct mechanisms in ovine and B. taurus lactation. Funct. Integr. Genom. 2013, 13, 115–131. [Google Scholar] [CrossRef] [PubMed]
- Beckett, L.; Xie, S.; Thimmapuram, J.; Tucker, H.A.; Donkin, S.S.; Casey, T. Mammary transcriptome reveals cell maintenance and protein turnover support milk synthesis in early-lactation cows. Physiol. Genom. 2020, 52, 435–450. [Google Scholar] [CrossRef]
- Bionaz, M.; Loor, J.J. Gene networks driving B. taurus mammary protein synthesis during the lactation cycle. Bioinform. Biol. Insights 2011, 5, 83–98. [Google Scholar] [CrossRef] [PubMed]
- Rainard, P.; Riollet, C. Mobilization of neutrophils and defense of the B. taurus mammary gland. Reprod. Nutr. Dev. 2003, 43, 439–457. [Google Scholar] [CrossRef]
- Fu, Y.; Shen, N.; Wang, S.; Liu, Y.; Peng, P.; Shi, L.; Han, B.; Wang, K.; Sun, D. Integrating transcriptomic and metabolomic profiles of primiparous Holstein cows across multiple lactation periods reveals the regulatory mechanism underlying milk component traits. J. Dairy Sci. 2025, 108, 10377–10390. [Google Scholar] [CrossRef] [PubMed]
- Fonseca, P.A.S.; Suárez-Vega, A.; Esteban-Blanco, C.; Pelayo, R.; Marina, H.; Gutiérrez-Gil, B.; Arranz, J.J. Epigenetic regulation of functional candidate genes for milk production traits in dairy sheep subjected to protein restriction in the prepubertal stage. BMC Genom. 2023, 24, 511. [Google Scholar] [CrossRef]
- Zhou, Y.; Gong, W.; Xiao, J.; Wu, J.; Pan, L.; Li, X.; Wang, X.; Wang, W.; Hu, S.; Yu, J. Transcriptomic analysis reveals key regulators of mammogenesis and the pregnancy-lactation cycle. Sci. China Life Sci. 2014, 57, 340–355. [Google Scholar] [CrossRef]
- Suchyta, S.P.; Sipkovsky, S.; Halgren, R.G.; Kruska, R.; Elftman, M.; Weber-Nielsen, M.; Vandehaar, M.J.; Xiao, L.; Tempelman, R.J.; Coussens, P.M. B. taurus mammary gene expression profiling using a cDNA microarray enhanced for mammary-specific transcripts. Physiol. Genom. 2003, 16, 8–18. [Google Scholar] [CrossRef]
- Mumtaz, P.T.; Bhat, B.; Ibeagha-Awemu, E.M.; Taban, Q.; Wang, M.; Dar, M.A.; Bhat, S.A.; Shabir, N.; Shah, R.A.; Ganie, N.A.; et al. Mammary epithelial cell transcriptome reveals potential roles of lncRNAs in regulating milk synthesis pathways in Jersey and Kashmiri cattle. BMC Genom. 2022, 23, 176. [Google Scholar] [CrossRef]
- Robinson, M.D.; McCarthy, D.J.; Smyth, G.K. edgeR: A Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 2010, 26, 139–140. [Google Scholar] [CrossRef] [PubMed]
- Sun, H.Z.; Zhou, M.; Wang, O.; Chen, Y.; Liu, J.X.; Guan, L.L. Multi-omics reveals functional genomic and metabolic mechanisms of milk production and quality in B. taurus. Bioinformatics 2020, 36, 2530–2537. [Google Scholar] [CrossRef] [PubMed]
- Eklund, A.C.; Szallasi, Z. Correction of technical bias in clinical microarray data improves concordance with known biological information. Genome Biol. 2008, 9, R26. [Google Scholar] [CrossRef]
- Hou, P.; Bao, S.; Fan, D.; Yan, C.; Su, J.; Qu, J.; Zhou, M. Machine learning-based integrative analysis of methylome and transcriptome identifies novel prognostic DNA methylation signature in uveal melanoma. Brief. Bioinform. 2021, 22, bbaa371. [Google Scholar] [CrossRef]
- Moutsopoulos, I.; Maischak, L.; Lauzikaite, E.; Vasquez Urbina, S.A.; Williams, E.C.; Drost, H.G.; Mohorianu, I.I. noisyR: Enhancing biological signal in sequencing datasets by characterizing random technical noise. Nucleic Acids Res. 2021, 49, e83. [Google Scholar] [CrossRef]
- Farhadian, M.; Rafat, S.A.; Hasanpur, K.; Ebrahimi, M.; Ebrahimie, E. Cross-Species Meta-Analysis of Transcriptomic Data in Combination with Supervised Machine Learning Models Identifies the Common Gene Signature of Lactation Process. Front. Genet. 2018, 9, 235. [Google Scholar] [CrossRef] [PubMed]
- Li, S.; Guo, Y.; Huo, C.; Zhu, L.; Shi, C.; Na, R.; Gu, M.; Zhang, W. Machine Learning-Based Analysis of Differentially Expressed Genes in the Muscle Transcriptome Between Beef Cattle and Dairy Cattle. Int. J. Mol. Sci. 2025, 26, 5046. [Google Scholar] [CrossRef] [PubMed]
- Filloux, C.; Cédric, M.; Romain, P.; Lionel, F.; Christophe, K.; Dominique, R.; Abderrahman, M.; Daniel, P. An integrative method to normalize RNA-Seq data. BMC Bioinform. 2014, 15, 188. [Google Scholar] [CrossRef]
- Mu, T.; Hu, H.; Ma, Y.; Yang, C.; Feng, X.; Wang, Y.; Liu, J.; Yu, B.; Zhang, J.; Gu, Y. Identification of critical lncRNAs for milk fat metabolism in B. taurus using WGCNA and the construction of a ceRNAs network. Anim. Genet. 2022, 53, 740–760. [Google Scholar] [CrossRef]
- Langfelder, P.; Horvath, S. WGCNA: An R package for weighted correlation network analysis. BMC Bioinform. 2008, 9, 559. [Google Scholar] [CrossRef]
- Tang, D.; Huang, Y.; Che, Y.; Yang, C.; Pu, B.; Liu, S.; Li, H. Identification of platelet-related subtypes and diagnostic markers in pediatric Crohn’s disease based on WGCNA and machine learning. Front. Immunol. 2024, 15, 1323418. [Google Scholar] [CrossRef]
- Ferreira, A.M.; Bislev, S.L.; Bendixen, E.; Almeida, A.M. The mammary gland in domestic ruminants: A systems biology perspective. J. Proteom. 2013, 94, 110–123. [Google Scholar] [CrossRef]
- Zheng, W.; Zhang, J.; Huang, J.; Han, B.; Liu, Y.; Sun, D. A pilot multi-omics study reveals genetic mechanisms regulating milk component traits in dairy cattle. Commun. Biol. 2025, 8, 1150. [Google Scholar] [CrossRef] [PubMed]
- Liu, P.; Ewald, J.; Galvez, J.H.; Head, J.; Crump, D.; Bourque, G.; Basu, N.; Xia, J. Ultrafast functional profiling of RNA-seq data for nonmodel organisms. Genome Res. 2021, 31, 713–720. [Google Scholar] [CrossRef]
- Ritchie, M.E.; Phipson, B.; Wu, D.; Hu, Y.; Law, C.W.; Shi, W.; Smyth, G.K. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015, 43, e47. [Google Scholar] [CrossRef]
- Leek, J.T.; Scharpf, R.B.; Bravo, H.C.; Simcha, D.; Langmead, B.; Johnson, W.E.; Geman, D.; Baggerly, K.; Irizarry, R.A. Tackling the widespread and critical impact of batch effects in high-throughput data. Nat. Rev. Genet. 2010, 11, 733–739. [Google Scholar] [CrossRef] [PubMed]
- Abraham, A.; Pedregosa, F.; Eickenberg, M.; Gervais, P.; Mueller, A.; Kossaifi, J.; Gramfort, A.; Thirion, B.; Varoquaux, G. Machine learning for neuroimaging with scikit-learn. Front. Neuroinform. 2014, 8, 14. [Google Scholar] [CrossRef]
- Mamoshina, P.; Volosnikova, M.; Ozerov, I.V.; Putin, E.; Skibina, E.; Cortese, F.; Zhavoronkov, A. Machine Learning on Human Muscle Transcriptomic Data for Biomarker Discovery and Tissue-Specific Drug Target Identification. Front. Genet. 2018, 9, 242. [Google Scholar] [CrossRef]
- Cai, C.; Li, M.; Zhang, Y.; Meng, S.; Yang, Y.; Gao, P.; Guo, X.; Cao, G.; Li, B. Comparative Transcriptome Analyses of Longissimus thoracis Between Pig Breeds Differing in Muscle Characteristics. Front. Genet. 2020, 11, 526309. [Google Scholar] [CrossRef] [PubMed]
- Toneyan, S.; Tang, Z.; Koo, P.K. Evaluating deep learning for predicting epigenomic profiles. Nat. Mach. Intell. 2022, 4, 1088–1100. [Google Scholar] [CrossRef]
- Wang, B.; Men, J.; Wang, C.; Hou, X.; Zhao, F. Laminin-dependent integrin β1 signaling regulates milk protein synthesis via prolactin/STAT5 pathway in B. taurus mammary epithelial cells. Biochem. Biophys. Res. Commun. 2020, 524, 288–294. [Google Scholar] [CrossRef]
- Perugini, J.; Smorlesi, A.; Acciarini, S.; Mondini, E.; Colleluori, G.; Pirazzini, C.; Kwiatkowska, K.M.; Garagnani, P.; Franceschi, C.; Zingaretti, M.C.; et al. Adipo-Epithelial Transdifferentiation in In Vitro Models of the Mammary Gland. Cells 2024, 13, 943. [Google Scholar] [CrossRef]
- Gruden, Š.; Poklar Ulrih, N. Diverse Mechanisms of Antimicrobial Activities of Lactoferrins, Lactoferricins, and Other Lactoferrin-Derived Peptides. Int. J. Mol. Sci. 2021, 22, 11264. [Google Scholar] [CrossRef]
- Szklarczyk, D.; Franceschini, A.; Wyder, S.; Forslund, K.; Heller, D.; Huerta-Cepas, J.; Simonovic, M.; Roth, A.; Santos, A.; Tsafou, K.P.; et al. STRING v10: Protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res. 2015, 43, D447–D452. [Google Scholar] [CrossRef] [PubMed]
- Zhao, Q.; Zhang, Y.; Shao, S.; Sun, Y.; Lin, Z. Identification of hub genes and biological pathways in hepatocellular carcinoma by integrated bioinformatics analysis. PeerJ 2021, 9, e10594. [Google Scholar] [CrossRef] [PubMed]
- Shen, X.; Qin, C.; Wang, Z.; Jiang, J.; Liu, Q.; Zeng, Q.; Li, L.; Li, Z. Transcriptomic and Proteomic Insights Into Buffalo Milk Fat Synthesis and the Role of IGFBP4 in BMECs. FASEB J. 2025, 39, e70971. [Google Scholar] [CrossRef]
- Yan, S. Integrative analysis of promising molecular biomarkers and pathways for coronary artery disease using WGCNA and MetaDE methods. Mol. Med. Rep. 2018, 18, 2789–2797. [Google Scholar] [CrossRef]
- Wang, F.; Liang, Y.; Wang, Q.W. Interpretable machine learning-driven biomarker identification and validation for Alzheimer’s disease. Sci. Rep. 2024, 14, 30770. [Google Scholar] [CrossRef]
- Guo, Y.; Li, S.; Na, R.; Guo, L.; Huo, C.; Zhu, L.; Shi, C.; Na, R.; Gu, M.; Zhang, W. Comparative Transcriptome Analysis of B. taurus, Porcine, and Sheep Muscle Using Interpretable Machine Learning Models. Animals 2024, 14, 2947. [Google Scholar] [CrossRef]
- Liu, J.; Jiang, P.; Iqbal, A.; Ali, S.; Gao, Z.; Pan, Z.; Xia, L.; Yin, F.; Zhao, Z. MiR-485 targets the DTX4 gene to regulate milk fat synthesis in B. taurus mammary epithelial cells. Sci. Rep. 2021, 11, 7623. [Google Scholar] [CrossRef]
- Keel, B.N.; Lindholm-Perry, A.K. Recent developments and future directions in meta-analysis of differential gene expression in livestock RNA-Seq. Front. Genet. 2022, 13, 983043. [Google Scholar] [CrossRef] [PubMed]
- Suravajhala, P.; Kogelman, L.J.; Kadarmideen, H.N. Multi-omic data integration and analysis using systems genomics approaches: Methods and applications in animal production, health and welfare. Genet. Sel. Evol. 2016, 48, 38. [Google Scholar] [CrossRef]
- Moghaddassi, S.; Eyestone, W.; Bishop, C.E. TALEN-mediated modification of the B. taurus genome for large-scale production of human serum albumin. PLoS ONE 2014, 9, e89631. [Google Scholar] [CrossRef] [PubMed]
- Zhang, H.; Mi, S.; Brito, L.F.; Hu, L.; Wang, L.; Ma, L.; Xu, Q.; Guo, G.; Yu, Y.; Wang, Y. Genomic and transcriptomic analyses enable the identification of important genes associated with subcutaneous fat deposition in Holstein cows. J. Genet. Genom. 2023, 50, 385–397. [Google Scholar] [CrossRef] [PubMed]




| Model Name | Accuracy | Precision | Recall | F1_Score | ROC_AUC |
|---|---|---|---|---|---|
| Random Forest | 0.983 | 0.968 | 1 | 0.984 | 1 |
| SVM | 1 | 1 | 1 | 1 | 1 |
| Naive Bayes | 0.967 | 1 | 0.933 | 0.966 | 0.997 |
| Decision Tree | 0.967 | 0.938 | 1 | 0.968 | 0.967 |
| Centrality Metric | Top 10 Hub Genes (Rank, Name) | Score(s) | Proportion of Ribosomal Proteins in Top 10 |
|---|---|---|---|
| Degree | 1.RPL29; 2.RPLP0; 2.RPS5; 2.RPL28; 2.RPS9; 2.RPS28; 2.RPL24; 2.RPSA; 9.RPS15; 9.RPL7A | 24, 23 (n = 7), 22 (n = 2) | 100% (10/10) |
| DMNC | 1.RPLP2; 1.RPL12; 3.RPL38; 3.RPL13A; 5.RPLP1; 6.RPL18A; 6.RPL8; 8.RPS16; 9.RPS15; 9.RPL7A | 1.139115600700526 (n = 2); 1.1299697840265273 (n = 2); 1.1239012963372081; 1.1096551236319298 (n = 2); 1.0852465843984804; 1.0758424166431555 (n = 2) | 100% (10/10) |
| EPC | 1.RPS9; 2.RPSA; 3.RPS5; 4.RPLP0; 5.RPL28; 6.RPS11; 7.RPL7A; 8.RPS15; 9.RPL24; 10.RPL29 | 9.726999999999996; 9.694999999999975; 9.593999999999976; 9.564999999999976; 9.485999999999985; 9.445999999999993; 9.361999999999986; 9.332999999999998; 9.293999999999974; 9.269999999999982 | 100% (10/10) |
| MNC | 1.RPL29; 2.RPLP0; 2.RPS5; 2.RPL28; 2.RPS9; 2.RPS28; 2.RPL24; 2.RPSA; 9.RPS15; 9.RPL7A | 24, 23 (n = 7), 22 (n = 2) | 100% (10/10) |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhang, Q.; Yang, L.; Li, Y.; Gu, P.; Si, R.; Li, S.; Zhu, L.; Zhang, W. Integrated Transcriptomic Analysis Identifies Core Hub Genes Regulating Mammary Gland Traits (Milk Quality/Lactation) in Dairy Livestock: Bos taurus and Ovis aries. Genes 2025, 16, 1483. https://doi.org/10.3390/genes16121483
Zhang Q, Yang L, Li Y, Gu P, Si R, Li S, Zhu L, Zhang W. Integrated Transcriptomic Analysis Identifies Core Hub Genes Regulating Mammary Gland Traits (Milk Quality/Lactation) in Dairy Livestock: Bos taurus and Ovis aries. Genes. 2025; 16(12):1483. https://doi.org/10.3390/genes16121483
Chicago/Turabian StyleZhang, Qiang, Lulu Yang, Yunhan Li, Pengbo Gu, Riguleng Si, Shuai Li, Lin Zhu, and Wenguang Zhang. 2025. "Integrated Transcriptomic Analysis Identifies Core Hub Genes Regulating Mammary Gland Traits (Milk Quality/Lactation) in Dairy Livestock: Bos taurus and Ovis aries" Genes 16, no. 12: 1483. https://doi.org/10.3390/genes16121483
APA StyleZhang, Q., Yang, L., Li, Y., Gu, P., Si, R., Li, S., Zhu, L., & Zhang, W. (2025). Integrated Transcriptomic Analysis Identifies Core Hub Genes Regulating Mammary Gland Traits (Milk Quality/Lactation) in Dairy Livestock: Bos taurus and Ovis aries. Genes, 16(12), 1483. https://doi.org/10.3390/genes16121483
