Next Article in Journal
Bioinformatic Analysis of Autism-Related miRNAs and Their PoTential as Biomarkers for Autism Epigenetic Inheritance
Previous Article in Journal
Ethical and Practical Considerations in Implementing Population-Based Reproductive Genetic Carrier Screening
Previous Article in Special Issue
GRM1 as a Candidate Gene for Buffalo Fertility: Insights from Genome-Wide Association Studies and Its Role in the FOXO Signaling Pathway
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genome-Wide Association Study Identifies Potential Regulatory Loci and Pathways Related to Buffalo Reproductive Traits

1
Laboratory of Buffalo Genetics, Reproduction and Breeding, Guangxi Buffalo Research Institute, Chinese Academy of Agricultural Sciences, Nanning 530001, China
2
Laboratory of Animal Breeding, Disease Control and Prevention, College of Animal Science & Technology, Guangxi University, Nanning 530004, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Genes 2025, 16(4), 422; https://doi.org/10.3390/genes16040422
Submission received: 13 February 2025 / Revised: 6 March 2025 / Accepted: 28 March 2025 / Published: 31 March 2025
(This article belongs to the Special Issue Buffalo Genetics and Genomics)

Abstract

:
Background: The reproductive performance of water buffalo significantly impacts the economic aspects of production. Traditional breeding methods are constrained by low heritability and numerous influencing factors, making it difficult to effectively improve reproductive efficiency. Genome-wide association studies (GWAS) offer new possibilities for exploring reproductive traits in water buffalo, opening up new avenues for efficient breeding. Methods: Using whole-genome resequencing, we identified quantitative trait loci (QTLs) associated with four suggestive reproductive traits: calving interval (CI), calf birth weight (CBW), dam birth weight (BW), and age at first calving (FCA). The study focused on identifying genetic variants that influence these reproductive traits. Results: Our research identified 52 suggestive regulatory loci associated with reproductive traits in water buffalo. Based on a 50 kb interval, we annotated these loci to 58 candidate genes. These loci involve genes such as AGBL4, GRM1, NCKAP5, and NRXN1, which are primarily enriched in pathways including the FOXO signaling pathway, calcium ion pathways, estrogen signaling pathway, and phospholipase D signaling pathway. These pathways directly or indirectly regulate the reproductive efficiency of water buffalo. Conclusions: This study has revealed suggestive regulatory genes (AGBL4, GRM1, NCKAP5, NRXN1) associated with reproductive traits in water buffalo. This not only enhances our understanding of the molecular mechanisms underlying complex traits but also points towards strategies for improving the reproductive capacity of water buffalo. These findings provide a solid foundation for future breeding programs aimed at enhancing water buffalo productivity.

1. Introduction

Water buffalo plays a pivotal role in human agriculture, serving not only as a robust force for cultivation but also yielding essential products such as meat, milk, leather, and fertilizer. Reproductive traits in water buffalo, representing vital economic factors, are intricate quantitative traits. Traditional breeding methods for enhancing reproductive qualities in water buffalo have been hindered by low heritability and various influencing factors. The advent of whole-genome association analysis (GWAS) has opened new avenues for exploring these quantitative traits [1]. GWAS methods have unearthed numerous single nucleotide polymorphisms (SNPs) and quantitative trait loci (QTLs) linked to the growth, reproduction, and meat quality traits of livestock and poultry [2,3,4,5]. Over the past two decades (2000–2020), GWAS has identified a multitude of QTLs associated with reproductive traits in both cattle and water buffalo [6].
The water buffalo genome assembly forms the cornerstone for genome-wide association studies (GWAS). Since the unveiling of the human genome in 2001, there has been significant progress in genomics, paralleled by advancements in high-throughput sequencing technologies that have lowered sequencing costs, enabling broader species sequencing. Reference genomes for various species, including chicken [7], cattle [8], pigs [9], sheep [10], ducks [11], and geese [12], have emerged successively.
GWAS is a powerful tool used to identify genomic regions associated with specific traits. By genotyping large numbers of individuals and correlating this data with phenotypic information, researchers can pinpoint genes and loci linked to reproductive traits such as calving interval, birth weight, and age at first calving. For example, researchers conducted a genome-wide association study (GWAS) on reproductive traits in Hanwoo cattle, focusing on age at first calving (AFC), calving interval (CI), gestation length (GL), and number of artificial inseminations per conception (NAIPC). Through GWAS, the researcher identified 10 candidate genes associated with age at first calving (AFC) [13]. Several studies have utilized genome-wide association studies (GWAS) to explore potential genetic factors influencing reproductive traits such as oocyte and embryo numbers. These studies have identified the protein-coding genes ARNT, EGR1, HIF1A, AHR, and PAX2 as strong markers for oocyte and embryo production in Gir cattle. The discovery of these genes provides important insights into the molecular mechanisms underlying reproductive performance in Gir cattle and offers scientific support for optimizing reproductive management strategies [14,15].
Our study elucidates the regulatory mechanisms underlying reproductive traits in water buffalo. Through whole-genome resequencing and association analysis of reproductive traits in water buffalo, we have identified suggestive molecular loci (AGBL4, GRM1, NCKAP5, and NRXN1) closely associated with these traits. Further investigation revealed that these candidate genes are primarily enriched in pathways such as the FOXO signaling pathway, estrogen signaling pathway, and calcium ion pathways. These findings provide effective support for enhancing assisted breeding programs aimed at improving water buffalo reproduction and offer a guide for future scientific breeding strategies.

2. Materials and Methods

2.1. Phenotypes and Animal Resources

This study utilized data collected from 120 water buffaloes at the Guangxi Buffalo Research Institute in China from 2000 to 2021. The sample comprised 1 local water buffalo (D), 46 hybrid water buffaloes (Z), 31 Nili-Ravi water buffaloes (N), and 42 Murrah water buffaloes (M). The investigation focused on four reproductive traits: calving interval (CI), calf birth weight (CBW), dam birth weight (BW), and age at first calving (FCA).
FCA represents the days from birth to the initial calving in water buffaloes, and CI signifies the days between successive calvings. CBW pertains to calf birth weight, while BW is the weight of the dam at birth.

2.2. Sample Collection and Sequencing

Blood samples from water buffalo were collected via tail vein puncture using a vacuum blood collector. Genomic DNA extraction from the blood was performed using the phenol/chloroform method, and the integrity and yield of the genomic DNA were evaluated through agarose gel electrophoresis.
The genomic DNA was utilized to create paired-end short-insert libraries (350 bp) and subsequently sequenced using the Illumina HiSeq X Ten system (Illumina, San Diego, CA, USA).

2.3. Alignments and Variant Identification

The clean reads were aligned to the reference genome (UOA_WB_1) using BWA-MEM with default settings. Subsequently, Samtools, Picard tools, and Genome Analysis Toolkit (GATK, version 4.0) were employed for single nucleotide polymorphism (SNP) detection. All SNPs underwent filtering through the “Variant Filtration” module of GATK, employing standard parameters as follows: Variants with Quality Depth (QD) < 2; FS (Phred-scaled p-value using Fisher’s exact test for strand bias detection) > 60; MQRankSum < −12.5; ReadPosRankSum < −8; MQ < 40.0; mean sequencing depth of variants (across all individuals) < 1/3× and > 3×; SOR > 3.0; maximum missing rate < 0.1; and SNPs were limited to two alleles.

2.4. Variation Filtering

Markers (SNPs/Indels) with allele number larger than 2, missing data greater than 50%, minor allele frequency lesser than 5%, and heterozygosity greater than 80% were removed, resulting in 691,729 SNPs and 53,761 Indels.
The remaining SNPs were used for population genetic analysis and all remaining markers (including SNPs and Indels) were used for GWAS.

2.5. Principal Component Analysis (PCA)

Principal component analysis (PCA) was performed based on filtered SNPs using GCTA [16] (v1.92.2) software.

2.6. Kinship Analysis

Pair-wise relationship matrix (kinship matrix) was calculated with all filtered SNPs using GCTA [16] (v1.92.2) software.

2.7. Phylogenetic Tree Construction

The neighbor-joining (NJ) tree based on filtered SNPs was constructed using MEGA-X [17] software v10.2 (p-distance model) with a total of 1000 bootstrap replicates.

2.8. Genome-Wide Association Mapping

Genome-wide association mapping was conducted in TASSEL [18] v.5.2.54, employing a mixed linear model (MLM(Q+K)) that incorporates both population structure and kinship as fixed and random effects, respectively. Population structure was addressed by utilizing the first three principal components, while kinship was considered using the kinship matrix generated through GCTA software. Significance thresholds were set at Bonferroni-adjusted genome-wide significance (0.05/745,490 = 6.71 × 10−8) and suggestive significance (5 × 10−6) to detect meaningful associations. Candidate genes (CAGs) located within the 50 kb region upstream or downstream of significantly associated markers were then identified.

3. Results

3.1. Phenotypic Parameters and Marker Information

We conducted a statistical analysis of four suggestive reproductive traits in 120 water buffaloes (summarized in Table 1). The phenotypic distributions of these traits are distinct and generally align with expected biological patterns. This alignment facilitates the identification of variant loci that exhibit significant differences in phenotypes, suggesting that nearby genes may play a role in controlling these traits. After performing quality control on SNP loci, we identified a total of 745,490 variant loci, including 691,729 SNPs and 53,761 Indels.
To enhance accuracy and minimize false positives, the GWAS analysis incorporates adjustments based on the genetic background and stratification of the study materials. Therefore, conducting a population structure analysis is essential, taking into account the filtered markers. This ensures that the model accurately reflects the genetic diversity and structure of the population, thereby improving the reliability of the GWAS results.

3.2. Population Structure Analysis

We generated a population structure plot using PCA (Figure 1). The plot illustrates clear stratification among the sample populations, suggesting potential population differences. After conducting PCA on the 120 buffalo samples, covering the four reproductive traits, the first three principal components (PCs) explained 4.45%, 4.39%, and 2.33% of the variation, collectively capturing around 11% of the total variation. The samples are distinctly grouped into three clusters, indicating significant population stratification. This underscores the presence of population differences among the sample groups, potentially leading to false positives in subsequent association analyses. Consequently, we incorporated the first three PCs as covariates in the analysis model to mitigate the risk of false positives and enhance result accuracy.
The results highlight significant familial correlations among some samples, necessitating further adjustments to the association analysis model. In unrelated groups or populations with unclear pedigrees, kinship represents the relative genetic similarity between two specific materials compared to the genetic similarity between any two materials. In our experiment, the heat map derived from the kinship matrix demonstrates evident familial correlations among specific samples (Figure 2). Consequently, we included the kinship matrix as a covariate in the analysis model to enhance result accuracy.

3.3. Results of the Genome-Wide Associations

GWAS was performed using logistic regression under an additive genetic model, incorporating five covariates: the first three principal components and the kinship matrix. A total of 52 SNPs were identified as significant (P < 5 × 10−6; two-tailed), as shown in the Manhattan plot (Figure 3) and the quantile–quantile (QQ) plot (Figure 4). Detailed information, including p-values, and percentage of variance explained (PVE), is provided in Table 2, which lists the SNPs within genes. Our research identified 52 suggestive regulatory loci associated with reproductive traits in water buffalo. Based on a 50 kb interval, we annotated these loci to 58 candidate genes. Notably, one significant SNP is located within the GRM1 gene locus, and several gene loci contain two or more independent SNPs: NCKAP5 contains 3 SNPs, AGBL4 contains 2 SNPs, and NRXN1 contains 2 SNPs.
To better visualize the associated loci, we generated a plot depicting the gene locations on chromosomes (Figure 5). A total of 58 genes linked to SNPs were identified across 15 chromosomes, with 14 of these genes containing SNPs within their genomic regions. Notably, the X chromosome exhibits sparse SNP density, while the highest SNP densities are observed on chromosomes 2, 8, and 13.

3.4. Kyoto Encyclopedia of Genes and Genomes Pathway Analysis of Candidate Genes

Through pathway analysis of these candidate genes, we found that they are significantly enriched in the FOXO signaling pathway, calcium ion pathways, estrogen pathways, phospholipase D signaling pathway, and glutamate metabolism. These signaling pathways are closely related to water buffalo reproductive traits either directly or indirectly (Figure 6). This suggests that the gene loci (AGBL4, GRM1, NCKAP5, NRXN1) may be important regulatory sites for controlling reproductive traits in water buffalo, providing strong support for improving future water buffalo reproduction rates.

4. Discussion

4.1. Control of Effective Parameters in GWAS Models

In our study, we used principal component analysis (PCA) results as covariates to genetically control genotype-by-environment (G×E) interactions. However, incorporating covariance from kinship relationships into the model can lead to overcorrection, reducing false positives but potentially overlooking loci with small effects. Despite this limitation, our study, which encompasses diverse water buffalo breeds and their hybrids, significantly enhances the efficacy of identifying genetic loci influencing phenotypes through GWAS. Importantly, we recognize that a well-characterized population structure is crucial for improving the reliability of data analysis. This aspect will be a key consideration in our future research efforts.

4.2. Suggestive Influencing Conditions for Water Buffalo Reproductive Traits and Insights from This Study

Previous research has established a significant link between reproductive traits and cow reproductive health. Researchers used classical repeatability and random regression models to analyze the variation in reproductive efficiency among 12,554 cows of the Retinta Spanish breed. They estimated deviations between optimal female performance and actual parity at different stages of the cows’ lives. Their analysis revealed that BTA4 and BTA28 significantly influence the reproductive efficiency of dairy cows [19,20].
Three of the four suggestive loci we identified are linked to the regulation of cow reproduction, while one is directly associated with sperm production and embryonic development. The most strongly associated locus is located on chromosome 10 in water buffalo within the gene GRM1 (metabotropic glutamate receptor 1). This gene encodes mGluRs, which are typically found as dimers on cell membranes. Glutamate, an excitatory neurotransmitter, plays a role in synaptic modulation during reproduction by integrating signals from wakefulness and reproductive-promoting neuromodulators via Homer1a (a suggested reproductive regulatory factor) and mGluR1/5 (a subtype group of mGluRs) [21]. NCKAP5 (NCK Associated Protein 5), predicted to be involved in microtubule formation and disassembly, is located on chromosome 2 in water buffalo, with three SNPs within its locus. Previous GWAS studies have associated NCKAP5 with age at first calving (AFC). By analyzing 2400 genotyped animals, these studies identified 25 candidate genes for reproductive traits, including CHD7, CLVS1, EVX2, MAT2B, NUDCD2, GPR39, NCKAP5, LYPD1, HOXD13, SEMA5B, CCNG1, SEMA5A, BRF1, PSEN2, CACHD1, SUGT1, ELF1, SNORA70, AKT1, TM2D1, SLF1, MCTP1, PABPC1, MTRF1, and ADCY2 [22]. Additionally, other studies have indicated that NCKAP5 is a regulatory gene for one of the selection traits in Limousin cattle [23]. AGBL4 (ATP/GTP-binding protein-like 4) is a metallo-carboxypeptidase involved in the deglutamylation process of proteins, particularly in the modification of tubulin. It is also associated with antiviral defense responses. The modification of tubulin is crucial for cell division, which is especially important in the generation of germ cells and early embryonic development [24]. Therefore, AGBL4 may indirectly influence reproductive performance by regulating tubulin function.

4.3. Limitations and Future Directions

Our study aims to preliminarily investigate ways to enhance the reproductive performance of water buffalo and identify genetic loci associated with the age at first calving (AFC). However, we have identified several limiting factors that may affect the accuracy of our GWAS (genome-wide association study) results. Firstly, the small sample size limits the reliability of our findings, allowing us to provide only indicative rather than definitive conclusions [25,26,27]. Therefore, in future research, we plan to increase the sample size to address this limitation. Additionally, literature reviews suggest that factors such as photoperiod, breeding season, and parity also significantly impact AFC [28,29,30,31]. Our current study lacks data on these variables, which constitutes another limitation of our preliminary analysis. To better understand the mechanisms underlying reproductive traits in water buffalo, we aim to incorporate these variables into our subsequent studies [32,33]. We are committed to providing more robust data support and technical guidance for the development of water buffalo breeding programs.

5. Conclusions

In this study, our research identified 52 suggestive regulatory loci associated with reproductive traits (age at first calving (FCA)) in water buffalo. Based on a 50 kb interval, we annotated these loci to 58 candidate genes. These candidate genes are significantly enriched in the FOXO signaling pathway, estrogen pathways, and calcium ion signaling pathways. Among these, four suggestive genes (AGBL4, GRM1, NCKAP5, and NRXN1) were found to be involved in embryonic development and the regulation of mammalian reproduction.
These genes not only provide theoretical support for understanding water buffalo reproduction but also offer potential target genes for practical breeding programs. In summary, by identifying QTLs associated with water buffalo reproductive traits, our study enhances the understanding of the reproductive mechanisms in water buffalo and provides scientific basis and technical support for future genetic breeding strategies.

Author Contributions

W.L. and Q.X. participated in writing the initial draft of the paper. H.Z., A.D., L.H., C.F., J.S., C.Y., W.L. and Q.X. contributed to the writing, review, and editing of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was financially supported by grants from Joint Funds of the National Natural Science Foundation of China (U20A2051), Guangxi Natural Science Foundation (2024GXNSFAA010044), and the National Modern Agricultural Industry Technology System Guangxi Dairy Buffalo Innovation Team Project (nycytxgxcxtd-2021-21-01).

Institutional Review Board Statement

All finished work was conducted in accordance with national and international guidelines. The protocol for this study was approved by the Attitude of the Animal Care & Welfare Committee of the Guangxi Buffalo Research Institute (Approval Code: GXU2019-021).

Data Availability Statement

Study content data may be obtained by contacting the corresponding author.

Conflicts of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Khoury, M.J.; Yang, Q. The future of genetic studies of complex human diseases: An epidemiologic perspective. Epidemiology 1998, 9, 350–354. [Google Scholar] [PubMed]
  2. Van den Berg, I.; Boichard, D.; Lund, M.S. Sequence variants selected from a multi-breed GWAS can improve the reliability of genomic predictions in dairy cattle. Genet. Sel. Evol. 2016, 48, 83. [Google Scholar] [PubMed]
  3. Wang, Y.; Cao, X.; Luo, C.; Sheng, Z.; Zhang, C.; Bian, C. Multiple ancestral haplotypes harboring regulatory mutations cumulatively contribute to a QTL affecting chicken growth traits. Commun. Biol. 2020, 3, 472. [Google Scholar] [CrossRef] [PubMed]
  4. Ji, J.; Zhou, L.; Huang, Y.; Zheng, M.; Liu, X.; Zhang, Y.; Huang, C.; Peng, S.; Zeng, Q.; Zhong, L.; et al. A whole-genome sequence based association study on pork eating quality traits and cooking loss in a specially designed heterogeneous F6 pig population. Meat Sci. 2018, 146, 160–167. [Google Scholar]
  5. Zhao, Y.; Hou, Y.; Xu, Y.; Luan, Y.; Zhou, H.; Qi, X.; Hu, M.; Wang, D.; Wang, Z.; Fu, Y.; et al. A compendium and comparative epigenomics analysis of cis-regulatory elements in the pig genome. Nat. Commun. 2021, 12, 2217. [Google Scholar]
  6. Shao, B.; Sun, H.; Ahmad, M.J.; Ghanem, N.; Abdel-Shafy, H.; Du, C.; Deng, T.; Mansoor, S.; Zhou, Y.; Yang, Y.; et al. Genetic features of reproductive traits in bovine and Buffalo: Lessons from bovine to Buffalo. Front. Genet. 2021, 12, 617128. [Google Scholar]
  7. International Chicken Genome Sequencing Consortium. Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution. Nature 2004, 432, 695–716. [Google Scholar] [CrossRef]
  8. Bovine Genome Sequencing and Analysis Consortium; Elsik, C.G.; Tellam, R.L.; Worley, K.C.; Gibbs, R.A.; Muzny, D.M.; Weinstock, G.M.; Adelson, D.L.; Eichler, E.E.; Elnitski, L.; et al. The genome sequence of taurine cattle: A window to ruminant biology and evolution. Science 2009, 324, 522. [Google Scholar]
  9. Groenen, M.A.; Archibald, A.L.; Uenishi, H.; Tuggle, C.K.; Takeuchi, Y.; Rothschild, M.F.; Schook, L.B. Analyses of pig genomes provide insight into porcine demography and evolution. Nature 2012, 491, 393–398. [Google Scholar]
  10. Dong, Y.; Xie, M.; Jiang, Y.U.; Xiao, N.; Du, X.; Zhang, W.; Wang, W. Sequencing and automated whole-genome optical mapping of the genome of a domestic goat (Capra hircus). Nat. Biotechnol. 2013, 31, 135–141. [Google Scholar]
  11. Huang, Y.; Li, Y.; Burt, D.W.; Chen, H.; Zhang, Y.; Qian, W.; Li, N. The duck genome and transcriptome provide insight into an avian influenza virus reservoir species. Nat. Genet. 2013, 45, 776–783. [Google Scholar] [CrossRef] [PubMed]
  12. Lu, L.; Chen, Y.; Wang, Z.; Li, X.; Chen, W.; Tao, Z.; Wang, J. The goose genome sequence leads to insights into the evolution of waterfowl and susceptibility to fatty liver. Genome Biol. 2015, 16, 89. [Google Scholar] [CrossRef] [PubMed]
  13. Haque, M.A.; Lee, Y.M.; Ha, J.J.; Jin, S.; Park, B.; Kim, N.Y.; Kim, J.J. Genome-wide association study identifies genomic regions associated with key reproductive traits in Korean Hanwoo cows. BMC Genom. 2024, 25, 496. [Google Scholar] [CrossRef]
  14. de Fátima Bretanha Rocha, R.; Garcia, A.O.; Otto, P.I.; Santos, M.G.D.; da Silva, M.V.B.; Martins, M.F.; Machado, M.A.; Panetto, J.C.D.C.; Guimarães, S.E.F. Single-step genome-wide association studies and post-GWAS analyses for the number of oocytes and embryos in Gir cattle. Mamm. Genome 2023, 34, 497–508. [Google Scholar] [CrossRef]
  15. Panigrahi, M.; Rajawat, D.; Nayak, S.S.; Jain, K.; Vaidhya, A.; Prakash, R.; Dutt, T. Genomic insights into key genes and QTLs involved in cattle reproduction. Gene 2024, 917, 148465. [Google Scholar] [CrossRef]
  16. Yang, J.; Lee, S.H.; Goddard, M.E.; Visscher, P.M. GCTA: A tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 2011, 88, 76–82. [Google Scholar] [CrossRef]
  17. Kumar, S.; Stecher, G.; Li, M.; Knyaz, C.; Tamura, K. MEGA X: Molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 2018, 35, 1547–1549. [Google Scholar] [CrossRef]
  18. Bradbury, P.J.; Zhang, Z.; Kroon, D.E.; Casstevens, T.M.; Ramdoss, Y.; Buckler, E.S. TASSEL: Software for association mapping of complex traits in diverse samples. Bioinformatics 2007, 23, 2633–2635. [Google Scholar] [CrossRef]
  19. Jiménez, J.M.; Morales, R.M.; Menéndez-Buxadera, A.; Demyda-Peyrás, S.; Laseca, N.; Molina, A. Estimation of the genetic components of (Co) variance and preliminary genome-wide association study for reproductive efficiency in Retinta beef cattle. Animals 2023, 13, 501. [Google Scholar] [CrossRef]
  20. Li, W.; Zeng, X.; Wang, L.; Yin, L.; Wang, Q.; Yang, H. Comparative Analysis of Gut Microbiota Diversity Across Different Digestive Tract Sites in Ningxiang Pigs. Animals 2025, 15, 936. [Google Scholar] [CrossRef]
  21. Wu, H.; Wang, C.; Gregory, K.J.; Han, G.W.; Cho, H.P.; Xia, Y.; Stevens, R.C. Structure of a class C GPCR metabotropic glutamate receptor 1 bound to an allosteric modulator. Science 2014, 344, 58–64. [Google Scholar] [CrossRef] [PubMed]
  22. Tourchi, J.J.; Alijani, S.; Rafat, S.A.; Abbasi, M.A. Bayesian-genome-wide association study and post-GWAS on reproductive traits of Holstein dairy cattle. J. Dairy Res. 2024, 91, 146–155. [Google Scholar]
  23. Mariadassou, M.; Ramayo-Caldas, Y.; Charles, M.; Féménia, M.; Renand, G.; Rocha, D. Detection of selection signatures in Limousin cattle using whole-genome resequencing. Anim. Genet. 2020, 51, 815–819. [Google Scholar] [PubMed]
  24. Wu, H.Y.; Rong, Y.; Correia, K.; Min, J.; Morgan, J.I. Comparison of the enzymatic and functional properties of three cytosolic carboxypeptidase family members. J. Biol. Chem. 2015, 290, 1222–1232. [Google Scholar]
  25. Brzáková, M.; Čítek, J.; Svitáková, A.; Veselá, Z.; Vostrý, L. Genetic Parameters for Age at First Calving and First Calving Interval of Beef Cattle. Animals 2020, 10, 2122. [Google Scholar] [CrossRef]
  26. George, L.; Alex, R.; Sukhija, N.; Jaglan, K.; Vohra, V.; Kumar, R.; Verma, A. Genetic improvement of economic traits in Murrah buffalo using significant SNPs from genome-wide association study. Trop. Anim. Health Prod. 2023, 55, 199. [Google Scholar] [CrossRef]
  27. Easa, A.A.; El-Aziz, A.H.A.; El Barbary, A.S.A.; Kostomakhin, N.M.; Nasr, M.A.F.; Imbabi, T.A. Genetic parameters of production and reproduction traits of Egyptian buffaloes under subtropical conditions. Trop. Anim. Health Prod. 2022, 54, 270. [Google Scholar]
  28. Setiaji, A.; Oikawa, T. Genetic parameters of reproductive traits from artificial insemination records of Japanese Black cows. Livest. Sci. 2019, 229, 85–89. [Google Scholar]
  29. Castro-Montoya, J.M.; González, F.L.; Mendoza, M.V.; Harper, K.; Corea, E.E. Interrelationship between diseases and calving season and their impact on reproductive parameters and milk production of tropical dairy cows. Trop. Anim. Health Prod. 2022, 54, 158. [Google Scholar] [CrossRef]
  30. Joaquim, L.A.; Changule, A.P.; da Glória Taela, M.; Novela, M.; Pinto, S.C.; Bila, C.G. The generation interval and season of birth do not affect age at first calving, birth weight and calving interval of Mozambican Angoni cattle. Trop. Anim. Health Prod. 2024, 56, 177. [Google Scholar]
  31. Hatamian, A.; Levine, L.; Oskouie, H.E.; Sarrafzadeh, M. Exploring a Datasets Statistical Effect Size Impact on Model Performance, and Data Sample-Size Sufficiency. arXiv 2025, arXiv:2501.02673. [Google Scholar]
  32. Sasaki, Y.; Kaneda, S.; Minamino, T.; Honkawa, K.; Horii, Y. Effects of parity, calving season, and previous lactation’s milk yield on fertility of dairy cows on a commercial farm. Anim. Sci. J. 2024, 95, e13998. [Google Scholar] [PubMed]
  33. Pushpa Magotra, A.; Sindhu, V.; Chaudhary, P. Evaluation of parity and its interaction effect with season of calving, period of calving on production and reproduction traits in Hardhenu (Bos taurus × Bos indicus) dairy cattle. Reprod. Domest. Anim. 2024, 59, e14537. [Google Scholar]
Figure 1. Principal component analysis (PCA) results based on filtered SNP markers. The PCA analysis was conducted using filtered SNP markers to obtain the variance explained by each principal component (PC) and the score matrix of samples on each PC. The population structure is visualized through pairwise scatter plots (AC) and a 3D plot (D) of the top three principal components. The colored circles represent four populations: D, M, N, and Z, each comprising 1 local buffalo, 42 Murrah buffalo, 31 Nili-Ravi buffalo, and 46 hybrid buffalo. The axes of the plots represent the scores of each PC, with the percentage values in parentheses indicating the variance explained by each PC, representing the proportion of total variance contributed by that PC in relation to all PCs. Orange: Represents group D. Green: Represents group M. Blue: Represents group N. Purple: Represents group Z.
Figure 1. Principal component analysis (PCA) results based on filtered SNP markers. The PCA analysis was conducted using filtered SNP markers to obtain the variance explained by each principal component (PC) and the score matrix of samples on each PC. The population structure is visualized through pairwise scatter plots (AC) and a 3D plot (D) of the top three principal components. The colored circles represent four populations: D, M, N, and Z, each comprising 1 local buffalo, 42 Murrah buffalo, 31 Nili-Ravi buffalo, and 46 hybrid buffalo. The axes of the plots represent the scores of each PC, with the percentage values in parentheses indicating the variance explained by each PC, representing the proportion of total variance contributed by that PC in relation to all PCs. Orange: Represents group D. Green: Represents group M. Blue: Represents group N. Purple: Represents group Z.
Genes 16 00422 g001
Figure 2. Heatmaps and neighbor-joining phylogenetic tree (NJ) of kinship among samples. Using filtered markers, we conducted kinship analysis with the GCTA software (v1.92.2) to obtain the kinship matrix for pairwise relationships among samples. A heatmap was generated based on this kinship matrix, where the color gradient from blue to red signifies increasing proximity in kinship relationships. Furthermore, a neighbor-joining phylogenetic tree was constructed using MEGA-X software (model: p-distance; bootstrap: 1000 replicates). The cattle populations D, M, N, and Z represent local buffalo, Murrah buffalo, Nili-Ravi buffalo, and hybrid buffalo, respectively.
Figure 2. Heatmaps and neighbor-joining phylogenetic tree (NJ) of kinship among samples. Using filtered markers, we conducted kinship analysis with the GCTA software (v1.92.2) to obtain the kinship matrix for pairwise relationships among samples. A heatmap was generated based on this kinship matrix, where the color gradient from blue to red signifies increasing proximity in kinship relationships. Furthermore, a neighbor-joining phylogenetic tree was constructed using MEGA-X software (model: p-distance; bootstrap: 1000 replicates). The cattle populations D, M, N, and Z represent local buffalo, Murrah buffalo, Nili-Ravi buffalo, and hybrid buffalo, respectively.
Genes 16 00422 g002
Figure 3. Manhattan plots illustrate GWAS results for reproductive traits. The x-axis represents chromosomal positions, while the y-axis displays -log10 p-values indicating allele-phenotype associations. The blue dashed line signifies the suggestive significance cutoff, with green spots denoting SNPs reaching suggestive significance. The red line corresponds to the Bonferroni thresholds (p = 0.05), and red spots indicate SNPs with significance. Traits, arranged from inner to outer lanes, include calving interval (CI), calf birth weight (CBW), dam birth weight (BW), and age at first calving (FCA). Green dots represent SNPs meeting the suggested threshold. Red dots represent SNPs meeting the significance threshold. The color scale in the top right indicates SNP density on the chromosome, with green showing low density and red showing high density.
Figure 3. Manhattan plots illustrate GWAS results for reproductive traits. The x-axis represents chromosomal positions, while the y-axis displays -log10 p-values indicating allele-phenotype associations. The blue dashed line signifies the suggestive significance cutoff, with green spots denoting SNPs reaching suggestive significance. The red line corresponds to the Bonferroni thresholds (p = 0.05), and red spots indicate SNPs with significance. Traits, arranged from inner to outer lanes, include calving interval (CI), calf birth weight (CBW), dam birth weight (BW), and age at first calving (FCA). Green dots represent SNPs meeting the suggested threshold. Red dots represent SNPs meeting the significance threshold. The color scale in the top right indicates SNP density on the chromosome, with green showing low density and red showing high density.
Genes 16 00422 g003
Figure 4. Quantile–Quantile (QQ) Plots. The QQ plots illustrate the negative logarithms of expected p-values on the x-axis and observed p-values on the y-axis (right panel). (A) calving interval (CI), (B) calf birth weight (CBW), (C) dam birth weight (BW), and (D) age at first calving (FCA).
Figure 4. Quantile–Quantile (QQ) Plots. The QQ plots illustrate the negative logarithms of expected p-values on the x-axis and observed p-values on the y-axis (right panel). (A) calving interval (CI), (B) calf birth weight (CBW), (C) dam birth weight (BW), and (D) age at first calving (FCA).
Genes 16 00422 g004
Figure 5. Plot of significant genes on chromosomes. The gene density heatmap visually represents the distribution of genes, ranging from sparse (blue) to dense (red). Red gene symbols indicate the presence of SNPs within the genes, while others are located within 50 kb upstream and downstream of the SNP. In total, 58 genes associated with SNPs were identified on 15 chromosomes. These genes are the candidate genes identified in the analysis.
Figure 5. Plot of significant genes on chromosomes. The gene density heatmap visually represents the distribution of genes, ranging from sparse (blue) to dense (red). Red gene symbols indicate the presence of SNPs within the genes, while others are located within 50 kb upstream and downstream of the SNP. In total, 58 genes associated with SNPs were identified on 15 chromosomes. These genes are the candidate genes identified in the analysis.
Genes 16 00422 g005
Figure 6. Bubble plots for KEGG enrichment analysis. The y-axis denotes the entry name, while the x-axis’s rich factor signifies the ratio of listed genes within the entry to the total genes constituting it. The size of each filled circle corresponds to the number of candidate genes in the entry, and the color indicates the saliency of the entry.
Figure 6. Bubble plots for KEGG enrichment analysis. The y-axis denotes the entry name, while the x-axis’s rich factor signifies the ratio of listed genes within the entry to the total genes constituting it. The size of each filled circle corresponds to the number of candidate genes in the entry, and the color indicates the saliency of the entry.
Genes 16 00422 g006
Table 1. Statistical description of study traits *.
Table 1. Statistical description of study traits *.
TraitsRecordsMeanSDMinMax
CI51581.68 (Days)194.21 4141336
CBW10635.02 (Months)4.89 18.546
BW11535.47 (Months)5.16 2046
FCA10544.41 (Months)15.50 5.5120.5
* CI, calving interval; CBW, calf birth weight; BW, dam birth weight; FCA, age at first calving. Calving interval (CI) is measured in days. Calf birth weight (CBW), dam birth weight (BW), and age at first calving (FCA) are measured in months.
Table 2. Genome-wide significant SNPs associated with the trait of age at first calving *.
Table 2. Genome-wide significant SNPs associated with the trait of age at first calving *.
SNPChrPosition(bp)Candidate Genep-ValuePVE
1116038140TRNAC-GCA1.06 × 10−60.51
2294758634GALNT131.54 × 10−60.50
32116842309NCKAP53.08 × 10−60.37
42116842310NCKAP53.08 × 10−60.37
52116842325NCKAP53.08 × 10−60.37
62173941889LOC102412580,KIAA1522,
SYNC,LOC112582948,
RBBP4
2.62 × 10−60.36
74164836341LOC1125844238.04 × 10−80.53
8549042788LOC102401328,KIF26B1.37 × 10−60.36
9684593264.8.09 × 10−70.55
10684593519.4.45 × 10−70.54
11696791388LOC102392171,AGBL4,9.2 × 10−80.51
12696811758LOC1023921714.69 × 10−60.33
12696811758AGBL44.69 × 10−60.33
13728031213.5.72 × 10−70.45
14759963243TBC1D18.62 × 10−70.42
157103706036.2.29 × 10−70.60
16817664369.2.95 × 10−60.59
178112491377ZNF746,ZNF7771.58 × 10−60.32
188113607118LOC112586634,ASIC3,
LOC102416003,FASTK,
TMUB1,ABCB8,
CDK5,SLC4A2,
AGAP3
4.77 × 10−60.31
1994404838LOC1024081341.51 × 10−70.51
2094404850LOC1024081341.21 × 10−70.51
2194404918LOC1024081343.6 × 10−60.38
2294404920LOC1024081344.29 × 10−60.37
23109287335LOC102407001,GTF2H5,
SERAC1
2.86 × 10−60.31
241013862680LOC1024058181.76 × 10−60.42
251016312432.6.3 × 10−70.37
261020660384GRM14 × 10−80.50
271045617271.4.97 × 10−60.35
281045617310.4.97 × 10−60.35
291132169042.3.18 × 10−60.34
301157697044PCLAF,LOC102410100,
TRIP4,CSNK1G1
4.69 × 10−60.33
31123176008TMEM1312.72 × 10−70.54
321231988278LOC112578245,NRXN11.92 × 10−60.50
331232801224LOC102414114,LOC102414446,
NRXN1
7.54 × 10−80.54
341234019714.4.38 × 10−60.38
351239801133.9.78 × 10−70.61
361250669172.5.07 × 10−80.43
371281013839LOC1125781141.11 × 10−70.49
381281118534.1.49 × 10−60.35
391367984517COG63.56 × 10−60.42
401386238940.3.38 × 10−70.54
411388603805LOC1023926322.9 × 10−60.40
421828221520.4.57 × 10−60.37
431828221526.4.57 × 10−60.37
441828221527.4.57 × 10−60.37
451839920231HYDIN1.22 × 10−60.44
461858127554LOC102396098,LOC102395776,
LOC102400904,LOC102395456,
PPP2R1A
1.86 × 10−70.52
471858127556LOC102396098,LOC102395776,
LOC102400904,LOC102395456,
PPP2R1A
1.86 × 10−70.52
481858127575LOC102396098,LOC102395776,
LOC102400904,LOC102395456,
PPP2R1A
1.78 × 10−70.52
491858127583LOC102396098,LOC102395776,
LOC102400904,LOC102395456,
PPP2R1A
1.87 × 10−70.52
501859367852LOC112580441,LOC102410721,
LOC102411053,LOC102411606
2.08 × 10−60.41
512033712610LOC1125809131.63 × 10−60.43
522036725181LINGO1,B4GALT62.05 × 10−60.32
* PVE, percentage of variance explained.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, W.; Xie, Q.; Zheng, H.; Duan, A.; Huang, L.; Feng, C.; Shang, J.; Yang, C. Genome-Wide Association Study Identifies Potential Regulatory Loci and Pathways Related to Buffalo Reproductive Traits. Genes 2025, 16, 422. https://doi.org/10.3390/genes16040422

AMA Style

Li W, Xie Q, Zheng H, Duan A, Huang L, Feng C, Shang J, Yang C. Genome-Wide Association Study Identifies Potential Regulatory Loci and Pathways Related to Buffalo Reproductive Traits. Genes. 2025; 16(4):422. https://doi.org/10.3390/genes16040422

Chicago/Turabian Style

Li, Wangchang, Qiyang Xie, Haiying Zheng, Anqin Duan, Liqing Huang, Chao Feng, Jianghua Shang, and Chunyan Yang. 2025. "Genome-Wide Association Study Identifies Potential Regulatory Loci and Pathways Related to Buffalo Reproductive Traits" Genes 16, no. 4: 422. https://doi.org/10.3390/genes16040422

APA Style

Li, W., Xie, Q., Zheng, H., Duan, A., Huang, L., Feng, C., Shang, J., & Yang, C. (2025). Genome-Wide Association Study Identifies Potential Regulatory Loci and Pathways Related to Buffalo Reproductive Traits. Genes, 16(4), 422. https://doi.org/10.3390/genes16040422

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop