Next Article in Journal
Fibroblast Growth Factor-7 and Hair Biology: Bridging Basic Science and Therapeutic Applications
Previous Article in Journal
SGPP2 Ameliorates Chronic Heart Failure by Attenuating ERS via the SIRT1/AMPK Pathway
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genome-Wide Insights into Intermittent Milking Behavior of Pandharpuri Buffalo

1
Division of Animal Genetics and Breeding, ICAR-National Dairy Research Institute, Karnal 132001, Haryana, India
2
Animal Genetics and Breeding, ICAR-Central Institute for Research on Buffalo, Hisar 125001, Haryana, India
3
ICAR-National Bureau of Animal Genetic Resources, Karnal 132001, Haryana, India
*
Author to whom correspondence should be addressed.
Curr. Issues Mol. Biol. 2026, 48(1), 101; https://doi.org/10.3390/cimb48010101
Submission received: 27 November 2025 / Revised: 31 December 2025 / Accepted: 1 January 2026 / Published: 19 January 2026
(This article belongs to the Section Biochemistry, Molecular and Cellular Biology)

Abstract

Buffaloes (Bubalus bubalis) are central to the dairy and agricultural economy, contributing high-quality milk, meat, draft power, and manure. Rich milk composition, the ability to utilize low-quality roughage, and strong disease resistance make buffaloes indispensable across diverse production systems. Among India’s major dairy breeds—Murrah, Nili-Ravi, Jaffarabadi, Surti, Bhadawari, Mehsana, and Nagpuri, none exhibit the distinctive trait of intermittent milking, which is uniquely observed in the Pandharpuri buffalo, a registered indigenous breed of Maharashtra. Despite coexisting with dominant dairy breeds such as Murrah, Pandharpuri buffalo is considered to possess primitive riverine ancestry and may represent one of the ancestral lineages from which several Indian breeds evolved. Its evolutionary relevance and unique intermittent milking capacity underscore the need to understand its genomic architecture. To address this, we applied whole-genome resequencing and the De-Correlated Composite of Multiple Signals (DCMS) approach to identify within-breed selection signatures. Our analyses identified 1337 candidate genes, including several linked to milk production, particularly those relevant to the physiological capacity for intermittent milking. Notable genes included ERBB4, ESR1, SYK, INSR, PTPN11, VAV3, MAPK3, and PRKG1. These signatures provide insights into genomic regions and biological pathways that may be involved in lactation-related processes relevant to intermittent milking. The identified genomic regions offer promising targets for functional validation and future genome-informed breeding strategies aimed at conserving this unique indigenous germplasm while improving lactation efficiency and resilience.

1. Introduction

The advent of Next-Generation Sequencing (NGS) technologies, particularly whole-genome sequencing (WGS), has transformed livestock genetics. With hundreds of thousands of genomes now sequenced worldwide, WGS has become the predominant tool for genomic analysis [1]. By enabling the identification of genome-wide variation, WGS has unraveled the genetic architecture of complex traits and facilitated the discovery of candidate genes and molecular markers for livestock improvement [2,3]. Beyond these applications, WGS data can be exploited to detect genomic footprints shaped by evolutionary forces such as genetic drift and selection, with genome-wide scans enabling the identification of regions influenced by both natural and artificial pressures [4,5,6]. Positive selection drives the increase in frequency of advantageous alleles, leaving distinct genomic signatures that deviate from neutral expectations [7,8,9,10]. Detecting these signatures helps pinpoint genomic regions that have undergone past or ongoing selection, many of which harbor loci linked to economically important and adaptive traits. Such candidate regions provide valuable targets that can be further explored and incorporated into livestock genetic improvement strategies [11].
In livestock species, genome-wide selection signature analyses have revealed candidate genes for production traits, disease resistance, and environmental adaptation, while also uncovering breed-specific adaptive mechanisms, particularly in indigenous and locally adapted populations [12,13,14,15,16,17,18]. Beyond trait-focused applications, such analyses enhance genetic characterization of livestock populations, support the conservation of native germplasm, and provide a framework for understanding the evolutionary forces shaping phenotypic diversity [6]. Several statistical approaches have been developed to detect selection signatures, including frequency spectrum–based tests such as Tajima’s D [19], nucleotide diversity (π) [20], Fay and Wu’s H statistic [21], and the composite likelihood ratio test (CLR) [22], as well as haplotype-based statistics such as integrated Haplotype Score (iHS) [23], and number of Segregating Sites by Length (nSL) [24]. While informative, individual statistics capture only specific aspects of the selection process, and reliance on a single method may reduce sensitivity to weak or complex selection signals, resulting in inconsistent detection across the genome [25]. To overcome these limitations, composite approaches integrating multiple complementary statistics have been developed. Among these, the De-Correlated Composite of Multiple Signals (DCMS) framework accounts for correlations among individual tests, thereby improving detection power and positional accuracy [26]. The effectiveness of DCMS has been demonstrated in several livestock studies, where it successfully identified biologically meaningful selection signals associated with adaptation and economically important traits in livestock species [27,28,29,30,31,32,33]. Such integrative approaches are particularly valuable for small, localized, and indigenous breeds that harbor unique functional variation and distinct genomic diversity [34].
The Pandharpuri buffalo, a registered breed native to Maharashtra, India, represents a distinctive and valuable component of Indian germplasm (Figure 1). Characterized by its sword-shaped horns and a remarkable ability to tolerate intermittent milking, this breed plays an important role in rural dairy systems and retains traits of biological and evolutionary importance. In traditional practice, farmers often carry the animal from house to house, supplying milk directly to customers; the buffalo is milked multiple times throughout the route according to demand, and despite these irregular and repeated milking intervals, it continues to produce milk efficiently. This unique behavioural adaptation is well-documented in the breed descriptor and forms a defining feature of the Pandharpuri buffalo. This milking adaptability may be supported by physiological mechanisms unique to buffaloes. One such example is Doka, a phenomenon involving transient teat engorgement during estrus, commonly used by farmers as an indicator of reproductive status [34]. This engorgement results from PGF2α-induced oxytocin release even in the absence of external stimulation [35], reflecting a tightly coordinated reproductive–mammary hormonal axis. While not directly related to milking behaviour, this heightened endocrine responsiveness suggests a mammary system capable of rapid physiological adjustment.
From an evolutionary perspective, these physiological features align with the domestication history of river buffaloes in the Indian subcontinent. Historical and archaeological evidence indicates that regions including the Mehsana, Surti, and Pandharpuri tracts were important centers for domestication and diversification of riverine buffaloes [36]. Accordingly, the Pandharpuri breed likely represents a primitive population that contributed to the genetic foundation of other Indian breeds. Despite its evolutionary relevance and distinctive milking phenotype, the genomic basis underlying intermittent milking in Pandharpuri buffalo remains largely unexplored. Considering its intermittent milking pattern, evolutionary significance, localized adaptation, and genetic distinctiveness, the present study applied whole-genome resequencing and the DCMS framework to detect within-breed selection signatures in Pandharpuri buffalo. To the best of our knowledge, this is the first investigation to employ a DCMS-based genome-wide selection scan in this breed. By integrating multiple complementary selection statistics, this approach enables the identification of genomic regions and candidate genes potentially involved in adaptive processes and breed-specific characteristics, including traits related to intermittent milking. These findings provide insights into the genetic architecture of this unique germplasm and offer a foundation for future functional validation, conservation strategies, and genome-informed breeding programs.

2. Materials and Methods

2.1. Whole-Genome Resequencing and Variant Detection

The dataset comprised 15 Pandharpuri buffaloes, including four newly sequenced individuals and eleven publicly available whole-genome datasets reported by Dutta et al. (2020) [37]. The newly sequenced animals were four adult buffaloes sampled from four different herds belonging to distinct owners across villages within the Solapur district of Maharashtra, India, which lies centrally within the recognized Pandharpuri breeding tract encompassing Kolhapur, Solapur, and Sangli districts. Animals were selected based on established breed descriptors, and farmer-reported pedigree and ownership records were used to minimize the inclusion of closely related individuals. In addition, potential cryptic relatedness was assessed using genome-wide identity-by-descent (IBD) estimates, and no evidence of close relationships was detected among the sampled individuals. The eleven previously published genomes were derived from Pandharpuri buffaloes sampled within the recognized native breeding tract of Maharashtra, as reported in the original study [37]. Although fine-scale village-level metadata were not available for these publicly archived genomes, they represent the same indigenous population.
Genomic DNA was extracted using the phenol–chloroform protocol [38], and quality-checked prior to sequencing on the Illumina HiSeq platform (150 bp paired-end; ~44.5× coverage). Raw reads were subjected to quality control (FastQC; [39]) and adapter/quality trimming (Fastp; [40]). High-quality reads were aligned to the Murrah reference genome (GCF_019923935.1) using BWA-MEM [41]), with indexing [42] and dictionary creation performed via PicardTools. Post-alignment processing included sorting, duplicate removal, and read group assignment. Single Nucleotide Polymorphisms (SNPs) were identified with GATK HaplotypeCaller [43] followed by joint genotyping across all samples. These SNPs were used for downstream analyses.

2.2. High-Confidence Variant Selection and Phasing

Variant filtering retained only high-confidence, biallelic SNPs. Filtering was implemented with BCFtools v1.10.2 (minimum read depth ≥ 10; maximum depth 500; QUAL ≥ 30), and additional hard filters applied in GATK VariantFiltration v4.6.0.0 excluded SNPs with QD < 2.0, FS > 60.0, SOR > 4.0, MQ < 40.0, MQRankSum < –12.5, or ReadPosRankSum < –8.0. For downstream selection signature analyses, only autosomal SNPs were retained. Additional filtering was conducted in PLINK v1.9 [44] using thresholds of sample call rate > 0.9, SNP call rate > 0.9, and Minor Allele Frequency (MAF) > 0.05. For haplotype-based methods, iHS and nSL, the chromosome-wise SNP datasets were phased with SHAPEIT v2.17 [45].

2.3. Computation of Intra-Population Selection Statistics

To identify genomic regions under selection in Pandharpuri buffalo, we applied the DCMS framework, which integrates multiple statistics into a single composite score. Four intra-population statistics—Tajima’s D [19], nucleotide diversity (π) [20], iHS [23], and nSL [24] were used. Tajima’s D and π were estimated in non-overlapping 50 kb windows using VCFtoolsv0.1.16 [46]. VCFtools was selected because it provides efficient and validated implementations of site frequency spectrum–based summary statistics directly from Variant Call Format (VCF) files, the standard output of whole-genome resequencing pipelines. Its ability to handle large-scale datasets, support window-based analyses, and its widespread adoption in livestock genomic diversity and selection signature studies make it a robust and appropriate tool for population genomic analyses [15,28,31].
The iHS and nSL statistics were computed from phased SNP data using selscan v1.3.0 with the—ihs and—nsl commands, respectively and normalized per chromosome using the—norm option in selscan [47]. Although Tajima’s D and nucleotide diversity (π) were estimated using non-overlapping 50 kb windows, their values were assigned to all SNPs falling within the corresponding window to enable SNP-level integration within the DCMS framework. In contrast, iHS and nSL were directly computed at the SNP level from phased haplotypes. This approach ensured that all four selection statistics were represented at the SNP level for DCMS calculation, consistent with previous DCMS-based selection signature studies.

2.4. Composite Selection Signature Analysis Using DCMS

To detect genomic regions under positive selection, we applied the DCMS approach [26] using four statistics: iHS, nSL, Tajima’s D, and nucleotide diversity (π). A SNP-level input file was processed in R v4.1.1 with the MINOTAUR, rrcovNA, MASS, data.table, and qvalue packages. Genome-wide p-values for each statistic were derived using stat_to_pvalue () (MINOTAUR) with one-tailed tests (right-tailed for iHS and nSL; left-tailed for Tajima’s D and π). A robust correlation matrix among statistics was then estimated using CovNAMcd() (rrcovNA) with the Minimum Covariance Determinant method (α = 0.75, nsamp = 50,000). This matrix was supplied to the DCMS () function (MINOTAUR) to calculate composite DCMS values for each SNP. The DCMS statistics is computed at any given loci l as follows:
D C M S l =   t = 1 n l o g ( 1 p l t p l t ) i = 1 n r i t
where, plt represents the p-value of the tth statistic at the lth genomic position, while rit corresponds to the weight assigned to each statistic, derived from the genome-wide correlation coefficients among pairs of univariate tests. Composite DCMS values for each SNP were fitted to a normal distribution using the rlm () function (MASS package, v7.3-65), and the estimated mean (μ) and standard deviation (σ) were used to derive right-tailed p-values with the pnorm () function.
To identify genomic regions under selection, DCMS scores were converted into SNP-level p-values by fitting a normal distribution using a robust linear model, followed by one-sided (right-tailed) testing. Multiple testing correction was applied using the False Discovery Rate (FDR) procedure implemented in the R using qvalue package. SNPs with q-values < 0.05 were considered statistically significant, resulting in 26,814 significant SNPs across the autosomal genome. Significant SNPs were then consolidated into putative selection sweep regions using a 50 kb genomic binning strategy. Specifically, consecutive significant SNPs (q < 0.05) located within the same or adjacent 50 kb intervals were grouped to define core candidate regions. To account for local linkage disequilibrium (LD) and prevent fragmentation of biologically linked signals, each core region was extended bi-directionally until flanked by SNPs with q-values > 0.10, methodological strategy broadly consistent with approaches adopted in previous DCMS-based and genome-wide selection signature studies [6,15,28]. Additionally, adjacent extended regions were merged if the intervening distance was ≤50 kb, ensuring continuous sweep regions reflective of LD blocks. This multi-tiered approach led to the identification of 1405 candidate selection sweep regions across the genome.
DCMS-derived p- and q-values were used to rank SNPs genome-wide, and the resulting 1405 selection sweep regions were converted into BED format and intersected with the Bubalus bubalis reference annotation (GTF) using BEDTools [48] to identify candidate genes within selected regions.

2.5. Functional Annotation and Hub Gene Identification

Candidate genes detected within selection sweep regions were subjected to functional enrichment analysis using DAVID v6.8 [49] to provide insights into their biological roles and overrepresented KEGG pathways; enrichment results are reported using nominal p-values from Fisher’s exact test. The background gene set used for DAVID enrichment consisted of annotated Bubalus bubalis genes. Gene–gene interaction networks were then generated through the STRING database [50], and the resulting networks were imported into Cytoscape v3.9.0 [51]. The CytoHubba plugin [52] was employed to prioritize and identify hub genes within these networks.

3. Results

Whole-genome resequencing of 15 Pandharpuri buffaloes generated a total of 13.89 billion raw reads, of which 9.11 billion high-quality reads passed quality control and 9.01 billion were successfully aligned to the Murrah reference genome, achieving an average mapping rate of ~98.9%. Across the 15 Pandharpuri buffalo genomes analyzed, sequencing coverage varied according to data source. Among the publicly available genomes, four individuals were sequenced at an average coverage of ~8×, while the remaining eleven genomes (including all newly sequenced individuals) exhibited higher coverage ranging from ~28× to ~45× (Supplementary Table S1). Overall, the dataset was dominated by high coverage genomes, ensuring reliable variant detection for within-breed selection signature analyses. The mean mapping quality across samples was 56.68, confirming the reliability of alignments. Initial variant calling identified 27.0 million SNPs. Following rigorous quality control, including depth-based filtering, hard filtering with GATK, and subsequent filtering in PLINK based on sample call rate greater than 0.9, SNP call rate greater than 0.9, and MAF greater than 0.05, a final dataset of 20,492,591 high-confidence biallelic SNPs was retained for downstream selection signature analysis.

3.1. Genome-Wide Detection of Putative Selection Signatures Using DCMS

A genome-wide scan employing the DCMS method identified 26,814 SNPs exhibiting significant empirical q-values (q < 0.05), indicative of putative selection signatures in the Pandharpuri buffalo population (Figure 2). Integration of DCMS statistics delineated 1405 discrete genomic regions under selection, with a mean length of 25.63 kb (approximately 0.0256 Mb) and a cumulative span of 36.01 Mb across the autosomes. The Manhattan plot depicting the chromosomal distribution of these selection signals is illustrated in Figure 3, revealing heterogeneous clustering with pronounced peaks on multiple chromosomes. Examination of the genome-wide distribution of DCMS scores indicated appropriate calibration, with no evidence of pronounced inflation or deflation, supporting the robustness of the DCMS-based selection signals. The strongest selection signal was detected on chromosome 8 (CHR 8: 93.75–93.95 Mb), encompassing 131 annotated genes distributed across adjacent intervals. Additional prominent regions included chromosome 12 (66.04–66.10 Mb), chromosome 20 (64.40–64.45 Mb), and chromosome 16 (8.73–8.99 Mb), as detailed in Table 1.
The 1405 putative selective sweep regions harboured 1337 unique genes, derived from the within-population DCMS distribution. The full list of 1405 DCMS-based selection sweep regions is provided in Supplementary Table S2. The top 20 genomic regions showing the strongest selection signals in Pandharpuri buffalo are summarized in Table 1. Notable candidate genes encompassed PTPN11 (protein tyrosine phosphatase non-receptor type 11), INSR (insulin receptor), SYK (spleen-associated tyrosine kinase), ERBB4 (erb-b2 receptor tyrosine kinase 4), ESR1 (estrogen receptor 1), VAV3 (vav guanine nucleotide exchange factor 3), MAPK3 (mitogen-activated protein kinase 3), PRKG1 (cGMP-dependent protein kinase).

3.2. Biological Processes and Pathways Enriched Among Candidate Genes

DAVID-based functional enrichment analysis of the 1337 candidate genes yielded 58 enriched terms (Fisher’s exact test, p < 0.05), comprising 17 biological processes (BP), 7 cellular components (CC), 16 molecular functions (MF), and 18 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways (Supplementary Table S3). Prominent GO terms included intracellular signal transduction (GO:0035556; 23 genes, fold enrichment = 2.00), calcium ion transmembrane transport (GO:0070588; 7 genes, fold enrichment = 3.50), endoplasmic reticulum tubular network organization (GO:0071786; 3 genes, fold enrichment = 9.22), cell surface (GO:0009986; 12 genes, fold enrichment = 2.10), calcium ion binding (GO:0005509; 46 genes, fold enrichment = 1.67), and ATP binding (GO:0005524; 78 genes, fold enrichment = 1.41), T cell receptor signaling pathway (GO:0050852, 5 genes, 3.84). These annotations underscore cellular mechanisms pivotal for metabolic homeostasis and stress resilience.
KEGG pathway enrichment highlighted significant overrepresentation in the Wnt signaling pathway (bbub04310; 20 genes, fold enrichment = 2.47), cAMP signaling pathway (bbub04024; 22 genes, fold enrichment = 1.91), insulin secretion (bbub04911; 10 genes, fold enrichment = 2.49), GnRH signaling pathway (bbub04912; 10 genes, fold enrichment = 2.25), and aldosterone synthesis and secretion (bbub04925; 10 genes, fold enrichment = 2.16). Collectively, these pathways implicate the identified regions in regulatory cascades governing milk production (e.g., via insulin and cAMP-mediated glucose uptake) and adaptation (e.g., via aldosterone for electrolyte balance under heat stress), mirroring observations in temperate and tropical dairy breeds [15,28].

3.3. Protein-Protein Interaction Network Analysis and Identification of Key Hub Genes

To elucidate functional interconnectivity among the candidate proteins, a protein-protein interaction (PPI) network was constructed using the STRING database (v12.0) at a high-confidence threshold (interaction score > 0.70). Of the 1337 input genes, 856 were successfully mapped, forming a network with 856 nodes and 360 edges. The average node degree was 0.84, with the observed interactions exceeding random expectations (PPI enrichment p-value = 1.61 × 10−10), suggesting biologically cohesive modules. Hub genes were prioritized using the Maximum Clique Centrality (MCC) algorithm implemented in CytoHubba (Cytoscape v3.10.2), revealing key nodes potentially driving network topology and trait associations (Figure 4). The PPI analysis was used primarily to support and prioritize candidate genes consistently identified across DCMS-based selection scans and functional enrichment analyses, rather than as an independent source of biological inference. The top 15 hub genes identified from the STRING protein–protein interaction network, ranked using the Maximum Clique Centrality (MCC) method, are provided in Supplementary Table S4.

4. Discussion

Understanding the genetic mechanisms underlying breed-specific adaptation requires moving beyond phenotypic observation to unravel the genomic architecture shaped by natural and artificial selection. Livestock breeds maintained under diverse ecological and management pressures, such as thermal stress, feed scarcity, or irregular milking, accumulate unique combinations of alleles that confer survival and productivity advantages. Uncovering such adaptive signatures necessitates a genome-wide approach capable of detecting both common and rare variants across the entire spectrum of genetic diversity. WGS provides the most comprehensive platform for variant discovery, enabling the identification of multiple forms of genetic variation, including SNPs, insertions and deletions (InDels), structural variants (SVs), and copy number variations (CNVs) [53]. Unlike SNP arrays or reduced-representation approaches such as ddRAD-seq, WGS offers superior resolution by capturing rare, low-frequency, and breed-specific variants that are often missed by conventional genotyping platforms. This ability to detect the full spectrum of genomic variation is particularly valuable for indigenous breeds, where adaptive alleles underlying environmental resilience and unique production traits frequently reside in less-characterized regions of the genome.
The Pandharpuri buffalo, is traditionally managed under intermittent milking schedules, which reflect a combination of management practices and intrinsic lactation biology. Such milking patterns have direct implications for milk yield, mammary gland physiology, and metabolic homeostasis, highlighting adaptive strategies that allow this breed to maintain lactation despite irregular milking intervals. Ruminant species exhibit diverse lactation strategies; for example, the ATM (“Anytime Milk”) goat is renowned for nearly continuous milk production, demonstrating a genetic predisposition for sustained lactation. While not directly comparable, these differences illustrate the spectrum of lactation adaptability among domesticated ruminants and underscore the importance of breed-specific physiological and genetic mechanisms in shaping milking responses. Considering the significance of this unique breed and the advantages of high-resolution genomic analysis, the present study employed whole-genome resequencing of 15 Pandharpuri buffaloes to identify genome-wide signatures of selection that may have arisen through generations of adaptation. The majority of genomes exhibited coverage exceeding 30×, which is particularly advantageous for studies involving indigenous breeds with naturally small breeding populations and limited sample sizes, as it enhances variant detection accuracy, minimizes false negatives, and enables robust population-level genomic inference [54].
Selection signatures were detected using the DCMS framework [26], which integrated multiple complementary statistics—Tajima’s D, Nucleotide diversity, iHS and nSL to capture distinct and subtle signatures of selection across the genome. Tajima’s D contrasts the number of segregating sites with the average pairwise nucleotide differences, with deviations from neutrality reflecting selection or demographic events [19]. Nucleotide diversity (π) quantifies the average number of nucleotide differences per site between sequences, with reductions often indicative of selective sweeps [20]. Both metrics were estimated in 50 kb non-overlapping windows using VCFtools v0.1.16 [46]. Haplotype-based methods, including iHS and nSL, detect recent positive selection by analyzing extended haplotype homozygosity. iHS compares haplotype lengths between ancestral and derived alleles [23], whereas nSL evaluates the number of segregating sites within haplotypes, offering enhanced sensitivity to incomplete sweeps [24]. Specifically, iHS is tailored to detect incomplete hard sweeps, while nSL adjusts for recombination variation and is better suited for partial or soft sweeps. By integrating complementary statistics, the DCMS framework generates a robust composite score that enhances detection power and resolution while reducing false positives.
In Pandharpuri buffalo, DCMS identified 1405 putative selective sweep regions containing 1337 candidate genes, several of which are involved in biological pathways relevant to lactation and mammary gland function. Key lactation-related genes, including ERBB4, ESR1, SYK, INSR, PTPN11, VAV3, MAPK3, and PRKG1, are located within these regions and are known to participate in epithelial proliferation, hormonal signaling, and mammary tissue remodeling. Together, these genomic signals highlight molecular processes plausibly relevant to lactation biology in Pandharpuri buffalo.
ERBB4 emerged as a key candidate in Pandharpuri buffalo, consistent with its role in regulating mammary epithelial proliferation, differentiation, and ductal development [55,56]. ERBB4 signaling can induce hyperproliferation in ducts and lobuloalveoli, highlighting that membrane-associated ERBB4 interactions are critical for regulating epithelial growth [57,58]. In this study, ERBB4 likely contributes to the ability of Pandharpuri buffalo to sustain lactation under intermittent milking by promoting epithelial survival and tissue remodeling, and preserving milk-secreting capacity. Through its integration with downstream RAS/ERK, PI3K/AKT, and STAT pathways, ERBB4 may coordinate proliferation, differentiation, and survival signals in response to cycles of milk removal and retention, thereby supporting continuous milk production despite irregular milking intervals. Notably, transcriptomic and functional studies in dairy goats have identified ERBB4 as a hub regulator of mammary gland involution and remodeling via PI3K/AKT signaling, influencing epithelial cell survival, proliferation, and apoptosis during non-lactating stages [59]. Thus selection on ERBB4 and related network genes may reflect adaptive processes related to mammary gland function in the context of intermittent milking.
ESR1, encoding the estrogen receptor α, emerged as a key candidate in Pandharpuri buffalo. In mammary epithelial cells, ESR1 regulates ductal elongation, epithelial proliferation, and parenchymal expansion during prepubertal and pubertal stages [60,61]. In the context of intermittent milking, ESR1 signaling may support the maintenance and remodeling of mammary tissue during cycles of milk removal and retention, ensuring sustained epithelial function and milk-secreting capacity. Dynamic coordination of ESR1 with prolactin and growth hormone pathways, as observed in small ruminants, likely facilitates lactogenic differentiation and casein gene activation [62,63], which could help Pandharpuri buffalo maintain milk production even under irregular milking schedules. Selection on ESR1 may therefore contribute to the breed’s distinctive ability to balance tissue proliferation, differentiation, and milk yield during intermittent milking.
Among the candidate genes identified, SYK, a non-receptor tyrosine kinase, is particularly relevant to intermittent milking due to its role in regulating mammary epithelial proliferation and remodeling. Although traditionally recognized for hematopoietic signaling, SYK is also expressed in the bovine mammary gland, where it influences milking cycles and supports milk production. Its elevated expression during the dry period suggests a role in preparing the mammary tissue for subsequent lactation, ensuring epithelial integrity and lactation persistency [64]. Although SYK is known for its role in mastitis resistance through modulation of pro- and anti-inflammatory responses in bovine mammary epithelial cells, its signal in this breed may also reflect adaptation to intermittent milking practices, where enhanced immune resilience is advantageous for maintaining udder health under irregular milking intervals [65,66].
The VAV3 gene has emerged as a versatile candidate in livestock, particularly in mammary gland biology. In cattle, it was identified among genes upregulated in lactating mammary tissue, co-expressed with regulators such as c-MYC, CASP8, and LASP1, suggesting roles in gland growth, remodeling, and secretory function [67]. Among the genes identified, PTPN11, a key regulator of the JAK-STAT signaling pathway, plays a central role in lactation biology. In Tharparkar cattle, PTPN11 appears within selection-signature regions linked to milk yield and milk composition traits [68], and transcriptome-wide analyses in Murrah buffalo and Holstein Friesian confirm its importance in mammary function [69,70]. Functionally, MAPK3 also enhances milk protein synthesis through its integration with the mTOR signaling pathway [71] and PRKG1 has been associated with milk yield and fatty acid composition in dairy animals [72,73]. Together, these results uncover the genomic basis and highlight key candidate genes that may govern lactation traits in Pandharpuri buffalo, particularly milk production, lactation efficiency, and the distinctive intermittent milking capacity. The PP interaction analysis further supported these findings by prioritizing genes that consistently overlapped across selection scans and functional enrichment analyses.
Complementing the candidate genes identified in selection scans, the functional enrichment analysis revealed biological processes and pathways directly linked to mammary gland activity and lactational physiology. While a small number of immune-related terms were enriched, these signals are interpreted cautiously, as several implicated genes have well-established pleiotropic roles in mammary remodeling, cellular signaling, and stress adaptation rather than immune-specific selection alone. Key enriched processes included intracellular signal transduction (VAV3, SYK, MAPK3), protein phosphorylation (MAPK3, PRKG1, ERBB4), and calcium ion transport (RYR2), which are essential for epithelial signaling, milk secretion, and mammary remodeling. Enriched pathways such as Wnt signaling (MAPK3), cAMP signaling (VAV3, MAPK3), insulin signaling (INSR), and GnRH signaling (MAPK3) regulate hormonal responsiveness, glucose uptake, and milk-component biosynthesis. Notably, insulin- and cAMP-mediated pathways (INSR, MAPK3, VAV3) may support mammary metabolism and maintain milk synthesis during extended inter-milking intervals. Functional pathway–level interpretations are emphasized only where they are supported by concordant DCMS-based selection signals and well-established biological functions of the implicated genes. Consistent with the present findings, DCMS-based genome-wide selection signature studies in livestock have repeatedly identified genomic regions enriched for genes associated with milk production and lactation-related traits across livestock populations” [6,11,15,28,31].

Future Directions: Validation and Functional Implementation

This study presents the first DCMS-based genome-wide analysis of Pandharpuri buffalo and identifies genomic signals associated with its distinctive intermittent milking phenotype. The within-breed DCMS framework enabled the detection of selection signals specific to this population; however, complementary between-breed or comparative population analyses would further help distinguish breed-specific adaptations from shared or long-term selection effects.
Although the integration of multiple complementary statistics enhances the robustness of DCMS, within-population selection scans remain influenced by demographic history and cannot fully separate selection from demography in the absence of explicit demographic modeling. In addition, as individual-level phenotypic records on intermittent milking were not available for the sequenced animals, the genetic signals are interpreted in the context of established biological functions rather than direct genotype–phenotype associations. Consequently, the identified genomic regions represent candidate selection signals, and further functional validation, including transcriptomic or mammary tissue–specific expression studies, will help clarify their biological relevance. Future studies incorporating larger, independent populations, comparative genomic approaches, and additional layers of genomic variation, including structural variants using long-read sequencing, will strengthen the interpretation of these findings and support genome-informed conservation and genetic improvement of Pandharpuri buffalo.

5. Conclusions

This study employed whole-genome resequencing and the DCMS framework to identify within-breed selection signatures in the Pandharpuri buffalo, an indigenous breed characterized by its unique intermittent milking ability. By integrating multiple complementary selection statistics, we identified putative genomic regions harboring candidate genes involved in mammary gland function, hormonal signaling, and milk secretion. Key genes, including ERBB4, ESR1, SYK, INSR, PTPN11, VAV3, MAPK3, and PRKG1, highlight molecular pathways that may support lactation under irregular milking intervals. Overall, these findings provide a genomic basis for the distinctive adaptation of Pandharpuri buffalo and offer a valuable resource for future functional studies, conservation efforts, and genome-informed breeding strategies.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/cimb48010101/s1.

Author Contributions

V.V. contributed to conceptualization, methodology, funding acquisition, and supervision; A.P. performed formal analysis, investigation, and wrote the original draft; P.P. and P.G. carried out formal analysis, investigation, and writing—review and editing; R.A. and S.C. were involved in methodology and writing—review and editing; and R.K.G. provided supervision. All authors have read and agreed to the published version of the manuscript.

Funding

The authors acknowledge the funding support received through the Network Project on Buffalo Improvement (Project code: NWP, 2001/3007/14552/1010476) for completion of this work.

Institutional Review Board Statement

The study was approved by the Institutional Animal Ethics Committee (IAEC) of ICAR–National Dairy Research Institute, Karnal (IAEC approval no. 53/IAEC/24/12, 2024-09-30). No experimental procedures were performed on the animals during the course of the study. All methods were conducted in accordance with the guidelines of the Committee for the Purpose of Control and Supervision of Experiments on Animals (CPCSEA).

Informed Consent Statement

Not applicable.

Data Availability Statement

The data newly generated in this study are available from the corresponding author upon reasonable request. The publicly available datasets used in this study were obtained from the European Nucleotide Archive (ENA) at the European Bioinformatics Institute under accession number PRJEB39591 (https://www.ebi.ac.uk/ena/browser/view/PRJEB39591) (accessed on 18 September 2024).

Acknowledgments

The authors express their sincere gratitude to Director, ICAR-National Dairy Research Institute and Director, ICAR-Central Institute for Research on Buffalo, for providing the logistical support necessary to complete the study.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Lappalainen, T.; Scott, A.J.; Brandt, M.; Hall, I.M. Genomic analysis in the age of human genome sequencing. Cell 2019, 177, 70–84. [Google Scholar] [CrossRef] [PubMed]
  2. Li, Y.; Liu, X.; Li, Z.; Wang, W.; Li, Y.; Cao, Z.; Luan, P.; Xiao, F.; Gao, H.; Guo, H.; et al. A combination of genome-wide association study and selection signature analysis dissects the genetic architecture underlying bone traits in chickens. Animal 2021, 15, 100322. [Google Scholar] [CrossRef] [PubMed]
  3. Zhang, S.; Yao, Z.; Li, X.; Zhang, Z.; Liu, X.; Yang, P.; Chen, N.; Xia, X.; Lyu, S.; Shi, Q.; et al. Assessing genomic diversity and signatures of selection in Pinan cattle using whole-genome sequencing data. BMC Genom. 2022, 23, 460. [Google Scholar] [CrossRef]
  4. Bao, J.; Xiong, J.; Huang, J.; Yang, P.; Shang, M.; Zhang, L. Genetic Diversity, Selection Signatures, and Genome-Wide Association Study Identify Candidate Genes Related to Litter Size in Hu Sheep. Int. J. Mol. Sci. 2024, 25, 9397. [Google Scholar] [CrossRef]
  5. Peng, W.; Zhang, Y.; Gao, L.; Shi, W.; Liu, Z.; Guo, X.; Zhang, Y.; Li, B.; Li, G.; Cao, J.; et al. Selection signatures and landscape genomics analysis to reveal climate adaptation of goat breeds. BMC Genom. 2024, 25, 420. [Google Scholar] [CrossRef] [PubMed]
  6. Le, S.V.; Alexandri, P.; van der Werf, J.H.; Olmo, L.; Walkden-Brown, S.W.; de Las Heras-Saldana, S. Signature of Selection Analysis Reveals Candidate Genes Related to Climate Adaptation and Production Traits in Lao Native Goats. J. Anim. Breed. Genet. 2025, 142, 152–171. [Google Scholar] [CrossRef]
  7. Smith, J.M.; Haigh, J. The hitch-hiking effect of a favourable gene. Genet. Res. 1974, 23, 23–35. [Google Scholar] [CrossRef]
  8. Kreitman, M. Methods to detect selection in populations with applications to the human. Annu. Rev. Genom. Hum. Genet. 2000, 1, 539–559. [Google Scholar] [CrossRef]
  9. Nielsen, R. Statistical tests of selective neutrality in the age of genomics. Heredity 2001, 86, 641–647. [Google Scholar] [CrossRef]
  10. Jensen, J.D.; Foll, M.; Bernatchez, L. The past, present and future of genomic scans for selection. Mol. Ecol. 2016, 25, 1. [Google Scholar] [CrossRef]
  11. Yurchenko, A.A.; Daetwyler, H.D.; Yudin, N.; Schnabel, R.D.; Jagt, C.J.V.; Soloshenko, V.; Lhasaranov, B.; Popov, R.; Taylor, J.F.; Larkin, D.M. Scans for signatures of selection in Russian cattle breed genomes reveal new candidate genes for environmental adaptation and acclimation. Sci. Rep. 2018, 8, 12984. [Google Scholar] [CrossRef]
  12. Bai, Y.; Sartor, M.; Cavalcoli, J. Current status and future perspectives for sequencing livestock genomes. J. Anim. Sci. Biotechnol. 2012, 3, 8. [Google Scholar] [CrossRef]
  13. Cesarani, A.; Sorbolini, S.; Criscione, A.; Bordonaro, S.; Pulina, G.; Battacone, G.; Marletta, D.; Gaspa, G.; Macciotta, N.P.P. Genome-wide variability and selection signatures in Italian island cattle breeds. Anim. Genet. 2018, 49, 371–383. [Google Scholar] [CrossRef]
  14. Georges, M.; Charlier, C.; Hayes, B. Harnessing genomic information for livestock improvement. Nat. Rev. Genet. 2019, 20, 135–156. [Google Scholar] [CrossRef]
  15. Illa, S.K.; Mukherjee, S.; Nath, S.; Mukherjee, A. Genome-wide scanning for signatures of selection revealed putative genomic regions and candidate genes controlling milk composition and coat color traits in Sahiwal cattle. Front. Genet. 2021, 12, 699422. [Google Scholar] [CrossRef]
  16. Chavan, S.P.; Khan, K.D.; Yadav, A.; Gahlyan, R.K.; Vohra, V.; Alex, R.; Gowane, G.R. Exploring genomic signatures of selection for adaptation and resilience in nomadic Belahi cattle in the tropical climate. Mol. Biol. Rep. 2025, 52, 676. [Google Scholar] [CrossRef] [PubMed]
  17. Liu, X.; Peng, Y.; Zhang, X.; Chen, W.; Chen, Y.; Wei, L.; Zhu, Q.; Khan, M.Z.; Wang, C. Potential genetic markers associated with environmental adaptability in herbivorous livestock. Animals 2025, 15, 748. [Google Scholar] [CrossRef] [PubMed]
  18. Jaglan, K.; Sukhija, N.; KK, K.; Verma, A.; Vohra, V.; Alex, R.; George, L. Surveying selection signatures in Murrah buffalo using genome-wide SNP data. Cytol. Genet. 2025, 59, 307–320. [Google Scholar] [CrossRef]
  19. Tajima, F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 1989, 123, 585–595. [Google Scholar] [CrossRef]
  20. Nei, M.; Li, W.H. Mathematical model for studying genetic variation in terms of restriction endonucleases. Proc. Natl. Acad. Sci. USA 1979, 76, 5269–5273. [Google Scholar] [CrossRef]
  21. Fay, J.C.; Wu, C.I. Hitchhiking under positive Darwinian selection. Genetics 2000, 155, 1405–1413. [Google Scholar] [CrossRef]
  22. Kim, Y.; Stephan, W. Detecting a local signature of genetic hitchhiking along a recombining chromosome. Genetics 2002, 160, 765–777. [Google Scholar] [CrossRef]
  23. Voight, B.F.; Kudaravalli, S.; Wen, X.; Pritchard, J.K. A map of recent positive selection in the human genome. PLoS Biol. 2006, 4, e72, Correction in PLoS Biol. 2006, 4, e154. Correction in PLoS Biol. 2007, 5, e147.. [Google Scholar]
  24. Ferrer-Admetlla, A.; Liang, M.; Korneliussen, T.; Nielsen, R. On detecting incomplete soft or hard selective sweeps using haplotype structure. Mol. Biol. Evol. 2014, 31, 1275–1291. [Google Scholar] [CrossRef] [PubMed]
  25. Lotterhos, K.E.; Whitlock, M.C. The relative power of genome scans to detect local adaptation depends on sampling design and statistical method. Mol. Ecol. 2015, 24, 1031–1046. [Google Scholar] [CrossRef]
  26. Ma, Y.; Ding, X.; Qanbari, S.; Weigend, S.; Zhang, Q.; Simianer, H. Properties of different selection signature statistics and a new strategy for combining them. Heredity 2015, 115, 426–436. [Google Scholar] [CrossRef]
  27. Yurchenko, A.A.; Deniskova, T.E.; Yudin, N.S.; Dotsev, A.V.; Khamiruev, T.N.; Selionova, M.I.; Egorov, S.V.; Reyer, H.; Wimmers, K.; Brem, G.; et al. High-density genotyping reveals signatures of selection related to acclimation and economically important traits in 15 local sheep breeds from Russia. BMC Genom. 2019, 20, 294. [Google Scholar] [CrossRef]
  28. Ghoreishifar, S.M.; Eriksson, S.; Johansson, A.M.; Khansefid, M.; Moghaddaszadeh-Ahrabi, S.; Parna, N.; Davoudi, P.; Javanmard, A. Signatures of selection reveal candidate genes involved in economic traits and cold acclimation in five Swedish cattle breeds. Genet. Sel. Evol. 2020, 52, 52. [Google Scholar] [CrossRef] [PubMed]
  29. Sweet-Jones, J.; Lenis, V.P.; Yurchenko, A.A.; Yudin, N.S.; Swain, M.; Larkin, D.M. Genotyping and whole-genome resequencing of Welsh sheep breeds reveal candidate genes and variants for adaptation to local environment and socioeconomic traits. Front. Genet. 2021, 12, 612492. [Google Scholar] [CrossRef]
  30. Al Abri, M.; Alfoudari, A.; Mohammad, Z.; Almathen, F.; Al-Marzooqi, W.; Al-Hajri, S.; Al-Amri, M.; Bahbahani, H. Assessing genetic diversity and defining signatures of positive selection on the genome of dromedary camels from the southeast of the Arabian Peninsula. Front. Vet. Sci. 2023, 10, 1296610. [Google Scholar]
  31. Muansangi, L.; Tiwari, J.; Ilayaraja, I.; Kumar, I.; Vyas, J.; Chitra, A.; Singh, S.P.; Pal, P.; Gowane, G.; Mishra, A.K.; et al. DCMS analysis revealed differential selection signatures in the transboundary Sahiwal cattle for major economic traits. Sci. Rep. 2025, 15, 15685. [Google Scholar] [CrossRef]
  32. Muthusamy, M.; Akinsola, O.M.; Pal, P.; Ramasamy, C.; Ramasamy, S.; Musa, A.A.; Thiruvenkadan, A.K. Comparative genomic insights into adaptation, selection signatures, and population dynamics in indigenous Indian sheep and foreign breeds. Front. Genet. 2025, 16, 1621960. [Google Scholar] [CrossRef]
  33. Utsunomiya, Y.T.; Pérez O’Brien, A.M.; Sonstegard, T.S.; Sölkner, J.; Garcia, J.F. Genomic data as the “hitchhiker’s guide” to cattle adaptation: Tracking the milestones of past selection in the bovine genome. Front. Genet. 2015, 6, 36. [Google Scholar] [CrossRef]
  34. Joshi, S.K.; Mohanty, T.K.; Kumaresan, A.; Bhakat, M.; Sathapathy, S. Changes in Teat Morphology (Doka Phenomenon) and Estrus Prediction in Riverine Buffaloes (Bubalus bubalis). Indian J. Anim. Res. 2020, 54, 16–19. [Google Scholar] [CrossRef]
  35. Grizelj, J.; Katana, B.; Dobranic, T.; Prvanovic, N.; Lipar, M.; Vince, S.; Stanin, D.; Djuricic, D.; Greguric-Gracner, G.; Samardzija, M. The efficacy of milk ejection induced by luteal oxytocin as a method of early pregnancy diagnostics in cows. Acta Vet. 2010, 60, 551–561. [Google Scholar] [CrossRef][Green Version]
  36. Kumar, S.; Nagarajan, M.; Sandhu, J.S.; Kumar, N.; Behl, V. Phylogeography and domestication of Indian river buffalo. BMC Evol. Biol. 2007, 7, 186. [Google Scholar] [CrossRef]
  37. Dutta, P.; Talenti, A.; Young, R.; Jayaraman, S.; Callaby, R.; Jadhav, S.K.; Dhanikachalam, V.; Manikandan, M.; Biswa, B.B.; Low, W.Y.; et al. Whole genome analysis of water buffalo and global cattle breeds highlights convergent signatures of domestication. Nat. Commun. 2020, 11, 4739. [Google Scholar] [CrossRef]
  38. Sambrook, J.; Russell, D.W. Purification of nucleic acids by extraction with phenol: Chloroform. Cold Spring Harb. Protoc. 2006, 2006, pdb.prot4455. [Google Scholar] [CrossRef]
  39. Andrews, S. FastQC: A Quality Control Tool for High Throughput Sequence Data. 2010. Available online: http://www.bioinformatics.babraham.ac.uk/projects/fastqc (accessed on 25 May 2025).
  40. Chen, S.; Zhou, Y.; Chen, Y.; Gu, J. fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 2018, 34, i884–i890. [Google Scholar] [CrossRef]
  41. Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv 2013, arXiv:1303.3997. [Google Scholar] [CrossRef]
  42. Li, H.; Durbin, R. Fast and accurate long-read alignment with Burrows–Wheeler transform. Bioinformatics 2010, 26, 589–595. [Google Scholar] [CrossRef]
  43. McKenna, A.; Hanna, M.; Banks, E.; Sivachenko, A.; Cibulskis, K.; Kernytsky, A.; Garimella, K.; Altshuler, D.; Gabriel, S.; Daly, M.; et al. The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010, 20, 1297–1303. [Google Scholar] [CrossRef]
  44. Purcell, S.; Neale, B.; Todd-Brown, K.; Thomas, L.; Ferreira, M.A.R.; Bender, D.; Maller, J.; Sklar, P.; de Bakker, P.I.W.; Daly, M.J.; et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 2007, 81, 559–575. [Google Scholar] [CrossRef]
  45. Delaneau, O.; Marchini, J.; Zagury, J.F. A linear complexity phasing method for thousands of genomes. Nat. Methods 2012, 9, 179–181. [Google Scholar] [CrossRef]
  46. Danecek, P.; Auton, A.; Abecasis, G.; Albers, C.A.; Banks, E.; DePristo, M.A.; Handsaker, R.E.; Lunter, G.; Marth, G.T.; Sherry, S.T.; et al. The variant call format and VCFtools. Bioinformatics 2011, 27, 2156–2158. [Google Scholar] [CrossRef]
  47. Szpiech, Z.A.; Hernandez, R.D. selscan: An efficient multithreaded program to perform EHH-based scans for positive selection. Mol. Biol. Evol. 2014, 31, 2824–2827. [Google Scholar] [CrossRef]
  48. Quinlan, A.R.; Hall, I.M. BEDTools: A flexible suite of utilities for comparing genomic features. Bioinformatics 2010, 26, 841–842. [Google Scholar] [CrossRef]
  49. Huang, D.W.; Sherman, B.T.; Lempicki, R.A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 2009, 4, 44–57. [Google Scholar] [CrossRef]
  50. Szklarczyk, D.; Gable, A.L.; Lyon, D.; Junge, A.; Wyder, S.; Huerta-Cepas, J.; Simonovic, M.; Doncheva, N.T.; Morris, J.H.; Bork, P.; et al. STRING v11: Protein–protein association networks with increased coverage. Nucleic Acids Res. 2019, 47, D607–D613. [Google Scholar] [CrossRef]
  51. Shannon, P.; Markiel, A.; Ozier, O.; Baliga, N.S.; Wang, J.T.; Ramage, D.; Amin, N.; Schwikowski, B.; Ideker, T. Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 2003, 13, 2498–2504. [Google Scholar] [CrossRef]
  52. Chin, C.H.; Chen, S.H.; Wu, H.H.; Ho, C.W.; Ko, M.T.; Lin, C.Y. cytoHubba: Identifying hub objects and sub-networks from complex interactome. BMC Syst. Biol. 2014, 8, S11. [Google Scholar] [CrossRef]
  53. Fuentes-Pardo, A.P.; Ruzzante, D.E. Whole-genome sequencing approaches for conservation biology: Advantages, limitations and practical recommendations. Mol. Ecol. 2017, 26, 5369–5406. [Google Scholar] [CrossRef]
  54. Santhosh, A.; Vohra, V.; Alex, R.; Gowane, G. Genome-wide SNP identification and annotation from high-coverage whole-genome sequenced data of Bhadawari buffalo. Indian J. Dairy Sci. 2024, 77, 71–75. [Google Scholar] [CrossRef]
  55. Muraoka-Cook, R.S.; Feng, S.M.; Strunk, K.E.; Earp, H.S. ErbB4/HER4: Role in mammary gland development, differentiation and growth inhibition. J. Mammary Gland Biol. Neoplasia 2008, 13, 235–246. [Google Scholar] [CrossRef]
  56. Morato, A.; Accornero, P.; Hovey, R.C. ERBB receptors and their ligands in the developing mammary glands of different species. J. Mammary Gland. Biol. Neoplasia 2023, 28, 10. [Google Scholar] [CrossRef]
  57. Wali, V.B.; Gilmore-Hebert, M.; Mamillapalli, R.; Haskins, J.W.; Kurppa, K.J.; Elenius, K.; Booth, C.J.; Stern, D.F. Overexpression of ERBB4 JM-a CYT-1 and CYT-2 isoforms in transgenic mice reveals isoform-specific roles in mammary gland development and carcinogenesis. Breast Cancer Res. 2014, 16, 501. [Google Scholar] [CrossRef]
  58. Muraoka-Cook, R.S.; Sandahl, M.A.; Strunk, K.E.; Miraglia, L.C.; Husted, C.; Hunter, D.M.; Elenius, K.; Chodosh, L.A.; Earp, H.S. ErbB4 splice variants Cyt1 and Cyt2 differ by 16 amino acids and exert opposing effects on the mammary epithelium in vivo. Mol. Cell. Biol. 2009, 29, 4935–4948. [Google Scholar] [CrossRef]
  59. Xuan, R.; Chao, T.; Zhao, X.; Wang, A.; Chu, Y.; Li, Q.; Zhao, Y.; Ji, Z.; Wang, J. Transcriptome profiling of the nonlactating mammary glands of dairy goats reveals the molecular genetic mechanism of mammary cell remodeling. J. Dairy Sci. 2022, 105, 5238–5260. [Google Scholar] [CrossRef]
  60. Choudhary, R.K.; Choudhary, S.; Mukhopadhyay, C.S.; Pathak, D.; Verma, R. Deciphering the transcriptome of prepubertal buffalo mammary glands using RNA sequencing. Funct. Integr. Genom. 2019, 19, 349–362. [Google Scholar] [CrossRef]
  61. Tucker, H.L.M.; Parsons, C.L.M.; Ellis, S.; Rhoads, M.L.; Akers, R.M. Tamoxifen impairs prepubertal mammary development and alters expression of estrogen receptor α (ESR1) and progesterone receptors (PGR). Domest. Anim. Endocrinol. 2016, 54, 95–105. [Google Scholar] [CrossRef]
  62. Colitti, M.; Pulina, G. Expression profile of caseins, estrogen and prolactin receptors in mammary glands of dairy ewes. Ital. J. Anim. Sci. 2010, 9, e55. [Google Scholar] [CrossRef]
  63. Dessauge, F.; Finot, L.; Wiart, S.; Aubry, J.M.; Ellis, S.E. Effects of ovariectomy in prepubertal goats. J. Physiol. Pharmacol. 2009, 60, 127–133. [Google Scholar]
  64. Hou, X.; Lin, L.; Xing, W.; Yang, Y.; Duan, X.; Li, Q.; Gao, X.; Lin, Y. Spleen tyrosine kinase regulates mammary epithelial cell proliferation in mammary glands of dairy cows. J. Dairy Sci. 2016, 99, 3858–3868. [Google Scholar] [CrossRef]
  65. Yang, F.; Chen, F.; Li, L.; Yan, L.; Badri, T.; Lv, C.; Yu, D.; Zhang, M.; Jang, X.; Li, J.; et al. Three novel players: PTK2B, SYK, and TNFRSF21 involved in regulation of bovine mastitis susceptibility via GWAS and post-transcriptional analysis. Front. Immunol. 2019, 10, 1579. [Google Scholar] [CrossRef]
  66. Yang, F.; Yuan, L.; Xiang, M.; Jiang, Q.; Zhang, M.; Chen, F.; Tong, J.; Huang, J.; Cai, Y. A novel TLR4–SYK interaction axis plays an essential role in innate immunity response in bovine mammary epithelial cells. Biomedicines 2022, 11, 97. [Google Scholar] [CrossRef]
  67. Suchyta, S.P.; Sipkovsky, S.; Halgren, R.G.; Kruska, R.; Elftman, M.; Weber-Nielsen, M.; Vandehaar, M.J.; Xiao, L.; Tempelman, R.J.; Coussens, P.M. Bovine mammary gene expression profiling using a cDNA microarray enriched for mammary-specific transcripts. Physiol. Genom. 2003, 16, 8–18. [Google Scholar] [CrossRef]
  68. Nayak, S.S.; Panigrahi, M.; Rajawat, D.; Ghildiyal, K.; Sharma, A.; Parida, S.; Bhushan, B.; Mishra, B.P.; Dutt, T. Comprehensive selection signature analyses in dairy cattle exploiting purebred and crossbred genomic data. Mamm. Genome 2023, 34, 615–631. [Google Scholar] [CrossRef]
  69. Chhotaray, S.; Vohra, V.; Uttam, V.; Santhosh, A.; Saxena, P.; Gahlyan, R.K.; Gowane, G. TWAS revealed significant causal loci for milk production and composition in Murrah buffaloes. Sci. Rep. 2023, 13, 22401. [Google Scholar] [CrossRef]
  70. Krishna, N.; Vishwakarma, S.; Katara, P. Identification and annotation of milk-associated genes from milk somatic cells using expression and RNA-seq data. Bioinformation 2022, 18, 703–709. [Google Scholar] [CrossRef]
  71. Pan, F.; Li, P.; Hao, G.; Liu, Y.; Wang, T.; Liu, B. Enhancing milk production by nutrient supplements: Strategies and regulatory pathways. Animals 2023, 13, 419. [Google Scholar] [CrossRef]
  72. Shi, L.; Lv, X.; Liu, L.; Yang, Y.; Ma, Z.; Han, B.; Sun, D. A post-GWAS confirming effects of PRKG1 gene on milk fatty acids in a Chinese Holstein dairy population. BMC Genet. 2019, 20, 53. [Google Scholar] [CrossRef]
  73. Niyazbekova, Z.; Xu, Y.; Qiu, M.; Wang, H.P.; Primkul, I.; Nanaei, H.A.; Ussenbekov, Y.; Kassen, K.; Liu, Y.; Gao, C.Y.; et al. Whole-genome sequencing reveals genetic architecture and selection signatures of Kazakh cattle. Zool. Res. 2025, 46, 301–314. [Google Scholar] [CrossRef]
Figure 1. Home tract of the registered Indian breed Pandharpuri buffalo (Bubalus bubalis) in Kolhapur, Solapur, and Sangli districts of Maharashtra, India.
Figure 1. Home tract of the registered Indian breed Pandharpuri buffalo (Bubalus bubalis) in Kolhapur, Solapur, and Sangli districts of Maharashtra, India.
Cimb 48 00101 g001
Figure 2. Manhattan plot displaying the distribution of SNPs with significant DCMS q-values (q < 0.05) across bovine autosomes. Each point represents a SNP, color-coded by chromosome, with the y-axis indicating the negative log-transformed DCMS q-value (−log10).
Figure 2. Manhattan plot displaying the distribution of SNPs with significant DCMS q-values (q < 0.05) across bovine autosomes. Each point represents a SNP, color-coded by chromosome, with the y-axis indicating the negative log-transformed DCMS q-value (−log10).
Cimb 48 00101 g002
Figure 3. Distribution of q-Values Across Chromosomes in Pandharpuri Buffalo. (Boxplot displaying the distribution of q-values from the DCMS analysis across chromosomes (1–24) in Pandharpuri buffalo, with significant thresholds (q < 0.05) indicated. The y-axis represents −log10 (q-value), and boxes depict the interquartile range and whiskers showing variability).
Figure 3. Distribution of q-Values Across Chromosomes in Pandharpuri Buffalo. (Boxplot displaying the distribution of q-values from the DCMS analysis across chromosomes (1–24) in Pandharpuri buffalo, with significant thresholds (q < 0.05) indicated. The y-axis represents −log10 (q-value), and boxes depict the interquartile range and whiskers showing variability).
Cimb 48 00101 g003
Figure 4. Protein–protein interaction (PPI) network analysis of the top 15 hub genes identified among candidate selection regions. Node color intensity reflects connectivity, with red indicating highest degree centrality.
Figure 4. Protein–protein interaction (PPI) network analysis of the top 15 hub genes identified among candidate selection regions. Node color intensity reflects connectivity, with red indicating highest degree centrality.
Cimb 48 00101 g004
Table 1. Top 20 Genomic Regions with Strongest Selection Signals in Pandharpuri Buffalo.
Table 1. Top 20 Genomic Regions with Strongest Selection Signals in Pandharpuri Buffalo.
S.NoChromosomeStart Position (bp)End Position (bp)Window Length (bp)Number of SNPsMinimum q-ValuePosition of Top SNP (bp)
1893,753,22193,949,273196,0531791.66 × 10−993,799,967
21266,038,73966,099,93461,196382.16 × 10−966,099,317
32064,396,90164,449,43452,534363.16 × 10−964,439,325
4168,725,2428,990,446265,2052046.83 × 10−98,810,042
524,741,9284,799,86157,9343176.98 × 10−94,778,662
61573,748,47273,877,721129,2504461.93 × 10−873,843,490
72416,000,43516,104,795104,3611533.94 × 10−816,082,640
81637,625,33737,699,79874,4622404.73 × 10−837,668,255
9557,900,50158,048,083147,5831491.56 × 10−758,043,435
101575,550,02575,595,16345,139532.76 × 10−775,593,925
11612,778,65013,081,208302,5596913.10 × 10−712,794,066
121822,052,25722,131,81879,5621953.38 × 10−722,052,587
131821,198,35221,499,708301,3572847.43 × 10−721,394,367
1411.34 × 1081.34 × 108160,568767.92 × 10−71.34 × 108
151450,219,89550,260,96441,070631.02 × 10−650,242,895
162116,769,42016,996,452227,0333681.26 × 10−616,911,161
1731.38 × 1081.38 × 108683441.29 × 10−61.38 × 108
1831.31 × 1081.31 × 108194,2992861.38 × 10−61.31 × 108
191246,304,98546,343,13838,15441.87 × 10−646,304,985
20329,153,26529,158,280501662.91 × 10−629,158,239
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Patil, A.; Gaur, P.; Pal, P.; Alex, R.; Chhotaray, S.; Gandham, R.K.; Vohra, V. Genome-Wide Insights into Intermittent Milking Behavior of Pandharpuri Buffalo. Curr. Issues Mol. Biol. 2026, 48, 101. https://doi.org/10.3390/cimb48010101

AMA Style

Patil A, Gaur P, Pal P, Alex R, Chhotaray S, Gandham RK, Vohra V. Genome-Wide Insights into Intermittent Milking Behavior of Pandharpuri Buffalo. Current Issues in Molecular Biology. 2026; 48(1):101. https://doi.org/10.3390/cimb48010101

Chicago/Turabian Style

Patil, Akshata, Parth Gaur, Pritam Pal, Rani Alex, Supriya Chhotaray, Ravi Kumar Gandham, and Vikas Vohra. 2026. "Genome-Wide Insights into Intermittent Milking Behavior of Pandharpuri Buffalo" Current Issues in Molecular Biology 48, no. 1: 101. https://doi.org/10.3390/cimb48010101

APA Style

Patil, A., Gaur, P., Pal, P., Alex, R., Chhotaray, S., Gandham, R. K., & Vohra, V. (2026). Genome-Wide Insights into Intermittent Milking Behavior of Pandharpuri Buffalo. Current Issues in Molecular Biology, 48(1), 101. https://doi.org/10.3390/cimb48010101

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop