1. Introduction
Sheep (
Ovis aries) represent one of the earliest domesticated livestock species, providing meat, milk, and wool for human needs [
1]. Following domestication, they exhibit pronounced phenotypic diversity in body conformation and productivity among breeds in different ecological and production system [
2]. In particular, the amount and distribution of adipose tissues have been the targets of breeding and selection [
3].
Different sheep breeds have different fattening efficiency, which affects meat quality, reproductive performance, and environmental adaptability [
4]. In sheep, adipose tissues include subcutaneous, perirenal, caul, intermuscular and tail fat [
5], and they differ in cellular composition, endocrine activity, and metabolic roles [
6], Thus, these various adipose tissues provide a foundational resources for dissecting the different molecular mechanisms underlying fat deposition [
7]. Different from other livestock and model animals, tail fat in sheep represents a specialized fat deposition with adaptive and economic importance [
8]. Sheep have been categorized into thin-tailed, short fat-tailed, and fat-rumped types based on the extent and distribution of fat deposition [
9]. The diversity of tail phenotype is widely considered to be the results of combined natural and artificial selection, reflecting the interplay among energy metabolism, environmental tolerance, and human preference [
10]. However, the molecular basis of tail fat deposition in sheep remains insufficiently understood, and only a limited number of studies have specifically revealed the transcriptomic profiles in different types of tail fats [
11]. Despite these advances, a systematic characterization for the transcriptomic difference among breeds of different tail types and with different fat deposition efficiency remains scarce, especially with adequate biological replicates to ensure robust statistical inference [
12]. Therefore, further exploration on the molecular mechanisms underlying tail fat deposition in sheep of different tail types will enhance the understanding of lipid metabolism biology and provide molecular tools for future sheep breeding and production.
To fill this gap, we conducted a large-scale transcriptomic analysis across five adipose depots (caul, subcutaneous, perirenal, tail, intermuscular fat) in five sheep breeds (Altay sheep, Tibetan sheep, Merino sheep, Wadi sheep, Small-tailed Han sheep), consisting of 250 RNA-seq samples, with 10 biological replicates per depot per breed, enabling a comprehensive examination of breed- and depot-specific transcriptional regulation. Differential expression gene and GO/KEGG enrichment analyses were implemented to explore shared and unique mechanisms associated with fat deposition. We highlight candidate genes and transcription factors (TFs) essential for the tail fat metabolism and tail-type divergence among breeds. Our results provide insights into genetic mechanisms of fat metabolism in different adipose tissues and lay an important foundation for sheep breeding in future.
3. Results
3.1. RNA-Seq Data and Processing
We conducted 250 publicly available RNA-seq samples from the NCBI SRA database, generating approximately 6 billion reads.
Table S1 presents the basic statistics for the RNA sequencing of the five adipose tissues across five breeds (i.e., Altay sheep, Small-tailed Han Sheep, Tibetan sheep, Wadi sheep and Merino sheep).
Figure 1A outlines the RNA-seq analysis pipeline, from data acquisition and quality control to read alignment (STAR), gene quantification (featureCounts), differential expression analysis (DESeq2), and functional enrichment of GOs. The unique mapping rate across all samples ranged from 80.15% to 95.15% when aligned with the sheep reference genome, with all values exceeding the typical threshold of 80%. Therefore, theses high-quality reads ensure the reliability of subsequent analyses and results.
To assess the impact of batch effects on transcriptomic patterns, we first performed batch correction on the raw count matrix using ComBat-seq and repeated the principal component analysis (PCA) and differential expression analysis. The corrected PCA results were consistent with the original analysis, with samples still clustering clearly by adipose tissue and breed, indicating that tissue and breed were the primary drivers of transcriptional variation. Furthermore, the numbers of differentially expressed genes (DEGs) and the patterns of enriched pathways from the batch-corrected data were highly concordant with the uncorrected analysis. The expression trends and significance of core candidate genes were not substantially altered. These results demonstrate that the breed- and tissue-specific transcriptional patterns identified in this study are robust despite potential batch confounding.
3.2. Differentially Expressed Gene Analysis
Following RNA-seq preprocessing and rigorous filtering, we obtained a finalized gene expression matrix comprising 19,440 genes. These genes were considered expressed and used for further analysis. To characterize the global expression patterns of all tissues, we first applied t-distributed stochastic neighbor embedding (t-SNE) analysis based on the gene expression profiles of samples. The resultant sample clustering recapitulated the different tissues and different breeds (
Figure 1B). Next, principal component analyses (PCA) were conducted individually for each adipose tissue type, revealing pronounced breed-specific differences across the different fat depots (
Figure 1C). Subsequent analyses focused on five adipose depots (i.e., caul fat, tail fat, intermuscular fat, subcutaneous fat, and perirenal fat), where pairwise comparisons were conducted across all the combinations of five sheep breeds. This comparison generated 10 sets of DEGs per adipose depot.
Figure 2A presents the comprehensive profile of DEGs, both up and downregulated, across all pairwise comparisons. Considerable variation exists in the number of DEGs for each adipose depot across different breed comparisons, highlighting distinct gene expression pattern in various adipose tissues among the five sheep breeds.
Figure 2B shows the number of DEGs identified in pairwise breed comparisons across five adipose tissues. A closer examination of individual adipose tissues revealed distinct patterns of breed-related transcriptional divergence. In the caul fat, the number of DEGs identified in pairwise breed comparisons was relatively modest compared to other adipose tissues. For example, the highest DEG count was observed in the comparison between Merino and Tibetan sheep, with 3797 DEGs (1797 upregulated, 2000 downregulated). Similarly, the comparisons of Small-tailed Han sheep vs. Altay sheep and Merino vs. Small-tailed Han sheep yielded 2433 and 3541 DEGs, respectively. However, several comparisons, such as Wadi sheep vs. Small-tailed Han sheep and Merino sheep vs. Altay sheep, showed very few DEGs, suggesting minimal transcriptional divergence in caul fat between those breeds. Overall, these results indicate that caul fat exhibits moderate breed-dependent expression differences, with notable divergence primarily involved with Altay sheep. Taken together, these findings highlight that the extent of differential gene expression is strongly influenced by both adipose tissue type and breed identity.
To ensure the statistical rigor of our differential expression analysis, we systematically evaluated the 10 pairwise comparisons within each adipose depot. Volcano plots visually illustrate the distribution of up- and down-regulated genes along with their significance (
Figures S1–S5), while MA plots depict the relationship between fold change and mean expression (
Figures S6–S10).
p-value histograms reveal a clear peak near zero in most comparisons, consistent with expected distributions for differential expression analysis (
Figures S11–S15). Detailed statistical summaries for each comparison, including the total number of genes tested and significant DEG counts, are provided in
Table S5. Together, these results confirm the reliability of our differential expression analysis under appropriate multiple testing control.
3.3. Pathway Enrichment Analysis of DEGs
GO enrichment analysis revealed that DEGs identified in each adipose tissue were significantly enriched in multiple biological processes (BPs), molecular functions (MFs), and cellular components (CCs) about fat metabolism. Notably, terms related to lipid biosynthetic process, fatty acid metabolic process, and organic acid metabolic process were recurrently enriched across tissues, albeit with tissue-specific patterns.
Further analysis of the specific enrichment results for each adipose tissue revealed that in subcutaneous fat, DEGs were predominantly associated with fatty acid metabolic process, organic acid metabolic process, and lipid biosynthetic process (GO:0006631—fatty acid metabolic process, GO:0008610—lipid biosynthetic process). Similarly, KEGG pathway analysis indicated strong enrichment in pathways such as fatty acid metabolism, fatty acid biosynthesis, and fatty acid elongation (oas01212—fatty acid metabolism). In contrast, caul fat did not show as many enriched terms. The more prominent GO terms in caul fat included regulation of immune system process, activation of immune response, and immune response-activating signaling pathway, suggesting a role in immune regulation. KEGG pathways such as intestinal immune network for IgA production and ribosome were more prominently enriched in these tissues. In addition, breed-level comparison revealed notable differences in enriched pathways. For instance, DEGs in Wadi sheep and Merino were frequently enriched in immune regulation. Meanwhile, comparisons involving Altay consistently showed strong enrichment for lipid metabolism and adipogenesis-related terms, being consistent with its adaptation to cold climates and fat deposition. In summary, the enrichment results revealed shared biological processes across different adipose tissues, such as fatty acid metabolic processes and lipid biosynthetic processes, among others. However, each tissue also exhibited unique enrichment signatures, reflecting its functional heterogeneity. For instance, terms related to immune system processes were strongly enriched in intermuscular fat. Full lists of enriched terms, including enrichment scores and gene counts, are provided in
Table S2.
The top five enriched GO terms in each category of DEG sets were visualized as bar plots. For each of the five adipose tissues, we selected pathways related to lipid metabolism from the ten comparison groups as representative sets, displaying only the five most significant results from each relevant subset. Key biological process (BP) terms revealed that the pathways involved in each adipose tissue were consistent with our previous findings (
Figure 3A). Further, perirenal adipose tissue was also associated with pathways such as ATP metabolic process. For the KEGG enrichment results, we also selected comparisons from each adipose tissue (
Figure 3B). The top five significantly enriched pathways were extracted from each selected comparison. These pathways included those related to lipid metabolism, such as oxidative phosphorylation and carbon metabolism, as well as the PPAR signaling pathway. Furthermore, several disease-associated pathways were observed, including non-alcoholic fatty liver disease.
Using subcutaneous fat as an example, we identified significantly enriched GO terms (Biological Process ontology,
p-adjusted < 0.05) and selected the top five enriched pathways from each comparison group. The 50 most frequently occurring genes across these pathways were subsequently analyzed. By integrating gene frequency and
p-adjusted values, a pathway–gene bubble plot was generated to visualize the relationships between key genes and biological pathways among different groups. (
Figure 4A). Due to the screening criteria applied, some comparison groups were excluded during this process. Subsequently, further filtering was performed among representative genes associated with lipid metabolism pathways, and their expression levels were compared between groups. In the subcutaneous fat tissues of the five sheep breeds, three key lipid metabolism-related genes (
ACO1,
GPI, and
MDH2) exhibited significant inter-breed differences (
Figure 4B). Specifically, ACO1 showed a significant difference between Altay sheep and Small-tailed Han sheep (
p < 0.05).
ACO1 encodes aconitase, a key enzyme of the tricarboxylic acid (TCA) cycle that catalyzes the reversible conversion between citrate and isocitrate. This step plays a central role in determining the allocation of carbon flux between energy metabolism and fatty acid synthesis, thereby influencing the efficiency of lipid deposition. The observed breed-specific expression suggests potential metabolic partitioning in energy utilization and lipid biosynthesis.
GPI displayed significant differences among Altay, Merino, and Small-tailed Han, as well as between Altay and Wadi (
p < 0.05).
GPI encodes glucose-6-phosphate isomerase, a rate-limiting enzyme in glycolysis and gluconeogenesis, which is crucial for regulating glucose utilization and energy supply. Variations in
GPI expression may affect the availability of glycolytic intermediates required for triglyceride synthesis, thereby modulating breed-specific fat deposition capacities. Notably,
MDH2 showed highly significant differences between Altay and the other four breeds (Merino sheep, Small-tailed Han sheep, Tibetan sheep, and Wadi sheep) (
p < 0.01 or
p < 0.001).
MDH2 encodes mitochondrial malate dehydrogenase, which catalyzes the interconversion of malate and oxaloacetate in the TCA cycle, playing a pivotal role in NADH production and energy homeostasis. Differences in
MDH2 expression may directly influence the energy metabolism capacity and redox balance of adipocytes, leading to breed-specific patterns of lipid deposition and metabolic traits. Taken together, these results not only reveal distinct gene expression profiles across breeds but also highlight the potential regulatory mechanisms linking carbon metabolism, energy production, and fat deposition in a breed-specific manner.
3.4. Cross-Tissue Integration of Significant Pathways
To systematically and comprehensively characterize the overlap and divergence of functional pathways across different adipose depots and breeds, we further integrated the GO and KEGG enrichment results of five adipose tissues based on our prior differential expression analyses. Specifically, for each tissue, we merged the sets of significantly enriched pathways from both GO and KEGG (threshold FDR ≤ 0.05) to form a union, thereby ensuring that potential biological signals would not be omitted due to redundancy between databases.
At the global level, this cross-tissue integration and UpSet visualization clearly revealed the common and breed-specific patterns of significantly enriched pathways across the five adipose depots. However, considering that tail fat represents the most distinctive and divergent adipose depot associated with fat deposition differences among sheep, and is directly linked to the morphological and metabolic basis of fat-rumped, short fat-tailed, and thin-tailed phenotypes, we focused subsequent analyses specifically on tail fat. More precisely, the five representative breeds were further grouped into three major phenotypic categories: fat-rumped, short fat-tailed, and thin-tailed breeds.
Figure 5A presents the overlap and divergence of significantly enriched GO/KEGG pathways (FDR ≤ 0.05) in the tail fat of the five sheep breeds. The tallest bar corresponds to 133 pathways shared by all five breeds, indicating a set of core pathways widely involved in lipid metabolism and signaling regulation, serving as a conserved molecular foundation across breeds. In addition, several other large intersection bars demonstrate that subsets of pathways are consistently present in the majority of breeds. Meanwhile, a number of smaller intersections correspond to pathways unique to specific breeds, or shared only between two breeds. Overall,
Figure 5A highlights that tail fat harbors both a broad spectrum of conserved metabolic pathways and a certain number of breed-specific or partially shared pathways. These unique pathways may be closely associated with the formation of different tail phenotypes and their metabolic features.
3.5. Identification of Tail Fat Specific Pathways and Phenotype Associated Genes
After classifying the enriched pathways in tail fat according to the three phenotypic groups, we first compared the overall distribution of pathway categories (
Figure 5B). The results showed that the majority of significant pathways belonged to the universally shared category, indicating the presence of a core set of conserved networks broadly involved in lipid metabolism and signaling regulation across all tail types. In pairwise comparisons, fat-rumped and short fat-tailed sheep exhibited the highest number of shared pathways, whereas overlaps involving thin-tailed sheep were markedly fewer, underscoring the closer functional resemblance between fat-rumped and short fat-tailed phenotypes. This pattern is consistent with their shared phenotypic trait of higher fat deposition. In contrast, thin-tailed sheep exhibited fewer overlapping pathways with the other two groups, highlighting its distinctiveness in tail fat metabolic pathways. Although phenotype-specific pathways were fewer in number, they often converged on a limited set of key pathways, suggesting that these unique signals may exert stronger phenotype-driving effects.
To further dissect these differences, we categorized the significant pathways of tail fat into major functional themes and compared their distribution across the three phenotypic groups (
Figure 5C). The detailed pathway lists are provided in
Table S3. Immune and inflammation-related pathways were broadly enriched across all phenotypes, with the vast majority falling into the shared category, indicating that low-grade inflammation and immune signaling constitute a common molecular background of tail fat. Lipid synthesis and storage pathways were predominantly enriched in the fat-rumped and short fat-tailed shared class, with additional fat-tail-specific signals, suggesting consistent enhancement of lipid droplet formation and triglyceride accumulation in these two groups, while such pathways were relatively attenuated in thin-tailed sheep. Mitochondrial oxidative phosphorylation pathways were also mainly shared, but with additional enrichment of sub-pathways in the fat-rumped group, reflecting a higher energy demand under increased synthetic load. Fatty acid oxidation pathways were enriched in both the shared category and fat-rumped and thin-tailed shared class, indicating that thin-tailed sheep retain a robust oxidative capacity and share certain features with fat-rumped sheep, while remaining distinct from short fat-tailed sheep. Extracellular matrix (ECM) and fibrosis-related pathways were generally present in the shared category, although short fat-tailed sheep displayed mild specificity in localized ECM remodeling pathways, potentially linked to structural tissue adaptation. Insulin and glucose metabolism pathways were the least represented and almost exclusively appeared in the shared category, suggesting they play a relatively stable role across all phenotypes and are unlikely to drive tail-type divergence. While these pathway-level differences highlight distinct molecular landscapes among phenotypes, their functional impacts are ultimately mediated through specific genes.
Subsequent analysis of genes associated with tail fat-specific pathways revealed multiple candidate genes potentially involved in tail-type determination. In fat-rumped specific pathways, 20 core genes were repeatedly identified across multiple enriched terms, such as
TAFAZZIN,
GPAM,
GCDH,
ETFDH,
COX5A, members of the
COQ family (
COQ4,
COQ7,
COQ8A,
COQ9), and
ATP5F1E (
Figure 5D). These genes are primarily involved in mitochondrial function, oxidative phosphorylation, fatty acid metabolism, and triglyceride biosynthesis, indicating that the formation of the fat-rumped phenotype may be closely associated with enhanced mitochondrial energy efficiency and lipid synthetic capacity. In contrast, short fat-tailed specific pathways yielded a distinct set of representative genes, mainly
MMP9,
MMP12,
MMP19 (
Figure 5E). The
MMP family genes are strongly associated with ECM remodeling and tissue reconstruction, suggesting that short fat-tailed sheep may rely more on ECM remodeling and localized structural adaptations rather than metabolic regulation alone to shape their tail fat phenotype. Additionally, fat-rumped sheep-specific genes predominantly cluster in energy metabolism and lipid storage processes, whereas short fat-tailed sheep-specific genes are more closely linked to structural and ECM regulation. Such mechanistic divergence may underlie the fundamental differences driving the development of these two tail types. Although thin-tailed sheep did not exhibit significantly enriched phenotype-specific genes in this analysis, our earlier findings suggest that their phenotype may be maintained through the absence of active lipid synthesis-associated genes, thereby restricting fat accumulation. Collectively, these results not only highlight distinct molecular mechanisms between fat-rumped and short fat-tailed phenotypes but also identify a set of promising candidate genes (
TAFAZZIN,
GPAM,
COQ family, and
MMP family) that may serve as potential targets for functional validation and molecular breeding aimed at regulating fat deposition and tail-type traits in sheep.
4. Discussion
In mammals, adipose tissue serves multiple physiological functions [
22]. As a major site of energy storage, it acts as an energy buffer by accumulating lipids during periods of nutrient availability and releasing fatty acids for systemic utilization during fasting or high energy demand [
23,
24]. This large-scale transcriptomic study of five sheep breeds and five adipose depots provided a comprehensive view of the transcriptional basis of adipose heterogeneity and tail phenotype diversification. We showed that variation in tail morphology, ranging from fat-rumped to short fat-tailed to thin-tailed types, appeared to driven by distinct molecular pathways. A conserved set of metabolic and immune pathways was shared across all phenotypes, while fat-rumped sheep were enriched in mitochondrial oxidative phosphorylation and triglyceride synthesis genes (
TAFAZZIN,
GPAM,
COQ family), and short fat-tailed sheep showed unique extracellular matrix remodeling signatures (
MMP9,
MMP12). These results suggest that similar fat deposition phenotypes can arise through either enhanced metabolic activity or adaptation. Thin-tailed sheep, in contrast, appeared to lacked these pro-lipogenic and ECM remodeling programs. Our findings deepened the understanding of tail fat biology and highlighted candidate genes potentially relevant for genetic improvement in sheep.
We observed marked differences in DEGs and enriched functions among caul, subcutaneous, perirenal, intermuscular, and tail fat highlighted distinct physiological roles [
25]. Subcutaneous and intermuscular fat showed abundant DEGs enriched in lipid metabolism, indicating high responsiveness to genetic background and major roles in regulating energy storage and mobilization [
26]. In contrast, caul and tail fat displayed greater transcriptional conservation, suggesting stricter functional regulation, with immune-related pathways in caul fat consistent with its role as a visceral depot involved in immune surveillance [
27]. Such depot-specificity is essential for interpreting breed differences, as the same comparison may yield divergent results depending on the tissue examined.
The tail represents a major site of fat storage in fat-rumped sheep, and uncovering the key genes involved in this process is essential for elucidating the molecular mechanisms underlying fat-tail formation in sheep [
20]. Focusing on tail fat, we discovered a complex interplay between conserved and breed-specific pathways. The large set of pathways shared across all five breeds represents the fundamental biological machinery required to establish and maintain a functional adipose tissue, including core lipid metabolism, energy production, and baseline immune function. This shared foundation underscores that tail fat, regardless of its ultimate size, is a bona fide adipose depot operating on common principles [
28]. However, the true drivers of phenotypic divergence lie in the differences superimposed upon this common background.
By grouping breeds into phenotypic categories, we detected a pronounced compelling molecular stratification. The fat-rumped phenotype, represented by Altay sheep, was associated with an upregulation of mitochondrial energy metabolism and lipid biosynthetic capacity. Key genes identified in this group included
TAFAZZIN, which is required for cardiolipin biosynthesis and maintaining mitochondrial membrane integrity [
29], and
GPAM, which catalyzes the first committed step in glycerolipid biosynthesis [
30]. In addition, several members of the coenzyme Q biosynthesis family, including
COQ4,
COQ7,
COQ8A, and
COQ9, were found to be critical for sustaining electron transport chain function and efficient ATP production [
31]. Previous studies have shown that
ETFDH plays an indispensable role in fatty acid β-oxidation, thereby linking energy metabolism directly to lipid utilization [
32]. The coordinated enhancement of these mitochondrial and lipid metabolic genes indicated that fat-rumped sheep may have acted as a highly efficient enhancement of the “metabolism engine”, in which robust oxidative phosphorylation not only provided sufficient energy for active lipogenesis but may also act as a regulatory signal to promote triglyceride accumulation and adipocyte hypertrophy [
33,
34]. These findings are consistent with previous studies emphasizing the role of transcriptional regulators such as
PPARγ and lipid biosynthetic pathways in promoting fat deposition in sheep [
35,
36]. Moreover, similar observations have been reported in other livestock species, where mitochondrial activity and coenzyme Q related pathways were shown to contribute to efficient fat storage and adaptive energy regulation [
37], highlighting the evolutionary significance of these mechanisms in animals exposed to cold or nutritionally challenging environments.
In contrast, the short fat-tailed phenotype, represented by Small-tailed Han sheep and Wadi sheep breeds, showed a distinct molecular profile dominated by Matrix Metalloproteinases (
MMP9,
MMP12 and
MMP19). These zinc-dependent endopeptidases are key regulators of extracellular matrix (ECM) degradation and remodeling [
38], suggesting that tail fat deposition in this phenotype depends not only on metabolism but also on structural plasticity [
39]. ECM breakdown and reorganization are essential for adipocyte hyperplasia and hypertrophy, enabling lipid accumulation and neovascularization [
40]. Thus, while fat-rumped sheep achieve efficient lipid storage through metabolism, short fat-tailed sheep rely on ECM remodeling to enhance adipocyte proliferation and expansion. This highlights ECM dynamics as an underexplored regulator of tail fat deposition, with additional evidence linking tissue microenvironment and mechanical properties to adipogenesis [
41]. By contrast, the thin-tailed phenotype exhibited fewer unique pathways, reflecting a lack of pro-lipogenic and ECM remodeling programs. Although fat-rumped and thin-tailed sheep shared some fatty acid oxidation pathways, Merino sheep appear to lack the anabolic drive, resulting in limited fat accumulation [
42].
Our study complemented previous transcriptomic research on tail fat in sheep and provided new mechanistic insights. We confirmed the involvement of triglyceride synthesis pathways, as reported in comparisons between Small-tailed sheep and Dorset sheep [
11], but further demonstrated that tail types differed in how strongly they activate these pathways, and that distinct biological processes such as extracellular matrix (ECM) remodeling also play crucial roles. Candidate genes such as
GPAM and
COQ family members highlighted enhanced lipid biosynthesis and mitochondrial metabolism in fat-rumped sheep, whereas
MMP family genes suggested to structural remodeling as a hallmark of short fat-tailed phenotypes [
43]. Biologically, our results support a dual-axis model of tail fat development: a metabolic axis (mitochondrial energy and lipid synthesis) and a structural axis (ECM remodeling) [
44]. Different breeds likely represent different evolutionary combinations of these mechanisms, leading to the observed diversity in tail phenotypes. From an applied perspective, these candidate genes provide promising molecular targets for breeding strategies, such as enhancing metabolic drivers (
GPAM,
COQ8A) to promote fat deposition or modulating ECM regulators (
MMP9,
MMP12) to optimize tissue composition. Nevertheless, limitations must be noted. Public RNA-seq data may contain batch effects, and transcriptomics alone could not establish causality [
17]. Future work should focus on functional validation of these genes and integrate multi-omics approaches under controlled environmental conditions to clarify regulatory mechanisms.