Next Article in Journal
Heavy Metal Contamination in Adaptogenic Herbal Dietary Supplements: Experimental, Assessment and Regulatory Safety Perspectives
Previous Article in Journal
Predicting the Potential Distribution of Galeruca daurica in Inner Mongolia Under Current and Future Climate Scenarios Using the MaxEnt Model
Previous Article in Special Issue
Microsatellite Markers Developed Based on Transcriptomic Data Reveal the Genetic Diversity and Population Genetic Structure of Angulyagra polyzonata in Guangxi, China
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genome-Wide SNP Analysis Reveals the Unique Genetic Diversity Represented by Fat-Tailed Coarse-Wooled Sheep Breeds of Kazakhstan

by
Kairat Dossybayev
1,2,3,
Makpal Amandykova
1,2,3,*,
Daniya Ualiyeva
1,2,4,5,
Tilek Kapassuly
1,2,3,
Altynay Kozhakhmet
1,2,3,
Elena Ciani
6,
Bakytzhan Bekmanov
1,3 and
Rauan Amzeyev
7
1
Laboratory of Animal Genetics and Cytogenetics, RSE Institute of Genetics and Physiology, Committee of Science of the Ministry of Science and Higher Education of the Republic of Kazakhstan, 93 Al-Farabi Avenue, Almaty 050060, Kazakhstan
2
Laboratory of Molecular Genetic Examination, LLP «Kazakh Research Institute of Animal Husbandry and Fodder Production», 51 Zhandosov Street, Almaty 050071, Kazakhstan
3
Faculty of Biology and Biotechnology, Al-Farabi Kazakh National University, 71 Al-Farabi Avenue, Almaty 050040, Kazakhstan
4
Laboratory of Ornithology and Herpetology, Institute of Zoology, Committee of Science of the Ministry of Science and Higher Education of the Republic of Kazakhstan, 93 Al-Farabi Avenue, Almaty 050060, Kazakhstan
5
Chengdu Institute of Biology, Chinese Academy of Sciences, 23 Qunxian South Street, Chengdu 610041, China
6
Department of Biosciences, Biotechnologies and Environment, University of Bari, 1 Piazza Umberto I, 70121 Bari, Italy
7
Faculty of Natural Sciences and Geography, Abai Kazakh National Pedagogical University, 13 Dostyk Avenue, Almaty 050010, Kazakhstan
*
Author to whom correspondence should be addressed.
Biology 2025, 14(11), 1478; https://doi.org/10.3390/biology14111478
Submission received: 6 September 2025 / Revised: 12 October 2025 / Accepted: 21 October 2025 / Published: 23 October 2025
(This article belongs to the Special Issue Genetic Variability within and between Populations)

Simple Summary

Sheep farming is an important part of agriculture in Kazakhstan, especially fat-tailed breeds that are well adapted to the local environment. Namely, Kazakh fat-tailed coarse-wooled, Edilbay, and Gissar sheep breeds are valued for their ability to thrive in harsh pasture conditions, their early maturity, and production of high-quality meat, fat, and wool. Despite their economic and cultural importance, little is known about the genetic background of these animals. In this study, we analyzed genome-wide SNP data from 160 animals representing the three main local breeds to gain insights into their diversity and relationship with other sheep worldwide. Our aim was to explore what makes these breeds unique and how they have developed over time. Understanding their genetics is essential for improving breeding programs, protecting valuable traits, and ensuring the long-term sustainability of sheep farming in Kazakhstan. It also contributes to preserving the country’s agricultural heritage and supporting food security in the region. Our findings show that the Edilbay breed possesses particularly high genetic diversity and forms a distinct group, while some gene flow was also detected among the studied populations. Overall, the native sheep of Kazakhstan represent a significant reservoir of genetic variation in Central Asia, underscoring their importance for conservation and future breeding strategies.

Abstract

Background: The fat-tailed coarse-wooled sheep breeds exhibit excellent reproductive performance, exceptional adaptability to pasture conditions, and high precocity, contributing to enhanced meat, fat, and wool productivity in sheep breeding. Despite the significant role of these sheep breeds in Kazakhstan’s livestock production, their genetics remain poorly studied. This raises concerns about the potential loss of unique, breed-specific traits that could be important for the future development and resilience of Kazakh stan’s sheep farming sector. This study aimed to analyze genome-wide genotyping SNP data of local fat-tailed coarse-wooled sheep breeds (Kazakh fat-tailed coarse-wooled, Edilbay, and Gissar) to reveal their genetic diversity, breed characteristics, and phylogenetic relationships with worldwide domestic sheep breeds and wild sheep. Methods: The OvineSNP50 Genotyping BeadChip was used to obtain genome-wide SNP genotyping data from 160 fat-tailed coarse-wooled sheep from Kazakhstan. Population structure analysis, principal component analysis, phylogenetic and the maximum likelihood tree analysis were performed in comparison with foreign domestic sheep breeds and wild sheep populations. Results: Kazakh breeds exhibited high genetic diversity, with Edilbay showing the greatest allelic richness. PCA and Admixture revealed clear differentiation among the three breeds: Edilbay and Gissar formed homogeneous clusters, while Kazakh fat-tailed coarse-wooled sheep displayed admixture and substructure. Evidence of gene flow from Edilbay into other Kazakh populations supports its role as a genetic source for regional breeds. Phylogenetic analysis placed Kazakhstani sheep close to other Central Asian breeds, while clearly distinct from East Asian and European populations. Wild sheep (Argali and Urial) formed separate clades, with Kerman wild sheep clustering closer to Urial. Conclusions: Our results highlight the value of genotyping data for studying genetic diversity and population structure. Developing genetic resources for Kazakhstan’s native sheep breeds will help preserve their unique diversity and ensure it remains available for future use in breeding and adaptation efforts.

1. Introduction

Sheep breeding plays a significant role in the agricultural economy of Kazakhstan, contributing approximately 6% to the overall livestock production value [1], with the sheep population reaching around 19.4 million by 2024 [2]. The most important breeds in Kazakhstan’s sheep industry are the fat-tailed coarse-wooled sheep, which make up approximately 50–55% of the country’s total sheep population [3]. This group includes major breeds such as Edilbay, Kazakh fat-tailed coarse-wooled, and Gissar, as well as more recently developed derived breeds like Saryarka, Ordabasy, and Baisary created through targeted crossbreeding [3]. Fat-tailed coarse-wooled sheep are predominantly maintained for meat and fat production. Among them, Edilbay sheep is highly adapted to the harsh breeding environments of Kazakhstan’s dry steppe, semi-desert, and desert zones, and are recognized for their excellent slaughter yields and high meat quality [4,5]. The Kazakh fat-tailed coarse-wooled sheep breed, developed through folk selection, is widely distributed across nearly all meat-fat sheep breeding regions of Kazakhstan and is known for its endurance, adaptability, early maturity, and excellent meat-fat quality [6]. Additionally, the Gissar breed, the largest among all domesticated sheep breeds worldwide, is characterized by its remarkable resilience, distinctive body structure, and exceptionally large fat tail [6]. Originated in Tajikistan [7], the Gissar sheep was introduced to Kazakhstan, Uzbekistan and Kyrgyzstan during the Soviet era, where it was widely used as an improver breed for meat and fat production [8].
Despite the unique characteristics of Kazakh fat-tailed coarse-wooled sheep breeds, which have been used in selective breeding programs both nationally and internationally, their genetic background, structure, and diversity remain insufficiently characterized. Understanding the genetic background of Kazakhstan’s fat-tailed coarse-wooled sheep breeds is crucial for uncovering their unique adaptive traits, guiding effective conservation strategies, and preserving valuable genetic resources for future breeding improvements.
In general, domestic sheep (Ovis aries) are reported to have originated from the wild Asian mouflon (Ovis orientalis) [9], with domestication events dated between approximately 11,000 and 10,500 years before present [10]. Following domestication, sheep dispersed from Southwest Asia to other regions of the world [11]. Subsequent mitochondrial DNA (mtDNA) analyses have revealed that domestic sheep are derived from two distinct ancestral maternal sources, with no evidence of genetic contributions from wild species such as urial or argali sheep [12]. The Edilbay sheep breed was also included in the sudy of Hiendleder et al. (1998) [12], which highlighted its distinct phylogenetic position, as it clustered separately from other domestic breeds. This further increases the interest in conducting a more detailed investigation of the genetic origins and relationships of Kazakhstan’s fat-tailed breeds.
Recent advancements in chip array-based genotyping technologies have enabled the detection of thousands of genomic variations across a wide range of species [13,14,15], including sheep [16]. These technological advances have established genome-wide SNP genotyping as a highly informative and relatively efficient method for studying the genetic structure of animal populations.
Previous investigations into the genetic diversity and phylogenetic relationships of Kazakh sheep populations have utilized various approaches, including 12 microsatellite markers [17] and mitochondrial DNA D-loop sequences [18]. In a more recent study, Kazakh fat-tailed coarse-wooled sheep were analyzed using runs of homozygosity (ROH) data, which identified several candidate genes—such as BMP2 and CLOCK associated with productive traits [19]. Additionally, genome-wide SNP genotyping was applied to characterize the genetic structure of the newly developed Baisary fat-tailed breed, revealing a unique genome composition distinct from its progenitor breeds [20]. In the Saryarka fat-tailed coarse-wooled breed SNP-based genome-wide association studies (GWAS) identified genes linked to meat productivity traits [21]. Although Kijas et al. conducted a genome-wide SNP analysis involving numerous sheep breeds from around the world [16,22], their study did not include Central Asian breeds. Another investigation based on Whole-Genome Resequencing data included some fine-wool Kazakh breeds and the Edilbay breed [23]; however, because it encompassed 158 diverse domestic populations, it did not provide a detailed characterization of the genetic background and structure of Kazakh sheep breeds.
Hence, while previous studies have provided valuable insights into the genetic characteristics of Kazakh sheep, many relied on limited molecular markers or were narrowly focused on specific traits such as meat productivity. Moreover, large-scale international analyses often excluded Central Asian breeds or included Kazakh breeds only in broad, multi-breed comparisons that lacked detailed resolution. These limitations underscore the need for a more comprehensive investigation into the genetic background and structure of Kazakhstan’s fat-tailed coarse-wooled sheep breeds. In the present study, we analyzed genome-wide SNP genotyping data from 160 animals representing three native Kazakh breeds to gain deeper insights into their genetic diversity and breed-specific features. Additionally, we performed comparative analyses with wild sheep and globally distributed domestic breeds to explore their phylogenetic relationships and better understand the genetic structure of these important regional populations.

2. Materials and Methods

2.1. Animal Sampling and Data Collection

For this study, blood samples were obtained from 160 randomly selected 2–3 years old animals (Table 1) representing three sheep breeds—Edilbay (ED), Kazakh fat-tailed coarse-wooled (KFTCW), and Gissar (GISS)—raised in five farms across Kazakhstan (Figure 1) and according to FAO Animal Production and Health Guidelines (2011) [24].
Sheep under study were raised under traditional pastoral systems with predominantly natural mating. Blood samples from each animal were collected by an experienced veterinarian in accordance with all ethical standards and the animal study protocol approved by the Local Ethics Commission of the RSE Institute of Genetics and Physiology, Committee of Science, Ministry of Science and Higher Education of the Republic of Kazakhstan (Conclusion of the Local Ethics Commission No.6, 3 November 2022, Almaty, Kazakhstan). Blood samples were collected using EDTA vacuum tubes (Leuven, Belgium) and subsequently transported to the Laboratory of Molecular Genetic Examination of LLP “Kazakh Research Institute of Animal Husbandry and Fodder Production” for further analysis.
For the comparative genetic structure analysis of the studied sheep breeds with wild sheep representatives and worldwide domestic sheep breeds, SNP genotyping data from 11 both fat-tailed and thin-tailed domestic sheep breeds (across 8 countries and one geographic region) and three wild sheep (from seven countries) were obtained from the WIDDE sheep database [25], DRYAD [26], and Figshare [27] repositories (Supplementary Table S1). For the comparative genetic analysis, we first focused on fat-tailed sheep breeds from Central Asia. To assess the relatedness of Middle East breeds to those from Kazakhstan, we included breeds from Iran and Turkey. We then expanded the geographic scope of the study by adding sheep breeds from Europe and Africa, and finally incorporated one breed from Barbados and U.S. (BBB), which was previously reported by Kijas et al. (2012) [16] to have an admixed origin from African and European breeds, allowing us to test the reliability of our results.

2.2. DNA Isolation, Genome-Wide SNP Genotyping and SNP Data Filtering

Genomic DNA was extracted from sheep blood samples using the GeneJet Genomic DNA Purification Kit (Thermo Scientific, Waltham, MA, USA). DNA quality was assessed using NanoDrop One spectrophotometer (Thermo Scientific, Waltham, MA, USA) and quantified to a concentration of 50–100 ng/µL for subsequent SNP genotyping. SNP genotyping was performed using the OvineSNP50 Genotyping BeadChip, which includes 54,241 SNPs (Illumina, San Diego, CA, USA), on the iScan genotyping system (Illumina, San Diego, CA, USA).
The SNP datasets for all sheep included in this study were merged, and quality control filtering of SNPs was performed using PLINK 1.9 [28]. Only the SNPs common for all datasets were used for further population structure analysis. Several quality control parameters were applied to exclude low-quality SNPs, including marker missingness (--geno 0.05), individual missingness (--mind 0.05), Hardy–Weinberg equilibrium (--hwe hwe 1 × 10−6), and pruning of markers in approximate linkage equilibrium (--indep-pairwise 50 5 0.2).

2.3. Genetic Diversity and Phylogenetic Analyses

To estimate the genetic diversity levels in the studied domestic sheep populations the proportion of polymorphic SNPs (Pn), expected (He), and observed (Ho) heterozygosity and inbreeding coefficient (F) were observed using PLINK 1.9 [28] --het and --freq commands. Negative estimates usually arise from sampling variance or contrasting allele-frequency references [28,29] and were not further interpreted as biological signals. Allelic richness (Ar), which characterizes the overall genetic diversity and the presence of unique alleles within populations was calculated using rarefaction in ADZE 1.0 [30] to account for unequal sample sizes.
Phylogenetic relationships of three Kazakhstani sheep breeds with other domestic and wild sheep were determined using UPGMA method implemented in MEGA v.11 [31]. The visualization of the tree was performed in Figtree 1.4.4 [32].

2.4. Population Structure and Demographic Analyses

PLINK 1.9 [28] was used for principal components analysis and a graphical display of the PCA plots were generated using Python 3 [33]. Admixture 1.3 [34] was used to estimate genetic components of the studied populations and to determine their optimal number (K). The resulting plots were constructed on Clumpak [35].
Historical gene flow patterns among the studied domestic sheep breeds and wild sheep populations were depicted via the maximum likelihood tree-based approach implemented in TreeMix v.1.13 [36]. We removed SNPs with more than 5% missingness, then used the --freq option in PLINK to generate the required allele-count input. We tested models allowing 0 to 4 migration edges (−m 0 to −m 4), employing a block-jackknife of 500 SNPs (−k 500) to account for residual linkage. Argali was setted as an outgroup to root the tree, given its recognized genetic divergence of an extant wild representative toward domestic sheep. The model fit was assessed using residual plots generated by TreeMix, which display the residual covariance between observed and model-predicted allele frequencies. A well-fitting model exhibits minimal and symmetrically distributed residuals. The four-migration-edge model minimized these residuals and was selected accordingly.
Linkage disequilibrium (LD) decay was estimated using PLINK 1.9 [28], while effective population size (Ne) trajectories were inferred with SNeP 1.1 [37]. Before analysis, a more stringent LD pruning was applied (a 100-SNP window, step size of 50 SNPs, and r2 < 0.1). All plots were generated in R using the ggplot2 4.0.0 package [38].

3. Results

3.1. Genome-Wide SNP Genotyping Data Assessment

We generated novel genome-wide SNP genotyping data for 160 animals representing three fat-tailed coarse-wooled sheep breeds widely distributed across Kazakhstan. This dataset was used to investigate the genetic structure and phylogenetics of these populations in comparison with wild sheep and a range of global domestic sheep breeds.
SNP data analysis was conducted by dividing the dataset into three groups: (1) Kazakhstan sheep breeds only, (2) domestic sheep breeds only, and (3) all sheep under the study including domestic and wild sheep. Moreover, to obtain a clearer picture of the variation within domestic sheep breeds from Asia, we performed the PCA and Admixture including only Asian sheep breeds. For the analyses involving the domestic sheep groups and the combined group of all sheep (domestic and wild), only 20 animals from each Kazakhstan sheep breed (ED, KFTCW, and GISS) populations were randomly selected to balance the sample sizes across populations and to ensure the reliability of the analysis. A summary of the SNP quality control results is presented in Table 2.
Thus, the number of SNPs selected after the quality control assessment (column 7 of Table 2) was sufficient to conduct further analyses and ensure the reliability of the results obtained.

3.2. Genetic Diversity and Population Structure Analyses

Several parameters were calculated for each of the 14 domestic sheep populations in the study to evaluate within-breed genetic diversity. Wild sheep populations were excluded from this analysis due to their small sample sizes, which were insufficient to ensure reliable results (Table 3).
As shown in Table 3, the expected heterozygosity (He) varied from 0.316 in OUE-FRANCE to 0.384 in ED-KAZ, while the observed heterozygosity (Ho) ranged from 0.272 in OUE-FRANCE to 0.419 in NQA-NAMIBIA. The highest He values observed in ED-KAZ, KCW-KRG, GISS-KRG, QEZ-SW-ASIA and SOP-ITALY suggest relatively high genetic variability, whereas OUE-FRANCE and NQA-NAMIBIA showed reduced diversity. The proportion of polymorphic SNPs (Pn) was highest in ED-KAZ (98.90%), KFTCW-KAZ (98.54%), and GISS-KAZ (98.35%), indicating high levels of genetic diversity of sheep breeds from Kazakhstan. In contrast, non-Asian breeds such as OUE-FRANCE (79.97%) and NQA-NAMIBIA (74.21%) exhibited lower SNP polymorphism, suggesting reduced genetic variability or possibly historical bottlenecks or founder effects. Allelic richness (Ar) was highest in KCW-KRG (1.90427), QEZ-SW-ASIA (1.90433), and ED-KAZ (1.90702), reflecting a greater reservoir of genetic variation. In contrast, OUE-FRANCE (1.65567) and NQA-NAMIBIA (1.65979) exhibited notably lower allelic richness, consistent with their reduced levels of polymorphism and potentially indicating past demographic constraints or limited gene flow. The within-population inbreeding coefficient (FIS) values were generally low, ranging from –0.134 to 0.065, suggesting little or no inbreeding within most populations. Negative FIS values observed in Kazakh and Kyrgyz populations (ED-KAZ, KFTCW-KAZ, GISS-KAZ, and KCW-KRG) indicate a slight excess of heterozygotes, which may reflect balanced breeding strategies and large effective population sizes. In contrast, positive FIS values in OUE-FRANCE (0.154) and TIBQ-CHINA (0.065) suggest moderate inbreeding or substructuring effects within these populations. Pairwise population differentiation, expressed as mean FST, ranged from 0.126 (ED-KAZ) to 0.301 (OUE-FRANCE), indicating moderate to high genetic differentiation among global sheep breeds. The lowest FST values among Asian populations support their shared ancestry and relatively recent divergence compared to non-Asian groups.
Principal component analysis (PCA) was performed to examine the population structure of the studied sheep breeds (Figure 2).
As shown in Figure 2A, the ED-KAZ population forms a dense, compact cluster, indicating low intra-population variance. In contrast, the KFTCW-KAZ population displays a distinct vertical spread along PC2 on the left side of the plot, suggesting greater internal variability along this component and a clear separation from the other groups. The GISS-KAZ population is distributed horizontally along PC1 on the right side of the plot, indicating differentiation primarily along this axis. The PCA clearly distinguishes the three populations, revealing underlying genetic structure. The separation along the first two principal components highlights strong group-specific patterns of variation.
Subsequently, PCA was performed for worldwide domestic sheep populations (Figure 2B) and domestic sheep populations from Asia (Figure 2C). The first two principal components (PC1 and PC2) in Figure 2B reveal a clear separation of populations based on their geographical origin. Breeds from Europe (SOP-ITALY, OUE-FRANCE), Africa (NQA-NAMIBIA, RMA-KENYA), and from the Barbados and U.S. (BBB-USA) are well differentiated from Asian populations, reflecting strong genetic structure among the studied groups. African breeds, together with the BBB and SOP-ITALY populations, are distributed vertically along PC1 on the right side of the plot, whereas the OUE-FRANCE population is positioned distinctly on the left side along PC1, separate from the other populations.
Since all Asian sheep breeds clustered within a single region of the plot with substantial overlap in Figure 2B, a separate PCA was performed focusing exclusively on Asian breeds to explore their distribution in greater detail (Figure 2C). In this plot, TON-MONGOL and TIBQ-CHINA form two distinct and well-separated clusters, reflecting the unique genetic signatures of East Asian breeds compared with other Asian breeds in the study. Breeds from South-West Asia (KRKS-SW-ASIA and QEZ-SW-ASIA) also form distinct clusters. Populations from Kyrgyzstan (GISS-KRG and KCW-KRG) appear in proximity, suggesting relatedness with regional divergence. Interestingly, the Gissar breed from Kazakhstan (GISS-KAZ) is positioned far from the other sheep breeds from Kazakhstan (ED-KAZ and KFTCW-KAZ), but clusters closely with the Gissar breed from Kyrgyzstan (GISS-KRG). ED-KAZ and KFTCW-KAZ show clear genetic differentiation but cluster relatively close together, consistent with their shared geographic origin in Kazakhstan.
Finally, PCA was performed for all populations included in the study, encompassing both global wild sheep and domestic sheep breeds (Figure 2D). The PCA plot reveals clear genetic structuring among global sheep populations, including both wild and domestic sheep. The first principal component (PC1) primarily separates the wild Argali populations from all domestic breeds and Urial populations, indicating deep genetic divergence between the Argali lineage and domestic sheep. Meanwhile, Urial populations form a distinct cluster located between the Argali and domestic breeds, consistent with their phylogenetic status as a separate wild species. African sheep populations (NQA-NAMIBIA and RMA-KENYA) form clearly separated clusters along PC2, distinct from other domestic breeds, confirming their genetic differentiation and geographic origin from a separate continent. The BBB-USA breed is positioned between the Eurasian and African breeds, suggesting a potential admixture or intermediate genetic profile. Interestingly, European sheep breeds (SOP-ITALY and OUE-FRANCE) are genetically distinct and well-separated from Asian breeds, particularly along PC2, highlighting the presence of population-specific variation not captured by PC1 alone. Among the domestic populations, Asian breeds form compact, well-defined clusters, suggesting high within-group genetic homogeneity and marked between-group divergence, consistent with their shared geographic origin.
Next, Admixture analysis was performed to assess the genetic components of the sheep breeds from Kazakhstan and Asian domestic sheep breeds (Figure 3), as well as among the global domestic sheep breeds and wild sheep species (Supplementary Figure S1).
Admixture analysis shown in Figure 3A reveals the population structure of the three groups (ED-KAZ, KFTCW-KAZ, and GISS-KAZ) across K = 2 to K = 4 genomic components, with the lowest cross-validation error observed at K = 2. At K = 2, ED-KAZ and KFTCW-KAZ animals are predominantly assigned to a single genetic cluster (blue), while GISS-KAZ is largely represented by a distinct cluster (orange). At K = 3, a third cluster (purple) emerges, contributing mainly to KFTCW-KAZ, indicating internal substructure within this population. At K = 4, a fourth cluster (green) appears sporadically within ED-KAZ, though it does not constitute a major component in any group, potentially reflecting low-level admixture.
Admixture plot in Figure 3B illustrates the genetic components of Asian domestic sheep populations under varying assumptions of genetic components clusters. The Admixture results at K = 2 reveal a major division: KRKS-SW-ASIA is dominated by an orange cluster, while the remaining populations share a largely homogeneous blue cluster. At K = 3, a third (purple) cluster emerges, particularly prevalent in TON-MONGOL and TIBQ-CHINA, indicating deep genetic divergence between East Asian and Central/Western Asian breeds. At K = 4, a fourth cluster (green) appears, especially in KRKS-SW-ASIA, KCW-KRG, and QEZ-SW-ASIA, suggesting finer-scale differentiation among Central Asian and Middle Eastern populations. Admixture signals are evident in transitional populations such as KFTCW-KAZ and QEZ-SW-ASIA, reflecting gene flow or shared ancestral components.
Supplementary Figure S1A presents the Admixture results for global domestic breeds, including sheep breeds from Europe, Africa, and the Americas. Based on the results at the optimal value of K = 8, the domestic sheep populations exhibit distinct genetic clusters with varying degrees of admixture. East Asian populations such as TON-MONGOL and TIBQ-CHINA display relatively homogeneous genetic components, indicating limited recent gene flow from other groups. In contrast, Central Asian populations like ED-KAZ and GISS-KAZ show considerable admixture, suggesting historical or recent introgression from multiple genetic backgrounds. Notably, KRKS-SW-ASIA, OUE-FRANCE, NQA-NAMIBIA, RMA-KENYA, BBB-USA, and SOP-ITALY are characterized by unique and dominant ancestry components, pointing to their genetic divergence and possibly long-term isolated breeding.
The Admixture analysis of the global dataset, including both domestic and wild sheep populations, reveals clear genetic separation based on geographic origin (Supplementary Figure S1B). The optimal number of genetic components clusters was determined to be K = 8, based on the lowest cross-validation error. At K = 7, wild and domestic populations are well differentiated, with Argali and Urial sheep forming distinct genetic components. Domestic breeds exhibit a high degree of ancestry homogeneity, particularly among Asian populations, which are predominantly represented by a blue cluster. The KERMAN_WILD-IRAN population shows greater genetic affinity with Urial than with Argali, suggesting closer evolutionary relatedness. Overall, the population structure observed at K = 7 remained consistent at K = 8 and K = 9, without introducing substantial changes.

3.3. Phylogenetics and Demographic Analyses, and Breed Differentiation

To elucidate the genetic relationships among the studied wild and domestic sheep populations, an un-rooted neighbor-joining phylogenetic tree was constructed based on genome-wide SNP data (Figure 4A).
The circular phylogenetic tree displays distinct clustering patterns consistent with population identity and geographic origin. Animals from the same population clustered closely, reflecting strong within-group genetic similarity. The European sheep from France and Italy clustered together with the breed from USA reflecting the probable common ancestry or possible introgression and forming a monophyletic clade to the wild representatives from Pakistan, Mongolia and Central Asia. African sheep breeds from Namibia and Kenya clustered a separate branch sharing geographic proximity. The Middle Eastern breeds (Turkey and Iran) clustered into a monophyletic group that was closely related to Central Asian and Chinese breeds, respectively.
TreeMix analysis was performed to investigate the relationships and potential gene flow among wild and global domestic sheep populations. The resulting maximum likelihood tree (Figure 4B) reveals distinct clustering of major lineages. The most optimal value of m, demonstrating the highest reproducibility and consistency across replicates, was determined to be 3. The ARGALI wild sheep form a deeply divergent clade, well separated from both Urial and domestic breeds. The URIAL-PAKISTAN populations occupy a basal position relative to domestic sheep, indicating a closer genetic relationship to domesticated forms than ARGALI. The KERMAN_WILD-IRAN population shows a closer genetic affinity to domestic breeds than to other wild lineages. Asian domestic sheep populations cluster together with short branch lengths, reflecting recent common ancestry or limited differentiation. The close genetic relationship among Central Asian breeds, where the Kazakh (ED, KFTCW, GISS) and Kyrgyz (GISS, KCW) populations formed a distinct subcluster, reflecting their shared ancestry and genetic continuity across the region. More geographically isolated breeds, such as SOP-ITALY, BBB-USA, RMA-KENYA and NQA-NAMIBIA form peripheral branches.
Next, we constructed linkage disequilibrium (LD) decay and effective population size (Ne) graphs (Figure 5A and Figure 5B, respectively) to assess patterns of genetic diversity and historical demography across populations and reveal recent population dynamics and long-term genetic structure. Summary statistics of pairwise LD (r2) values for each population are provided in Supplementary Table S2, while estimates of effective population size (Ne) obtained from LD-based analysis are summarized in Supplementary Table S3. These supplementary data complement the graphical results by presenting detailed numerical summaries for all analyzed sheep populations.
Wild sheep populations were excluded from the linkage disequilibrium (LD) decay analysis due to their small sample size, which was insufficient for reliable estimation of LD decay. Therefore, the LD decay graph shown in Figure 5A includes only Asian domestic sheep breeds, while Supplementary Figure S2 presents the LD decay graph for all sheep populations, including non-Asian domestic breeds and selected wild sheep populations with sufficient sample sizes for inclusion in the analysis.
The LD decay analysis (Figure 5A) illustrates the decline of linkage disequilibrium (measured as mean r2) with increasing physical distance between SNPs across multiple Asian domestic sheep populations. A typical LD decay pattern was observed, where r2 values decreased sharply within the first 100–200 kb and plateaued at greater distances. Among the populations, KRKS-SW-ASIA exhibited the slowest LD decay and the highest overall r2 values, suggesting extended linkage disequilibrium. In contrast, ED-KAZ showed the fastest LD decay and the lowest r2 values, indicating shorter haplotype blocks and potentially greater historical recombination. Populations such as GISS-KAZ, QEZ-SW-ASIA, and TON-MONGOL showed intermediate LD decay rates, reflecting varying levels of genetic diversity and effective recombination.
Figure 5B presents the estimated historical effective population sizes (Ne) for domestic and wild sheep populations over the past 100 generations. Domestic sheep from Kazakhstan (ED-KAZ, GISS-KAZ, and KFTCW-KAZ) exhibited relatively high Ne values with a steady increase in the distant past, reaching estimates between 3500 and 4000 animals. In contrast, wild sheep populations such as URIAL-PAKIST and URIAL-TAJIK demonstrated much lower Ne values, particularly in recent generations.

4. Discussion

In this study, we present the first systematic analysis of the genetic structure, phylogenetics relationships, and demographic history of three fat-tailed coarse-wooled sheep breeds from Kazakhstan in a global context. Our findings reveal high levels of SNP polymorphism and allelic richness, particularly the Edilbay breed, reflecting substantial genetic diversity likely maintained through large effective population size and extensive breeding history. The high genetic diversity observed in Edilbay sheep in our study is consistent with previous findings [17,20,39], which reported high levels of allelic richness and heterozygosity in this breed.
Population structure analyses (PCA and Admixture) revealed the well-defined clustering and minimal overlap among three Kazakh breeds, with Edilbay and Gissar forming genetically homogeneous groups and KFTCW-KAZ displaying higher internal variability. Evidence of Edilbay-specific components in Gissar and KFTCW-KAZ suggests historical introgression, possibly due to shared ancestry or past breeding practices. These results are consistent with our previous findings, which demonstrated clear genetic differentiation between the Edilbay and Gissar sheep breeds [20]. However, the admixture results also revealed traces of Edilbay-specific genetic components in the Gissar and KFTCW-KAZ breeds, suggesting that introgression from the Edilbay breed may have occurred historically. In a previous study comparing Kazakh sheep breeds with global sheep populations, Edilbay sheep were identified as direct descendants of historical domestic sheep ancestors [39]. Consistent with previous findings, our results suggest that the Edilbay breed may represent a genetic ancestral source for both the Kazakh fat-tailed coarse-wooled sheep and the Gissar breed. Additional support for this hypothesis comes from the mitochondrial DNA analysis conducted by Hiendleder et al. (1998) [12], which demonstrated that the Edilbay sheep diverged from wild sheep ancestors earlier than both the Gissar and Astrakhan breeds. This finding reinforces the notion that the Edilbay breed has ancient genetic roots and may predate several modern Central Asian breeds in terms of its evolutionary history. However, Deniskova et al. (2018) [40] proposed that the Edilbay breed originated from a cross between Kazakh fat-tailed coarse-wooled sheep and the Astrakhan breed. Nevertheless, this hypothesis is not yet confirmed by direct genetic evidence and as far as now no genetic data are available to support such hybrid origin. Furthermore, the available literature provides no concrete information regarding the precise origin, timeline, or breeding methods used in the development of the Edilbay sheep. In general, the breed is regarded as a product of traditional or folk selection practices carried out by local pastoral communities. It is also likely that the name “Edilbay” was derived from the name of a tribe or clan historically associated with the region where the breed was developed. On the contrary, the Astrakhan breed of sheep is considered to be descendant from the Edilbay breed. During the Great Jura of 1879, Kazakh nomads temporarily moved to the Astrakhan region in search of pastures. The descendants of Edilbay sheep left in this area over time formed an Astrakhan breed. Thus, the modern Astrakhan breed of sheep is in fact a genetic continuation of the Edilbay breed [41,42,43,44].
The PCA results highlight pronounced genetic structuring among global sheep populations, reflecting both their domestication history and geographic distribution. The clear separation of Argali and Urial from domestic breeds supports their deep evolutionary divergence and distinct phylogenetic status. Moreover, the genetic differentiation observed among African, European, and Asian domestic populations underscores the influence of geographic isolation and localized breeding histories on the genetic makeup of modern sheep breeds. Both PCA and Admixture revealed strong geographic clustering of global domestic sheep, with clear continental differentiation and fine-scale structure within Asia. Analyses highlighted the distinctiveness of East Asian breeds (e.g., TON-MONGOL, TIBQ-CHINA) and showed a close genetic relationship between GISS-KAZ and GISS-KRG. These results reinforce the hypothesis of historical genetic continuity and introgression across West, Central, and East Asian sheep lineages [23]. Also, these findings are in agreement with the results of Wei et al. (2015), who demonstrated by Admixture analysis that Kazakh sheep are genetically distinct from those of Tibet and China [45]. The strong genetic affinity observed between GISS-KAZ and GISS-KRG aligns with previous studies [21], suggesting that the Gissar breed originated from a single historical and geographical region—the Pamir-Alai mountainous region [46]. During the Soviet era, active exchange of livestock among Kyrgyzstan, Tajikistan, Uzbekistan, and Kazakhstan [47,48] likely facilitated gene flow and contributed to the preservation of genetic similarity across regions. Additionally, breeding programs in these countries often targeted similar traits (e.g., live weight, fat deposition in the rump), which may have reinforced a shared genetic structure. It is also important to note that the SNP panel used was of medium density and did not cover the entire genome, potentially limiting the detection of highly polymorphic regions or signals of selection. On the other hand, the development of the Ovine SNP50 BeadChip did not adequately account for fat-tailed sheep, resulting in their underrepresentation in the SNP Discovery panel [49].
Additionally, Admixture analysis detected substructure within South-West Asian populations (KRKS-SW-ASIA and QEZ-SW-ASIA) that was not resolved by PCA, indicating its greater sensitivity to ancestral admixture and gene flow. The results confirm a stable genetic boundary between Iranian and Turkish breeds, shaped by geography and selection, consistent with previous studies [50,51]. It is known that Admixture models are generally more effective than PCA in detecting fine-scale ancestry and recent gene flow [52,53].
Both PCA and Admixture analyses consistently revealed strong genetic structuring among global domestic and wild sheep populations, particularly along geographic lines. The separation of wild species (Argali and Urial) from domestic breeds supports deep evolutionary divergence, while the intermediate position of Urial sheep aligns with their phylogenetic placement. The homogeneity of Asian domestic breeds observed in PCA is mirrored in Admixture by the dominance of a shared genetic cluster, highlighting their cohesiveness. Admixture and TreeMix further revealed common ancestry between Central Asian and Iranian sheep, corroborating the findings of Moosanezhad Khabisi et al. [54]. This shared ancestry likely reflects historical events such as nomadic expansions, military invasions, and trade along the ancient Silk Road.
The phylogenetic tree offers a clear visualization of the genetic divergence and genetic relationships among sheep populations. The deep separation of Argali from both Urial and domestic breeds supports its role as an evolutionarily distinct lineage, corroborating findings from TreeMix and PCA analyses. Meanwhile, Urial populations, while genetically distinct, are positioned closer to domestic lineages in spite of the fact that there is no evidence for contributions to domestication from urials [12]. Kerman wild sheep was determined by both genome-wide SNP and mtDNA analysis. SNP analysis showed that the Kerman sheep is an intermediate species between O. vignei and O. gimelini, while according to mtDNA close to O. vignei [55]. In our study, the Kerman wild sheep’s intermediate placement between wild and domestic clusters may indicate a historical admixture or transitional role in early sheep domestication in the Iranian Plateau. The tight clustering of Central Asian domestic breeds indicates genetic continuity, likely due to shared ancestry and gene flow, supporting earlier reports of a common genetic background among fat-tailed coarse-wooled sheep along the Silk Road [54].
Distinct branches of African (RMA-KENYA, NQA-NAMIBIA), American (BBB-USA), and European (SOP-ITALY, OUE-FRANCE) breeds point to geographic isolation and local adaptation, in line with phylogenetic clades reported in previous studies [16,23]. Overall, the phylogenetic results corroborate PCA and Admixture analyses, highlighting the complex history of domestication, divergence, and regional differentiation in sheep.
The TreeMix results provide valuable insight into the domestication and migration history of sheep. The deep divergence of Argali from all other groups supports its role as a genetically distinct wild ancestor not directly involved in sheep domestication. In contrast, the Urial populations, especially those from Pakistan, appear more closely related to domestic breeds. The genetic cohesion observed among Central Asian breeds supports a scenario of regional expansion from a common ancestral stock, while the longer branches of peripheral breeds may reflect geographic isolation and genetic drift. The detected migration event involving RMA-KENYA, NQA-NAMIBIA and BBB-USA into Central Asian domestic sheep implies historical gene flow, possibly mediated by transregional livestock trade or pastoralist movements between Africa and Asia. This signal of admixture highlights the complex demographic history of domestic sheep, shaped not only by domestication and divergence but also by later episodes of cross-continental gene flow.
The low and mostly negative FIS values in Central Asian breeds suggest that these populations maintain healthy levels of genetic variation, likely due to their large population sizes and traditional breeding practices that promote genetic mixing. This pattern aligns with the observed high heterozygosity and allelic richness in Kazakh and Kyrgyz sheep. Conversely, the elevated FIS in European and East Asian breeds (e.g., OUE-FRANCE, TIBQ-CHINA) may reflect population subdivision or recent inbreeding. The moderate FST values among Asian populations indicate ongoing gene flow and shared historical origins, whereas higher FST in geographically distant populations such as OUE-FRANCE and NQA-NAMIBIA reflects stronger genetic differentiation driven by geographic isolation and breeding system differences. These findings collectively emphasize the genetic cohesion within Asian sheep and their clear divergence from other continental groups.
The LD decay patterns among Asian domestic sheep populations reflect significant variation in demographic history and breeding structure. In KRKS-SW-ASIA, slow LD decay with high r2 values indicates reduced recombination and small effective population size, likely due to bottlenecks, inbreeding, or recent selection, consistent with Eydivandi et al. [50]. In contrast, rapid LD decay and lower r2 values in ED-KAZ suggest higher genetic diversity, larger effective population size, and more recombination, in line with its high polymorphism and allelic richness indicating that this population may have experienced fewer demographic constraints or more gene flow over time. These results agree with Pozharsky’s estimates of LD decay in Kazakh sheep breeds [37] and with ROH analyses showing that the Edilbay breed retains broad genetic diversity, likely due to its native status and balanced breeding strategies aimed at preserving genetic variability [56].
The Ne estimates further support this interpretation. Kazakhstani breeds (ED-KAZ, KFTCW-KAZ and GISS-KAZ) maintained large effective population sizes, reflecting stable breeding histories and the long-term preservation of genetic diversity. This pattern likely results from less intensive artificial selection, broad breeding networks, and the socio-economic importance of meat-fat–tailed sheep, which comprise over 70% of Kazakhstan’s sheep population [42]. These sheep are deeply rooted in traditional Kazakh pastoralism due to their exceptional adaptability to the harsh environmental conditions of arid and semi-arid zones. Among them, the Edilbay breed is the most widespread and serves as a key genetic resource for improving meat-fat traits in other native breeds. Its large Ne is directly linked to population size and management structure; for example, the “Birlik” farm maintains 16,512 Edilbay sheep, organized into flocks of 500–600 animals that promote genetic exchange and reduce inbreeding [17]. The breed’s wide distribution across diverse environments and the persistence of traditional folk selection further safeguard its gene pool. Similar results were reported by Deniskova et al. (2018) [40], with Central Asian breeds showing higher Ne compared to global populations [8], likely due to mass breeding, adaptability, and cultural importance. In contrast, the reduced Ne values observed in URIAL-PAKIST and URIAL-TAJIK likely reflect recent demographic declines driven by poaching, habitat loss, and competition with domestic livestock. Such pressures have fragmented urial populations into small, isolated groups, consistent with their documented conservation concern in both Pakistan and Tajikistan [57,58].
To sum up, the results of this study provide valuable insights into the demographic and evolutionary history of sheep populations in Central Asia, highlighting both diversity and connectivity in the region’s livestock genetic resources.

5. Conclusions

In summary, this study provides the first comprehensive genomic assessment of fat-tailed coarse-wooled sheep from Kazakhstan in the context of wild and globally distributed domestic sheep populations. Our results demonstrate that Kazakh breeds, particularly the Edilbay, harbor exceptionally high levels of genetic diversity, maintained through large effective population sizes and extensive breeding histories. Population structure analyses consistently revealed well-defined clustering among Kazakh breeds, with Edilbay and Gissar forming homogeneous groups, while Kazakh fat-tailed coarse-wooled sheep exhibited greater variability and signatures of admixture. The Edilbay breed emerged as a key ancestral genetic resource for Central Asian sheep, with evidence of historical introgression into Gissar and other regional breeds.
Comparative analyses further highlighted strong geographic structuring of global sheep populations, with clear separation between continental groups and fine-scale differentiation among Asian lineages. The genetic continuity observed across Central Asian, Iranian, and Chinese breeds underscores the role of historical migrations and trade along the Silk Road in shaping sheep diversity. Moreover, wild sheep species (Argali and Urial) were confirmed as deeply divergent lineages, with Kerman wild sheep positioned as a distinct urial subspecies closely related to domestic lineages on the Iranian Plateau. Together, these findings underscore Kazakhstan’s fat-tailed sheep as an important reservoir of genetic diversity and a cornerstone of Central Asian pastoral heritage. They provide a valuable genomic framework for future conservation, breeding, and sustainable management programs aimed at preserving both the productivity and adaptability of these historically significant breeds.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/biology14111478/s1, Figure S1: Admixture analysis of the studied domestic sheep populations: (A) worldwide domestic breeds (optimal K = 8). (B) domestic sheep breeds and wild sheep populations (optimal K = 8). (C) Delta K graph obtained with the CLUMPAK software; Figure S2: LD Decay across the studied sheep populations including Asian and non-Asian domestic breeds and selected wild sheep populations; Table S1: Worldwide domestic sheep breeds and wild sheep populations included in the study (references [59,60,61,62] are cited in Supplementary Materials). Table S2: Summary statistics of linkage disequilibrium (r2) across sheep populations. Table S3: Summary of estimated effective population size (Ne) across sheep populations.

Author Contributions

Conceptualization, K.D., M.A., D.U. and E.C.; methodology, K.D., M.A., T.K., A.K. and R.A.; software, M.A. and B.B.; validation, K.D. and M.A.; formal analysis, M.A. and B.B.; investigation, K.D., M.A., D.U. and R.A.; resources, K.D. and B.B.; data curation, K.D. and M.A.; writing—original draft preparation, M.A. and K.D.; writing—review and editing, K.D., B.B. and E.C.; visualization, M.A. and B.B.; supervision, K.D.; project administration, K.D., M.A. and D.U.; funding acquisition, K.D., M.A. and D.U. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by program-targeted funding for 2023–2025 of the Ministry of Education and Science of the Republic of Kazakhstan, BR21882201 “Improvement of meat productivity of fat-tailed sheep by new methods of selection, genetics and biotechnology” and by grant funding of young scientists for scientific and (or) scientific and technical projects for 2023–2025 years of the Ministry of Science and Higher Education of the Republic of Kazakhstan AP19577108 “Analysis of phylogenetic relationships and evolutionary demographic events of fat-tailed coarse-wool sheep breeds raised in Kazakhstan”.

Institutional Review Board Statement

The animal study protocol was approved by the Local Ethics Commission of the RSE Institute of Genetics and Physiology Committee of Science of the Ministry of Science and Higher Education of the Republic of Kazakhstan (Conclusion of the Local Ethics Commission №6, 3 November 2022, Almaty, Kazakhstan).

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

We sincerely thank the staff of all sheep farms for providing data and assisting in the data collection process. Special thanks go to Birlik Mal Zauyty LLP and its head, Kanat Eshimov, for providing detailed information on the studied sheep breeds.

Conflicts of Interest

Authors Kairat Dossybayev, Makpal Amandykova, Daniya Ualiyeva, Tilek Kapassuly and Altynay Kozhakhmet were employed by LLP «Kazakh Research Institute of Animal Husbandry and Fodder Production». The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
SNPSingle Nucleotide Polymorphism
RSERepublican State Enterprise
EDTAEthylenediaminetetraacetic acid
PCAPrincipal Component Analysis
LDLinkage Disequilibrium

References

  1. Kassenbayev, G.; Kerimova, U.; Rakhimzhanova, G.; Shalgimbayeva, K. Animal husbandry market in Kazakhstan: Dynamics and prognosis. Sci. Horiz. 2024, 27, 176–188. [Google Scholar] [CrossRef]
  2. Byuro Natsional’noy Statistiki (Office of National Statistics). Available online: https://stat.gov.kz/ru/industries/business-statistics/stat-forrest-village-hunt-fish/publications/287928/ (accessed on 22 July 2025).
  3. Zhumadillayev, N.K. Dissertation on the Topic “Scientific and Practical Justification for Improving Productive Traits and Creating New Genotypes of Fat-Tailed Sheep in Various Regions of Kazakhstan” (Nauchno-Prakticheskoe Obosnovanie Povysheniya Produktivnykh Kachestv i Sozdaniya Novykh Genotipov Kurdyuchnykh Ovets v Razlichnykh Regionakh Kazakhstana). 2025. Available online: https://mbook.kz/media/360/items/58/index.html (accessed on 23 July 2025).
  4. Isaeva, D.; Korotkevich, O. Characteristics of the Edilbay sheep breed of the Republic of Kazakhstan. Bull. NSAU (Novosib. State Agrar. Univ.) 2022, 157–163. [Google Scholar] [CrossRef]
  5. Katasheva, A.; Iskakov, K.; Kulataev, B.; Abdramanov, A.; Sattorov, S. Improving the efficiency of the production of mutton of kazakh fat-tailed coarse-wool sheep. Izdenister Natigeler 2024, 2, 55–62. [Google Scholar] [CrossRef]
  6. Zhumadillaev, N. Introductory crossing in the meat-fat sheep production of western Kazakhstan. Sheep Goats Woolen Bus. 2021, 4, 7–12. [Google Scholar] [CrossRef]
  7. Rákossy, Z.; Petrescu-Mag, I.V.; Kovacs, E.; Gavriloaie, C.; Coroian, C.O.; Păpuc, T.; Ungureanu, T.; Botha, M. Black wooled sheep breeds of the world, I. ABAH Bioflux 2019, 11, 69–78. [Google Scholar]
  8. Deniskova, T.; Dotsev, A.; Lushihina, E.; Shakhin, A.; Kunz, E.; Medugorac, I.; Reyer, H.; Wimmers, K.; Khayatzadeh, N.; Sölkner, J.; et al. Population Structure and Genetic Diversity of Sheep Breeds in the Kyrgyzstan. Front. Genet. 2019, 10, 1311. [Google Scholar] [CrossRef]
  9. Hiendleder, S.; Kaupe, B.; Wassmuth, R.; Janke, A. Molecular analysis of wild and domestic sheep questions current nomenclature and provides evidence for domestication from two different subspecies. Proc. Biol. Sci. 2002, 269, 893–904. [Google Scholar] [CrossRef]
  10. Zeder, M.A. Domestication and early agriculture in the Mediterranean Basin: Origins, diffusion, and impact. Proc. Natl. Acad. Sci. USA 2008, 105, 11597–11604. [Google Scholar] [CrossRef]
  11. Peters, J.; Helmer, D.; Von Den Driesch, A.; Saña Segui, M. Early animal husbandry in the Northern Levant. Paléorient 1999, 25, 27–48. [Google Scholar] [CrossRef]
  12. Hiendleder, S.; Mainz, K.; Plante, Y.; Lewalski, H. Analysis of mitochondrial DNA indicates that domestic sheep are derived from two different ancestral maternal sources: No evidence for contributions from urial and argali sheep. J. Hered. 1998, 89, 113–120. [Google Scholar] [CrossRef]
  13. Larsen, P.F.; Demontis, D.; Nielsen, V.H.; Thirstrup, J.P.; Loeschcke, V.; Pertoldi, C. The Transferability of Illumina Canine BeadChip Single-Nucleotide Polymorphisms (SNPs) to American Mink (Neovison vison). Biochem. Genet. 2012, 50, 717–721. [Google Scholar] [CrossRef]
  14. Haynes, G.D.; Latch, E.K. Identification of novel single nucleotide polymorphisms (SNPs) in deer (Odocoileus spp.) using the BovineSNP50 BeadChip. PLoS ONE 2012, 7, e36536. [Google Scholar] [CrossRef]
  15. More, M.; Gutiérrez, G.; Rothschild, M.; Bertolini, F.; Abel, F. Evaluation of SNP Genotyping in Alpacas Using the Bovine HD Genotyping Beadchip. Front. Genet. 2019, 10, 433129. [Google Scholar] [CrossRef]
  16. Kijas, J.W.; Lenstra, J.A.; Hayes, B.; Boitard, S.; Porto Neto, L.R.; San Cristobal, M.; Servin, B.; McCulloch, R.; Whan, V.; Gietzen, K.; et al. Genome-Wide Analysis of the World’s Sheep Breeds Reveals High Levels of Historic Mixture and Strong Recent Selection. PLoS Biol. 2012, 10, e1001258. [Google Scholar] [CrossRef]
  17. Dossybayev, K.; Orazymbetova, Z.; Mussayeva, A.; Saitou, N.; Zhapbasov, R.; Makhatov, B.; Bekmanov, B. Genetic diversity of different breeds of Kazakh sheep using microsatellite analysis. Arch. Anim. Breed. 2019, 62, 305–312. [Google Scholar] [CrossRef]
  18. Mukhametzharova, I.; Islamov, Y.; Shauyenov, S.; Ibrayev, D.; Atavliyeva, S.; Tarlykov, P. Genetic Characterization of Kazakh Native Sheep Breeds Using Mitochondrial DNA. OnLine J. Biol. Sci. 2018, 18, 341–348. [Google Scholar] [CrossRef]
  19. Amandykova, M.; Akhatayeva, Z.; Kozhakhmet, A.; Kapassuly, T.; Orazymbetova, Z.; Yergali, K.; Khamzin, K.; Iskakov, K.; Dossybayev, K. Distribution of Runs of Homozygosity and Their Relationship with Candidate Genes for Productivity in Kazakh Meat–Wool Sheep Breed. Genes 2023, 14, 1988. [Google Scholar] [CrossRef] [PubMed]
  20. Zhumadillayev, N.; Dossybayev, K.; Khamzina, A.; Kapasuly, T.; Khamzina, Z.; Tlevlesov, N. SNP Genotyping Characterizes the Genome Composition of the New Baisary Fat-Tailed Sheep Breed. Animals 2022, 12, 1468. [Google Scholar] [CrossRef] [PubMed]
  21. Dossybayev, K.; Amandykova, M.; Orakbayeva, A.; Adylkanova, S.; Kozhakhmet, A.; Yergali, K.; Kulboldin, T.; Kulataev, B.; Torekhanov, A. Genome-Wide Association Studies Revealed Several Candidate Genes of Meat Productivity in Saryarka Fat- Tailed Coarse-Wool Sheep Breed. Genes 2024, 15, 1549. [Google Scholar] [CrossRef] [PubMed]
  22. Kijas, J.W.; Townley, D.; Dalrymple, B.P.; Heaton, M.P.; Maddox, J.F.; McGrath, A.; Wilson, P.; Ingersoll, R.G.; McCulloch, R.; McWilliam, S.; et al. International Sheep Genomics Consortium. A genome wide survey of SNP variation reveals the genetic structure of sheep breeds. PLoS ONE 2009, 4, e4668. [Google Scholar] [CrossRef]
  23. Lv, F.H.; Cao, Y.H.; Liu, G.J.; Luo, L.Y.; Lu, R.; Liu, M.J.; Li, W.R.; Zhou, P.; Wang, X.H.; Shen, M.; et al. Whole-Genome Resequencing of Worldwide Wild and Domestic Sheep Elucidates Genetic Diversity, Introgression, and Agronomically Important Loci. Mol. Biol. Evol. 2022, 39, msab353. [Google Scholar] [CrossRef]
  24. FAO. Developing the Institutional Framework for the Management of Animal Genetic Resources. FAO Animal Production and Health Guidelines. No. 6. Rome. 2011. Available online: http://www.fao.org/docrep/014/ba0054e/ba0054e00.pdf (accessed on 22 July 2025).
  25. WIDDE. Available online: http://widde.toulouse.inra.fr/widde/widde/main.do?module=sheep (accessed on 22 July 2025).
  26. DRYAD. Available online: https://datadryad.org/ (accessed on 22 July 2025).
  27. Figshare. Available online: https://figshare.com/ (accessed on 22 July 2025).
  28. Purcell, S.; Neale, B.; Todd-Brown, K.; Thomas, L.; Ferreira, M.A.; Bender, D.; Maller, J.; Sklar, P.; de Bakker, P.I.; Daly, M.J.; et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 2007, 81, 559–575. [Google Scholar] [CrossRef]
  29. Weir, B.S.; Cockerham, C.C. Estimating F-Statistics for the Analysis of Population Structure. Evolution 1984, 38, 1358–1370. [Google Scholar] [CrossRef]
  30. Szpiech, Z.A.; Jakobsson, M.; Rosenberg, N.A. ADZE: A rarefaction approach for counting alleles private to combinations of populations. Bioinformatics 2008, 24, 2498–2504. [Google Scholar] [CrossRef] [PubMed]
  31. Kumar, S.; Stecher, G.; Li, M.; Knyaz, C.; Tamura, K. MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. Mol. Biol. Evol. 2018, 35, 1547–1549. [Google Scholar] [CrossRef] [PubMed]
  32. Molecular Evolution, Phylogenetics and Epidemiology. Available online: http://tree.bio.ed.ac.uk/software/figtree/ (accessed on 22 July 2025).
  33. Van Rossum, G.; Drake, F.L. Python 3 Reference Manual; CreateSpace: Scotts Valley, CA, USA, 2009. [Google Scholar]
  34. Alexander, D.H.; Novembre, J.; Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 2009, 19, 1655–1664. [Google Scholar] [CrossRef] [PubMed]
  35. Kopelman, N.M.; Mayzel, J.; Jakobsson, M.; Rosenberg, N.A.; Mayrose, I. CLUMPAK: A program for identifying clustering modes and packaging population structure inferences across K. Mol. Ecol. Resour. 2015, 15, 1179–1191. [Google Scholar] [CrossRef]
  36. Pickrell, J.K.; Pritchard, J.K. Inference of population splits and mixtures from genome-wide allele frequency data. PLoS Genet. 2012, 8, e1002967. [Google Scholar] [CrossRef]
  37. Barbato, M.; Orozco-terWengel, P.; Tapio, M.; Bruford, M.W. SNeP: A tool to estimate trends in recent effective population size trajectories using genome-wide SNP data. Front. Genet. 2015, 6, 109. [Google Scholar] [CrossRef]
  38. Wickham, H. ggplot2: Elegant Graphics for Data Analysis; Springer: New York, NY, USA, 2009. [Google Scholar]
  39. Pozharskiy, A.; Khamzina, A.; Gritsenko, D.; Khamzina, Z.; Kassymbekova, S.; Karimov, N.; Karymsakov, T.; Tlevlesov, N. SNP genotyping and population analysis of five indigenous Kazakh sheep breeds. Livest. Sci. 2020, 241, 104252. [Google Scholar] [CrossRef]
  40. Deniskova, T.E.; Dotsev, A.V.; Selionova, M.I.; Kunz, E.; Medugorac, I.; Reyer, H.; Wimmers, K.; Barbato, M.; Traspov, A.A.; Brem, G.; et al. Population structure and genetic diversity of 25 Russian sheep breeds based on whole-genome genotyping. Genet. Sel. Evol. 2018, 50, 29. [Google Scholar] [CrossRef]
  41. Popov, I.N. Edilbay Sheep (Edilbayevskaya Ovtsa); Bastau: Almaty, Kazakhstan, 1951; p. 62. [Google Scholar]
  42. Kanapin, K.K. Edilbay Sheep (Edilbayevskaya Ovtsa); Bastau: Almaty, Kazakhstan, 2009; pp. 25–30. [Google Scholar]
  43. Kiyatkin, P.F. Fat-Tailed Sheep and Breeding Work with Them (Kurduychnyie Ovtsi i Plemennaya Rabota s Nimi); Fan: Tashkent, Uzbekistan, 1968; pp. 4–120. [Google Scholar]
  44. Lus, Y.Y.; Medvedev, N.N. Fat-tailed sheep of Semipalatinsk province (Kurduychnaya ovtsa Semipalatinskoy gubernii). In Domestic aniMals of the Semipalatinsk Province: Mater. Commission of the Expeditionary Research; Publishing House of the USSR Academy of Sciences: Leningrad, Russia, 1928; pp. 176–265. [Google Scholar]
  45. Wei, C.; Wang, H.; Liu, G.; Wu, M.; Cao, J.; Liu, Z.; Liu, R.; Zhao, F.; Zhang, L.; Lu, J.; et al. Genome-wide analysis reveals population structure and selection in Chinese indigenous sheep breeds. BMC Genom. 2015, 16, 194. [Google Scholar] [CrossRef] [PubMed]
  46. Slow Food Foundation for Biodiversity. Available online: https://www.fondazioneslowfood.com/en/ark-of-taste-slow-food/gissar-sheep/ (accessed on 22 July 2025).
  47. Schillhorn van Veen, T.W. The Kyrgyz Sheep Herders at a Crossroads. Pastor. Dev. Netw. Ser. 1995, 38, 1–14. [Google Scholar]
  48. Kerven, C.; Steimann, B.; Ashley, L.; Dear, C.; Rahim, I. Pastoralism and Farming in Central Asia’s Mountains: A Research Review; University of Central Asia: Bishkek, Kyrgyzstan, 2011. [Google Scholar]
  49. van Marle-Köster, E.; Lashmar, S.F.; Retief, A.; Visser, C. Whole-Genome SNP Characterisation Provides Insight for Sustainable Use of Local South African Livestock Populations. Front. Genet. 2021, 12, 714194. [Google Scholar] [CrossRef] [PubMed]
  50. Eydivandi, S.; Sahana, G.; Momen, M.; Moradi, M.H.; Schönherz, A.A. Genetic diversity in Iranian indigenous sheep vis-à-vis selected exogenous sheep breeds and wild mouflon. Anim. Genet. 2020, 51, 772–787. [Google Scholar] [CrossRef]
  51. Bayraktar, M. Analysing the genetic diversity of three sheep breeds in Turkey and nearby countries using 50 K SNPs data. Anim. Biotechnol. 2024, 35, 2329106. [Google Scholar] [CrossRef]
  52. Liu, C.C.; Shringarpure, S.; Lange, K.; Novembre, J. Exploring Population Structure with Admixture Models and Principal Component Analysis. In Statistical Population Genomics. Methods in Molecular Biology; Dutheil, J.Y., Ed.; Humana: New York, NY, USA, 2020; Volume 2090. [Google Scholar]
  53. van Waaij, J.; Li, S.; Garcia-Erill, G.; Albrechtsen, A.; Wiuf, C. Evaluation of population structure inferred by principal component analysis or the admixture model. Genetics 2023, 225, iyad157. [Google Scholar] [CrossRef]
  54. Khabisi, M.M.; Foozi, M.A.; Lv, F.H.; Esmailizadeh, A. Genome-wide DNA arrays profiling unravels the genetic structure of Iranian sheep and pattern of admixture with worldwide coarse-wool sheep breeds. Genomics 2021, 113, 3501–3511. [Google Scholar] [CrossRef]
  55. Dotsev, A.V.; Moradi, M.H.; Deniskova, T.E.; Esmailizadeh, A.; Bakoev, N.F.; Koshkina, O.A.; Griffin, D.K.; Romanov, M.N.; Zinovieva, N.A. Molecular Genetic Assessment Aids in Clarifying Phylogenetic Status of Iranian Kerman Wild Sheep. Animals 2025, 15, 238. [Google Scholar] [CrossRef]
  56. Khamzina, A.; Smagulov, D.; Dossybayev, K.; Kantanen, J.; Khamzin, K. Assessing runs of homozygosity reveals production traits of Kazakh sheep breeds. Braz. J. Biol. 2025, 85, e292980. [Google Scholar] [CrossRef]
  57. Ali Awan, G.; Festa-Bianchet, M.; Ahmad, T. Poaching, recruitment and conservation of Punjab urial Ovis vignei punjabiensis. Wildl. Biol. 2006, 12, 443–449. [Google Scholar] [CrossRef]
  58. Stefan, M. Conservation of Tajik Markhor (Capra falconeri heptneri) and Urial (Ovis vignei) in Tajikistan and adjacent Afghanistan. Galemys Span. J. Mammal. 2010, 22, 407–419. [Google Scholar]
  59. Yuan, Z.; Liu, E.; Liu, Z.; Kijas, J.; Zhu, C.; Hu, S.; Ma, X.; Zhang, L.; Du, L.; Wang, H.; et al. Selection signature analysis reveals genes associated with tail type in Chinese indigenous sheep. Anim. Genet. 2016, 48, 55–66. [Google Scholar] [CrossRef]
  60. Ciani, E.; Lasagna, E.; D’Andrea, M.; Alloggio, I.; Marroni, F.; Ceccobelli, S.; Delgado Bermejo, J.V.; Sarti, F.M.; Kijas, J.; Lenstra, J.A.; et al. Merino and Merino-derived sheep breeds: A genome-wide intercontinental study. Genet. Sel. Evol. 2015, 47, 64. [Google Scholar] [CrossRef]
  61. Rochus, C.M.; Tortereau, F.; Plisson-Petit, F.; Restoux, G.; Moreno-Romieux, C.; Tosser-Klopp, G.; Servin, B. Revealing the selection history of adaptive loci using genome-wide scans for selection: An example from domestic sheep. BMC Genom. 2018, 19, 71. [Google Scholar] [CrossRef]
  62. Dotsev, A.; Koshkina, O.; Kharzinova, V.; Deniskova, T.; Reyer, H.; Kunz, E.; Mészáros, G.; Shakhin, A.; Petrov, S.; Medvedev, D.; et al. Genome-Wide Insights into Intraspecific Taxonomy and Genetic Diversity of Argali (Ovis ammon). Diversity 2023, 15, 627. [Google Scholar] [CrossRef]
Figure 1. Geographic origins and sample localities of the studied sheep breeds from Kazakhstan (photographs were taken by the authors during farm visits for sample collection).
Figure 1. Geographic origins and sample localities of the studied sheep breeds from Kazakhstan (photographs were taken by the authors during farm visits for sample collection).
Biology 14 01478 g001
Figure 2. Principal Component Analysis (PCA) of the studied sheep populations. (A) PCA of the three studied sheep populations from Kazakhstan. (B) PCA of the studied global domestic sheep breeds. (C) PCA of the studied Asian domestic sheep breeds. (D) PCA of the studied wild sheep and global domestic sheep breeds.
Figure 2. Principal Component Analysis (PCA) of the studied sheep populations. (A) PCA of the three studied sheep populations from Kazakhstan. (B) PCA of the studied global domestic sheep breeds. (C) PCA of the studied Asian domestic sheep breeds. (D) PCA of the studied wild sheep and global domestic sheep breeds.
Biology 14 01478 g002
Figure 3. Admixture analysis of the studied sheep breed populations from Kazakhstan (A) (optimal K = 2) and of Asian domestic breeds (B) (optimal K = 2).
Figure 3. Admixture analysis of the studied sheep breed populations from Kazakhstan (A) (optimal K = 2) and of Asian domestic breeds (B) (optimal K = 2).
Biology 14 01478 g003
Figure 4. (A) Phylogenetic tree depicting relationships among the studied sheep populations. (B) TreeMix plot showing the maximum likelihood tree and inferred admixture events among the studied sheep populations.
Figure 4. (A) Phylogenetic tree depicting relationships among the studied sheep populations. (B) TreeMix plot showing the maximum likelihood tree and inferred admixture events among the studied sheep populations.
Biology 14 01478 g004
Figure 5. (A) Estimated linkage disequilibrium (LD) decay for Asian domestic sheep breeds under the study. (B) Estimated effective population size (Ne) for domestic sheep breeds and wild sheep under the study.
Figure 5. (A) Estimated linkage disequilibrium (LD) decay for Asian domestic sheep breeds under the study. (B) Estimated effective population size (Ne) for domestic sheep breeds and wild sheep under the study.
Biology 14 01478 g005
Table 1. Sheep breeds used for the study and their characteristics.
Table 1. Sheep breeds used for the study and their characteristics.
No.Sheep BreedFarmNumber of AnimalsOverall
1Edilbay (ED)“Birlik Mal Zauyty” LLP, Birlik village, Zhangaly district, West Kazakhstan region3397
LLP “Suyindik asyl tukymdy koy zauyty”, Suyindik village, Kurmangazy district, Atyrau region32
Peasant farm “Aigul”, Donenbay village, Ayagoz district, Abay region32
2Kazakh fat-tailed coarse-wooled (KFTCW)Peasant farm “Kundyzdy”, Kundyzdy village, Ulken Naryn district, East Kazakhstan region3232
3Gissar (GISS)Peasant farm “Darbaza”, Darbaza village, Saryagash district, Turkestan region3131
Overall160
Table 2. Summary of SNP quality control results for sheep groups under the study.
Table 2. Summary of SNP quality control results for sheep groups under the study.
Studied Sheep GroupsTotal Number of AnimalsTotal Number of SNPsNumber of Animals After QCNumber of SNPs After QCNumber of Animals After LD PruningNumber of SNPs After LD Pruning
Kazakhstani sheep breeds16051,10216047,96916037,969
Domestic sheep breeds22539,27318236,79818230,230
Domestic sheep breeds and wild sheep26738,66725831,33925827,045
Table 3. Within-breed genetic diversity parameters for the studied domestic sheep populations.
Table 3. Within-breed genetic diversity parameters for the studied domestic sheep populations.
No.PopulationNHeHoFFISMean FSTPn, %Ar
1ED-KAZ200.3839010.396967−0.034014−0.0080593600.12642537198.901.90702
2KFTCW-KAZ200.3795330.390952−0.030186−0.0057617070.13399535398.541.89864
3GISS-KAZ200.3781270.394446−0.043173−0.0169178770.13525799998.351.89474
4TON-MONGOL150.3761770.3692860.0189220.0480464520.14283055296.761.88441
5TIBQ-CHINA140.3753900.3623800.0351350.0655352230.14653951396.211.88215
6KCW-KRG130.3815210.405609−0.063152−0.0234692130.13656791197.621.90427
7GISS-KRG120.3808440.399926−0.049898−0.0086979620.13356861697.601.90648
8KRKS-SW-ASIA180.3729220.394711−0.058287−0.0273270930.15633886396.151.86975
9QEZ-SW-ASIA150.3820870.391640−0.0250530.007706401 0.13192936898.021.90433
10SOP-ITALY90.3823990.402810−0.053236−0.002107075 0.15892414296.211.65776
11OUE-FRANCE180.3165220.2715220.1423680.1543655640.30182227779.971.65567
12BBB-USA150.3626490.366821−0.0112580.019518782 0.19168470693.191.83533
13RMA-KENYA150.3524990.372089−0.055720−0.0208600070.19444160993.111.82236
14NQA-NAMIBIA120.3521810.418734−0.189218−0.1340633270.29003602374.211.65979
He—expected heterozygosity; Ho—observed heterozygosity; F—inbreeding coefficient; FIS—the inbreeding coefficient within populations; mean FST—mean pairwise genetic differentiation among populations; Pn—proportion of polymorphic SNPs; Ar—allelic richness. ED-KAZ—Edilbay breed from Kazakhstan; KFTCW-KAZ—Kazakh fat-tailed coarse-wooled breed from Kazakhstan; GISS-KAZ—Gissar breed from Kazakhstan; TON-MONGOL—Tong breed from Mongolia; TIBQ-CHINA—Tibetan breed from China; KCW-KRG—Kyrgyz coarse-wool breed from Kyrgyzstan; GISS-KRG—Gissar breed from Kyrgyzstan; KRKS-SW-ASIA—Karakas breed from South-West Asia; QEZ-SW-ASIA—Qezel breed from South-West Asia; SOP-ITALY—Sopravissana breed from Italy; OUE-FRANCE—Ouessant breed from France; BBB-USA—Barbados Black Belly breed from Barbados and US; RMA-KENYA—Red Maasai breed from Kenya; NQA-NAMIBIA—Namaqua Afrikaner breed from Namibia.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Dossybayev, K.; Amandykova, M.; Ualiyeva, D.; Kapassuly, T.; Kozhakhmet, A.; Ciani, E.; Bekmanov, B.; Amzeyev, R. Genome-Wide SNP Analysis Reveals the Unique Genetic Diversity Represented by Fat-Tailed Coarse-Wooled Sheep Breeds of Kazakhstan. Biology 2025, 14, 1478. https://doi.org/10.3390/biology14111478

AMA Style

Dossybayev K, Amandykova M, Ualiyeva D, Kapassuly T, Kozhakhmet A, Ciani E, Bekmanov B, Amzeyev R. Genome-Wide SNP Analysis Reveals the Unique Genetic Diversity Represented by Fat-Tailed Coarse-Wooled Sheep Breeds of Kazakhstan. Biology. 2025; 14(11):1478. https://doi.org/10.3390/biology14111478

Chicago/Turabian Style

Dossybayev, Kairat, Makpal Amandykova, Daniya Ualiyeva, Tilek Kapassuly, Altynay Kozhakhmet, Elena Ciani, Bakytzhan Bekmanov, and Rauan Amzeyev. 2025. "Genome-Wide SNP Analysis Reveals the Unique Genetic Diversity Represented by Fat-Tailed Coarse-Wooled Sheep Breeds of Kazakhstan" Biology 14, no. 11: 1478. https://doi.org/10.3390/biology14111478

APA Style

Dossybayev, K., Amandykova, M., Ualiyeva, D., Kapassuly, T., Kozhakhmet, A., Ciani, E., Bekmanov, B., & Amzeyev, R. (2025). Genome-Wide SNP Analysis Reveals the Unique Genetic Diversity Represented by Fat-Tailed Coarse-Wooled Sheep Breeds of Kazakhstan. Biology, 14(11), 1478. https://doi.org/10.3390/biology14111478

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop