Next Article in Journal
Histological Insights into Testicular Apoptosis Associated with Spermatogenesis in Pre-Pubertal and Adult Meagre (Argyrosomus regius)
Previous Article in Journal
Ultrasonographic Features of Gastropathy in Dogs with Acute Kidney Injury and Acute-on-Chronic Kidney Injury
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Uncovering Genetic Diversity and Adaptive Candidate Genes in the Mugalzhar Horse Breed Using Whole-Genome Sequencing Data

by
Shinara N. Kassymbekova
1,
Zhanat Z. Bimenova
1,
Kairat Z. Iskhan
2,
Przemyslaw Sobiech
3,
Jan P. Jastrzebski
4,*,
Pawel Brym
5,
Wiktor Babis
6,
Assem S. Kalykova
1,7,
Zhassulan M. Otebayev
2,
Dinara I. Kabylbekova
1,
Hasan Baneh
8,9 and
Michael N. Romanov
10,11,12,*
1
Department of Clinical Disciplines, Faculty of Veterinary and Zooengineering, Kazakh National Agrarian Research University, Almaty 050010, Kazakhstan
2
Department of Animal Biology Named after N.U. Bazanova, Faculty of Veterinary and Zooengineering, Kazakh National Agrarian Research University, Almaty 050010, Kazakhstan
3
Department and Clinic of Internal Diseases, Faculty of Veterinary Medicine, University of Warmia and Mazury in Olsztyn, 10-719 Olsztyn, Poland
4
Department of Plant Physiology Genetics and Biotechnology, Faculty of Biology and Biotechnology, University of Warmia and Mazury in Olsztyn, 10-719 Olsztyn, Poland
5
Department of Animal Genetics, Faculty of Animal Bioengineering, University of Warmia and Mazury in Olsztyn, 10-719 Olsztyn, Poland
6
Ecology and Genetics, Faculty of Science, University of Oulu, 90520 Oulu, Finland
7
Department of Fundamental Medicine, al-Farabi Kazakh National University, Almaty 050040, Kazakhstan
8
Project Center for Agro Technologies, Skolkovo Institute of Science and Technology (Skoltech), Moscow 121205, Russia
9
Animal Science Research Department, Kurdistan Agricultural and Natural Resources Research and Education Center, Agricultural Research, Education and Extension Organization (AREEO), Sanandaj 6616936311, Iran
10
School of Natural Sciences, University of Kent, Canterbury CT2 7NJ, UK
11
Animal Genomics and Bioresource Research Unit (AGB Research Unit), Faculty of Science, Kasetsart University, Chatuchak, Bangkok 10900, Thailand
12
L. K. Ernst Federal Research Center for Animal Husbandry, Dubrovitsy, Podolsk 142132, Russia
*
Authors to whom correspondence should be addressed.
Animals 2025, 15(18), 2667; https://doi.org/10.3390/ani15182667
Submission received: 6 July 2025 / Revised: 1 September 2025 / Accepted: 9 September 2025 / Published: 11 September 2025
(This article belongs to the Section Animal Genetics and Genomics)

Simple Summary

Mugalzhar horses are a native breed of Kazakhstan valued for their ability to produce milk and meat and adapt to harsh environments. This study explored the genetic diversity of these horses and identified regions of their DNA affected by natural selection using advanced genome analysis techniques. Using more than 21 million genetic variants, we found that most of them occurred in non-coding regions of the genome, with only a small fraction affecting certain genes directly as candidates for adaptation to a harsh climate. Despite the presence of rare genetic markers associated with traits like coat color and gait, no harmful segregating genetic mutations linked to diseases with Mendelian inheritance were identified. These results suggest that Mugalzhar horses have maintained a moderate genetic diversity, exhibiting traces of historical selection and no signs of inbreeding. This study provides useful insights into the genetic makeup of this breed, which can help to preserve and improve it in breeding programs.

Abstract

Mugalzhar horses are a relatively young native breed of Kazakhstan, prized for meat and milk production and adaptation. This study was conducted to investigate genetic diversity and pinpoint genomic regions associated with selection signatures in this breed using whole-genome sequence data. Variant calling yielded a total of 21,722,393 high-quality variants, including 19,495,163 SNPs and 2,227,230 indels. Most variants were located in introns and intergenic regions, while only 1.94% were exonic. Estimates of genetic diversity were moderate, with expected and observed heterozygosity and nucleotide diversity of 0.2325, 0.2402, and 0.0021, respectively. We identified nine adaptive candidate genes (SCAPER, FHAD1, MMP15, ADGRE1, CMKLR1, MRPL15, ZNF667, CCDC66, and LOC100055310), harboring high-impact exonic variants in the homozygote state for an alternative allele. No deleterious segregating variants associated with Mendelian traits were found in this population, while seven variants linked to coat color and gaitedness were detected in a low frequency heterozygous state. Our findings suggest that there are certain genomic regions subjected to ancient selection footprints during the ancestor breed formation and adaptation. The outcome of this study serves as a foundation for future genomic-driven strategies, a broader utilization of this breed, and a reference for genomic studies on other horse breeds.

1. Introduction

Recent genomic studies [1,2,3] suggested that the horse was domesticated around 5500 years ago, significantly later than the domestication of most other livestock species. Despite this, horses (Equus caballus) have considerably influenced human civilization [4,5,6], having made a significant contribution to cultural exchange between societies, economic development, and supporting the farmers in agricultural activities [7,8]. The recent discovery of ancient equine signature tracks in the horse genome has led scientists to propose Kazakhstan as a candidate site for the initial domestication of horses [9,10]. Horse breeding has been considered an important part of the traditional livestock production sector in Kazakhstan for centuries, which has led to generations of herding knowledge. According to Kosharov et al. [11], steppe horses from Kazakhstan were widely distributed as early as the fifth century B.C. The country’s vast territory of natural pastures (185 million hectares), the constant demand for horse products, and a high cultural affinity for traditional equestrian sports have provided a strong potential for the development of horse breeding [12,13]. As of 1 January 1980, there were 312,447 Kazakh horses, including 63,329 purebreds, which are kept on pastures all year round.
A native Mugalzhar breed (Figure 1A) [14,15], named after the area (i.e., Mugalzhar District, including the village of Mugalzhar) where it was bred, had been developed through a selective breeding program applied to Jabe-type Kazakh horses, between 1969 and 1998, in order to enhance the meat and milk production [16]. In 1998, this breed was officially recognized as a new breed by the Ministry of Agriculture of Kazakhstan (Order No. 156, dated 30 December 1998) [17,18,19,20,21]. By 2020, the breed population embraced 16,290 horses, including 7026 mares [12].
Mugalzhar horses, mainly of dun or bay coat colors without spots, exhibit a compact physique with a height at the withers not exceeding 145 cm. The Mugalzhar horse populations and their ancestral breed Jabe frequently exhibit a “zebra pattern” leg stripes (Figure 1B) and a “dorsal stripe” (Figure 1C). These phenotypic traits likely reflect the preservation of ancient equine genetic lineages [22]. This breed is distinguished by its potential to produce high-quality horsemeat and by substantial milk yield, even under a year-round pasture-grazing production system [23,24,25]. Iskhan et al. [16] claimed that this is the world’s first dual-purpose (meat and dairy) breed developed from a non-specialized local horse breed, which provides high-quality meat and kumis during year-long grassland farming [23]. It has a faster growth rate and, accordingly, a better meat productivity, with body weight being 100–120 kg more for stallions and 80–100 kg more for mares compared to its Jabe-type ancestor [26]. In particular, stallions typically weigh between 493.5 and 538.4 kg, while mares range from 452.7 to 469.3 kg [16]. Owing to these distinctive characteristics, the Mugalzhar horse is considered a valuable breed in Kazakhstan [27]. However, this breed is distributed across the whole country; they are mainly reared in the Karagandy region of central Kazakhstan, the Aktobe region of western Kazakhstan, and the Kyzylorda region of southwest Kazakhstan [28].
According to Iskhan et al. [16], there were three main intra-breed types of the Mugalzhar breed, i.e., Emba, Kulandy, and Kozhamberdy (till 2009 known as Saryarqa), as well as six lines and 55 families in 2019. The six lines listed by Iskhan et al. [16] were Maupas and Mesker (of Kozhamberdy type), and Patok, Zaliv, Aral, and Kulan (of Kulandy type), named after the respective outstanding stallion. In 2024, Kabylbekova et al. [12] stated that the modern breed structure consists of four intra-breed types (Emba, Kulandy, Saryarqa, and, since 2009, Kozhamberdy), six lines, and 55 families. In 2024, Iskhan et al. [25] also described a new Irtysh stud farm type of the Mugalzhar breed [29,30]. Within the Kozhamberdy intra-breed type of the Mugaldzhar breed, there are structurally two established stud farm subtypes, i.e., Saryarqa and Kaindy. Orazymbetova et al. [31] mentioned five Kozhamberdy lines. Exploring three of them, i.e., Maupas, Mesker, and Meiman, Orazymbetova et al. [31] showed that they are genetically divergent. These subtypes and lines are part of the selective breeding work aimed at improving the breed and developing its specialized herds. A more detailed account pertinent to the Mugalzhar breed history, its breeding, and production features is presented in Supplementary Information.
Adequate genetic diversity within the population is essential for a successful selective and sustainable breeding program and genetic progress in Mugalzhar horses. It also helps with monitoring inbreeding level in this population and reducing inbreeding depression, as a potential threat to production and climate resilience. Additionally, identifying the candidate genes linked to economic traits enables integrating genomic information into breeding decisions and consequently more efficient and breed-specific selection strategies. The genetic studies of Kazakh horse breeds over the past decade have mainly focused on assessing genetic diversity and identification of candidate genes associated with economically important traits using a limited set of microsatellite markers [28,31,32]. Investigations using single-nucleotide polymorphism (SNP) array genotypes have unraveled significant genetic differences between native Kazakh and other exotic horse breeds [33,34]. Although SNP chips are informative, with genetic markers dispersed over the whole genome [35,36,37,38,39], they cover a part of genomic variation and, in particular, are not able to detect structural variants.
Whole-genome sequencing (WGS) has revolutionized the investigation of genetic architecture in livestock and wild species [40,41,42,43], guiding their comprehensive assessments of genetic diversity [44,45,46], inbreeding [47,48,49], and population structure [50,51,52]. This high-resolution approach provides insights into evolutionary history and detection of the selection footprints and breed-specific variants [53,54,55]. Previous equine genomic studies have used WGS to assess the genetic diversity, evaluate the inbreeding level, and identify genomic regions underlying economically important traits in horses [56,57]. A recent investigation [58] of runs of homozygosity (ROH) using WGS revealed population-specific inbreeding patterns in different horse breeds. These findings highlighted the importance of understanding inbreeding dynamics, particularly in indigenous breeds, to guide conservation and breeding strategies. Despite the economic and cultural importance of the Mugalzhar horse breed in Kazakhstan, the genomic characteristics of this breed remain largely unexplored.
Previously, we suggested that the Mugalzhar DNA samples obtained for the microsatellite research [33] would enable performing their WGS to seek breed-specific SNP variants. In the present study, we presented the first whole-genomic sequence analysis of the native Mugalzhar horse breed, aiming to establish fundamental genomic characteristics, including the assessment of genetic diversity and level of inbreeding, and the identification of genome regions under selection that will lay a foundation for breeding strategies in this unique Kazakhstani breed.

2. Materials and Methods

2.1. Animals and Sample Collection and DNA Extraction

In this investigation, blood samples from 20 healthy Mugalzhar horses were collected at the ORDA breeding farmer enterprise headed by Amandyk Zhumagalyuly, in the vicinity of Khromtau (Shalkar District, Aktobe Region, western Kazakhstan; 50°15′04″ N, 58°26′24″ E; 429 m elevation above sea level). All 20 individuals were previously sampled for the microsatellite-based investigation [32]. They represented the local Aktobe population and belonged to the Meiman line of the Kozhamberdy intra-breed type that represents a selective breeding achievement developed within this intra-breed type. A more detailed account of the sampling site and sampled animals is provided in Supplementary Information.
Horse gender, birth year, and coat color of the collected samples were recorded and are provided in Supplementary Table S1 and in more detail in Supplementary Information. Genomic DNA isolation was performed using the phenol–chloroform procedure following Maniatis et al. [59]. According to DNA concentration and quality metrics, assessed by A260/A280 and A260/A230 ratios, the extracted genomic DNA for all samples, shown in Supplementary Table S2, was suitable for downstream genomic analyses.

2.2. WGS, Reads Preprocessing and Mapping

This high-molecular-weight DNA was then sequenced using Illumina’s (San Diego, CA, USA) NovaSeq 6000 sequencing technology that provided 150-bp paired ends. After quality controls of the raw reads, Trimmomatic software (version 0.39; [60]) was used to pre-process the reads and remove the adapter sequences (the first 10 nucleotides), low-quality, and short reads. We compared two sequence quality thresholds (Q20 and Q30), and applied Q20, which provides a suitable and acceptable efficient alignment, with a low rate of sequence removal, resulting, on average, in 380,480,205 reads per sample and ranging from 216,143,288 to 562,962,552 reads per horse. The average read sequencing quality Phred scores >35 and >20 were 94.84% and 99.86%, respectively.
High-quality sequence alignment was achieved, with trimmed reads mapping uniquely to the horse reference genome assembly EquCab3.0 (with annotation GCF_002863925.1 [61]), at an average rate of 99.79%, using Burrows–Wheeler Aligner (BWA) software (version 0.7.17-r1188; [62]) with default parameters. Herewith, average allele frequency, sequencing depth, and BaseQRankSum statistics of the identified variants were determined. The aligned mapping files (SAM files) were sorted and converted to binary format (BAM files) and merged to obtain individual files for each horse separately using SAMtools (version 1.6; [63]). The merged and sorted BAM files were then indexed using SAMtools for further analysis.

2.3. Variants Calling and Annotation

The Genome Analysis Toolkit GATK4-4.0.5.1-0 (GATK) pipeline using haplotypecaller [64] was used for variant calling. To reduce the risk of error and facilitate further analysis, the following two pipelines were used to generate the result files of all variants: (1) an analysis combining all BAM files and calling variants with all trials included; and (2) an independent analysis individually for each horse, combining then all variant files into a single file. Variant Effect Predictor (VEP) tools [65] were used to annotate the functional, location, and classification of identified variants. The variants identified in this study, along with their predicted effects by functional annotation classes, were visualized using Circos (version 0.69-9; [66]).
In order to identify Mugalzhar breed-specific variants, the following two criteria were applied for filtering those variants: (1) based on the variants with at least 10-nucleotide position coverage depth that were sequenced in each sample at least 10 times; and (2) variants with fixed (AF = 1) alternative alleles. In other words, we selected only those variants that were at homozygous state for alternative alleles in all 20 studied samples. The variants were functionally annotated using the Variant Effect Predictor (VEP) tool [65] and Ensembl ([67]; release 114, published on 7 May 2025 [68]) annotation of the horse genome assembly EquCab3.0 [61]. Only those genes harboring ≥5 exon variants with “high” or “moderate” impact (according to the VEP analysis) were retained and considered as prioritized candidate genes (PCGs).

2.4. Genetic Diversity Analysis and Population Structure

The genome-wide observed (Ho) and expected (He) heterozygosity values were computed using VCFtools (version 0.1.16; [69]) for processing the Variant Call Format (VCF) data. We also estimated nucleotide diversity (π) that reflects, on average, the quantity of nucleotide variations per site between two DNA sequences selected at random from the population under study [70]. The nucleotide diversity was estimated over the entire autosomal genome using a sliding window approach implemented in VCFtools (version 0.1.16; [69]).
In order to provide a more reliable and comprehensive overview of inbreeding in this population, three methods (-ibc function) implemented in the GCTA program (version 1.94.4; [71]) were applied to estimate the genomic inbreeding, as follows:
  • FGRM, the inbreeding coefficient driven from the genomic relationship matrix (GRM) and calculated as the deviation of the diagonal elements from unity:
    G R M = Z Z 2 p i ( 1     p i ) F G R M i = G R M i i 1 ,
    where Z is the standardized and centralized genotype file, p is the minor allele frequency, F G R M i is the inbreeding coefficient for ith sample, and G R M i i is the ith diagonal element of GRM corresponding to ith sample.
  • FHOM, the Wright’s inbreeding coefficient based on the proportion of the loci with higher observed homozygosity than expected homozygosity:
    F H O M = N O H o m N E H o m N N o n _ M i s s N E H o m
    where N O H o m , N E H o m and N N o n _ M i s s are numbers of observed homozygous, expected homozygous, and non-missing loci, respectively.
  • FUNI, the Wright’s inbreeding coefficient based on the correlation between alleles in uniting gametes:
    F U N I = 1 n i = 1 n x i 2 1 + 2 p i x i + 2 p i 2 2 p i ( 1 p i ) ,
    where x i is the number of copies of the reference allele for the ith SNP, n is the number of versions of the reference allele, and p i is the minor allele frequency for the ith SNP.
In order to investigate the population stratification of the studied samples, principal component analysis (PCA) was implemented based on GRM. Before constructing GRM, the dataset was filtered for biallelic SNPs with MAF > 0.05, resulting in 13,536,151 SNPs. GRM was constructed using GCTA (version 1.94.4; [71]). PCA plots were produced using Microsoft Excel and ggplot2 [72] in the R environment [73,74].

2.5. Segregating Variants from Online Mendelian Inheritance in Animals

To investigate variants associated with Mendelian traits in horses, we obtained the genomic position and the corresponding phenotypes of 116 trait-associated variants from the Online Mendelian Inheritance in Animals (OMIA) database [75]. Segregating variants was selected based on the number of heterozygote genotypes in the samples.

3. Results

3.1. WGS, Reads Preprocessing, Mapping, Variant Calling, and Annotation

High-quality extracted genomic DNA with enough DNA concentration led to high-quality sequencing and consequently efficient alignment of sequence reads (99.79%) over the reference genome, as summarized in Table 1. WGS information in more detail is available in the Supplementary Table S3.
Average allele frequency, BaseQRankSum statistics, and sequencing depth for the identified variants were visualized within each chromosome as shown in Figure 2.
The variant calling analysis resulted in sites of 21,722,393 variants, including 19,495,163 SNPs and 2,227,230 indels. Among these, 20,354,948 variants (93.7%), including 18,364,296 SNPs and 1,990,652 indel variants, were located on 32 chromosomes, both autosomes and sex chromosomes. The distribution of the variants in different classes based on the number of alleles is shown in Supplementary Table S4. As expected, most of the variants (n = 19,958,242; 91.88%) were biallelic loci, including 18,245,338 SNPs (91.49%) and 1,712,904 indels (8.51%). The distribution of these variants on 32 chromosomes is given in Table 2. Around 96% of the biallelic variants (19,123,873 variants, 17,495,490 SNPs, and 1,628,383 indels) were located on autosomes.
Functional annotation of the identified biallelic variants is summarized in Table 3. The results showed that the majority of both SNPs and indels are intergenic variants (75.34% and 72.62%, respectively) or located in intronic regions (18.41% and 20.55%, respectively). There were only 353,056 biallelic SNPs located in exon regions, which constituted 1.94% of the total identified biallelic SNPs in this breed. Of those SNPs, 62,111, 42,319, and 7112 variants were respectively missense, synonymous, and high-impact SNPs (splice donor, splice acceptor, stop gained, stop lost, and start lost variants). Among the missense variants, 1945 SNPs were identified as deleterious to protein function according to SIFT score criteria (SIFT < 0.05). Among the 6323 protein-coding indels identified, 5048 are high-impact variants, with frameshift variants being the most frequent class, comprising 4444 variants.

3.2. Genetic Diversity Analysis and Population Stratification

In terms of the examined population heterozygosity, the average genome-wide estimated Ho and He values were 0.2402 and 0.2325, respectively. In addition, the average genome-wide nucleotide diversity (π) within the Mugalzhar horse breed was estimated to be 0.0021 bp−1.
The descriptive metrics for the estimated inbreeding coefficients for the studied samples are provided in Table 4. The average inbreeding coefficients in all three methods were negative and were −0.038, −0.033, and −0.033 for FGRM, FHOM, and FUNI, respectively.
The PCA results and the respective plots for 20 individuals are presented in Figure 3 and show the sampled population structure. In this figure, principal component 1 (PC1) is plotted against second (PC2) and third (PC3) components (Figure 3A and Figure 3B, respectively). The PCA plot reveals the genetic diversity among 20 Mugalzhar horse samples, with PC1 (1.55%) and PC2 (1.38%) explaining the total variation. A wide range of principal components obtained for the samples indicates a high level of diversity in the population. However, there are some small subgroups of two or more individuals, indicating possible population stratifications. The scatterplot PC1 by PC2 is basically similar to PC1 by PC3, pointing out that, most likely, the first three PCs do not contrast any variation in the genome of the Mugalzhar breed and correspond to specific features of this breed.

3.3. Adaptation Footprints and Candidate Genes

Since the variants with high allele frequency are very important for breed-specific analysis, we kept only the variants whose alternative allele is fixed in this population. Therefore, 139,163 variants with allele frequency for the alternative allele equal to unity were identified. It means that the reference allele of those variants was not observed in all 20 samples analyzed. Therefore, these variants were kept for further analysis. In order to improve the reliability of the results and reduce the risk of random error, we applied a depth-of-coverage filter, retaining only variants supported by at least 10 sequencing reads (≥10X) in each sample, which resulted in 15,027 high-confidence variants. The selected threshold was because of the fact that the lower coverage is associated with reduced genotyping accuracy and higher false-positive rates. A graphical summary of these analyses is shown in Figure 4.
Among these variants, we retained only 9318 variants that are located on autosomal chromosomes. In total, 73 exon variants located on nine protein-coding genes passed the criteria for PCGs. These PCGs harboring high-impact homozygous exon variants that are fixed in Mugalzhar horses are listed in Table 5.
These genes were located on several autosomal chromosomes, including ECA1 (SCAPER), ECA2 (FHAD1), ECA3 (MMP15), ECA7 (ADGRE1), ECA8 (CMKLR1), ECA9 (MRPL15), ECA10 (ZNF667), ECA16 (CCDC66), and ECA23 (LOC100055310). PCGs have a high number of orthologous genes, in a range of 30 (LOC100055310) to 344 (CMKLR1). Among the detected genes, MRPL15 conformed to a remarkably high number of homozygous exon variants for the alternative allele (n = 15) with high impact.

3.4. OMIA Variants Segregating Analysis

To investigate variants associated with Mendelian traits in horses, we analyzed 116 previously reported variants across various breeds. The results revealed that seven variants in four genes (MC1R, KIT, MITF, and DMRT3) are segregating in the studied horse population. The detailed description of these variants, their associated genes, and phenotypes is presented in Table 6. A missense mutation (ECA3:g.36979560C > T) of the MC1R gene associated with coat color phenotype (OMIA 001199-9796 [75]) was found in a heterozygous state in four horses, indicating moderate segregation of this allele in the population. Three missense mutations (ECA3:g.79538738C > T, ECA3:g.79548220T > C, and ECA3:g.79566881T > C) of the KIT gene were also observed in the heterozygous state in only one horse. These variants are known to influence phenotypes such as white spotting and dominant white coat color (OMIA 000209-9796 [75]). In the MITF gene, one structural variant (ECA16:g.21555811delinsAAAT) and a regulatory variant (ECA16:g.21608936C > T) are segregating in this population, which are associated with the splashed white phenotype (OMIA 000214-9796 [75]). A stop-gain SNP (ECA23:g.22391254C > A) of the DMRT3 gene was identified in a heterozygote state in one horse, which is associated with gaitedness, a distinctive pattern of movement in horses (OMIA 001715-9796 [75]).

4. Discussion

4.1. WGS Outcome and Variant Characterization in the Mugalzhar Breed

Understanding the genetic diversity within livestock breeds is crucial as it influences the success of effective selective breeding programs [76,77,78,79,80] and the preservation of genetic potential for adaptive traits (e.g., survival in specific environmental conditions), helping prevent excessive rate of inbreeding [81,82,83,84,85]. Whole-genome sequences are a promising tool for investigating the evolutionary history, selection signatures, and breed-specific genomic profile, assessment of genetic diversity, and identifying genomic regions underlying economically important traits in livestock species [53,54,55,56,57]. In this study, we identified 21,722,393 variants, including 2,227,230 indels and 19,495,163 SNPs among 20 individuals of the Mugalzhar horse, representing a significant contribution to the current resources available for equine research. Although high-quality sequencing data provided valuable insights into the genomic structure of the breed, the limited sample size (n = 20) may reduce the power to detect rare genetic variants.
The total number of autosomal variants (19,502,882) identified in this breed was higher than those reported for North American [86] and Japanese Thoroughbred horses [87]. These differences could be due to the evolutionary history of the breeds, sampling criteria, sequencing technology, and applied bioinformatic procedure. However, similar findings have been reported by Al Abri et al. [88]. The proportion of bi-allelic indel variants (86.05%) was lower compared to that of bi-allelic SNPs (99.35%). It has been reported that indels are underlying some genetic disorders in horses, like lavender foal syndrome [89] and severe combined immunodeficiency [90]. The number of exonic SNPs (353,056; ~1.94%) was close to the value reported by Al Abri et al. [88]. The average genome-wide SNP density for autosomes was 1/130 nucleotides, ranging from 1/87 (ECA 12) to 1/147 (ECA 13). This was approximately 10 times higher than the density of indels, which had an average of 1/1205 and ranged from 1/873 (ECA 20) to 1/1328 (ECA 9).

4.2. Genomic Diversity, Inbreeding, and Population Stratification

The average of genome-wide nucleotide diversity, a metric providing valuable insight into the divergence, demographic history, and genetic diversity of populations [70], was estimated to be 0.0021 bp−1. This value indicates that, on average, there were ~2 nucleotide differences per 1000 base pairs between two randomly selected sequences in this horse population, corresponding to a total of 5,250,000–5,670,000 nucleotides across the 2.5–2.7 Gb horse genome size [91]. The estimates of these genomic diversity metrics indicate a moderate level of genetic variability in the Mugalzhar horse population. Our estimate is higher than the average diversity (0.0017) in the autosomal chromosomes reported for six other horse breeds (American Miniature, Percheron, Arabian, Mangalarga Marchador, Native Mongolian Chakouyi, and Tennessee Walking) by Al Abri et al. [88].
A higher average genome-wide Ho value (0.2402) compared to He (0.2325) indicates the preservation of genetic diversity and a relatively low level of inbreeding in this population from the Aktobe Region. Using SNP microarrays, Pozharskiy et al. [33] reported a high diversity level for 584 Mugalzhar horses, with Ho and He estimates being 0.345 and 0.340, respectively, which exceed our findings for a smaller Mugalzhar sample size. Genomic inbreeding coefficients reflect the actual level of homozygosity in the genome, which is not affected by the availability, depth, accuracy, and completeness of the pedigree information [92]. In this study, we applied three different methods to estimate inbreeding, which allows for comparison between estimates, minimizes method-specific biases, and provides more robust results. The negative average genomic inbreeding coefficients obtained in this study using three methods (i.e., FGRM, −0.038; FHOM, −0.033; and FUNI, −0.033) suggest that the Aktobe population of Mugalzhar horses exhibits a relatively high level of variability, supporting adequate genetic progress in a selection program. PC 1-3 explained a relatively low proportion of the variation in the studied population, which could be due to the high number of original variables (e.g., millions of WGS variants) and the genetic homogeneity of the breed. The distribution of individuals on PCA plots showed a wide scatter, while the similarity in PCA scatterplots, on the other hand, suggests that the observed population structure within the studied samples might be due to a relationship pattern among the animals over recent generations (e.g., due to the development of the Meiman line).
The moderate level of genetic variability could be due to the random mating, a relatively high ratio of breeding males to females, and early animal culling in a round-year grazing and meat production-oriented system. The genetic diversity in this breed, as indicated by several metrics outlined above, suggests that the horse population can be utilized to provide genetic diversity for other breeds and has the potential to further boost productivity. The previously published findings using 17 microsatellite markers in the same population [32] demonstrated higher Ho and lower inbreeding values, confirming the results of the current study. Additionally, in a wider microsatellite examination of Kazakh horses, including the Mugalzhar breed, Orazymbetova et al. [31] identified minimal genetic differentiation (0.05%), suggesting significant admixture and an ongoing lineage sorting process, which is consistent with our results.

4.3. PCGs

There were 9318 autosomal variants with a homozygous state for the alternative allele in all 20 studied samples. Among them, only the exonic variants with high or moderate impact, according to VEP, were retained. The genes harboring at least five variants were considered as potentially candidate genes (PCGs) associated with fitness/adaptation in this breed. The identified genes represent strong candidates for further investigation; however, functional and experimental validation is recommended to confirm their biological impacts on adaptation and economically important traits in Mugalzhar horses. The functions and potential relevance to horse biology, and particularly for the Mugalzhar breed’s adaptability, of these PCGs are outlined below.
SCAPER (S-phase cyclin A-associated protein in the endoplasmic reticulum) is mainly involved in nucleic acid binding, which has been reported to be associated with nonsyndromic intellectual disability [93] and retinal disease [94] in humans, male sterility and reduced female fertility in mice [95], sperm motility in Holstein cattle [96], adaptation in the cattle [97], growth and nervous system in goats [98], and as a potential deleterious gene selected in Tibetan pigs [99]. FHAD1 (forkhead associated phosphopeptide binding domain 1) encodes a protein that acts as a regulator of sperm motility and spermatocyte meiosis. This gene has been reported to be located in the genomic regions under potential selection in adaptation in the Tunisian Black Thibar sheep [100], highly expressed in remyelinating lesions [101], and associated with body weight and size in the Jabe horse breed, which is the ancestor of Mugalzhar horses [34]. The Mugalzhar horses are reared in a year-round grazing system, where the reproductive efficiency is an important factor for this production system. Meanwhile, a higher growth rate helps the animals, especially the young horses, become stronger and more resilient to harsh climates when food availability is limited. It appears that the SCAPER and FHAD1 genes may contribute to the Mugalzhar breed’s adaptability by enhancing the summer growth rates, thereby improving physiological robustness for survival during harsh winters.
MMP15 (matrix metallopeptidase 15) encodes a member of the peptidase M10 family that plays a role in the breakdown of extracellular matrix during both disease processes and normal physiological processes. It was reported [102] that this gene is located in the positive selection signature in the live-bearing fish Heterandria formosa, which contributes to endometrial tissue remodeling and placental labyrinth formation. Dierks [103], using GWAS in Hanoverian warmblood horses, reported a QTL associated with osteochondrosis and osteochondrosis dissecans, which is the genomic region with MMP15 homology in the human genome. This gene is not well-studied, even in the other livestock species, and its biological role in improving the adaptability of this horse breed requires further investigations.
ADGRE1 (adhesion G protein-coupled receptor E1) contains a domain similar to seven seven-transmembrane G protein-coupled receptor and plays a role in cell adhesion and interactions between cells, particularly immune system cells. This gene plays an important role in the resistance and resilience of the Mugalzhar horses, as a native breed, to environment-specific pathogens. Its function has been reported to be associated with defense against infections through the development of antigen-specific CD8+ regulatory T cells [104,105,106]. It has also been reported that it is involved in both innate and adaptive immune responses, and was identified as a target of positive selection against tropical parasites in African dogs [40].
CMKLR1 (chemerin chemokine-like receptor 1) regulates negatively NF-kappaB transcription factor activity, which positively regulates the macrophage chemotaxis, and regulation of calcium-mediated signaling, adipogenesis, and adipocyte metabolism. Dander et al. [107] suggested that the CMKLR1/chemerin axis controls intestinal graft-versus-host disease. While de Camargo et al. [108] found that this gene is associated with protein percentage in dairy buffaloes. Evidence suggests that the RARRES2-CMKLR1 axis may be involved in regulating metabolic processes related to obesity, influencing glucose and fat metabolism in humans and murine models [109,110].
MRPL15 (mitochondrial ribosomal protein L15) is a nuclear gene that encodes a protein, which helps in protein synthesis within the mitochondrion. The MRPL15 gene might be associated with carcass weight in pasture-finished beef cattle in Hawai’i [111] and in Korean Hanwoo cattle [112], milk fatty acids (C6:0) in Dual-Purpose Belgian Blue cows [113], residual feed intake in Australian Angus cattle [114], and body weight gain and feed intake in crossbred beef steers [115]. Given that the horses of this breed are kept on pastures year-round and face challenges related to food and forage availability during cold conditions, they have been under natural selection to regulate their energy for the long and cold winter season in Kazakhstan. Therefore, the CMKLR1 and MRPL15 genes most likely play crucial roles in adaptation of Mugalzhar horses to a year-round free-grazing system by mediating metabolic regulation and optimizing energy utilization to help sustain them through the winter when forage availability is restricted.
ZNF667 (zinc finger protein 667) facilitates DNA-binding transcription factor activity and sequence-specific binding to RNA polymerase II cis-regulatory regions. This gene, as a transcriptional regulator, helps to control the expression of resilience-related genes, which are critical for the cold climate conditions faced by the Mugalzhar breed in Kazakhstan. It has been reported to be under natural selection in the Bardigiano horse, a native Italian breed [116], the Rhenish German Draught Horse [117], Yorkshire pigs [118], and Tunchang pigs in China [119].
CCDC66 (coiled-coil domain containing 66) encodes a microtubule-associated protein essential for ciliogenesis and cell division. It mediates protein transport to cilia, regulates spindle assembly during mitosis, and facilitates cytokinesis through microtubule organization. This gene has been reported to be associated with early-onset progressive retinal atrophy in Portuguese Water Dogs [120], generalized progressive retinal atrophy in Schapendoes dogs [121], and testicle length in chickens [122]. Since there is a lack of reports on the function of this gene in farm animals, including horses, further research is needed to investigate how it may aid adaptation in Mugalzhar horses. LOC100055310 is not well annotated in the horse genome; however, it encodes a putative spermatogenesis-associated protein 31D3. This gene appears to be critical for reproduction and highly conserved, with two transcripts, 30 orthologues, and 25 paralogues.

4.4. OMIA Variants

Our findings showed that there is no deleterious variant segregating in this population. We identified seven variants, located within four genes, associated with Mendelian traits in horses that are segregating in this population. Three of the genes, including MC1R, KIT, and MITF, are associated with coat color in horses (OMIA 001199-9796, OMIA 000209-9796, and OMIA 000214-9796 [75]). Since coat color is not an economic trait in the Mugalzhar breed, the observed pattern is due to natural selection or most likely genetic drift during the breed formation. However, this breed is mainly dun or bay without spots.
Melanocortin 1 receptor (MC1R) is linked to the agouti-signaling-protein (ASIP) gene by a close epistasis relationship, such that relative amounts of melanin pigments in mammals are controlled by their antagonistic interaction [123]. The recessive missense mutation (ECA3:g.36979560C > T) of the MC1R gene determines the production of a red-yellow pigment (pheomelanin), while the other allele determines black pigment (eumelanin) [124]. If the dominant allele of ASIP, responsible for agouti signaling protein, is expressed, the melanocortin receptor 1 in the melanocytes is blocked, and consequently, pheomelanin synthesis happens. Hence, black, bay, and chestnut (three basic coat colors in horses) are results of the combination of specific genotypes of these two genes [124]. KIT (KIT proto-oncogene, receptor tyrosine kinase) regulates four depigmentation phenotypes: roan, sabino, tobiano, and dominant white. Haase et al. [125] reported that the dominant white color in different horse breeds is associated with multiple independent mutations inside this gene. However, this gene is crucial for the survival, proliferation, and differentiation of cells. Deficiency of KIT function causes lethal anemia, leading to prenatal or perinatal mortality [126].
MITF (melanocyte-inducing transcription factor) encodes a transcription factor containing basic helix-loop-helix and leucine zipper structural features. This transcription factor regulates pigment cell-specific expression of melanogenesis-related enzyme genes. The coat color pattern known as “splashed white” and defined by large white patterns on the legs, abdomen, and face, has been reported to be determined by this gene in several breeds of horse including American Paint [127,128], Quarter [129], Pura Raza Española horses [130], Thoroughbred [131], Menorca Purebred and Spanish Purebred [132] horses. Horses exhibiting this phenotype are typically deaf [127,129]. Since this breed relies on year-round grazing, the ability to evade predators (e.g., wolves) is critical for survival. This likely explains the significant decline in alleles associated with the phenotype over time.
A segregating stop-gain variant of the DMRT3 (doublesex and mab-3 related transcription factor 3) gene was identified, which is associated with gaitedness. A mutation in this gene is mainly associated with gaitedness in horses, which can be found in horse breeds worldwide, not limited to a geographical area, as was found in 68 of the 141 horse breeds investigated by Promerová et al. [133]. The authors reported the allele frequency of this mutation in the studied breeds in a range of 1% to 100%, where it is more frequent in gaited and harness racing horse breeds [133]. This gene encodes an essential transcription factor that orchestrates vertebrate locomotion, functioning within spinal cord neural circuits to coordinate limb movement synchronization [134,135,136]. The Mugalzhar horse descended from the Jabe horse lineage, whose movement has been evolutionarily optimized for energy-efficient travel across vast steppes [137]. Consequently, the reduced heterozygosity observed in the studied breed likely resulted from strong natural selection pressures in their ancestral population.
Overall, genomic analysis of Mugalzhar horses at the whole genome sequence level revealed selection signatures within their genome that indicate a high level of adaptability to harsh weather conditions for a year-round grazing system.

5. Conclusions

This study provides the first comprehensive whole-genome analysis of the Mugalzhar horse breed, uncovering valuable insights into its genetic diversity, selection history, and adaptation mechanisms. Despite the limited sample size and geographic distribution, over 21 million high-quality variants were detected in this population. The high-impact exonic variants in the homozygote state for the alternative allele are located in the genes associated with fitness, adaptation, and reproduction, suggesting ancient natural selection footprints for the year-round grazing in Kazakhstan climate conditions. While no deleterious Mendelian variants were found, seven low-frequency heterozygous variants associated with coat color and gaitedness were detected. Our findings not only contribute to the understanding of Mugalzhar horse genetics but also establish a reference point for future conservation and breeding strategies aimed at improving performance and resilience in native horse populations. From a practical point of view, the identified variants can be integrated into breeding programs to enhance climate adaptation while preserving diversity in this breed, and also to apply this genomic framework to conserve other vulnerable native horse breeds facing environmental challenges.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ani15182667/s1, Supplementary Information: Mugalzhar horse history, breeding, production characteristics and sampling, Table S1: Horse gender, birth year and coat color for the collected samples, Table S2: The quality and quantity metrics for the extracted genomic DNA samples, Table S3: Summary of trimmed read statistics pooled for each horse sampled, Table S4: Distribution of the variants in different classes based on number of alleles.

Author Contributions

Conceptualization, S.N.K. and J.P.J.; methodology, S.N.K. and J.P.J.; software, J.P.J., W.B. and H.B.; validation, J.P.J., P.B. and M.N.R.; formal analysis, J.P.J., W.B. and H.B.; investigation, S.N.K. and Z.Z.B.; resources, S.N.K. and Z.Z.B.; data curation, S.N.K., K.Z.I., Z.Z.B. and J.P.J.; writing—original draft preparation, S.N.K., Z.M.O., D.I.K., J.P.J., P.B., H.B. and M.N.R.; writing—review and editing, A.S.K., J.P.J., W.B., P.B., P.S., H.B. and M.N.R.; visualization, J.P.J., W.B. and H.B.; supervision, S.N.K.; project administration, S.N.K. and Z.Z.B.; funding acquisition, S.N.K. All authors have read and agreed to the published version of the manuscript.

Funding

This work was carried out within the framework of grant funding for scientific and technical projects for 2023–2025 that was supported by the Ministry Education and Science of the Republic of Kazakhstan (Grant No. AP19677892, “Preservation and Assessment of the Genetic Diversity of Horses of the Kazakh Breed Using Whole-genome Sequencing”).

Institutional Review Board Statement

The animal experiments were conducted in accordance with the ethical guidelines of the Kazakh National Agrarian Research University. The animal study protocol was approved by the Bioethics Committee of the University (Protocol No. 9, dated 10 August 2023).

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw fastq reads for all horses analyzed in the current study are available in the European Nucleotide Archive (ENA) under the study number PRJEB82323 (https://www.ebi.ac.uk/ena/browser/view/PRJEB82323 (accessed on 5 July 2025)). The detected SNPs and indels are available for download at the European Variants Archive (EVA) (https://www.omicsdi.org/dataset/eva/PRJEB82323 (accessed on 5 July 2025)).

Acknowledgments

We express a special gratitude to the head, Amandyk Zhumagalyuly, and team of the ORDA breeding farmer enterprise (Shalkar District, Aktobe Region, Kazakhstan) for their valuable suggestions and assistance in carrying out this study. The authors are also grateful to Tolegen S. Assanbayev for sharing two photographs of the Mugalzhar horse breed coat variants used in Figure 1.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Schubert, M.; Jónsson, H.; Chang, D.; Der Sarkissian, C.; Ermini, L.; Ginolhac, A.; Albrechtsen, A.; Dupanloup, I.; Foucal, A.; Petersen, B.; et al. Prehistoric genomes reveal the genetic foundation and cost of horse domestication. Proc. Natl. Acad. Sci. USA 2014, 111, E5661–E5669. [Google Scholar] [CrossRef]
  2. Orlando, L. Ancient genomes reveal unexpected horse domestication and management dynamics. BioEssays 2020, 42, 1900164. [Google Scholar] [CrossRef]
  3. Orlando, L. The evolutionary and historical foundation of the modern horse: Lessons from ancient genomics. Annu. Rev. Genet. 2020, 54, 563–581. [Google Scholar] [CrossRef]
  4. Lawrence, E.A. Horses in society. Anthrozoös 1988, 1, 223–231. [Google Scholar] [CrossRef]
  5. Klecel, W.; Martyniuk, E. From the Eurasian steppes to the Roman circuses: A Review of early development of horse breeding and management. Animals 2021, 11, 1859. [Google Scholar] [CrossRef]
  6. Librado, P.; Khan, N.; Fages, A.; Kusliy, M.A.; Suchan, T.; Tonasso-Calvière, L.; Schiavinato, S.; Alioglu, D.; Fromentier, A.; Perdereau, A.; et al. The origins and spread of domestic horses from the Western Eurasian steppes. Nature 2021, 598, 634–640. [Google Scholar] [CrossRef]
  7. Kelekna, P. The Horse in Human History; Cambridge University Press: Cambridge, UK, 2009; Available online: https://www.cambridge.org/us/universitypress/subjects/archaeology/archaeology-europe-and-near-and-middle-east/horse-human-history (accessed on 5 July 2025).
  8. Librado, P.; Fages, A.; Gaunitz, C.; Leonardi, M.; Wagner, S.; Khan, N.; Hanghøj, K.; Alquraishi, S.A.; Alfarhan, A.H.; Al-Rasheid, K.A. The evolutionary origin and genetic makeup of domestic horses. Genetics 2016, 204, 423–434. [Google Scholar] [CrossRef] [PubMed]
  9. Outram, A.K.; Stear, N.A.; Bendrey, R.; Olsen, S.; Kasparov, A.; Zaibert, V.; Thorpe, N.; Evershed, R.P. The earliest horse harnessing and milking. Science 2009, 323, 1332–1335. [Google Scholar] [CrossRef]
  10. Outram, A.; Bendrey, R.; Evershed, R.P.; Orlando, L.; Zaibert, V.F. Rebuttal of Taylor and Barrón-Ortiz 2021 Rethinking the evidence for early horse domestication at Botai. Dataset Zenodo 2021. [Google Scholar] [CrossRef]
  11. Kosharov, A.N.; Pern, E.M.; Rozhdestvenskaya, G.A. Horses. In Animal Genetic Resources of the USSR; FAO Animal Production and Health Paper; Dmitriev, N.G., Ernst, L.K., Eds.; Food and Agriculture Organization of the United Nations: Rome, Italy, 1989; Volume 65, pp. 272–343. Available online: https://www.fao.org/4/ah759e/AH759E14.htm (accessed on 5 July 2025).
  12. Kabylbekova, D.; Assanbayev, T.S.; Kassymbekova, S.; Kantanen, J. Genetic studies and breed diversity of Kazakh native horses: A comprehensive review. Adv. Life Sci. 2024, 11, 18–27. Available online: https://www.als-journal.com/1113-24/ (accessed on 5 July 2025). [CrossRef]
  13. Sansyzbayev, B.; Sydykov, D.; Kozhanov, Z.; Akhmetov, U.; Zhenishbekov, A. Organization and analysis of production efficiency horse breeding products by breed and region of the Republic of Kazakhstan. [Oš Mamlekettik Univ. Žarčysy. Ajyl Čarba Agron. Vet. Žana Zootehniâ.] J. Osh State Univ. Agric. Agron. Vet. Zootech. 2024, 2, 219–226. [Google Scholar] [CrossRef]
  14. Rzabayev, S.S. Mugalzhar Horse Breed; LLP Information and Printing Center—Kokzhiyek: Aktobe, Kazakhstan, 2007. [Google Scholar]
  15. Satybaldin, A.A. Current State of Horse Breeding and Horse Sports in Kazakhstan. In Proceedings of the First International Conference; n.p.: Kostanay, Kazakhstan, 2002; Available online: https://scholar.google.com/scholar?q=Current+State+of+Horse+Breeding+and+Horse+Sports+in+Kazakhstan (accessed on 5 July 2025).
  16. Iskhan, K.Z.; Kalashnikov, V.V.; Akimbekov, A.R.; Mongush, S.D.; Demin, V.A.; Rzabayev, T.S.; Nesipbaeva, A.K.; Zhilkybaeva, M.M.; Zhikishev, Y.K. Zootechnic characteristics of modern populations of Mugalzhar horse breed. Bull. Natl. Acad. Sci. Rep. Kazakhstan 2019, 6, 75–82. [Google Scholar] [CrossRef]
  17. Baimukanov, D.A.; Iskhan, K.Z. Steppe Horse Breeds—Lecture 7.1; Agriexpert.ru: Moscow, Russia, 2022; Available online: https://agriexpert.ru/articles/555/stepnye-porody-losadei-lekciya-71 (accessed on 5 July 2025). (In Russian)
  18. Salkova, N.; Moldagaliev, B. Mugalzhar Horse Breed is 25 Years Old; TV channel 24KZ: Astana, Kazakhstan, 2023; Available online: https://24.kz/ru/news/social/620379-mugalzharskoj-porode-loshadej-25-let (accessed on 5 July 2025). (In Russian)
  19. Kalzhanov, A. Mugalzhar Horse Breed is 25 Years Old; Aktyubinskiy Vestnik: Aktobe, Kazakhstan, 2023; Available online: https://avestnik.kz/mugalzharskoj-porode-loshadej-25-let/ (accessed on 5 July 2025). (In Russian)
  20. Boss Agro. Mugalzhar Horse; Boss Agro: Ust-Kamenogorsk, Kazakhstan, 2023; Available online: https://bossagro.kz/glossary/mugalzharskaya-loshad/ (accessed on 5 July 2025). (In Russian)
  21. Baibolsyn. Mugalzhar Horse; Baibolsyn: Almaty, Kazakhstan, 2023; Available online: https://baibolsyn.kz/ru/zhivotnye/mugalzharskaya-loshad/ (accessed on 5 July 2025). (In Russian)
  22. Stachurska, A.M. Inheritance of primitive markings in horses. J. Anim. Breed. Genet. 1999, 116, 29–38. [Google Scholar] [CrossRef]
  23. Seleuova, L.A.; Naimanov, D.K.; Jaworski, Z.; Aubakirov, M.Z.H.; Mustafin, M.K.; Mustafin, B.M.; Safronova, O.S.; Baktybaev, G.T.; Turabaev, A.T.; Domatski, V.N. Population genetic characteristic of horses of Mugalzhar breed by STR-markers. Biomed. Res. 2018, 29, 3508–3511. [Google Scholar] [CrossRef]
  24. Akhmetov, U.A.; Kozhanov, Z.Y.; Sydykov, D.A. Optimal Timing of Slaughtering Horses of Different Breeds in the Southern Region of Kazakhstan. In Seifullin Readings—18(2): Science of the 21st Century—The Era of Transformation, Proceedings of the International Scientific-Practical Conference, Astana, Kazakhstan, 7 October 2022; Saken Seifullin Kazakh State Agrotechnical University: Astana, Kazakhstan, 2022; Volume I, Chapter II; pp. 169–172. Available online: https://kazatu.edu.kz/webroot/js/kcfinder/upload/files/наука/СЧ-18(2)/Akhmetov%20U.A..pdf (accessed on 5 July 2025).
  25. Shamshidin, A.S.; Beishova, I.S.; Alikhanov, O.; Aubakirov, K.A.; Shamekova, M.K.; Kargaeva, M.T.; Karibayeva, D.K.; Baimukanov, D.A. Productive longevity of Mugalzhar mares. [Ġylym Žἄne Bìlìm.] Sci. Educ. 2025, 2, 279–287. [Google Scholar] [CrossRef]
  26. Dyussegaliyev, M.Z. The genotypes of herd horses of the West Region of Kazakhstan. [Sel’skoe Hozâjstvo Èkosistemy Sovremennom Mire Reg. Mežstranovye Issledovaniâ.] Agric. Ecosyst. Mod. World Reg. Inter Countries Res. 2022, 1, 33–43, (In Russian with English summary). [Google Scholar]
  27. Nurushev, M. A Unique Breed of Horses Has Been Developed; Kazakhstanskaya Pravda: Astana, Kazakhstan, 2013; Available online: https://kazpravda.kz/n/vyvedena-unikalnaya-poroda-loshadey/ (accessed on 5 July 2025). (In Russian)
  28. Orazymbetova, Z.; Ualiyeva, D.; Dossybayev, K.; Torekhanov, A.; Sydykov, D.; Mussayeva, A.; Baktybayev, G. Genetic diversity of Kazakhstani Equus caballus (Linnaeus, 1758) horse breeds inferred from microsatellite markers. Vet. Sci. 2023, 10, 598. [Google Scholar] [CrossRef] [PubMed]
  29. Iskhan, K.; Uskenov, R.; Akimbekov, A.; Baymukanov, D.; Yuldashbayev, Y.; Orynaliev, K. The Irtysh factory type of the Mugalzhar breed and the line Zamana, Bakay. [Ìzdenìster Nἄtiželer.] Res. Results 2024, 4, 16–24. [Google Scholar] [CrossRef]
  30. Akim of the Abay Region. Presentation of the Mugalzhar Horse Breed “Irtysh” Took Place; Websites of Government Bodies, gov.kz: Semey City, Republic of Kazakhstan, 2024. Available online: https://www.gov.kz/memleket/entities/abay/press/news/details/825473?lang=ru (accessed on 5 July 2025)(In Russian and Kazakh).
  31. Orazymbetova, Z.S.; Sydykov, D.A.; Dossybayev, K.Z.; Razak, A.B. Analysis of the Population Structure of the Kozhamberdin Type Inside Mugalzhar Horse Breed Using Microsatellite Markers. In Scientific Support for Animal Husbandry in Siberia, Proceedings of the VII International Scientific and Practical Conference, Krasnoyarsk, Russia, 18–19 May 2023; Efimova, L.V., Tereshchenko, V.A., Eds.; KrasNIISKh FRC KSC SB RAS: Krasnoyarsk, Russia, 2023; pp. 177–182. Available online: https://sh.krasn.ru/upload/iblock/f5d/bmna0ji3o1516z19ezzz21wq46pdq2ca.pdf#page=178 (accessed on 5 July 2025).
  32. Kassymbekova, S.N.; Iskhan, K.Z.; Rzabaev, S.S.; Bimenova, Z.Z.; Kabylbekova, D.I.; Tursunkulov, S.A. Assessment of genetic diversity using microsatellite markers and milk productivity of Mugalzhar horses. [S Sejfullin Atyndaġy K̦az. Agroteh. Univ. Ġylym Žaršysy.] Her. Sci. S Seifullin Kazakh Agro Techn. Res. Univ. Vet. Sci. 2024, 3, 29–36. [Google Scholar] [CrossRef]
  33. Pozharskiy, A.; Abdrakhmanova, A.; Beishova, I.; Shamshidin, A.; Nametov, A.; Ulyanova, T.; Bekova, G.; Kikebayev, N.; Kovalchuk, A.; Ulyanov, V. Genetic structure and genome-wide association study of the traditional Kazakh horses. Animal 2023, 17, 100926. [Google Scholar] [CrossRef]
  34. Pozharskiy, A.; Beishova, I.; Nametov, A.; Shamshidin, A.; Ulyanova, T.; Kovalchuk, A.; Ulyanov, V.; Shamekova, M.; Bekova, G.; Gritsenko, D. Genetic composition of Kazakh horses of Zhabe type evaluated by SNP genotyping. Heliyon 2025, 11, e41173. [Google Scholar] [CrossRef]
  35. Dementeva, N.V.; Romanov, M.N.; Kudinov, A.A.; Mitrofanova, O.V.; Stanishevskaya, O.I.; Terletsky, V.P.; Fedorova, E.S.; Nikitkina, E.V.; Plemyashov, K.V. Studying the structure of a gene pool population of the Russian White chicken breed by genome-wide SNP scan. Sel’skokhozyaistvennaya Biol. (Agric. Biol.) 2017, 52, 1166–1174. [Google Scholar] [CrossRef]
  36. Romanov, M.N.; Dementyeva, N.V.; Terletsky, V.P.; Plemyashov, K.V.; Stanishevskaya, O.I.; Kudinov, A.A.; Perinek, O.Y.; Fedorova, E.S.; Larkina, T.A.; Pleshanov, N.V. Applying SNP Array Technology to Assess Genetic Diversity in Russian Gene Pool of Chickens. In Proceedings of the International Plant and Animal Genome XXV Conference, San Diego, CA, USA, 14–18 January 2017; Scherago International: San Diego, CA, USA, 2017. Abstract P0115. Available online: https://pag.confex.com/pag/xxv/webprogram/Paper23948.html (accessed on 5 July 2025).
  37. Dementieva, N.V.; Shcherbakov, Y.S.; Tyshchenko, V.I.; Terletsky, V.P.; Vakhrameev, A.B.; Nikolaeva, O.A.; Ryabova, A.E.; Azovtseva, A.I.; Mitrofanova, O.V.; Peglivanyan, G.K.; et al. Comparative analysis of molecular RFLP and SNP markers in assessing and understanding the genetic diversity of various chicken breeds. Genes 2022, 13, 1876. [Google Scholar] [CrossRef] [PubMed]
  38. Volkova, N.A.; German, N.Y.; Larionova, P.V.; Vetokh, A.N.; Romanov, M.N.; Zinovieva, N.A. Identification of SNPs and candidate genes associated with abdominal fat deposition in quails (Coturnix japonica). Sel’skokhozyaistvennaya Biol. (Agric. Biol.) 2023, 58, 1079–1087. [Google Scholar] [CrossRef]
  39. Deniskova, T.E.; Dotsev, A.V.; Koshkina, O.A.; Solovieva, A.D.; Churbakova, N.A.; Petrov, S.N.; Frolov, A.N.; Platonov, S.A.; Abdelmanova, A.S.; Vladimirov, M.A.; et al. Examination of runs of homozygosity distribution patterns and relevant candidate genes of potential economic interest in Russian goat breeds using whole-genome sequencing. Genes 2025, 16, 631. [Google Scholar] [CrossRef] [PubMed]
  40. Liu, Y.-H.; Wang, L.; Xu, T.; Guo, X.; Li, Y.; Yin, T.-T.; Yang, H.-C.; Hu, Y.; Adeola, A.C.; Sanke, O.J.; et al. Whole-genome sequencing of African dogs provides insights into adaptations against tropical parasites. Mol. Biol. Evol. 2018, 35, 287–298. [Google Scholar] [CrossRef]
  41. Tuttle, E.; Korody, M.; Lear, T.; Gonser, R.; Houck, M.; Ryder, O.; Romanov, M.; Balakrishnan, C.; Bergland, A.; Warren, W. Whole Genome Sequence of the Behaviorally Polymorphic White-throated Sparrow. 1: Mapping Genes for Sociogenomics. In Proceedings of the Evolution 2014 Conference, Raleigh, NC, USA, 20–24 June 2014; Society for the Study of Evolution (SSE); Society of Systematic Biologists (SSB). American Society of Naturalists (ASN): Raleigh, NC, USA, 2014. Abstract 628. p. 183. Available online: https://kar.kent.ac.uk/id/eprint/46698 (accessed on 5 July 2025).
  42. Ryder, O.; Chemnick, L.G.; Thomas, S.; Martin, J.; Romanov, M.; Ralls, K.; Ballou, J.D.; Mace, M.; Ratan, A.; Miller, W.; et al. Supporting California Condor Conservation Management through Analysis of Species-wide Whole Genome Sequence Variation. In Proceedings of the International Plant and Animal Genome XXII Conference, San Diego, CA, USA, 11–15 January 2014; Scherago International: San Diego, CA, USA, 2014. Abstract W635. Available online: https://pag.confex.com/pag/xxii/webprogram/Paper11851.html (accessed on 5 July 2025).
  43. Ryder, O.; Miller, W.; Ralls, K.; Ballou, J.D.; Steiner, C.C.; Mitelberg, A.; Romanov, M.; Chemnick, L.G.; Mace, M.; Schuster, S. Whole Genome Sequencing of California Condors Is Now Utilized for Guiding Genetic Management. In Proceedings of the International Plant and Animal Genome XXIV Conference, San Diego, CA, USA, 8–13 January 2016; Scherago International: San Diego, CA, USA, 2016. Abstract W741. Available online: http://kar.kent.ac.uk/61072 (accessed on 5 July 2025).
  44. Rehman, S.U.; Hassan, F.-U.; Luo, X.; Li, Z.; Liu, Q. Whole-genome sequencing and characterization of buffalo genetic resources: Recent advances and future challenges. Animals 2021, 11, 904. [Google Scholar] [CrossRef]
  45. Sun, J.; Chen, T.; Zhu, M.; Wang, R.; Huang, Y.; Wei, Q.; Yang, M.; Liao, Y. Whole-genome sequencing revealed genetic diversity and selection of Guangxi indigenous chickens. PLoS ONE 2022, 17, e0250392. [Google Scholar] [CrossRef]
  46. Chen, N.; Xia, X.; Hanif, Q.; Zhang, F.; Dang, R.; Huang, B.; Lyu, Y.; Luo, X.; Zhang, H.; Yan, H.; et al. Global genetic diversity, introgression, and evolutionary adaptation of indicine cattle revealed by whole genome sequencing. Nat. Commun. 2023, 14, 7803. [Google Scholar] [CrossRef]
  47. Alemu, S.W.; Kadri, N.K.; Harland, C.; Faux, P.; Charlier, C.; Caballero, A.; Druet, T. An evaluation of inbreeding measures using a whole-genome sequenced cattle pedigree. Heredity 2021, 126, 410–423. [Google Scholar] [CrossRef]
  48. Huo, J.L.; Zhang, L.Q.; Zhang, X.; Wu, X.W.; Ye, X.H.; Sun, Y.H.; Cheng, W.M.; Yang, K.; Pan, W.R.; Zeng, Y.Z. Genome-wide single nucleotide polymorphism array and whole-genome sequencing reveal the inbreeding progression of Banna minipig inbred line. Anim. Genet. 2022, 53, 146–151. [Google Scholar] [CrossRef]
  49. Chen, H.M.; Zhao, H.; Zhu, Q.Y.; Yan, C.; Liu, Y.Q.; Si, S.; Jamal, M.A.; Xu, K.X.; Jiao, D.L.; Lv, M.J.; et al. Genomic consequences of intensive inbreeding in miniature inbred pigs. BMC Genom. 2025, 26, 154. [Google Scholar] [CrossRef]
  50. Lou, R.N.; Jacobs, A.; Wilder, A.P.; Therkildsen, N.O. A beginner’s guide to low-coverage whole genome sequencing for population genomics. Mol. Ecol. 2021, 30, 5966–5993. [Google Scholar] [CrossRef] [PubMed]
  51. Wang, Q.; Lan, T.; Li, H.; Sahu, S.K.; Shi, M.; Zhu, Y.; Han, L.; Yang, S.; Li, Q.; Zhang, L.; et al. Whole-genome resequencing of Chinese pangolins reveals a population structure and provides insights into their conservation. Commun. Biol. 2022, 5, 821. [Google Scholar] [CrossRef] [PubMed]
  52. Zhu, Z.; Zhang, L.; Xin, Q.; Li, L.; Miao, Z.; Huang, Q.; Zheng, N. Whole-genome sequencing revealed the population structure of Fujian chicken breeds. Czech J. Anim. Sci. 2024, 69, 323–330. [Google Scholar] [CrossRef]
  53. Shi, H.; Li, T.; Su, M.; Wang, H.; Li, Q.; Lang, X.; Ma, Y. Whole genome sequencing revealed genetic diversity, population structure, and selective signature of Panou Tibetan sheep. BMC Genom. 2023, 24, 50. [Google Scholar] [CrossRef] [PubMed]
  54. Gebreselase, H.B.; Nigussie, H.; Wang, C.; Luo, C. Genetic diversity, population structure and selection signature in Begait goats revealed by whole-genome sequencing. Animals 2024, 14, 307. [Google Scholar] [CrossRef]
  55. Zhang, Y.; Wei, Z.; Zhang, M.; Wang, S.; Gao, T.; Huang, H.; Zhang, T.; Cai, H.; Liu, X.; Fu, T.; et al. Population structure and selection signal analysis of Nanyang cattle based on whole-genome sequencing data. Genes 2024, 15, 351. [Google Scholar] [CrossRef]
  56. Onogi, A.; Shirai, K.; Amano, T. Investigation of genetic diversity and inbreeding in a Japanese native horse breed for suggestions on its conservation. Anim. Sci. J. 2017, 88, 1902–1910. [Google Scholar] [CrossRef]
  57. Pokharel, K.; Weldenegodguad, M.; Reilas, T.; Kantanen, J. EquCab_Finn: A new reference genome assembly for the domestic horse, Finnhorse. Anim. Genet. 2024, 55, 766–771. [Google Scholar] [CrossRef]
  58. Tang, X.; Zhu, B.; Ren, R.; Chen, B.; Li, S.; Gu, J. Genome-wide copy number variation detection in a large cohort of diverse horse breeds by whole-genome sequencing. Front. Vet. Sci. 2023, 10, 1296213. [Google Scholar] [CrossRef]
  59. Chen, C.; Zhu, B.; Tang, X.; Chen, B.; Liu, M.; Gao, N.; Li, S.; Gu, J. Genome-wide assessment of runs of homozygosity by whole-genome sequencing in diverse horse breeds worldwide. Genes 2023, 14, 1211. [Google Scholar] [CrossRef]
  60. Maniatis, T.; Fritsch, E.F.; Sambrook, J. Molecular Cloning. A Laboratory Manual, 2nd ed.; Cold Spring Harbor Laboratories: Cold Spring Harbor, NY, USA, 1982. [Google Scholar]
  61. NCBI. Equus caballus Genome Assembly EquCab3.0; NCBI RefSeq assembly GCF_002863925.1; National Center for Biotechnology Information; National Library of Medicine: Bethesda, MD, USA, 2018. Available online: https://www.ncbi.nlm.nih.gov/datasets/genome/GCF_002863925.1/ (accessed on 5 July 2025).
  62. Li, H.; Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 2009, 25, 1754–1760. [Google Scholar] [CrossRef] [PubMed]
  63. Narasimhan, V.; Danecek, P.; Scally, A.; Xue, Y.; Tyler-Smith, C.; Durbin, R. BCFtools/RoH: A hidden Markov model approach for detecting autozygosity from next-generation sequencing data. Bioinformatics 2016, 32, 1749–1751. [Google Scholar] [CrossRef] [PubMed]
  64. Poplin, R.; Ruano-Rubio, V.; DePristo, M.A.; Fennell, T.J.; Carneiro, M.O.; Van der Auwera, G.A.; Kling, D.E.; Gauthier, L.D.; Levy-Moonshine, A.; Roazen, D. Scaling accurate genetic variant discovery to tens of thousands of samples. bioRxiv 2017, 201178. [Google Scholar] [CrossRef]
  65. McLaren, W.; Gil, L.; Hunt, S.E.; Riat, H.S.; Ritchie, G.R.S.; Thormann, A.; Flicek, P.; Cunningham, F. The Ensembl Variant Effect Predictor. Genome Biol. 2016, 17, 122. [Google Scholar] [CrossRef]
  66. Krzywinski, M.; Schein, J.; Birol, I.; Connors, J.; Gascoyne, R.; Horsman, D.; Jones, S.J.; Marra, M.A. Circos: An information aesthetic for comparative genomics. Genome Res. 2009, 19, 1639–1645. [Google Scholar] [CrossRef]
  67. Dyer, S.C.; Austine-Orimoloye, O.; Azov, A.G.; Barba, M.; Barnes, I.; Barrera-Enriquez, V.P.; Becker, A.; Bennett, R.; Beracochea, M.; Berry, A.; et al. Ensembl 2025. Nucleic Acids Res. 2025, 53, D948–D957. [Google Scholar] [CrossRef]
  68. Mushtaq, A. Ensembl 114 Has Been Released! Ensembl Blog. 2025. Available online: https://www.ensembl.info/2025/05/07/ensembl-114-has-been-released/ (accessed on 5 July 2025).
  69. Danecek, P.; Auton, A.; Abecasis, G.; Albers, C.A.; Banks, E.; DePristo, M.A.; Handsaker, R.E.; Lunter, G.; Marth, G.T.; Sherry, S.T. The variant call format and VCFtools. Bioinformatics 2011, 27, 2156–2158. [Google Scholar] [CrossRef]
  70. Yu, N.; Jensen-Seaman, M.I.; Chemnick, L.; Ryder, O.; Li, W.-H. Nucleotide diversity in gorillas. Genetics 2004, 166, 1375–1383. [Google Scholar] [CrossRef]
  71. Yang, J.; Lee, S.H.; Goddard, M.E.; Visscher, P.M. GCTA: A tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 2011, 88, 76–82. [Google Scholar] [CrossRef] [PubMed]
  72. Wickham, H. ggplot2: Elegant Graphics for Data Analysis; Springer: New York, NY, USA, 2009; ISBN 978-0-387-98141-3. [Google Scholar] [CrossRef]
  73. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2018; Available online: https://www.r-project.org/ (accessed on 5 July 2025).
  74. R Core Team. R-4; R Foundation for Statistical Computing: Vienna, Austria, 2020; Available online: https://cran.r-project.org/src/base/R-4/ (accessed on 5 July 2025).
  75. Nicholas, F.; Tammen, I.; Sydney Informatics Hub. Online Mendelian Inheritance in Animals (OMIA); The University of Sydney: Sydney, Australia, 1995. [Google Scholar] [CrossRef]
  76. Yi, W.; Hu, M.; Shi, L.; Li, T.; Bai, C.; Sun, F.; Ma, H.; Zhao, Z.; Yan, S. Whole genome sequencing identified genomic diversity and candidated genes associated with economic traits in Northeasern Merino in China. Front. Genet. 2024, 15, 1302222. [Google Scholar] [CrossRef]
  77. Scherf, B.D. (Ed.) World Watch List for Domestic Animal Diversity, 2nd ed.; FAO; UNEP: Rome, Italy, 1995; Available online: https://web.archive.org/web/20151126034309/http://www.fao.org/ag/againfo/programmes/en/lead/toolbox/Indust/wwl.pdf (accessed on 26 November 2015).
  78. Nikiforov, A.A.; Moiseeva, I.G.; Zakharov, I.A. Mesto russkikh porod kur v raznoobrazii porod Yevrazii. Position of Russian chicken breeds in the diversity of Eurasian breeds. Genetika 1998, 34, 850–851. Available online: https://pubmed.ncbi.nlm.nih.gov/9719931/ (accessed on 5 July 2025). (In Russian with English summary).
  79. Romanov, M.N.; Weigend, S. Genetic Diversity in Chicken Populations Based on Microsatellite Markers. In Proceedings of the Conference “From Jay Lush to Genomics: Visions for Animal Breeding and Genetics”, Ames, IA, USA, 16–18 May 1999; Dekkers, J.C.M., Lamont, S.J., Rothschild, M.F., Eds.; Iowa State University, Department of Animal Science: Ames, IA, USA, 1999; p. 174. Available online: https://web.archive.org/web/20050314091227/http://www.agbiotechnet.com/proceedings/jaylush.asp#34 (accessed on 14 March 2025).
  80. Rzabayev, T.; Rzabayev, S.; Rzabayev, K. A new intra-breed type, “Mamyr-Aktobe,” of the Kushum breed of horses of the Aktobe population. Arch. Razi Inst. 2022, 77, 2273. [Google Scholar] [CrossRef]
  81. Boettcher, P.; Martin, J.F.; Gandini, G.; Joshi, B.K.; Oldenbroek, J.K. In Vivo Conservation of Animal Genetic Resources. In FAO Animal Production and Health Guidelines; Commission on Genetic Resources for Food and Agriculture, FAO: Rome, Italy, 2013; Volume 14, Available online: https://www.fao.org/4/i3327e/i3327e.pdf (accessed on 5 July 2025).
  82. Woolliams, J.A.; Oldenbroek, J.K. Genetic diversity issues in animal populations in the genomic era. In Genomic Management of Animal Genetic Diversity; Oldenbroek, J.K., Ed.; Wageningen Academic Publishers: Wageningen, The Netherlands, 2017; pp. 13–47. [Google Scholar] [CrossRef]
  83. Eusebi, P.G.; Martinez, A.; Cortes, O. Genomic tools for effective conservation of livestock breed diversity. Diversity 2019, 12, 8. [Google Scholar] [CrossRef]
  84. Moiseeva, I.G. Vliyaniye inbridinga na kachestvo kurinykh yaits. The effect of inbreeding on the quality of fowl eggs. Genetika 1970, 6, 99–107, (In Russian with English Summary). [Google Scholar]
  85. Bondarenko, Y.V.; Popsuy, V.V. Vykorystannya inbrydynhu v suchasnomu svynarstvi. In The Use of Inbreeding in Modern Pig Farming; LLC Ahrar Mediyen Ukrayina: Kyiv, Ukraine, 2019; Available online: https://repo.snau.edu.ua/bitstream/123456789/6994/1/Бoндapeнкo%20Ю.%20Bикopистaння%20iнбpидингy.pdf (accessed on 5 July 2025). (In Ukrainian)
  86. Bailey, E.; Finno, C.J.; Cullen, J.N.; Kalbfleisch, T.; Petersen, J.L. Analyses of whole-genome sequences from 185 North American Thoroughbred horses, spanning 5 generations. Sci. Rep. 2024, 14, 22930. [Google Scholar] [CrossRef]
  87. Tozaki, T.; Ohnuma, A.; Kikuchi, M.; Ishige, T.; Kakoi, H.; Hirota, K.; Kusano, K.; Nagata, S. Rare and common variant discovery by whole-genome sequencing of 101 Thoroughbred racehorses. Sci. Rep. 2021, 11, 16057. [Google Scholar] [CrossRef]
  88. Al Abri, M.A.; Holl, H.M.; Kalla, S.E.; Sutter, N.B.; Brooks, S.A. Whole genome detection of sequence and structural polymorphism in six diverse horses. PLoS ONE 2020, 15, e0230899. [Google Scholar] [CrossRef] [PubMed]
  89. Brooks, S.A.; Gabreski, N.; Miller, D.; Brisbin, A.; Brown, H.E.; Streeter, C.; Mezey, J.; Cook, D.; Antczak, D.F. Whole-genome SNP association in the horse: Identification of a deletion in myosin Va responsible for Lavender Foal Syndrome. PLoS Genet. 2010, 6, e1000909. [Google Scholar] [CrossRef]
  90. Piro, M.; Benjouad, A.; Tligui, N.S.; Allali, K.E.; Kohen, M.E.; Nabich, A.; Ouragh, L. Frequency of the severe combined immunodeficiency disease gene among horses in Morocco. Equine Vet. J. 2008, 40, 590–591. [Google Scholar] [CrossRef]
  91. Wade, C.; Giulotto, E.; Sigurdsson, S.; Zoli, M.; Gnerre, S.; Imsland, F.; Lear, T.; Adelson, D.; Bailey, E.; Bellone, R.; et al. Genome sequence, comparative analysis, and population genetics of the domestic horse. Science 2009, 326, 865–867. [Google Scholar] [CrossRef]
  92. Zhang, Q.; Calus, M.P.; Guldbrandtsen, B.; Lund, M.S.; Sahana, G. Estimation of inbreeding using pedigree, 50k SNP chip genotypes and full sequence data in three cattle breeds. BMC Genet. 2015, 16, 88. [Google Scholar] [CrossRef]
  93. Najmabadi, H.; Hu, H.; Garshasbi, M.; Zemojtel, T.; Abedini, S.S.; Chen, W.; Hosseini, M.; Behjati, F.; Haas, S.; Jamali, P. Deep sequencing reveals 50 novel genes for recessive cognitive disorders. Nature 2011, 478, 57–63. [Google Scholar] [CrossRef]
  94. Carss, K.J.; Arno, G.; Erwood, M.; Stephens, J.; Sanchis-Juan, A.; Hull, S.; Megy, K.; Grozeva, D.; Dewhurst, E.; Malka, S. Comprehensive rare variant analysis via whole-genome sequencing to determine the molecular pathology of inherited retinal disease. Am. J. Hum. Genet. 2017, 100, 75–90. [Google Scholar] [CrossRef]
  95. Tatour, Y.; Bar-Joseph, H.; Shalgi, R.; Ben-Yosef, T. Male sterility and reduced female fertility in SCAPER-deficient mice. Hum. Mol. Genet. 2020, 29, 2240–2249. [Google Scholar] [CrossRef] [PubMed]
  96. Ghoreishifar, M.; Vahedi, S.M.; Salek Ardestani, S.; Khansefid, M.; Pryce, J.E. Genome-wide assessment and mapping of inbreeding depression identifies candidate genes associated with semen traits in Holstein bulls. BMC Genom. 2023, 24, 230. [Google Scholar] [CrossRef]
  97. Terefe, M.T. Identification of Adaptive Signatures in the Cattle Genome. Ph.D. Thesis, Seoul National University, Seoul, Republic of Korea, 2018. Available online: https://s-space.snu.ac.kr/handle/10371/140789 (accessed on 5 July 2025).
  98. Gao, J.; Lyu, Y.; Zhang, D.; Reddi, K.K.; Sun, F.; Yi, J.; Liu, C.; Li, H.; Yao, H.; Dai, J.; et al. Genomic characteristics and selection signatures in indigenous Chongming white goat (Capra hircus). Front. Genet. 2020, 11, 901. [Google Scholar] [CrossRef] [PubMed]
  99. Shang, P.; Li, W.; Tan, Z.; Zhang, J.; Dong, S.; Wang, K.; Chamba, Y. Population genetic analysis of ten geographically isolated Tibetan pig populations. Animals 2020, 10, 1297. [Google Scholar] [CrossRef]
  100. Yahyaoui, G.; Jemaa, S.B.; Kdidi, S.; Gaouar, S.B.S.; Yahyaoui, M.H. Identification of selection signatures in Algero-Tunisian sheep breeds using medium-density SNP chips. Genet. Biodiv. J. 2024, 8, 59–75. [Google Scholar]
  101. Elkjaer, M.L.; Frisch, T.; Reynolds, R.; Kacprowski, T.; Burton, M.; Kruse, T.A.; Thomassen, M.; Baumbach, J.; Illes, Z. Molecular signature of different lesion types in the brain white matter of patients with progressive multiple sclerosis. Acta Neuropathol. Commun. 2019, 7, 205. [Google Scholar] [CrossRef] [PubMed]
  102. van Kruistum, H.; van den Heuvel, J.; Travis, J.; Kraaijeveld, K.; Zwaan, B.J.; Groenen, M.A.M.; Megens, H.-J.; Pollux, B.J.A. The genome of the live-bearing fish Heterandria formosa implicates a role of conserved vertebrate genes in the evolution of placental fish. BMC Evol. Biol. 2019, 19, 156. [Google Scholar] [CrossRef]
  103. Dierks, C. Molecular Genetic Analysis of Quantitative Trait Loci (QTL) for Osteochondrosis in Hanoverian Warmblood Horses. Ph.D. Thesis, Institut für Tierzucht und Vererbungsforschung der Tierärztlichen Hochschule Hannover, Hannover, Germany, 2006. Available online: https://elib.tiho-hannover.de/receive/etd_mods_00002149 (accessed on 5 July 2025).
  104. Waddell, L.A.; Lefevre, L.; Bush, S.J.; Raper, A.; Young, R.; Lisowski, Z.M.; McCulloch, M.E.B.; Muriuki, C.; Sauter, K.A.; Clark, E.L.; et al. ADGRE1 (EMR1, F4/80) is a rapidly-evolving gene expressed in mammalian monocyte-macrophages. Front. Immunol. 2018, 9, 2246. [Google Scholar] [CrossRef] [PubMed]
  105. Lin, H.-H.; Faunce, D.E.; Stacey, M.; Terajewicz, A.; Nakamura, T.; Zhang-Hoover, J.; Kerley, M.; Mucenski, M.L.; Gordon, S.; Stein-Streilein, J. The macrophage F4/80 receptor is required for the induction of antigen-specific efferent regulatory T cells in peripheral tolerance. J. Exp. Med. 2005, 201, 1615–1625. [Google Scholar] [CrossRef]
  106. Al-Quraishy, S.; Dkhil, M.A.; Abdel-Baki, A.A.S.; Delic, D.; Santourlidis, S.; Wunderlich, F. Genome-wide screening identifies Plasmodium chabaudi-induced modifications of DNA methylation status of Tlr1 and Tlr6 gene promoters in liver, but not spleen, of female C57BL/6 mice. Parasitol. Res. 2013, 112, 3757–3770. [Google Scholar] [CrossRef]
  107. Dander, E.; Vinci, P.; Vetrano, S.; Recordati, C.; Piazza, R.; Fazio, G.; Bardelli, D.; Bugatti, M.; Sozio, F.; Piontini, A.; et al. The chemerin/CMKLR1 axis regulates intestinal graft-versus-host disease. JCI Insight. 2023, 8, e154440. [Google Scholar] [CrossRef]
  108. de Camargo, G.M.F.; Aspilcueta-Borquis, R.R.; Fortes, M.R.S.; Porto-Neto, R.; Cardoso, D.F.; Santos, D.J.A.; Lehnert, S.A.; Reverter, A.; Moore, S.S.; Tonhati, H. Prospecting major genes in dairy buffaloes. BMC Genom. 2015, 16, 872. [Google Scholar] [CrossRef]
  109. Bozaoglu, K.; Bolton, K.; McMillan, J.; Zimmet, P.; Jowett, J.; Collier, G.; Walder, K.; Segal, D. Chemerin is a novel adipokine associated with obesity and metabolic syndrome. Endocrinology 2007, 148, 4687–4694. [Google Scholar] [CrossRef]
  110. Sell, H.; Divoux, A.; Poitou, C.; Basdevant, A.; Bouillot, J.-L.; Bedossa, P.; Tordjman, J.; Eckel, J.; Clement, K. Chemerin correlates with markers for fatty liver in morbidly obese patients and strongly decreases after weight loss induced by bariatric surgery. J. Clin. Endocrinol. Metab. 2010, 95, 2892–2896. [Google Scholar] [CrossRef]
  111. Adhikari, M.; Kantar, M.B.; Longman, R.J.; Lee, C.N.; Oshiro, M.; Caires, K.; He, Y. Genome-wide association study for carcass weight in pasture-finished beef cattle in Hawai’i. Front. Genet. 2023, 14, 1168150. [Google Scholar] [CrossRef] [PubMed]
  112. Alam, M.Z.; Haque, M.A.; Iqbal, A.; Lee, Y.-M.; Ha, J.-J.; Jin, S.; Park, B.; Kim, N.-Y.; Won, J.I.; Kim, J.-J. Genome-wide association study to identify QTL for carcass traits in Korean Hanwoo cattle. Animals 2023, 13, 2737. [Google Scholar] [CrossRef]
  113. Atashi, H.; Chen, Y.; Wilmot, H.; Vanderick, S.; Hubin, X.; Soyeurt, H.; Gengler, N. Single-step genome-wide association for selected milk fatty acids in Dual-Purpose Belgian Blue cows. J. Dairy Sci. 2023, 106, 6299–6315. [Google Scholar] [CrossRef]
  114. de las Heras-Saldana, S.; Clark, S.A.; Duijvesteijn, N.; Gondro, C.; van der Werf, J.H.J.; Chen, Y. Combining information from genome-wide association and multi-tissue gene expression studies to elucidate factors underlying genetic variation for residual feed intake in Australian Angus cattle. BMC Genom. 2019, 20, 939. [Google Scholar] [CrossRef]
  115. Lindholm-Perry, A.K.; Freetly, H.C.; Oliver, W.T.; Rempel, L.A.; Keel, B.N. Genes associated with body weight gain and feed intake identified by meta-analysis of the mesenteric fat from crossbred beef steers. PLoS ONE 2020, 15, e0227154. [Google Scholar] [CrossRef]
  116. Ablondi, M.; Dadousis, C.; Vasini, M.; Eriksson, S.; Mikko, S.; Sabbioni, A. Genetic diversity and signatures of selection in a native Italian horse breed based on SNP data. Animals 2020, 10, 1005. [Google Scholar] [CrossRef] [PubMed]
  117. Sievers, J.; Distl, O. Genomic patterns of homozygosity and genetic diversity in the Rhenish German draught horse. Genes 2025, 16, 327. [Google Scholar] [CrossRef] [PubMed]
  118. Vahedi, S.M.; Salek Ardestani, S.; Karimi, K.; Banabazi, M.H. Weighted single-step GWAS for body mass index and scans for recent signatures of selection in Yorkshire pigs. J. Hered. 2022, 113, 325–335. [Google Scholar] [CrossRef] [PubMed]
  119. Wang, F.; Zha, Z.; He, Y.; Li, J.; Zhong, Z.; Xiao, Q.; Tan, Z. Genome-wide re-sequencing data reveals the population structure and selection signatures of Tunchang pigs in China. Animals 2023, 13, 1835. [Google Scholar] [CrossRef]
  120. Murgiano, L.; Becker, D.; Spector, C.; Carlin, K.; Santana, E.; Niggel, J.K.; Jagannathan, V.; Leeb, T.; Pearce-Kelling, S.; Aguirre, G.D.; et al. CCDC66 frameshift variant associated with a new form of early-onset progressive retinal atrophy in Portuguese Water Dogs. Sci. Rep. 2020, 10, 21162. [Google Scholar] [CrossRef]
  121. Dekomien, G.; Vollrath, C.; Petrasch-Parwez, E.; Boevé, M.H.; Akkad, D.A.; Gerding, W.M.; Epplen, J.T. Progressive retinal atrophy in Schapendoes dogs: Mutation of the newly identified CCDC66 gene. Neurogenetics 2010, 11, 163–174. [Google Scholar] [CrossRef]
  122. Volkova, N.A.; Kotova, T.O.; Vetokh, A.N.; Larionova, P.V.; Volkova, L.A.; Romanov, M.N.; Zinovieva, N.A. Genome-wide association study of testes development indicators in roosters (Gallus gallus L.). Sel’skokhozyaistvennaya Biol. (Agric. Biol.) 2024, 59, 649–657. [Google Scholar] [CrossRef]
  123. Lu, D.; Willard, D.; Patel, I.R.; Kadwell, S.; Overton, L.; Kost, T.; Luther, M.; Chen, W.; Woychik, R.P.; Wilkison, W.O. Agouti protein is an antagonist of the melanocyte-stimulating-hormone receptor. Nature 1994, 371, 799–802. [Google Scholar] [CrossRef]
  124. Cosso, G.; Carcangiu, V.; Luridiana, S.; Fiori, S.; Columbano, N.; Masala, G.; Careddu, G.M.; Sanna Passino, E.; Mura, M.C. Characterization of the Sarcidano Horse coat color genes. Animals 2022, 12, 2677. [Google Scholar] [CrossRef] [PubMed]
  125. Haase, B.; Brooks, S.A.; Schlumbaum, A.; Azor, P.J.; Bailey, E.; Alaeddine, F.; Mevissen, M.; Burger, D.; Poncet, P.-A.; Rieder, S.; et al. Allelic heterogeneity at the equine KIT locus in dominant white (W) horses. PLoS Genet. 2007, 3, e195. [Google Scholar] [CrossRef]
  126. Rönnstrand, L. Signal transduction via the stem cell factor receptor/c-Kit. Cell. Mol. Life Sci. 2004, 61, 2535–2548. [Google Scholar] [CrossRef] [PubMed]
  127. Henkel, J.; Lafayette, C.; Brooks, S.A.; Martin, K.; Patterson-Rosa, L.; Cook, D.; Jagannathan, V.; Leeb, T. Whole-genome sequencing reveals a large deletion in the MITF gene in horses with white spotted coat colour and increased risk of deafness. Anim. Genet. 2019, 50, 172–174. [Google Scholar] [CrossRef] [PubMed]
  128. Magdesian, K.G.; Tanaka, J.; Bellone, R.R. A de novo MITF deletion explains a novel splashed white phenotype in an American Paint Horse. J. Hered. 2020, 111, 287–293. [Google Scholar] [CrossRef]
  129. Hauswirth, R.; Haase, B.; Blatter, M.; Brooks, S.A.; Burger, D.; Drögemüller, C.; Gerber, V.; Henke, D.; Janda, J.; Jude, R.; et al. Mutations in MITF and PAX3 cause “splashed white” and other white spotting phenotypes in horses. PLoS Genet. 2012, 8, e1002653. [Google Scholar] [CrossRef]
  130. McFadden, A.; Martin, K.; Foster, G.; Vierra, M.; Lundquist, E.W.; Everts, R.E.; Martin, E.; Volz, E.; McLoone, K.; Brooks, S.A.; et al. Two novel variants in MITF and PAX3 associated with splashed white phenotypes in horses. J. Equine Vet. Sci. 2023, 128, 104875. [Google Scholar] [CrossRef]
  131. Bellone, R.R.; Tanaka, J.; Esdaile, E.; Sutton, R.B.; Payette, F.; Leduc, L.; Till, B.J.; Abdel-Ghaffar, A.K.; Hammond, M.; Magdesian, K.G. A de novo 2.3 kb structural variant in MITF explains a novel splashed white phenotype in a Thoroughbred family. Anim. Genet. 2023, 54, 752–762. [Google Scholar] [CrossRef]
  132. Negro, S.; Imsland, F.; Valera, M.; Molina, A.; Solé, M.; Andersson, L. Association analysis of KIT, MITF, and PAX3 variants with white markings in Spanish horses. Anim. Genet. 2017, 48, 349–352. [Google Scholar] [CrossRef]
  133. Promerová, M.; Andersson, L.S.; Juras, R.; Penedo, M.C.T.; Reissmann, M.; Tozaki, T.; Bellone, R.; Dunner, S.; Hořín, P.; Imsland, F.; et al. Worldwide frequency distribution of the ‘Gait keeper’ mutation in the DMRT3 gene. Anim. Genet. 2014, 45, 274–282. [Google Scholar] [CrossRef] [PubMed]
  134. Moazemi, I.; Mohammadabadi, M.R.; Mostafavi, A.; Esmailizadeh, A.K.; Babenko, O.I.; Bushtruk, M.V.; Tkachenko, S.V.; Stavetska, R.V.; Klopenko, N.I. Polymorphism of DMRT3 Gene and its association with body measurements in horse breeds. Russ. J. Genet. 2020, 56, 1232–1240. [Google Scholar] [CrossRef]
  135. Sonali; Bhardwaj, A.; Unnati; Nayan, V.; Legha, R.A.; Bhattacharya, T.K.; Pal, Y.; Giri, S.K. Identification and characterization of single nucleotide polymorphisms in DMRT3 gene in Indian horse (Equus caballus) and donkey (Equus asinus) populations. Anim. Biotechnol. 2023, 34, 4910–4920. [Google Scholar] [CrossRef] [PubMed]
  136. Kolberg, L.; Raudvere, U.; Kuzmin, I.; Adler, P.; Vilo, J.; Peterson, H. g:Profiler—Interoperable web service for functional enrichment analysis and gene identifier mapping (2023 update). Nucleic Acids Res. 2023, 51, W207–W212. [Google Scholar] [CrossRef]
  137. Nurmakhanbetov, D.M.; Sydykov, D.A.; Nasyrkhanova, B.K.; Kozhanov, Z.E. Selection and breeding work with Kazakh horse type Zhabe in peasant farms of Kazakhstan. [S Sejfullin Atyndaġy K̦az. Agroteh. Univ. Ġylym Žaršysy] Her. Sci. S Seifullin Kazakh Agro Techn. Res. Univ. Multidiscip. 2022, 3, 181–191. [Google Scholar] [CrossRef]
Figure 1. (A) The Mugalzhar horse breed (Photo by K. Iskhan); (B) The Mugalzhar horse breed with «zebra pattern» stripes on the legs; (C) and characteristic «dorsal stripe» (Photo by T. Assanbayev).
Figure 1. (A) The Mugalzhar horse breed (Photo by K. Iskhan); (B) The Mugalzhar horse breed with «zebra pattern» stripes on the legs; (C) and characteristic «dorsal stripe» (Photo by T. Assanbayev).
Animals 15 02667 g001
Figure 2. Violin plots for all identified variants in each of 32 horse chromosomes: (A) density of allele frequency, (B) density of BaseQRankSum statistics, and (C) shows density of sequencing depth. Data are pooled from all 20 Mugalzhar horses.
Figure 2. Violin plots for all identified variants in each of 32 horse chromosomes: (A) density of allele frequency, (B) density of BaseQRankSum statistics, and (C) shows density of sequencing depth. Data are pooled from all 20 Mugalzhar horses.
Animals 15 02667 g002
Figure 3. Principal component analysis (PCA) of the population structure and genetic relationships between 20 Mugalzhar individuals: (A) distribution of the individuals in the projections of two coordinates, i.e., the first (PC1; X-axis) and second (PC2; Y-axis) principal components, showing percentage of total genetic variation that can be explained by each of the two PCs (indicated within the parentheses); (B) same in the dimensions of two coordinates, i.e., PC1 (X-axis) and the third principal component (PC3; Y-axis).
Figure 3. Principal component analysis (PCA) of the population structure and genetic relationships between 20 Mugalzhar individuals: (A) distribution of the individuals in the projections of two coordinates, i.e., the first (PC1; X-axis) and second (PC2; Y-axis) principal components, showing percentage of total genetic variation that can be explained by each of the two PCs (indicated within the parentheses); (B) same in the dimensions of two coordinates, i.e., PC1 (X-axis) and the third principal component (PC3; Y-axis).
Animals 15 02667 g003
Figure 4. Circos (version 0.69-9; [66]) plot showing 32 chromosomes of Equus caballus. Chromosomes are divided into bins of 1,000,000 nucleotides. Description from the outermost to the innermost circle: I, rainbow scale colored bars are the chromosomes and black/white barcodes means gene concentration in the reference genome (EquCab3.0; [61]); II, concentration of variants detected in our data (red) and concentration of variants passing the allele frequency and sequencing depth filtering (blue); III, frequency of detected effects of variants (using Variant Effect Predictor (VEP) tool [65]) divided into four groups: intron (blue), intergenic (green), up- or downstream gene (orange), and other (purple); IV, concentration of detected consequences of variants obtained after the filtration step and divided into four groups: intron (blue), intergenic (green), up- or downstream gene (orange), and other (purple). Darker colors on heatmap indicate higher concentrations.
Figure 4. Circos (version 0.69-9; [66]) plot showing 32 chromosomes of Equus caballus. Chromosomes are divided into bins of 1,000,000 nucleotides. Description from the outermost to the innermost circle: I, rainbow scale colored bars are the chromosomes and black/white barcodes means gene concentration in the reference genome (EquCab3.0; [61]); II, concentration of variants detected in our data (red) and concentration of variants passing the allele frequency and sequencing depth filtering (blue); III, frequency of detected effects of variants (using Variant Effect Predictor (VEP) tool [65]) divided into four groups: intron (blue), intergenic (green), up- or downstream gene (orange), and other (purple); IV, concentration of detected consequences of variants obtained after the filtration step and divided into four groups: intron (blue), intergenic (green), up- or downstream gene (orange), and other (purple). Darker colors on heatmap indicate higher concentrations.
Animals 15 02667 g004
Table 1. Reads counts, sequence quality control (QC), and mapping statistics for each sample.
Table 1. Reads counts, sequence quality control (QC), and mapping statistics for each sample.
Horse IDRead CountRead (Raw/Trim)QC > 20, %QC > 35, %Alignment, %
1421,952,515150/14098.6390.9499.69
2420,140,194150/14098.6790.7899.77
3389,171,234150/14099.0193.0599.81
4562,962,552150/14098.8292.1099.74
5266,970,766150/14099.1193.0999.80
6466,017,248150/14099.0092.6599.79
7232,870,671150/14099.2594.8499.86
8502,411,868150/14098.8992.6399.80
9362,727,174150/14099.2694.5899.83
10429,967,714150/14098.9593.0599.78
11363,588,160150/14098.6490.7899.78
12400,995,480150/14098.9692.9999.82
13337,511,396150/14098.5790.6799.78
14335,204,785150/14098.9092.6599.80
15347,165,492150/14098.9592.7099.79
16525,540,100150/14098.7791.6199.75
17216,143,288150/14098.8792.8299.75
18417,983,452150/14099.1393.7099.82
19339,261,443150/14098.8792.5999.72
20271,018,576150/14099.1393.6099.86
Table 2. Distribution of variants over 32 horse chromosomes.
Table 2. Distribution of variants over 32 horse chromosomes.
ECC 1All VariantsBi-Allelic Variants
IndelsSNPsTotalIndelsSNPsTotal
1 (NC_009144.3 2)147,7221,357,4531,505,175126,3691,349,5651,475,934
2 (NC_009145.3)97,645899,301996,94683,781893,743977,524
3 (NC_009146.3)93,450868,813962,26380,354863,936944,290
4 (NC_009147.3)91,653852,027943,68078,769846,792925,561
5 (NC_009148.3)76,667690,517767,18465,553686,424751,977
6 (NC_009149.3)74,571674,296748,86763,869670,070733,939
7 (NC_009150.3)79,382727,068806,45068,070722,632790,702
8 (NC_009151.3)80,305776,161856,46669,441771,166840,607
9 (NC_009152.3)64,599603,074667,67355,591599,656655,247
10 (NC_009153.3)72,890649,547722,43762,265645,511707,776
11 (NC_009154.3)48,745419,897468,64241,227417,041458,268
12 (NC_009155.3)41,910427,686469,59637,294423,529460,823
13 (NC_009156.3)38,426363,283401,70932,834360,942393,776
14 (NC_009157.3)73,042668,303741,34562,609664,494727,103
15 (NC_009158.3)72,635679,471752,10662,273675,408737,681
16 (NC_009159.3)68,916635,961704,87758,733632,351691,084
17 (NC_009160.3)69,046642,029711,07559,593638,266697,859
18 (NC_009161.3)71,768664,346736,11461,759660,204721,963
19 (NC_009162.3)54,275508,176562,45146,645505,024551,669
20 (NC_009163.3)74,882693,542768,42466,477683,845750,322
21 (NC_009164.3)50,108484,203534,31143,275481,038524,313
22 (NC_009165.3)39,885385,028424,91334,402382,654417,056
23 (NC_009166.3)43,983398,492442,47537,631396,114433,745
24 (NC_009167.3)40,132376,887417,01934,381374,407408,788
25 (NC_009168.3)30,733291,433322,16626,275289,800316,075
26 (NC_009169.3)38,294381,680419,97433,366379,107412,473
27 (NC_009170.3)36,748348,404385,15231,760346,037377,797
28 (NC_009171.3)36,445347,184383,62931,480345,095376,575
29 (NC_009172.3)32,311313,570345,88128,037311,406339,443
30 (NC_009173.3)28,951269,626298,57725,025267,744292,769
31 (NC_009174.3)22,581212,724235,30519,245211,489230,734
X (NC_009175.3)97,952754,114852,06684,521749,848834,369
Total1,990,65218,364,29620,354,9481,712,90418,245,33819,958,242
1 ECC, Equus caballus chromosome; 2 NC_0091XX.X, NCBI Reference Sequence No. [61].
Table 3. Distribution of identified bi-allelic variants over functional annotation classes.
Table 3. Distribution of identified bi-allelic variants over functional annotation classes.
Functional Ontology ClassSNPsIndelsTotal
intergenic variant13,745,9731,243,96314,989,936
intron variant3,359,772351,9673,711,739
upstream gene variant557,18160,416617,597
downstream gene variant229,35624,047253,403
5′ UTR variant132,69114,854147,545
3′ UTR variant71,784807379,857
missense variant62,111062,111
synonymous variant42,319042,319
non coding transcript exon variant23,580128924,869
splice region variant13,331200915,340
frameshift variant044444444
splice donor variant26103232933
stop gained2336642400
start lost988491037
splice acceptor variant669156825
inframe deletion0801801
stop lost50912521
inframe insertion0403403
stop retained variant1267133
protein altering variant02222
coding sequence variant257
Total18,245,3381,712,90419,958,242
Table 4. Descriptive statistics of inbreeding coefficients calculated using three different methods.
Table 4. Descriptive statistics of inbreeding coefficients calculated using three different methods.
Inbreeding Estimation Method 1MinimumMaximumAverage
FGRM−0.1200.062−0.038
FHOM−0.1490.188−0.033
FUNI−0.0570.058−0.033
1 FGRM, inbreeding coefficient driven from genomic relationship matrix (GRM); FHOM, Wright’s inbreeding coefficient based on the proportion of the loci with higher observed homozygosity than expected homozygosity; FUNI, Wright’s inbreeding coefficient based on the correlation between alleles in uniting gametes.
Table 5. Prioritized candidate genes harboring high-impact homozygous exon variants fixed in the Mugalzhar horse.
Table 5. Prioritized candidate genes harboring high-impact homozygous exon variants fixed in the Mugalzhar horse.
Gene SymbolEnsembl Gene IDNo. of VariantsECA 1StartEndNo. of OrthologuesNo. of Paralogues
SCAPERENSECAG0000001727291117,976,410118,465,9522071
FHAD1ENSECAG000000251268237,672,05037,824,7681421
MMP15ENSECAG000000001966310,831,20110,851,18527322
ADGRE1ENSECAG00000017237574,879,4484,948,79210350
CMKLR1ENSECAG0000004938210814,730,55414,789,7103447
MRPL15ENSECAG0000001211015930,176,71030,221,285225
ZNF667ENSECAG0000001099561025,714,42625,740,6471757
CCDC66ENSECAG0000001866281633,029,04033,134,439179
LOC100055310ENSECAG000000358706236,312,9306,557,2003025
1 ECA, Equus caballus autosome.
Table 6. Variants associated with Mendelian traits and segregating in the Mugalzhar horse population.
Table 6. Variants associated with Mendelian traits and segregating in the Mugalzhar horse population.
Variants 1A1 2A2 2MAF 3No. HeterozygotesType of VariantGenePhenotype
ECA3:g.36979560C > TTC0.1004missenseMC1Rcoat color, chestnut
ECA3:g.79538738C > TTC0.0251missenseKITwhite spotting
ECA3:g.79548220T > CTC0.0251missenseKITcoat color, dominant white
ECA3:g.79566881T > CCT0.0251missenseKITincreased white spotting
ECA16:g.21555811delinsAAATAC0.0251deletionMITFsplashed white
ECA16:g.21608936C > TCA0.0753regulatoryMITFwhite splashing
ECA23:g.22391254C > AAC0.0251stop-gainDMRT3gaitedness
1 ECA: Equus caballus autosomes; 2 A1 and A2, alleles 1 and 2; 3 MAF, minor allele frequency.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kassymbekova, S.N.; Bimenova, Z.Z.; Iskhan, K.Z.; Sobiech, P.; Jastrzebski, J.P.; Brym, P.; Babis, W.; Kalykova, A.S.; Otebayev, Z.M.; Kabylbekova, D.I.; et al. Uncovering Genetic Diversity and Adaptive Candidate Genes in the Mugalzhar Horse Breed Using Whole-Genome Sequencing Data. Animals 2025, 15, 2667. https://doi.org/10.3390/ani15182667

AMA Style

Kassymbekova SN, Bimenova ZZ, Iskhan KZ, Sobiech P, Jastrzebski JP, Brym P, Babis W, Kalykova AS, Otebayev ZM, Kabylbekova DI, et al. Uncovering Genetic Diversity and Adaptive Candidate Genes in the Mugalzhar Horse Breed Using Whole-Genome Sequencing Data. Animals. 2025; 15(18):2667. https://doi.org/10.3390/ani15182667

Chicago/Turabian Style

Kassymbekova, Shinara N., Zhanat Z. Bimenova, Kairat Z. Iskhan, Przemyslaw Sobiech, Jan P. Jastrzebski, Pawel Brym, Wiktor Babis, Assem S. Kalykova, Zhassulan M. Otebayev, Dinara I. Kabylbekova, and et al. 2025. "Uncovering Genetic Diversity and Adaptive Candidate Genes in the Mugalzhar Horse Breed Using Whole-Genome Sequencing Data" Animals 15, no. 18: 2667. https://doi.org/10.3390/ani15182667

APA Style

Kassymbekova, S. N., Bimenova, Z. Z., Iskhan, K. Z., Sobiech, P., Jastrzebski, J. P., Brym, P., Babis, W., Kalykova, A. S., Otebayev, Z. M., Kabylbekova, D. I., Baneh, H., & Romanov, M. N. (2025). Uncovering Genetic Diversity and Adaptive Candidate Genes in the Mugalzhar Horse Breed Using Whole-Genome Sequencing Data. Animals, 15(18), 2667. https://doi.org/10.3390/ani15182667

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop