Breed Distribution and Allele Frequencies of Base Coat Color, Dilution, and White Patterning Variants across 28 Horse Breeds

Since domestication, horses have been selectively bred for various coat colors and white spotting patterns. To investigate breed distribution, allele frequencies, and potential lethal variants for recommendations on genetic testing, 29 variants within 14 genes were investigated in 11,281 horses from 28 breeds. The recessive chestnut ea allele in melanocortin 1 receptor (MC1R) (p.D84N) was identified in four breeds: Knabstrupper, Paint Horse, Percheron, and Quarter Horse. After filtering for relatedness, ea allele frequency in Knabstruppers was estimated at 0.035, thus illustrating the importance of testing for mate selection for base coat color. The Rocky Mountain Horse breed had the highest allele frequency for two of the dilution variants under investigation (Za.f. = 0.32 and Cha.f. = 0.026); marker-assisted selection in this breed could aid in the production of horses with desirable dilute coats with less severe ocular anomalies caused by the silver (Z) allele. With regard to white patterning, nine horses homozygous for the paired box 3 (PAX3) splashed white 2 (SW2) allele (p.C70Y) and six horses homozygous for the KIT proto-oncogene, receptor tyrosine kinase (KIT) sabino 1 (SB1) allele (ECA3g.79544206A>T) were identified, thus determining they are rare and confirming that homozygosity for SW2 is not embryonic lethal. The KIT dominant white 20 (W20) allele (p.R682H) was identified in all but three breeds: Arabian (n = 151), Icelandic Horse (n = 66), and Norwegian Fjord Horse (n = 90). The role of W20 in pigmentation across breeds is not well understood; given the different selection regimes of the breeds investigated, these data provide justification for further evaluating the functional role of this allele in pigmentation. Here, we present the largest dataset reported for coat color variants in horses to date, and these data highlight the importance of breed-specific studies to inform on the proper use of marker-assisted selection and to develop hypotheses related to pigmentation for further testing in horses.


Introduction
Since domestication, horses have been selectively bred for a variety of pigmentation phenotypes [1]. Coat color is an important economic trait, and in some cases, it is a breeddefining phenotype. Coat color is also relevant from a health perspective, as some equine pigmentation variants have been connected to genetic disorders. One of the first variants discovered in the horse was a recessive mutation in the melanocortin 1 receptor (MC1R), which causes the chestnut phenotype, characterized by red-pigmented hair in the body and points (mane, tail, lower legs, and ear rims) [2]. Since then, over 60 variants contributing to equine pigmentation have been identified [3].
Coat color phenotypes can be divided into three categories: base coat color, pigmentation dilution, and white patterning. The base coat color for horses is typically characterized deafness [23] to ocular issues [24] to lethality [25]. In some cases, only homozygous individuals are affected, as in the case of lethal white overo [25], congenital stationary night blindness [26], and potentially other embryonic lethal white variants mentioned above. In other cases, homozygotes are more severely affected than heterozygotes; for example, multiple congenital ocular anomalies (MCOA) are associated with the same mutation in the PMEL gene that causes silver coat dilution [8,9]. Homozygotes have a more severe ocular disease that includes cataracts, iris stromal hypoplasia, abnormal pectinate ligaments, megaloglobus, and iridociliary cysts, whereas heterozygotes typically present only with iridociliary cysts [8,9]. Therefore, genetic testing can be a powerful selection tool for producing desirable traits while limiting pleiotropic anomalies in breeds in which such alleles are found. An understanding of breed-specific allelic and genotypic distribution can inform the proper utilization and interpretation of the results for this purpose. Here, we aim to further evaluate potential homozygous lethal variants and investigate the breed distribution and allele frequencies across 28 breeds of variants in 14 genes known to contribute to equine pigmentation phenotypes.
To determine allelic and genotypic distributions across breeds, all variants were first evaluated without considering relatedness among individuals within breeds. However, to estimate breed-specific allele frequencies (a.f.) of variants in this study, the initial dataset was filtered for within-breed relatedness by removing first-degree relatives based on the date of sample submission; i.e., genotypes for the oldest banked sample were included in the study, and all of its first-degree relatives subsequently tested were removed from the dataset. The filtered cohort comprised 6,878 individuals representing 28 breeds, with the number of horses per breed ranging from 31 (Hanoverian) to 3,193 (Quarter Horse) (mean = 246 horses/breed; median = 78 horses/breed; Supplementary Table S1). Allele frequencies were calculated using the GenAlex 6.502 add-in [30,31] for Microsoft Excel 2016 (Microsoft Corporation, Redmond, Washington, United States).
The dominant D allele causing the dun dilution phenotype with primitive marks was not detected in 9 breeds analyzed: Arabian, Dutch Warmblood, Gypsy Vanner, Hanoverian, Knabstrupper, Oldenburg, Percheron, Standardbred, and Thoroughbred. In breeds in which this allele was detected, it was estimated to be fixed (a.f. = 1.0) in the Norwegian Fjord Horse, whereas the lowest allele frequency was estimated for the Tennessee Walking Horse (a.f. = 0.0060). The nd1 (non-dun 1) allele, which causes primitive marks without diluting the coat, was identified in all breeds except for Gypsy Vanner and Norwegian Fjord Horse, and its frequency ranged from 0.79 in Pura Raza Española to 0.017 in Gypsy Cob. Similarly, the nd2 (non-dun 2) allele (no dun dilution and no primitive marks) was found in all breeds, except for the Norwegian Fjord Horse. Its allele frequencies were relatively high across all breeds investigated, ranging from 1.0 in Gypsy Vanner to 0.20 in Pura Raza Española.
The mushroom (Mu) allele, while only tested in Icelandic Horse, Miniature Horse, Quarter Horse, and Shetland Pony, was detected only in Shetland Ponies, with an allele frequency after filtering for relatedness of 0.23. Finally, the silver (Z) allele was found in 15 breeds, with the highest allele frequency estimated for Rocky Mountain Horse (a.f. = 0.32) and the lowest in Quarter Horse (a.f. = 0.0011). Unfiltered genotype counts and allele frequencies for the filtered dataset can be found in Supplementary Tables S3 and S4, respectively.

White Patterning Loci
Homozygotes for three of the four KIT dominant white variants investigated (W5, W10, W22) were not detected in our dataset ( Table 2). Compound heterozygotes were detected only for W20 (one W10/W20 and three W20/W22). The dominant white 20 (W20) variant was found to have a widespread distribution, detected in 25/28 breeds included in this study, with a total of 340 W20 homozygotes from 21 breeds ( Table 2). The allele frequency for this variant was lowest in the Rocky Mountain Horse (a.f. = 0.0090) and highest in the Gyspy Cob (a.f. = 0.33) and Thoroughbred (a.f. = 0.34). The W20 allele was not detected in Arabians (n = 180), Icelandic Horses (n = 73), and Norwegian Fjord Horses (n = 121). The dominant white 5 (W5) in KIT was only identified in two related heterozygous Thoroughbreds, with a population filtered allele frequency of 0.0070 (Supplementary Tables S2 and S3). Conversely, the W10 variant was found in two Paint Horses (N/W10) and two Quarter Horses (one N/W10 and one W10/W20), whereas the W22 mutation was identified in one Oldenburg (a.f. = 0.011), 4 Paint Horses (a.f. = 0.00096), 6 Quarter Horses (a.f. = 0.00054), and 2 Thoroughbreds (a.f. = 0.014). All horses with W22 can be traced back to the ancestor, originally described as the founder [22].   Table S4). SB1 is reported to follow an incompletely dominant inheritance pattern, with homozygotes displaying a more pronounced phenotype [32]. In this large dataset, only six individuals were homozygous for this variant (4 Paint Horses, 1 Shetland Pony, and 1 Tennessee Walking Horse), and photographic records were available for four of these ( Figure 1), all of which displayed an all-white phenotype without any other known white spotting variants (Table 3).
Tobiano (TO) [33] was more widely distributed and was identified in 787 horses from 15 breeds, with allele frequencies ranging from 0.22 in Shetland Ponies to 0.0033 in Arabian Horses (Supplementary Table S4). A total of 179 TO/TO homozygotes were identified in 9 breeds: Appaloosa, Gypsy Cob, Gypsy Vanner, Icelandic Horse, Miniature Horse, Missouri Fox Trotter, Paint Horse, Shetland Pony, and Tennessee Walking Horse (Supplementary Table S3). Investigating the distribution of horses with multiple variants at the KIT locus identified 140 horses with a combination of W20, SB1, and/or TO ( Table 4). The majority of individuals with both TO and W20 alleles were found in Paint Horse (n = 59) and Gypsy Cob (n = 12). There were 12 additional Paint Horses found to have both SB1 and W20 (Table 4).
The gray (G) variant had a wide distribution, as it was detected in all breeds studied, except for Knabstrupper (n = 91) and Norwegian Fjord Horse (n = 121). The highest frequency for this allele was observed in Connemara Pony (a.f. = 0.31), whereas the lowest was found in Icelandic and Morgan Horses (a.f. = 0.0080). The lethal white overo (O) variant was identified in 422 samples across 11 breeds: Andalusian, Appaloosa, Arabian, Miniature Horse, Missouri Fox Trotter, Mustang, Paint Horse, Quarter Horse, Shetland Pony, Tennessee Walking Horse, and Thoroughbred. Estimated allele frequencies after relatedness filtering ranged from 0.0020 in the Quarter Horse and 0.0025 in Appaloosa to 0.030 in the Thoroughbred and 0.10 in the Paint Horse.  Table S4). The appaloosa pattern-1 (PATN1) variant, a modifier of LP that controls for a high amount of white patterning when inherited with LP [34], was identified in all 13 breeds where LP was found, as well as Connemara Pony  Table S4). Moreover, the estimated allele frequency of PATN1 was higher than that of LP in three breeds: Gypsy Vanner, Shetland Pony, and Welsh Pony (Table 5).  The splashed white phenotype is controlled by at least six different mutations in two genes. Consistent with previous studies [35], the MITF SW1 allele was the most widespread of these variants, identified in 599 horses from 13 breeds with homozygotes (SW1/SW1, n = 49) identified in six of these: Appaloosa (n = 1), Icelandic Horse (n = 5), Miniature Horse (n = 5), Morgan Horse (n = 4), Paint Horse (n = 11), and Quarter Horse (n = 23). Allele frequencies were highest in the Miniature Horse (a.f. = 0.10) and lowest in the Shetland Pony (a.f. = 0.0070) ( Table 6).  Only the combinations of splashed white variants detected in the dataset are presented herein.
The PAX3 splashed white 2 (SW2) variant was found in 274 horses from only two breeds: Paint Horse (n = 66) and Quarter Horse (n = 208) ( Table 6). A total of three Paint Horses and six Quarter Horses were identified as homozygous for the SW2 variant. Photographic records were available for six Quarter Horses, with each displaying an all-white phenotype, although five of these also had other known white spotting alleles ( Figure 2).
After removing related individuals, the frequency was low in both breeds (a.f. = 0.0078 for Paint Horse and a.f. = 0.0083 for Quarter Horse, Table 6). No homozygotes for SW3, SW4, SW5, or SW6 were identified in this dataset. The PAX3 splashed white 4 (SW4) allele was not identified in any horses in this study. The MITF SW3 allele was identified in five Paint Horses and seven Quarter Horses, with a very low frequency after filtering for relatedness in each breed (a.f. ≤ 0.0010). Finally, SW5 and SW6 were restricted to Paint Horses (with one exception for SW6, a double registered AQHA/APHA Quarter Horse), with allele frequencies of 0.0020 and 0.00056, respectively. One Paint Horse SW1/SW6 compound heterozygote was identified. The only other combinations of splashed white variants identified in the dataset involved SW1 and SW2 in Paint Horses and Quarter Horses (Table 6, Figure 2). After removing related individuals, the frequency was low in both breeds (a.f. = 0.0078 for Paint Horse and a.f. = 0.0083 for Quarter Horse, Table 6). No homozygotes for SW3, SW4, SW5, or SW6 were identified in this dataset. The PAX3 splashed white 4 (SW4) allele was not identified in any horses in this study. The MITF SW3 allele was identified in five Paint Horses and seven Quarter Horses, with a very low frequency after filtering for relatedness in each breed (a.f. ≤ 0.0010). Finally, SW5 and SW6 were restricted to Paint Horses (with one exception for SW6, a double registered AQHA/APHA Quarter Horse), with allele frequencies of 0.0020 and 0.00056, respectively. One Paint Horse SW1/SW6 compound heterozygote was identified. The only other combinations of splashed white variants identified in the dataset involved SW1 and SW2 in Paint Horses and Quarter Horses (Table 6, Figure 2).

Discussion
The base coat color variants for chestnut (e, in the MC1R gene) and black (a, located in ASIP) were identified in all 28 breeds analyzed in this study (Supplementary Table S3). The a allele was found to be nearly fixed in the Percheron (a.f. = 0.99), a breed highly selected for black and gray coat phenotypes. The MC1R allele e a [6], previously reported in chestnut horses in Black Forest, Hungarian Coldblood, and Haflinger breeds [6,7], was identified for the first time here in Paint Horse, Percheron, and Quarter Horse. Moreover, since several Knabstruppers with the e a allele were detected in this study, we were able to report, for the first time, an estimated allele frequency for e a in this breed (a.f. = 0.035). Knabstruppers are selectively bred for leopard complex spotting patterns, and a particular base color is often desired. Therefore, genotyping that allows for the accurate detection of horses with the e a allele is important for marker-assisted selection. No homozygous e a /e a

Discussion
The base coat color variants for chestnut (e, in the MC1R gene) and black (a, located in ASIP) were identified in all 28 breeds analyzed in this study (Supplementary Table S3). The a allele was found to be nearly fixed in the Percheron (a.f. = 0.99), a breed highly selected for black and gray coat phenotypes. The MC1R allele e a [6], previously reported in chestnut horses in Black Forest, Hungarian Coldblood, and Haflinger breeds [6,7], was identified for the first time here in Paint Horse, Percheron, and Quarter Horse. Moreover, since several Knabstruppers with the e a allele were detected in this study, we were able to report, for the first time, an estimated allele frequency for e a in this breed (a.f. = 0.035). Knabstruppers are selectively bred for leopard complex spotting patterns, and a particular base color is often desired. Therefore, genotyping that allows for the accurate detection of horses with the e a allele is important for marker-assisted selection. No homozygous e a /e a individuals were identified in this study, although they have been reported in the Black Forest breed [6].
Dilution alleles, on the other hand, were found to be restricted in their breed distribution and show relatively low allele frequencies, with the exception of Cr. Nonetheless, some interesting observations can be made. Concerning the Ch allele, it was found at a low frequency in eight breeds, but interestingly, the breed in which the variant was first discovered, Tennessee Walking Horse [13], is among those breeds with the lowest estimated frequency (a.f. = 0.0080). Conversely, the breed with the highest frequency, Rocky Mountain Horse (a.f. = 0.026), has not previously been reported to have this variant. No known adverse health effects have been reported to be associated with champagne dilution. Additionally, given the high allele frequency of the silver variant in Rocky Mountain Horses (a.f. = 0.32) and its association with MCOA, the use of marker-assisted selection for silver heterozygosity and champagne may yield breed-desirable coat color dilutions with fewer potential ocular issues. Horses homozygous for the PMEL silver mutation (Z/Z) are reported to have a more severe form of disease that can impair vison or cause blindness, whereas heterozygotes (Z/N) are often reported to have a cyst-only phenotype. The Z allele was identified in 15 other breeds, and in addition to the Rocky Mountain Horse, it was previously reported in Miniature Horse, Missouri Fox Trotter, Icelandic Horse, Shetland Pony, and Morgan Horse. Here, we also identified the Z allele in Mustang, Tennessee Walking Horse, Gypsy Vanner, Welsh Pony, Dutch Warmblood, Gypsy Cob, Pony of the Americas, Lusitano, Quarter Horse, and Paint Horse for the first time (Supplementary  Table S3). Given the known pleiotropic effects of this variant, breeds in which it occurs are advised to utilize genetic testing for mate selection, as well as a tool to identify horses who should be examined by a veterinary ophthalmologist for MCOA.
In the case of Prl, while identified at a relatively low frequency in seven breeds, the highest frequencies were found in Iberian horses. Similar allele frequencies were estimated for the Andalusian and closely related Pura Raza Española breed (a.f. = 0.12 and 0.11, respectively), as well as the Lusitano (a.f. = 0.064) (Supplementary Table S3). To our knowledge, this is the first time that this variant has been reported in Lusitano and Pura Raza Española. Given that the highest frequency of Prl is in Iberian breeds, it is possible that it is undergoing positive selection because of the recent use of genetic testing and/or changes in studbook rules in the Pura Raza Española starting in 2002, which allowed the registration of additional color phenotypes other than black, bay, and gray. The Prl allele is recessive and is known to dilute only the base color when homozygous or when combined with the cream (Cr) allele. Iberian breeds were also among those with the highest allele frequencies for Cr, with the highest being in Lusitano (a.f. = 0.42), which again indicates selection for coat color dilution in these breeds. Given the relatively high frequency of Cr and Prl in these breeds, genetic testing for these variants can easily assist in the consistent production of these desirable phenotypes.
The Mu allele was originally reported in Shetland Ponies with an allele frequency of 0.12 (n = 177), and in that study, it was also identified in Miniature Horses at a low frequency (a.f. = 0.020; n = 129) [14]. In this study, the Mu allele was only identified in Shetland Pony, as genotyping data were only available for two Miniature Horses. Here, the Mu allele frequency in Shetland Pony after filtering for relatedness was estimated at 0.23 (n = 173). This was nearly double that previously reported by us [14]; however, this is unlikely to reflect a true increase in population frequency despite being a favorable trait in the breed. Since the discovery was reported in 2019, the Mu allele frequency estimated herein more likely represents a sampling bias for horses owned and tested by breeders who breed for this trait. Testing a larger randomized cohort is needed to investigate this further.
Concerning the dun dilution characterized by lightening of body hair and the presence of primitive markings, including a darker dorsal stripe, leg and/or shoulder stripes, and dark marks known as cobwebbing on the forehead, the Norwegian Fjord Horse was found to be fixed for the wild-type D allele (a.f. = 1.0, n = 121). To the best of our knowledge, this is the first documented molecular investigation of the frequency of this variant in the breed. However, it has been previously suggested that this allele was fixed, or nearly so, given the breed-defining dun phenotype characteristic of Norwegian Fjords [5]. The nd1 allele, which leads to the expression of primitive markings without dilution of the coat, was estimated to be at high frequency in Iberian horse breeds as well as in the Arabian: Pura Raza Española (a.f. = 0.79), Andalusian (a.f. = 0.78), Lusitano (a.f. = 0.57), and Arabian (a.f. = 0.68). These findings likely reflect the common ancestry of Iberian horse breeds, which were developed using Arabians brought to the Iberian Peninsula during the Muslim invasions of the 8th century. Given the frequency of nd1 in Iberian and Arabian breeds, investigating whether this variant contributes to the darker shade or countershading is worthy of further exploration and may help to better understand why the nd1 allele appears to be under positive selection.
Dominant white phenotypes and their associated alleles were originally named as such because they were believed to be lethal when homozygous. Consistent with this, no homozygotes for W5, W10, or W22 were identified in this study. However, due to their low allele frequencies and limited breed distribution (Supplementary Table S4), the probability of identifying homozygotes in the population is extremely low, regardless of lethality. Consistent with previous work [22,36], homozygotes were identified for the W20 allele in the 21 breeds included in this study. These breeds represent a variety of coat color selection preferences, ranging from horses with breed-defining phenotypes that include white patterning, such as the Paint Horse (a.f. = 0.21) and the Appaloosa (a.f. = 0.18), to breeds that prohibit excessive white markings, such as the Hanoverian (a.f. = 0.24). The role of W20 in pigmentation across breeds is not well understood. Previous research supports that W20 (p.Arg682His) leads only to a minor reduction in KIT function [36]. However, it has been documented that W20, in combination with other dominant white alleles, causes higher amounts of white patterning [22,36]. Based on our findings, compound heterozygotes for W10/W20 (one Quarter Horse) and W20/W22 (three Quarter Horses) are rare ( Table 2). It is important to note here that the W22 variant occurs on the sequence background of W20 [22]. A detailed investigation of the phenotypic effects of W20 in combination with other whitepatterning alleles has not yet been performed. Here, we identified 93 horses from nine breeds (Appaloosa, Gypsy Cob, Gypsy Vanner, Miniature Horse, Missouri Fox Trotter, Paint Horse, Shetland Pony, Tennessee Walking Horse, and Welsh Pony) with both the TO allele and W20 (Table 4). Thirty-three horses across nine breeds (Gypsy Cob, Gypsy Vanner, Miniature Horse, Missouri Fox Trotter, Mustang, Paint Horse, Quarter Horse, Shetland Pony, and Tennessee Walking Horse) had SB1 and W20 (Table 4, Supplementary Table S3). Photographic records were not available for most of the horses in this study but given the occurrence of these combinations of KIT mutations across several breeds, a formal investigation of their potential additive effects on white pattering is warranted. A recent study of American Paint Horses showed that, in the absence of any other known white patterning variants, W20 was associated with white spotting phenotypes defined by the American Paint Horse Association (APHA) [37]. However, the possibility that undiscovered variants contribute to the phenotypes, to our knowledge, has not yet been explored. The widespread across-breed distribution of W20, combined with high estimated allele frequencies in breeds not typically selected for white patterning in this study, provides further justification for evaluating the functional role of this allele in pigmentation across breeds. This knowledge will be essential in order to develop breed-specific recommendations for utilizing W20 genotypes to maximize breeding potential for desired coat color phenotypes.
Some of the splashed white variants were predicted to be homozygous lethal, and in this study, we only identified homozygotes for SW1 (n = 49) and SW2 (n = 9) ( Table 6). Previous studies have shown that SW1 homozygotes are viable and have an all-white or nearly all-white phenotype [35]. Two homozygous SW2/SW2 horses have been reported in the literature, with only one having a documented phenotype [37]. Having confirmed genotypes of nine SW2 homozygotes across two breeds (Paint Horse and Quarter Horse)-the most reported to date-and in evaluating photographic records of six of these horses, we confirm that this genotype is not lethal. All six SW2/SW2 horses had an all-white phenotype, but five of the six also had other white spotting variants ( Figure 2). Thus, further evaluation of SW2 homozygotes without other known white pattern alleles will substantiate the hypothesis that this genotype alone produces an all-white phenotype. One owner reported that their SW2/SW2 horse was deaf, but this remains to be clinically evaluated in this individual, as well as in other SW2 homozygotes.
Another rare non-lethal white patterning variant is SB1. The sabino 1 phenotype is described as extensive face and leg markings, along with white in the belly and roaning in the flanks [32]. The causal KIT variant was first reported in the Tennessee Walking Horse, American Miniature Horse, Paint Horse, Azteca, Missouri Foxtrotter, Shetland Pony, and Spanish Mustang [32]. A subsequent study investigated this variant in 899 horses across eight breeds and identified SB1 in three additional breeds (Haflinger, Noriker, and Lippizan), but no homozygotes were observed [38]. Here, we report the occurrence of the SB1 allele in Gypsy Cob, Gypsy Vanner, Pony of the Americas, and Quarter Horse for the first time. Furthermore, we identified the largest number of SB1 homozygotes to date, with six SB1/SB1 individuals across three breeds (Paint Horse, Shetland Pony, and Tennessee Walking Horse). Photographic records were available for four of these horses, and consistent with a previous report [37], they have an all-white phenotype (Figure 1). Given that SB1 and SW2 homozygotes are viable but rare, genetic testing can aid in the identification of these rare mates so that 100% of offspring have a white patterning allele.
Concerning leopard complex spotting, we report the first molecular detection and allele frequency of LP and PATN1 in breeds outside of those used to discover mutations [26,34] (Table 5). Furthermore, in those breeds that are selected for LP, this is the first time that the frequency of the modifying locus PATN1 has been concurrently evaluated. Consistent with photographic records and breeding schemes, the allele frequency for PATN1 is higher in the Knabstrupper than in Appaloosa, Pony of the Americas, or Miniature Horse. Knowing the frequency in these breeds can help guide selection strategies away from homozygosity for LP, which causes congenital stationary night blindness (CSNB) and an increased risk of insidious uveitis (ERU) [26,39,40]. Horses homozygous for LP are night blind due to premature polyadenylation of TRPM1, which in turn is predicted to lead to no functional TRPM1 protein in the bipolar cells of the retina. In the case of ERU, horses homozygous for LP are at a higher risk of this disease, but the biological mechanism for this increased risk remains unknown [39,40]. Genotyping for LP in breeds where the allele is present can help breeders select for LP heterozygotes, which, when in combination with the PATN1 variant, display the desirable leopard pattern without being afflicted with night-blindness [26]. Interestingly, PATN1 was identified in three breeds where no LP horses were detected (Connemara Pony, Missouri Fox Trotter, and Rocky Mountain Horse); moreover, PATN1 allele frequency was estimated to be higher than that of LP in three breeds: Gypsy Vanner, Shetland Pony, and Welsh Pony (Table 5). It is currently unknown whether PATN1 acts as a modifier for other white patterning loci; therefore, investigating white pattern levels in these breeds where PATN1 but not LP was identified, specifically in horses with other white patterning alleles, is warranted.
In conclusion, these data represent the largest study to date investigating pigmentation variants in horses across a large number of breeds. We reported estimated allele frequencies for base coat color, dilution, and white patterning genes in 28 breeds. While we restricted our dataset to breeds with 30 or more samples, a limitation of the study is that these samples were originally submitted for coat color testing. Therefore, it is possible that in those breeds with fewer individuals evaluated, estimates may be biased upwards, as noted above, for the mushroom variant. We identified the presence of several variants in breeds not previously reported. Additionally, we identified and reported nine SW2/SW2 homozygotes, the most to date, and confirmed that this genotype is not lethal. Here, we also report the largest number of SB1 homozygotes to date (n = 6), with photographic records for most to corroborate its effect on phenotype. Understanding breed distribution and allele frequencies can help guide recommendations on and utilization of genetic testing for marker-assisted selection. Furthermore, these findings will help guide future hypothesis-driven studies toward a better functional understanding of these pigmentation variants in and across breeds.
Supplementary Materials: The following supporting information can be downloaded at https: //www.mdpi.com/article/10.3390/genes13091641/s1, Table S1: Number of individuals tested per breed before and after filtering for relatedness. Table S2: Genes, reported alleles, and variant descriptions for the alleles investigated in this study. Table S3: Genotype counts for eact coat color locus analyzed in this study across breeds. Table S4: Allele frequencies for each coat color locus analyzed in this study after filtering for relatedness across breeds. References [41,42] are cited in the supplementary materials.