Next Article in Journal
Impact of Nutrients on the Hoof Health in Cattle
Previous Article in Journal
Effect of Breed on the Level of the Nutritional and Health-Promoting Quality of Semimembranosus Muscle in Purebred and Crossbred Bulls
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Detection of Selection Signatures Underlying Production and Adaptive Traits Based on Whole-Genome Sequencing of Six Donkey Populations

Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Xianyang 712100, China
*
Author to whom correspondence should be addressed.
Animals 2020, 10(10), 1823; https://doi.org/10.3390/ani10101823
Submission received: 27 August 2020 / Revised: 1 October 2020 / Accepted: 3 October 2020 / Published: 7 October 2020
(This article belongs to the Section Animal Genetics and Genomics)

Abstract

Simple Summary

After a long period of artificial selection, the donkey now presents a variety of body types and production performance values. In this experiment, we performed selective signal scanning on the second-generation resequencing data of six different varieties. The regions and candidate genes related to artificial selection were identified to provide reference for future breeding.

Abstract

Donkeys (Equus asinus) are an important farm animal. After long-term natural and artificial selection, donkeys now exhibit a variety of body sizes and production performance values. In this study, six donkey breeds, representing different regions and phenotypes, were used for second-generation resequencing. The sequencing results revealed more than seven million single nucleotide variants (SNVs), with an average of more than four million SNVs per species. We combined two methods, Z-transformed heterozygosity (ZHp) and unbiased estimates of pairwise fixation index (di) values, to analyze the signatures of selection. We mapped 11 selected regions and identified genes associated with coat color, body size, motion capacity, and high-altitude adaptation. These candidate genes included staining (ASIP and KITLG), body type (ACSL4, BCOR, CDKL5, LCOR, NCAPG, and TBX3), exercise (GABPA), and adaptation to low-oxygen environments (GLDC and HBB). We also analyzed the SNVs of the breed-specific genes for their potential functions and found that there are three varieties in the conserved regions with breed-specific mutation sites. Our results provide data to support the establishment of the donkey SNV chip and reference information for the utilization of the genetic resources of Chinese domestic donkeys.

1. Introduction

The donkey (Equus asinus) is an important livestock species used for farming that originates from about 5000 years ago. Therefore, the donkey has been a witness to the progress of human history and civilization. However, with the increasing use of modern agricultural machinery and efficient transportation, the value of the draft donkey has gradually declined, but the value of donkey meat and milk has gradually increased. Factors such as diverse ecological changes, human migration, and socio-economic impacts have influenced the phenotypic characteristics of donkeys; there are now 157 registered donkey varieties. The research of selection signature plays an important role in animal breeding and population evolution. As the cost of single nucleotide polymorphism arrays (e.g., SNV (single nucleotide variant) chips) and high-throughput sequencing continues to decrease, research on the selection signals of domesticated animals is becoming more common. For example, selection signal screens have been conducted in domesticated pigs [1,2,3], chickens [4], cattle [5,6,7], sheep [8,9], goats [10,11,12], and dogs [13]. Several statistical methods have also been developed to detect the genomic regions related to selection in domesticated animals, such as overall low heterozygosity [3,4], genetic diversity patterns [14], haplotype homozygosity [15], and integrated haplotype score (|iHS|) [16]. However, there have been few studies on the selection signals of Chinese domestic donkeys.
The purpose of this study was to combine whole-genome resequencing technology with selective scavenging analysis to study the selected regions and genes unique to six Chinese donkey breeds, as well as to reveal the molecular mechanisms of their germplasm formation. The selection signatures were investigated by the pool sequencing of six distinct donkey populations (Table S1), including a black coated breed (Dezhou donkey), a highland breed (Qinghai donkey), two large donkey breeds (Hetian Qing donkey (Guola donkey) and Guanzhong donkey), and two donkey breeds with a small body size (Kulun donkey and Xinjiang donkey) (Figure 1a,b). The obtained SNVs were analyzed by calculating the Z-transformed heterozygosity (ZHp) and di (unbiased estimates of pairwise fixation index) values. Our results contribute to a better understanding of the genomic features donkey selection and reveal the genomic regions and genes controlling the production and adaptation of traits.

2. Materials and Methods

2.1. Ethics Approval

We performed the experiment according to the Regulations of the Administration of Affairs Concerning Experimental Animals (Ministry of Science and Technology, Beijing, China, 2004), and our research was approved by the Institutional Animal Care and Use Committee of Northwest A&F University.

2.2. Animals and Whole Genome Sequencing

Approximately 10 animals from each breed were combined into a pool for high-throughput resequencing (Table S1). According to the improved phenol–chloroform method [17], the genomic DNA of 57 individuals, representing 6 breeds of donkeys (Dezhou donkey = 9; Qinghai donkey = 9; Guola donkey = 10; Guanzhong donkey = 10; Kulun donkey = 10; and Xinjiang donkey = 9) were extracted from donkeys’ blood. These donkeys are not related to each other up to three generations. Six paired-end libraries (insert sizes: 180, 300, and 500 bp) were constructed and sequenced on the Illumina HiSeq Xten platform (2×150 bp) at Novogene (https://www.ncbi.nlm.nih.gov/biosample/13284254, https://www.ncbi.nlm.nih.gov/biosample/13284255, https://www.ncbi.nlm.nih.gov/biosample/13284256, https://www.ncbi.nlm.nih.gov/biosample/13284257, https://www.ncbi.nlm.nih.gov/biosample/13284258, and https://www.ncbi.nlm.nih.gov/biosample/13284259). Library preparation and sequencing followed the manufacturer’s instructions. Variant detection was conducted with the Genome Analysis Toolkit (GATK-3.8, https://github.com/broadinstitute/gatk/). To filter the SNVs for flowing analysis, we required at least three reads with different start sites supporting the non-reference allele.

2.3. Reads Alignment and Variations Calling

Reads were aligned to the donkey reference genome ASM130575v1 (https://www.ncbi.nlm.nih.gov/assembly/GCF_001305755.1/) [18] using BWA (0.7.13-r1126 version), then SAMtools (https://github.com/samtools/samtools/releases/) was used to sort the generated BAM files, and these were finally followed by duplicate removal using Picard-Tools-2.16.0 (http://broadinstitute.github.io/picard/). The Genome Analysis Toolkit (GATK-3.8) [19] was utilized to perform local realignment around existing indels and basic quality score recalibration. To ensure the accuracy of the SNV information, strict detection conditions were set: (1) there was only one mutation type, (2) the minimum quality value was ≥ 30, and (3) the minimum coverage was ≥ 68. The SNVs were formatted and functionally annotated using the SnpEff software and the gff (general feature format) annotation file of the donkey reference genome. Through the vcf (variant call format) file obtained in the previous step, principal component analysis (PCA) was performed with the PLINK and smartPCA software to evaluate the genetic relationships between breeds. The minor allelic frequency (MAF) was calculated using the VCFTOOLS software, and the distribution of the MAF was viewed to determine the quality of the SNP.

2.4. Detection of Selected Loci

To identify the regions under selection, the Z-transformed heterozygosity (ZHp) approach was utilized, as previously described [1,2,3,4]. To calculate the results, the PLINK software was used. Briefly, in an overlapping sliding window, Hp was calculated as:
F s t = M S P M S G M S P + ( n 1 ) M S G
where Σnmaj is the sum of major allele frequencies and ΣnMin is the sum of the MAF within each window. Individual Hp values were Z-transformed as:
Z H p = ( H p μ H p ) σ H p
where μHp is the overall average heterozygosity and σHp is the standard deviation for all windows in each subpopulation. To detect putative selected loci in this study, we first calculated the pooled heterozygosity (Hp) and its Z-transformations, ZHp, in sliding 50 KB windows along the scaffolds [20,21]. Putatively selected loci were defined as genetic regions in overlapping windows with extremely low ZHp values (<−7) and extremely high di values (top 1%).

2.5. Bioinformatics Analysis of Breed Specific SNVs

After obtaining the selected genes, we identified breed-specific genes and analyzed the genes and their SNVs within 1000 bp upstream and downstream. We selected and downloaded the gene sequences of 11 mammals (Bos taurus, Ovis aries, Capra hircus, camels, Felis catus, Canis lupus familiaris, Equus caballus, Homo sapiens, Sus scrofa, whales, and dolphins) and the donkey. We determined the conserved regions of these genes by sequence alignment, and we analyzed the SNVs in these regions to predict breed-specific SNVs that may have certain functions.

3. Results

3.1. Genome Resequencing of Six Donkey Breeds

We utilized six genetically and phenotypically diverse domestic donkey breeds to systematically analyze the signatures of selection (Figure 1b; Table S1). Whole-genome sequencing (WGS) [18] was performed via Chicago HiRise [22] on an Illumina HiSeq Xten platform using pooled DNA from each breed. Genome sequencing yielded a total of 202 GB of raw data and produced 205–246 million sequence reads per breed (Table 1).
Over 99.8% of the generated sequence reads mapped to approximately 98.22% (98.18–98.38%) of the newly annotated donkey reference genome (ASM130575v1), thus indicating high-quality sequencing results. The sequence coverage averaged 11.5× per breed within a range of 10- to 12-fold. single nucleotide variants (SNVs) varied from 4.3 to 4.6 million in each population (Table 2).
PLINK and smartPCA were used to perform the principal component analysis (PCA) on the SNVs. The PCA results (shown in Figure 1c) classified the six donkey breeds according to their genetic differentiation. The Guanzhong donkey and Guola donkey were clustered together and close to the Kulun donkey, while the other three breeds were separate. This clustering pattern was consistent with the findings of a prior study on the genetic diversity of donkey breeds in China [23].

3.2. Identification of Coding SNVs and Short Insertions/Deletions

After the data were filtered, there were more than four million confirmed SNVs in each breed. Around 88–93% of the SNVs were heterozygous, and the Guanzhong, Guola, and Kulun breeds had the largest frequency of heterozygous SNVs. This suggests that these three breeds have not undergone recent intensive selection. The SNVs were rarely located in coding regions (~1.4%). Furthermore, non-synonymous and synonymous variants were identified in the donkey genome (Table 2); there were more synonymous variants than non-synonymous substitutions. The frequency of non-synonymous SNVs in each breed was stable (~47.1%; Figure S1) and is close to the frequency of non-synonymous SNVs in the horse [24]. SNVs of large impact were also identified in the donkey genome (premature stop codons, start codon to non-start codon, stop codon to non-stop, and splice site; Figure S2).
We observed the distribution of minor allelic frequency (MAF) with 10 continued classes from 0–0.05 to 0.45–0.50 for each breed (Figure 1d). The largest group of SNVs had an MAF in the range of 0.25–0.30 (16–18%), whereas the proportion of rare alleles (MAF < 0.05) accounted for <0.5% of the total SNVs (most rare alleles were removed during the SNV calling process).

3.3. Identification of Selected Loci and Candidate Genes

The ZHp value was calculated in sliding 50 KB windows along the scaffolds of sequence reads corresponding to the most and least frequently observed alleles at all SNV positions [3,4]. A total of 234,063 windows was obtained, and the ZHp and di values of each group’s sliding window were calculated (Figure 2 and Figure 3). Only the windows with more than 10 SNVs were used in the analyses. The selected regions and genes are marked in the figure. To evaluate population differentiation among specific breeds, the genetic differentiation between every pairwise comparison was measured using the fixation index (Fst), and di values were calculated [25]. Putatively selected loci were defined as genetic regions in overlapping windows with extremely low ZHp values (<−7) and extremely high di values (top 1%). Two selection signal analysis methods were used to determine the overlap region and find the candidate genes in the ZHp value upstream and di value downstream regions. The number of candidate genes identified in Dezhou, Quola, Qinghai, and Xinjiang donkeys was 4, 3, 5, and 3, respectively (Table 3).

4. Discussion

Pool sequencing is an efficient, low-time, and low-cost method for quickly and efficiently finding candidate gene regions associated with target traits using bulk segregant analysis (BSA). At present, many articles similar to our research have also chosen this method [22,26,27].

4.1. Coat Color

A total of 18 genomic regions were found within the top 1% (di > 7), and a list of candidate genes was generated (KITLG is also included in the list). By analyzing the heterozygosity of the Xinjiang breed, 26 distinct loci were identified (ZHp < −7), including the well-known coat color gene KITLG (Figure 2a; Tables S1 and S2). KITLG, which encodes for the ligand of c-Kit, plays a role in the melanocyte production pathway. KITLG variants cause different coat color phenotypes such as roan. The migration and survival of melanocytes derived from neural crest cells depend on the normal expression of the KIT gene and the presence of the mast cell growth factor/stem cell growth factor (MGF/SGF). If KIT is mutated or the SGF is absent, the melanocytes will not survive, and the resulting lack of melanocytes and melanin particles will produce a white coat color [28,29]. Variants at KIT are known to be associated with depigmentation or leucism.
Four loci overlapped between the genetic regions with the lowest ZHp and highest di values. One of the overlapping regions contained the strongest candidate gene for coat color (ASIP) in the Dezhou breed (Table 3). The ASIP gene can affect the production of brown melanin in several ways, and it has been implicated as a strong candidate controlling the coat color patterns of horses and donkeys [26,30,31]. Related studies have also shown that the ASIP gene is associated with the coat color phenotype of Dezhou donkeys [32,33].

4.2. Body Size

Of the six donkey breeds, the body sizes of the Dezhou and Guola donkeys were the largest; the average weight of the Dezhou and Goula donkeys was more than 200 kg, but the bodyweight of the smallest variety, the Qinghai donkey, was found to be only 110 kg for males and 120 kg for females. Advances in agriculture have gradually decreased the value of donkeys as servants, but the consumption of meat has gradually increased. Thus, the development of genetic breeding programs for donkey meat is particularly important.
Six of the 11 selected genes identified in this study were related to body size, namely TBX3, NCAPG, LOCR, BCOR, CDKL5, and ACSL4. These selected genes have also been reported in other species. SNV comparisons of the Debao pony, the Yili horse, and the Mongolian horse revealed that TBX3 was significantly associated with body size, and ACSL4 and CDKL5 were associated with growth traits.

4.3. Motion Capacity and High-Altitude Adaptation

Of the five breeds, the Guola and Qinghai breeds are the major livestock species that serve as sources of transportation for Xinjiang and Qinghai inhabitants. Both breeds have a strong motion capacity even though the Qinghai breed lives in a mountainous environment. An analysis of the sequence heterozygosity and divergence of the breeds revealed overlapping regions with the lowest ZHp and highest di values (ZHp < −9 and di > 9). These regions contained the strongest candidate gene, GABPA, which has also been identified in horses. Schroder et al. performed a genome-wide association study (GWAS) with the jumping ability of Hanover’s warm-blooded horse and found an association with the sport-related gene GABPA [32].
The Qinghai donkey is well-adapted to high altitudes between 3000 and 5000 m. In the genome of the Qinghai donkey, 31 loci were identified as selected regions based on the calculated ZHp values (Figure 3b; Tables S1 and S2), and a total of 24 regions were within the top 1% of the distribution (di > 7). We identified two regions that exhibited the strongest selection signature by an overlap of the genomic regions with high di and low ZHp values. Two genes within these regions were previously implicated in the adaptation of Tibetan dwellers. The HBB gene is associated with the high-altitude adaptation of Tibetan people [33], and the GLDC gene in Tibetan pigs represents a genetic adaptation for the low oxygen concentrations of high-elevation environments [34].
We further analyzed the functional importance of the SNVs in the seven breed-specific selected genes (ASIP, CDKL5, GLDC, HBB, KITLG, ACSL4, and NCAPG). The breed-specific SNVs in four of these genes (GLDC, ASIP, CDKL5, and ACSL4) are in evolutionarily conserved regions (Figure 4a–d), indicating that these SNVs may have certain functions. Consistent with a previous finding in goat domestication [27], none of these conserved SNVs were located in coding regions that lead to amino acid exchanges. This indicates that the genetic basis of donkey production and adaptive traits is complex and likely involves regulatory variants.

5. Conclusions

In this experiment, ZHp and di values were used to determine the genetic regions under selection in five domestic donkey breeds. Overlapping areas were identified and annotated, revealing a total of 11 selected genes. These genes are involved in hair color, body size, motion capacity, and adaptations for low-oxygen environments. Through the analysis of the mutation sites of each selected gene, possible functional breed-specific SNVs were also identified and revealed potential mechanisms of differentiation among several cultivars of Chinese domestic donkeys. In the age of large volume data from genomics and bioinformatics, the technology of whole-genome sequencing provides a new method to characterize the domestic donkey and provides a reference for future breeding practices.

Supplementary Materials

The following are available online at https://www.mdpi.com/2076-2615/10/10/1823/s1, Table S1: Pooled sample information for resequencing data from donkeys. Donkey DNA pools represent six separate breeds. The number of individuals from each breed (n) and the average sequence and assembly coverage per pool are indicated. Table S2: The number of candidate regions and genes identified by ZHp and di in different breeds. Figure S1: The number of synonymous and non-synonymous mutations in each donkey breed. DZ: Dezhou; GL: Guola; GZ: Guanzhong; KL: Kulun; QH: Qinghai; and XJ: Xinjiang. Figure S2: The number of SNVs with large effects (premature stop, stop codon to nonstop codon, start codon to non-start codon, and splicing sites) in each donkey breed. DZ: Dezhou; GL: Guola; GZ: Guanzhong; KL: Kulun; QH: Qinghai; and XJ: Xinjiang.

Author Contributions

Conceptualization, Z.Z.; methodology, Z.L. and Y.F.; software, G.W.; validation, F.W., formal analysis, Z.Z.; investigation, G.W; resources, C.L. and R.D.; data curation, R.D.; writing—original draft preparation, Y.G.; writing—review and editing, R.D.; supervision, C.L.; project administration, R.D. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Key Research & Development Plan of Shaanxi Province of China (2020NY-002), Dong-E-E-Jiao Co. Ltd. (20191001), and National Natural Science Foundation of China (81770514).

Acknowledgments

We would like to thank Editage (www.editage.com) for English language editing.

Conflicts of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Abbreviations

SNVssingle nucleotide variants
diunbiased estimates of pairwise fixation index
ZHpZ-transformed heterozygosity
WGSWhole-genome sequencing
PCAprincipal component analysis
MAFminor allelic frequency
Fstfixation index

References

  1. Diao, S.-Q.; Luo, Y.-Y.; Ma, Y.-L.; Deng, X.; He, Y.-T.; Gao, N.; Zhang, H.; Li, J.-Q.; Chen, Z.-M.; Zhang, Z. Genome-wide detection of selective signatures in a Duroc pig population. J. Integr. Agric. 2018, 17, 2528–2535. [Google Scholar] [CrossRef]
  2. Ma, Y.; Zhang, S.; Zhang, K.; Fang, C.; Xie, S.; Du, X.; Li, X.; Ni, D.; Zhao, S. Genomic Analysis To Identify Signatures of Artificial Selection and Loci Associated with Important Economic Traits in Duroc Pigs. G3 (BethesdaMd.) 2018, 8, 3617–3625. [Google Scholar] [CrossRef]
  3. Rubin, C.-J.; Megens, H.-J.; Martinez Barrio, A.; Maqbool, K.; Sayyab, S.; Schwochow, D.; Wang, C.; Carlborg, Ö.; Jern, P.; Jørgensen, C.B.; et al. Strong signatures of selection in the domestic pig genome. Proc. Natl. Acad. Sci. USA 2012, 109, 19529–19536. [Google Scholar] [CrossRef]
  4. Rubin, C.-J.; Zody, M.C.; Eriksson, J.; Meadows, J.R.S.; Sherwood, E.; Webster, M.T.; Jiang, L.; Ingman, M.; Sharpe, T.; Ka, S.; et al. Whole-genome resequencing reveals loci under selection during chicken domestication. Nature 2010, 464, 587–591. [Google Scholar] [CrossRef]
  5. Cheruiyot, E.K.; Bett, R.C.; Amimo, J.O.; Zhang, Y.; Mrode, R.; Mujibi, F.D.N. Signatures of Selection in Admixed Dairy Cattle in Tanzania. Front. Genet. 2018, 9, 607. [Google Scholar] [CrossRef] [PubMed]
  6. Gurgul, A.; Jasielczuk, I.; Semik-Gurgul, E.; Szmatoła, T.; Majewska, A.; Sosin-Bzducha, E.; Bugno-Poniewierska, M. Diversifying selection signatures among divergently selected subpopulations of Polish Red cattle. J. Appl. Genet. 2019, 60, 87–95. [Google Scholar] [CrossRef] [PubMed]
  7. Pitt, D.; Bruford, M.W.; Barbato, M.; Orozco-terWengel, P.; Martínez, R.; Sevane, N. Demography and rapid local adaptation shape Creole cattle genome diversity in the tropics. Evol. Appl. 2019, 12, 105–122. [Google Scholar] [CrossRef] [PubMed]
  8. Ahbara, A.; Bahbahani, H.; Almathen, F.; Al Abri, M.; Agoub, M.O.; Abeba, A.; Kebede, A.; Musa, H.H.; Mastrangelo, S.; Pilla, F.; et al. Genome-Wide Variation, Candidate Regions and Genes Associated With Fat Deposition and Tail Morphology in Ethiopian Indigenous Sheep. Front. Genet. 2018, 9, 699. [Google Scholar] [CrossRef]
  9. Lv, F.-H.; Agha, S.; Kantanen, J.; Colli, L.; Stucki, S.; Kijas, J.W.; Joost, S.; Li, M.-H.; Ajmone Marsan, P. Adaptations to climate-mediated selective pressures in sheep. Mol. Biol. Evol. 2014, 31, 3324–3343. [Google Scholar] [CrossRef]
  10. Bertolini, F.; Servin, B.; Talenti, A.; Rochat, E.; Kim, E.S.; Oget, C.; Palhière, I.; Crisà, A.; Catillo, G.; Steri, R.; et al. Signatures of selection and environmental adaptation across the goat genome post-domestication. Genet. Sel. Evol. Gse 2018, 50, 57. [Google Scholar] [CrossRef]
  11. Dong, Y.; Zhang, X.; Xie, M.; Arefnezhad, B.; Wang, Z.; Wang, W.; Feng, S.; Huang, G.; Guan, R.; Shen, W.; et al. Reference genome of wild goat (capra aegagrus) and sequencing of goat breeds provide insight into genic basis of goat domestication. BMC Genom. 2015, 16, 431. [Google Scholar] [CrossRef] [PubMed]
  12. Oget, C.; Servin, B.; Palhière, I. Genetic diversity analysis of French goat populations reveals selective sweeps involved in their differentiation. Anim. Genet. 2019, 50, 54–63. [Google Scholar] [CrossRef] [PubMed]
  13. Axelsson, E.; Ratnakumar, A.; Arendt, M.-L.; Maqbool, K.; Webster, M.T.; Perloski, M.; Liberg, O.; Arnemo, J.M.; Hedhammar, A.; Lindblad-Toh, K. The genomic signature of dog domestication reveals adaptation to a starch-rich diet. Nature 2013, 495, 360–364. [Google Scholar] [CrossRef] [PubMed]
  14. Wiener, P.; Pong-Wong, R. A regression-based approach to selection mapping. J. Hered. 2011, 102, 294–305. [Google Scholar] [CrossRef]
  15. Depaulis, F.; Veuille, M. Neutrality tests based on the distribution of haplotypes under an infinite-site model. Mol. Biol. Evol. 1998, 15, 1788–1790. [Google Scholar] [CrossRef][Green Version]
  16. Voight, B.F.; Kudaravalli, S.; Wen, X.; Pritchard, J.K. A map of recent positive selection in the human genome. PLoS Biol. 2006, 4, e72. [Google Scholar]
  17. Sambrook, P.N.; Champion, G.D.; Browne, C.D.; Cairns, D.; Cohen, M.L.; Day, R.O.; Graham, S.; Handel, M.; Jaworski, R.; Kempler, S. Corticosteroid injection for osteoarthritis of the knee: Peripatellar compared to intra-articular route. Clin. Exp. Rheumatol. 1989, 7, 609–613. [Google Scholar]
  18. Huang, J.; Zhao, Y.; Bai, D.; Shiraigol, W.; Li, B.; Yang, L.; Wu, J.; Bao, W.; Ren, X.; Jin, B.; et al. Donkey genome and insight into the imprinting of fast karyotype evolution. Sci. Rep. 2015, 5, 14106. [Google Scholar] [CrossRef][Green Version]
  19. McKenna, A.; Hanna, M.; Banks, E.; Sivachenko, A.; Cibulskis, K.; Kernytsky, A.; Garimella, K.; Altshuler, D.; Gabriel, S.; Daly, M.; et al. The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010, 20, 1297–1303. [Google Scholar] [CrossRef]
  20. Danecek, P.; Auton, A.; Abecasis, G.; Albers, C.A.; Banks, E.; DePristo, M.A.; Handsaker, R.E.; Lunter, G.; Marth, G.T.; Sherry, S.T.; et al. The variant call format and VCFtools. Bioinformatics 2011, 27, 2156–2158. [Google Scholar] [CrossRef]
  21. Sun, T.; Shen, J.; Achilli, A.; Chen, N.; Chen, Q.; Dang, R.; Zheng, Z.; Zhang, H.; Zhang, X.; Wang, S.; et al. Genomic analyses reveal distinct genetic architectures and selective pressures in buffaloes. GigaScience 2020, 9. [Google Scholar] [CrossRef] [PubMed]
  22. Renaud, G.; Petersen, B.; Seguin-Orlando, A.; Bertelsen, M.F.; Waller, A.; Newton, R.; Paillot, R.; Bryant, N.; Vaudin, M.; Librado, P.; et al. Improved de novo genomic assembly for the domestic donkey. Sci. Adv. 2018, 4, eaaq0392. [Google Scholar] [CrossRef] [PubMed]
  23. Lu, C.-J.; Xie, W.-M.; Su, R.; Ge, Q.-L.; Chen, H.; Shen, S.-Y.; Lei, C.-Z. African origin of Chinese domestic donkeys. Yi Chuan Hered. 2008, 30, 324–328. [Google Scholar] [CrossRef] [PubMed]
  24. Cheng, Z.; Pan, N.; Ishfaq, A.H.; Gemingguli, M.; Baizilaitibei, A.; Gulibaheti, D.; Yaping, F.; Haiyang, W.; Rasool, A.A.; Changyi, X. Detecting the Population Structure and Scanning for Signatures of Selection in Horses (Equus caballus) From Whole-Genome Sequencing Data. Evol. Bioinform. Online 2018, 14, 117693431877510. [Google Scholar]
  25. Akey, J.M.; Ruhe, A.L.; Akey, D.T.; Wong, A.K.; Connelly, C.F.; Madeoy, J.; Nicholas, T.J.; Neff, M.W. Tracking footprints of artificial selection in the dog genome. Proc. Natl. Acad. Sci. USA 2010, 107, 1160–1165. [Google Scholar] [CrossRef]
  26. Rieder, S.; Taourit, S.; Mariat, D.; Langlois, B.; Guérin, G. Mutations in the agouti (ASIP), the extension (MC1R), and the brown (TYRP1) loci and their association to coat color phenotypes in horses (Equus caballus). Mamm. Genome Off. J. Int. Mamm. Genome Soc. 2001, 12, 450–455. [Google Scholar] [CrossRef]
  27. Wang, X.; Liu, J.; Zhou, G.; Guo, J.; Yan, H.; Niu, Y.; Li, Y.; Yuan, C.; Geng, R.; Lan, X.; et al. Whole-genome sequencing of eight goat populations for the detection of selection signatures underlying production and adaptive traits. Sci. Rep. 2016, 6, 38932. [Google Scholar] [CrossRef]
  28. Bai, D.-Y.; Yang, L.-H.; Unerhu, U.; Zhao, Y.-P.; Zhao, Q.-N.; Hasigaowa, H.; Dugarjaviin, M. Effects of Kit gene on coat depigmentation in white horses. Yi Chuan Hered. 2011, 33, 1171–1178. [Google Scholar] [CrossRef]
  29. Shi, K.-R.; Wang, A.-G.; Li, N.; Deng, X.-M. Effect study of white locus (I) on coat color inheritance in Chinese native pig breeds. Yi Chuan Xue Bao Acta Genet. Sin. 2005, 32, 275–281. [Google Scholar]
  30. Abitbol, M.; Legrand, R.; Tiret, L. A missense mutation in the agouti signaling protein gene (ASIP) is associated with the no light points coat phenotype in donkeys. Genet. Sel. Evol. Gse 2015, 47, 28. [Google Scholar] [CrossRef]
  31. Li, B.; He, X.-L.; Zhao, Y.-P.; Wang, X.-J.; Manglai, D.; Zhang, Y.-R. Molecular basis and applicability in equine color genetics. Yi Chuan Hered. 2010, 32, 1133–1140. [Google Scholar]
  32. Schröder, W.; Klostermann, A.; Stock, K.F.; Distl, O. A genome-wide association study for quantitative trait loci of show-jumping in Hanoverian warmblood horses. Anim. Genet. 2012, 43, 392–400. [Google Scholar] [CrossRef] [PubMed]
  33. Simonson, T.S.; Yang, Y.; Huff, C.D.; Yun, H.; Qin, G.; Witherspoon, D.J.; Bai, Z.; Lorenzo, F.R.; Xing, J.; Jorde, L.B.; et al. Genetic evidence for high-altitude adaptation in Tibet. Science 2010, 329, 72–75. [Google Scholar] [CrossRef] [PubMed]
  34. Li, M.; Tian, S.; Jin, L.; Zhou, G.; Li, Y.; Zhang, Y.; Wang, T.; Yeung, C.K.L.; Chen, L.; Ma, J.; et al. Genomic analyses identify distinct patterns of selection in domesticated pigs and Tibetan wild boars. Nat. Genet. 2013, 45, 1431–1438. [Google Scholar] [CrossRef]
Figure 1. Summary of six donkey breeds. (a) The six donkey breeds included in this study. (b) Geographic map indicating the distribution of the donkeys sampled in this study. Each point represents the location of sampling. The map was generated using the ‘ggmap’ package in R (version 3.5.2) (R Core Team 2018). (c) Principal components analysis (PCA) of six donkey breeds using the PC1 and PC2 components. DZ: Dezhou; GL: Guola; GZ: Guanzhong; KL: Kulun; QH: Qinghai; and XJ: Xinjiang. (d) A schematic representation of minor allelic frequency (MAF) plotted as a function of distance for each donkey population.
Figure 1. Summary of six donkey breeds. (a) The six donkey breeds included in this study. (b) Geographic map indicating the distribution of the donkeys sampled in this study. Each point represents the location of sampling. The map was generated using the ‘ggmap’ package in R (version 3.5.2) (R Core Team 2018). (c) Principal components analysis (PCA) of six donkey breeds using the PC1 and PC2 components. DZ: Dezhou; GL: Guola; GZ: Guanzhong; KL: Kulun; QH: Qinghai; and XJ: Xinjiang. (d) A schematic representation of minor allelic frequency (MAF) plotted as a function of distance for each donkey population.
Animals 10 01823 g001
Figure 2. Overview of selective sweeps in the Dezhou and Xinjiang breeds plotted by Z-transformed heterozygosity (ZHp) and unbiased estimates of pairwise fixation index (di) values. (a) Dezhou donkey breed. (b) Xinjiang donkey. Functional genes are highlighted, and red and bold characters represent overlapping genes that were generated using the two methods. Absolute values of ZHp were used for plotting.
Figure 2. Overview of selective sweeps in the Dezhou and Xinjiang breeds plotted by Z-transformed heterozygosity (ZHp) and unbiased estimates of pairwise fixation index (di) values. (a) Dezhou donkey breed. (b) Xinjiang donkey. Functional genes are highlighted, and red and bold characters represent overlapping genes that were generated using the two methods. Absolute values of ZHp were used for plotting.
Animals 10 01823 g002
Figure 3. Overview of selective sweeps in the Qinghai, Guola, and Kulun donkeys plotted by ZHp and di values. (a) Qinghai breed. (b) Guola breed. (c) Kulun breed.
Figure 3. Overview of selective sweeps in the Qinghai, Guola, and Kulun donkeys plotted by ZHp and di values. (a) Qinghai breed. (b) Guola breed. (c) Kulun breed.
Animals 10 01823 g003
Figure 4. The conservation analyses identified candidate functional mutations that are specific to donkey breeds. Breed-specific SNVs within or close to the key selected genes. (a) GLDC, (b) ASIP, (c) CDKL5, and (d) ACSL4 were screened, and the SNVs localized to evolutionary conserved sites were retained. The locations and positions of the candidate SNVs are indicated.
Figure 4. The conservation analyses identified candidate functional mutations that are specific to donkey breeds. Breed-specific SNVs within or close to the key selected genes. (a) GLDC, (b) ASIP, (c) CDKL5, and (d) ACSL4 were screened, and the SNVs localized to evolutionary conserved sites were retained. The locations and positions of the candidate SNVs are indicated.
Animals 10 01823 g004
Table 1. Number of reads.
Table 1. Number of reads.
BreedRaw Data (G)Clean Data (G)Reads Number (M)Reads for Alignment (%)Sequence Coverage
DZD33.3033.19214.63214.38(99.88)11.23×
GLD31.7031.60210.92210.71(99.90)11.15×
GZD37.1236.99246.83246.57(99.89)12.88×
KLD33.8633.76225.34225.11(99.90)11.81×
QHD32.0931.95213.04212.82(99.90)11.18×
XJD34.0733.93205.54205.30(99.88)10.74×
Note: DZD, Dezhou donkey; GLD, Guola donkey; GZD, Guanzhong donkey; KLD, Kulun donkey; QHD Qinghai donkey; and XJD, Xinjiang donkey.
Table 2. Summary and annotation of single nucleotide variants (SNVs) and indels in the donkey genome.
Table 2. Summary and annotation of single nucleotide variants (SNVs) and indels in the donkey genome.
BreedNumber of AnimalsSNVs NumberSNVsCoding SNVs
HomozygoteHeterozygote (%)Nonsynonymous Mutation (%)Synonymous Mutation
DZD94,372,377490,6313,881,746(88.78)21,996(47.76)24,057
GLD104,548,452336,0324,212,420(92.61)23,036(46.55)26,449
GZD104,658,120320,4974,337,623(93.12)23,446(47.91)25,489
KLD104,607,547339,5694,267,978(92.63)22,821(47.51)25,213
QHD94,480,050390,0004,090,050(91.29)23,589(46.0)27,688
XJD94,455,917397,3064,058,611(91.08)22,963(46.96)25,937
Note: DZD, Dezhou donkey; GLD, Guola donkey; GZD, Guanzhong donkey; KLD, Kulun donkey; QHD Qinghai donkey; and XJD, Xinjiang donkey.
Table 3. Overlapping genes identified by both ZHp and di for different donkey breeds.
Table 3. Overlapping genes identified by both ZHp and di for different donkey breeds.
Candidate GeneScaffoldAnnotationZHpdi
Dezhou donkey
TBX3NW_014638181.1T-box 3−7.348.55
NCAPGNW_014637278.1non-SMC (spondylo-metaphyseal chondrodysplasia) condensin I complex subunit G−7.349.77
LCORNW_014638419.1ligand dependent nuclear receptor corepressor−7.349.60
ASIPNW_014638605.1agouti signaling protein−7.209.50
Guola donkey
BCORNW_014638463.1B-cell CLL/lymphoma 6 corepressor−9.909.89
GABPANW_014638188.1GA binding protein transcription factor subunit alpha−9.909.66
LCORNW_014638419.1ligand dependent nuclear receptor corepressor−9.909.54
Kulun donkey
BCORNW_014638463.1B-cell CLL/lymphoma 6 corepressor−9.989.95
TBX3NW_014638181.1T-box 3−9.989.89
LCORNW_014638419.1ligand dependent nuclear receptor corepressor−9.989.88
Qinghai donkey
GABPANW_014638188.1GA binding protein transcription factor subunit alpha−9.609.86
CDKL5NW_014638744.1cyclin dependent kinase like 5−9.609.85
LOC106830211(HBB)NW_014637421.1hemoglobin subunit beta−9.609.88
BCORNW_014638463.1B-cell CLL/lymphoma 6 corepressor−9.609.98
GLDCNW_014638041.1glycine decarboxylase−9.609.98
Xinjiang donkey
KITLGNW_014638016.1Receptor tyrosine kinase(KIT) ligand−9.609.95
LCORNW_014638419.1ligand dependent nuclear receptor corepressor−9.609.88
ACSL4NW_014638105.1acyl-CoA synthetase long chain family member 4−9.609.60

Share and Cite

MDPI and ACS Style

Zhou, Z.; Fan, Y.; Wang, G.; Lai, Z.; Gao, Y.; Wu, F.; Lei, C.; Dang, R. Detection of Selection Signatures Underlying Production and Adaptive Traits Based on Whole-Genome Sequencing of Six Donkey Populations. Animals 2020, 10, 1823. https://doi.org/10.3390/ani10101823

AMA Style

Zhou Z, Fan Y, Wang G, Lai Z, Gao Y, Wu F, Lei C, Dang R. Detection of Selection Signatures Underlying Production and Adaptive Traits Based on Whole-Genome Sequencing of Six Donkey Populations. Animals. 2020; 10(10):1823. https://doi.org/10.3390/ani10101823

Chicago/Turabian Style

Zhou, Zihui, Yingzhi Fan, Gang Wang, Zhenyu Lai, Yuan Gao, Fei Wu, Chuzhao Lei, and Ruihua Dang. 2020. "Detection of Selection Signatures Underlying Production and Adaptive Traits Based on Whole-Genome Sequencing of Six Donkey Populations" Animals 10, no. 10: 1823. https://doi.org/10.3390/ani10101823

APA Style

Zhou, Z., Fan, Y., Wang, G., Lai, Z., Gao, Y., Wu, F., Lei, C., & Dang, R. (2020). Detection of Selection Signatures Underlying Production and Adaptive Traits Based on Whole-Genome Sequencing of Six Donkey Populations. Animals, 10(10), 1823. https://doi.org/10.3390/ani10101823

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop