Next Article in Journal
Focal Recurrent Copy Number Alterations Characterize Disease Relapse in High Grade Serous Ovarian Cancer Patients with Good Clinical Prognosis: A Pilot Study
Previous Article in Journal
Whole Genome Diversity, Population Structure, and Linkage Disequilibrium Analysis of Chickpea  (Cicer arietinum L.) Genotypes Using Genome-Wide DArTseq-Based SNP Markers
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Draft Genome of the Endangered Sichuan Partridge (Arborophila rufipectus) with Evolutionary Implications

1
Key Laboratory of Bioresources and Ecoenvironment (Ministry of Education), College of Life Sciences, Sichuan University, Chengdu 610064, China
2
College of Life Sciences, Leshan Normal University, Leshan 614004, China
3
Institute of Qinghai-Tibetan Plateau, Southwest Minzu University, Chengdu 610064, China
*
Author to whom correspondence should be addressed.
Genes 2019, 10(9), 677; https://doi.org/10.3390/genes10090677
Submission received: 25 July 2019 / Revised: 30 August 2019 / Accepted: 2 September 2019 / Published: 5 September 2019
(This article belongs to the Section Animal Genetics and Genomics)

Abstract

:
The Sichuan partridge (Arborophila rufipectus, Phasianidae, Galliformes) is distributed in south-west China, and classified as endangered grade. To examine the evolution and genomic features of Sichuan partridge, we de novo assembled the Sichuan partridge reference genome. The final draft assembly consisted of approximately 1.09 Gb, and had a scaffold N50 of 4.57 Mb. About 1.94 million heterozygous single-nucleotide polymorphisms (SNPs) were detected, 17,519 protein-coding genes were predicted, and 9.29% of the genome was identified as repetitive elements. A total of 56 olfactory receptor (OR) genes were found in Sichuan partridge, and conserved motifs were detected. Comparisons between the Sichuan partridge genome and chicken genome revealed a conserved genome structure, and phylogenetic analysis demonstrated that Arborophila possessed a basal phylogenetic position within Phasianidae. Gene Ontology (GO) enrichment analysis of positively selected genes (PSGs) in Sichuan partridge showed over-represented GO functions related to environmental adaptation, such as energy metabolism and behavior. Pairwise sequentially Markovian coalescent analysis revealed the recent demographic trajectory for the Sichuan partridge. Our data and findings provide valuable genomic resources not only for studying the evolutionary adaptation, but also for facilitating the long-term conservation and genetic diversity for this endangered species.

Graphical Abstract

1. Introduction

The Sichuan partridge (Arborophila rufipectus, Phasianidae, and Galliformes) is endemic to the mountains in the south-west China [1]. It has been classified as an endangered species (IUCN 2007) and a nationally protected animal in China because of its largely restricted range, very small population size, and severely fragmented habitat (Figure 1) [2,3]. Owing to the habitat fragmentation of endemic birds in subtropical forest in the mountains of Southwestern China, the populations of Sichuan partridge have decreased dramatically [4], which has led to governmental protection of the Sichuan partridge in much of their range. To date, few studies have been conducted for the investigation associated with the genetic mechanisms of the environmental adaption of the Sichuan partridge, and the whole genome of Sichuan partridge is currently not available. Further research is essential to clarify the environmental adaption and evolutionary history of Sichuan partridge and to elucidate functional genomic regions that underlie ecological adaptation. To help achieve the goal, we have sequenced and assembled the Sichuan partridge reference genome, and performed comparative genomics analysis. The whole genome of Sichuan partridge will be a valuable genomic resource for answering evolutionary questions associated with this endangered species, and for the development of genetic tools for Sichuan partridge research and conservation.

2. Materials and Methods

2.1. Sequencing, Assembly, and Genome Data

Muscle tissue was collected from a wild male Sichuan partridge which was preserved in the Natural History Museum of Sichuan University. Collected muscle tissue was used for genomic DNA extraction, isolation, and sequencing. Three paired-end libraries with insert sizes of 250 base pairs (bp), 500 bp, and 800 bp, and five mate-paired libraries with insert sizes of 2 kb, 5 kb, 10 kb, 15 kb, and 20 kb were constructed. The libraries were sequenced on an Illumina 2000 platform (Illumina, San Diego, CA, USA).
Before assembly, 17-Kmer analysis was performed for the genome size estimation of the Sichuan partridge, and the assembly was first analyzed by SOAPdenovo2 [5]. After using SSPACE [6] to build the super-scaffolds, intra-scaffold gaps were then filled using Gapcloser with reads from short-insert libraries. CEGMA (v2.5) [7] and BUSCO (v3.1.0) [8] were employed to evaluate the genome completeness. Gene prediction and functional annotation were conducted according to our previous study [9]. The raw data is available as a Bioproject at NCBI (PRJNA419836).
To conduct comparative genomic analysis, we collected complete genomes of seven other birds from NCBI (turkey, Meleagris gallopavo, GCF_000146605.2; chicken, Gallus gallus, GCF_000002315.6; mallard, Anas platyrhynchos, GCF_003850225.1; ostrich, Struthio camelus, GCF_000698965.1; peregrine falcon, Falco peregrinus, GCF_000337955.1; zebra finch, Taeniopygia guttata, GCF_000151805.1; rock pigeon, Columba livia, GCF_000337935.1). The complete genome of Hainan partridge (Arborophila ardens) and Chinese monal (Lophophorus lhuysii) were sequenced and assembled by our lab [9,10].

2.2. Characterization of Repeat Content

Repetitive elements in Sichuan partridge, Hainan partridge, Chinese monal, turkey, and chicken were identified using RepeatMasker (http://www.repeatmasker.org/). Perl script “calcDivergence-fromAlign.pl” in the RepeatMasker package was used to calculate the Kimura 2-parameter [11] distance between a transposable element insertion and the assumed ancestral sequence. Microsatellites (SSRs) in the genome of the above five Phasianidae species were identified by the software Krait [12] with default settings.

2.3. Analyses of Olfactory Receptor (ORs) Genes

For the identification of OR genes in birds, we employed a consensus approach, because it was reported that different approaches and/or cutoffs would possibly cause systematic bias [13]. A homology search was conducted to identify OR genes in these ten avian genomes. The method to detect functional OR genes in the ten bird genome sequences was similar to our previous studies [13,14], but we improved it to be applicable to any bird species. We aligned a total of 477 functional OR amino acid sequences obtained from the ten avian species using MAFFT 7 with auto strategy parameters. An unrooted phylogenetic tree was constructed after 1000 rounds of bootstrapping using IQ-TREE (v1.6.10) [15] with maximum likelihood (ML) approaches on the basis of the best-fit model JTT+F+R8 of amino acid evolution. We classified the identified OR genes into families and subfamilies based on the results of phylogenetic analyses and clustering analysis, and the multispecies OR gene clustering analysis was performed using CD-HIT software (v4.8.1) [16] (cutoff value of 40% and 60% amino acid similarity were set as the thresholds to discriminate between families and subfamilies, respectively) as previously described [17]. Sequence logos were generated from the alignment of the functional OR protein sequences for the identification of the conserved motifs in the predicted OR amino acid sequences with the program Multiple Expectation Maximization for Motif Elicitation (MEME) [18]. Only the top five conserved motifs were detected, with the motif length ranging from 5 to 50.

2.4. Gene Family and Positive Selection

The software orthoMCL (v2.0.9) [19] was used to define the orthologous genes from ten studied avian genomes (Sichuan partridge, Hainan partridge, Chinese monal, turkey, chicken, mallard, peregrine falcon, zebra finch, ostrich, and rock pigeon). Based on 1:1 orthologous genes aligned by PRANK (v.170427) [20] and concatenated to one sequence for each species, we constructed the phylogenetic tree using RAxML (v8.2.12) [21] with 1000 bootstrap replicates. The phylip dataset has been deposited at Figshare under the link https://figshare.com/articles/Sichuan_partridge/9751982. Divergence time estimation was performed by PAML MCMCTREE (v4.4) [22] as described in previous studies [23,24]. We used two calibrations for this analysis: (1) Divergence of the chicken and zebra finch was set to an age of 58.2–92.2 million years ago [10]; and (2) divergence of the chicken and duck was set to an age of 53.6–72 million years ago [23]. The alignments of 1:1 orthologous genes and phylogenetic tree were employed to estimate the ratio of the rates of non-synonymous-to-synonymous substitutions per gene (ω) by ML with the “codeml” program from PAML package [22] under the branch-site model. Two models were conducted to test the statistical significance of selective pressure specifically on the Sichuan partridge branch. One was the one-ratio model acting as the null model (NSsites = 0, model = 0), and the other was model 2 (NSsites = 2). The two models were compared with the likelihood ratio test (LRT), which was calculated from the log likelihood (lnL) values for both models. The p-values were obtained by calculating twice the difference between lnL (model2) and lnL (one-ratio) and compared with a χ-square distribution. We then identified positively selected genes (PSGs) of the Sichuan partridge by means of FDR adjustment with Q-values < 0.05. Gene Ontology (GO) enrichment of the PSGs of Sichuan partridge was performed via KOBAS 3.0 [25,26].

2.5. Demography Reconstruction

The software SAMtools was employed to detect single-nucleotide polymorphisms (SNPs) between diploid chromosomes for the Sichuan partridge [27]. Then we used the pairwise sequentially Markovian coalescent (PSMC) to infer the demographic history of the Sichuan partridge [28].

3. Results

3.1. Genome Sequencing, Assembly, and Quality Assessment

A total of 296.74 Gb (~272-fold coverage) of high-quality reads were generated for the Sichuan partridge after filtering out low quality and duplicated reads (Table 1). Based on the K-mer analysis, the genome size of Sichuan partridge was estimated to be 1.09 Gb, which was similar to other reported avian genomes. The final Sichuan partridge genome assembly consisted of approximately 1.09 Gb, with the contig N50 and scaffold N50 of 0.93 Kb and 4.57 Mb, respectively. CEGMA revealed 77.42% complete and 85.89% partial gene set for the assembled Sichuan partridge genome (Table S1), while BUSCO indicated the presence of 86.5% of the eukaryotic single-copy genes (Table S2).
The high synteny and assembly correctness were illustrated through the alignment of Sichuan partridge scaffolds to the chicken reference genome (Figure S1). Inversions and chromosomal rearrangements were evident like chromosome 4 of the chicken (Figure 2a), and some of the chicken microchromosomes such as chromosome 26 can be covered almost entirely by a small number of Sichuan partridge scaffolds (Figure 2b).

3.2. Genome Characterization

The Sichuan partridge genome had a GC content of approximately 41.95%, which was similar to other bird species such as chicken and zebra finch. Through gene prediction, we obtained a total of 17,519 protein-coding genes (PCGs) for the Sichuan partridge, of which 15,891 (90.71%) were well supported by public protein databases: TrEMBL (https://www.uniprot.org/statistics/TrEMBL), Swissprot (https://www.uniprot.org/), Interpro (https://www.ebi.ac.uk/interpro/), Nr (https://www.ncbi.nlm.nih.gov/refseq/about/nonredundantproteins/), GO (http://geneontology.org/), and KEGG (https://www.genome.jp/kegg/) (Figure 3a and Table S3).
About 101.22 Mb sequences (approximately 9.29% of the genome assembly) were attributed to repeats in Sichuan partridge genome. The percentage of long interspersed nuclear elements (LINEs), long terminal repeats (LTRs), short interspersed nuclear elements (SINEs), and DNA transposons were 6.35%, 1.77%, 0.05%, and 1.05% in the Sichuan partridge genome (Table 2). The divergence rate distribution of four major types of transposable elements in Sichuan partridge, Hainan partridge, Chinese monal, turkey, and chicken is shown in Figure 4. The repeat estimation of Sichuan partridge was larger than that of Hainan partridge, while smaller than those of Chinese monal, turkey, and chicken. However, read-based scaffolding which was involved in the insertion of “N’s” into gaps was demonstrated to lead to the underestimation of genome-wide repetitive content [29]. Even so, there is a common feature of the Sichuan partridge, turkey, scarlet macaw, zebra finch, and northern bobwhite genomes: the high proportion of L3/CR1 interspersed repeats [29,30,31,32] which are conserved across these divergent avian lineages.
For further evaluation of the repetitive content of the Sichuan partridge genome, we employed Krait to predict and characterize genome-wide SSR loci, which can identify the loci that could be used for population genetic studies. Imperfect SSRs were the most frequent type, followed by the perfect SSRs, and the least was the compound SSRs in the Sichuan partridge genome (Table S4). In total, we identified 367,513 perfect SSR loci containing 1 to 6 bp sequence motifs (Table 3). The total number of perfect SSR loci in the Sichuan partridge was larger than that in turkey and Chinese monal, while lower than that in Hainan partridge and chicken. The most frequent perfect SSRs were mononucleotide SSRs, with the highest frequencies of 256.49 loci/Mb and the highest densities of 4619.86 bp/Mb, accounting for 71.58% of the total number of SSRs of the Sichuan partridge genome. The second most frequent SSRs were tetranucleotide SSRs with a proportion of 11.49% for the Sichuan partridge. In contrast, dinucleotide, trinucleotide, and pentanucleotide SSRs were less frequent, and the least was hexanucleotide SSRs, only accounted for 0.58% of all of the SSRs for the Sichuan partridge. The most abundant motif categories found in the Sichuan partridge genome was revealed in Table S5. Importantly, microsatellite genotyping can be utilized to assess the population structure, gene flow, and covey composition within and between Sichuan partridge populations, and thus, the resources described herein can be used for development of genetic markers for the Sichuan partridge.

3.3. Olfactory Receptors (ORs): Composition, Classification, and Conserved Motifs

The total number of OR genes varied from 34 to 92, with the average number of functional genes (47.7) considerably greater than truncated genes (9.5) or pseudogenes (2.0) (Figure 5a). The total number of OR genes was greatest in zebra finch, while the least in peregrine falcon. It was reported that OR genes with more than 60% identity in protein sequence are suggested to recognize odorants with related structures [33,34]. To evaluate the diversity in the OR gene repertoires of the ten studied birds, the identified avian OR functional genes were classified into families and subfamilies on the basis of phylogenetic analyses and their sequence similarity. The OR genes of the ten avian species were divided into 10 families and 80 subfamilies. The comparison of OR gene subfamilies of the five Phasianidae species is illustrated in Figure 5b. Phylogenetic comparison of OR repertoires suggested the obvious species-specific clustering pattern (Figure 5c), which was in line with previous studies [35,36]. Previous studies have indicated that some conserved amino acid motif features are found in mammalian OR genes [33,34,37]. To characterize the conserved motifs of OR protein sequences belonging to the five Phasianidae species, the five most conserved motifs were identified by the MEME program. As illustrated in Figure 6, the conserved motifs were strikingly similar among the Phasianidae species. The existence of the conserved motifs of OR genes has also been detected in other taxa (e.g., fish [38]).

3.4. Bird Phylogeny, Divergence and Evolution of Gene Families

A total of 14,668 gene families were identified for 10 bird species (Sichuan partridge, Hainan partridge, Chinese monal, turkey, chicken, mallard, peregrine falcon, zebra finch, ostrich) of which 5050 represented 1:1 orthologous gene families. We compared the orthologous gene clusters among five Phasianidae species (Sichuan partridge, Hainan partridge, Chinese monal, turkey, and chicken), which is shown in Figure 3b. The phylogenetic tree constructed based on the 1:1 orthologous genes suggested the basal phylogenetic position of the genus Arborophila within Phasianidae, and was most likely derived from a common ancestor approximately 49.1 million years ago (Mya) (Figure 3c).

3.5. Positive Selection in the Sichuan Partridge

We found that 234 of the 5050 one-to-one orthologous genes were under positive selection in the Sichuan partridge. The Gene Ontology (GO) annotation classified the PSGs into three categories: molecular functions, cellular components, and biological processes (Figure 7a). Molecular functions included genes mainly involved in binding (132 genes; GO:0005488) and catalytic activity (76 genes; GO:0003824). Genes related to cellular components were primarily cell (193 genes; GO:0005623), cell part (192 genes; GO:0044464), and organelle (166 genes; GO:0043226). Biological process genes were mainly involved in cellular process (177 genes; GO:0009987), metabolic process (131 genes; GO:0008152), biological regulation (126 genes; GO:0065007), and regulation of biological process (114 genes; GO:0050789). The distribution of GO annotations in different functional categories showed a substantial diversity of PSGs. We identified biochemical pathways represented by the (PSGs). The KEGG annotation of the PSGs suggested that they were distributed in 40 pathways related to metabolism (24 genes), genetic information processing (19 genes), environmental information processing (19 genes), cellular processes (22 genes), organismal systems (17 genes), and human diseases (27 genes) (Figure 7b). We further performed GO enrichment with all the PSGs. GO enrichment identified significant overrepresentation of genes involved in environmental adaptation of Sichuan partridge (Table S6). Several pathways related to energy metabolism and behavior were found in Sichuan partridge, such as mitochondrion (GO:0005739), nitrogen compound metabolic process (GO:0006807), and adult locomotory behavior (GO:0008344).

3.6. Demography Reconstruction

A total of 1,943,364 heterozygous SNPs in the Sichuan partridge genome were detected, and the genome-wide SNP density distribution is showed in Figure 8a. Pairwise sequentially Markovian coalescent modeling (PSMC) analysis was conducted based on local SNP densities to model the demographic history of Sichuan partridge from 10 million years ago to 10,000 years ago (Figure 8b). PSMC showed that the effective size of the Sichuan partridge population had experienced one sharp decrease from approximately 195,000 individuals to a minimum of 2500 individuals around 15,000 years ago. PSMC plots from a variety of birds [39] have revealed several taxa (e.g., the Rifleman) that underwent relatively drastic declines during the last glacial period. The Sichuan Partridge has a similar drastic decline. However, the Sichuan Partridge had a relatively small and stable population size for a long period of time prior to the increase to 195,000 individuals, which was then followed by a drastic decrease. This differs from the taxa that were observed to have a decrease in the prior study [39], which had fluctuating population sizes prior to the decline.

4. Discussion and Conclusions

The Sichuan partridge reference genome is an important resource for studying the genetic mechanisms and environmental adaptation facilitating more effective protection for this endangered species. The synteny analysis between the Sichuan partridge and chicken revealed the relatively conserved genome structures, which was consistent with previous reports of conserved overall synteny between ground tit and zebra finch [23], zebra finch and chicken [31], and also between turkey and chicken [40], suggesting a conserved genome structure among these avian species. However, further confirmation with more sequenced avian genomes is needed to be conducted for this inference.
Olfactory receptors (ORs) are expressed in sensory neurons within the olfactory epithelium and play a pivotal role in the sense of smell among vertebrates. To date, avian OR genes have been largely unexplored. In this study, an average number of 59 OR genes in the ten studied avian species were identified, and the total number of OR genes varied greatly from 34 to 92 across the bird species. The total number of OR functional genes in bird species was considerably fewer than in the mammals like pig (1113), cattle (881), the yak (981), and the forest musk deer (864) [37]. Among the five Phasianidae species, the total number of OR genes in the chicken was obviously greater than Sichuan partridge, Hainan partridge, turkey, and Chinese monal, which suggested that the olfaction capability of chicken was better than other Phasianidae birds. The enhanced olfaction capability of chicken can possibly be attributed to the domestication, which was in line with previous studies [37]. Pigs and cattle have been domesticated for thousands of years and selective breeding could have favored or enhanced olfaction capabilities. Previous study indicated that domesticated pig has highly developed noses for foraging in the soil [41], which revealed the important role of olfaction in a pig’s life. With a range of odiferous glands over their body and a strong sense of smell, cattle have been reported to be capable of recognizing companions, distinguishing gonads and receiving pheromones secreted by the skin [42]. The proportion of OR pseudogenes in the ten bird species was on average 3.90%, which was apparently less than in mammals like pig (14.5%), cattle (13.4%), yaks (34.8%), and forest musk deer (26.0%). The difference in the number of OR genes and the percentage of OR pseudogene may be attributed to the difference of survival strategies, the quality of the genomes and/or different bioinformatics search strategies used. Furthermore, the genome quality was reported to possibly contribute to the difference in the proportion of OR pseudogenes [36]. Therefore, a more accurate analysis of OR gene repertoires in these ten species, and whether the olfaction capability of chicken was more highly developed than other birds needs verification.
The phylogenetic analysis revealed the monophyly of the Phasianidae and the genus Arborophila was given a basal phylogenetic position, branching apparently earlier than other genera within Phasianidae. This result was consistent with the results of previous phylogenetic analyses that have also placed Arborophila basal to the balance of phasianines that we included in our analysis [43,44,45,46]. The genus Arborophila diverged from the other lineages in the Phasianidae around 49.1 Mya, which was much earlier than other genera [43].
The altitude of Laojunshan National Nature Reserve ranges from 1100 to 2008 m and has an annual average temperature of 12.5 °C [4]. Previous studies reported that Sichuan partridge eggs experienced an average of 4.2 h per day below 26 °C (the ‘Physiological Zero Temperature’) at which the embryo cannot develop [47]. Although the temperature is fairly more than that reported for most birds and likely to cause death of the embryo, the hatching success of Sichuan partridge was 88.4%, which is approaching the average hatchability rate for birds generally (89.1%) [47]. The positively selected genes, found in this study, related to energy metabolism possibly illustrated the molecular genetic mechanisms of adaptation to cold montane environments and embryonic tolerance of hypothermia of Sichuan partridge. Compared to birds tending to nest in relatively quieter areas, Sichuan partridge was reported to nest in more disturbed areas, but hatching success was not reduced [47]. Nesting located close to forest trails is likely to gain some advantages because disturbance along trails could reduce predation to some extent. At the very least, reference [47] has illustrated the resilience of the Sichuan partridge to disturbance along forest trails and revealed that increased visitor numbers to Laojunshan National Nature Reserve did not necessarily result in reduced breeding success of Sichuan partridge, and that the reserve is there to protect. Energy and behavior related pathways analyzed in this study will provide pivotal information for understanding the molecular genetic mechanism of the special reproduction of Sichuan partridge.
In summary, this is the first report describing the complete Sichuan partridge genome. We illustrated the conserved genome structure between the Sichuan partridge and chicken and the basal phylogenetic position of the genus Arborophila within Phasianidae. We used the PSMC to infer the demographic history of the Sichuan partridge on the basis of the distribution of heterozygote sites across the genome. Positively selected genes related to energy metabolism and behavior possibly hint the molecular genetic mechanisms of the environmental adaptation of the Sichuan partridge. Further research is needed to further explore the significance of the findings presented here. The Sichuan partridge reference genome will play an instrumental role in the future examination of adaptive evolution to the environment and aiding the long-term conservation of this endangered species and their genetic diversity. Our de novo assembled genome presented here will provide a resource for the future examination of evolution and adaptation of Sichuan Partridge, and the genome will eventually be useful in aiding the long-term conservation of Sichuan Partridge.

Supplementary Materials

The following are available online at https://www.mdpi.com/2073-4425/10/9/677/s1, Table S1: Statistics of the genome completeness of the Sichuan partridge based on 248 CEGs, Table S2: Statistics of the genome completeness of Sichuan partridge based on BUSCO benchmark, Table S3: Functional annotation statistics of the Sichuan partridge genome, Table S4: The number and frequency of the SSRs in the Sichuan Partridge genome, Table S5: The most abundant motif categories found in the Sichuan partridge genome, Table S6: GO enrichment of the PSGs in the Sichuan partridge, Figure S1: The alignment of Sichuan partridge scaffolds to the chicken reference genome.

Author Contributions

Conceptualization, C.Z., B.Y. and Y.M.; methodology, C.Z., H.T., H.Y., S.Z. and B.D.; formal analysis, C.Z., H.T., H.Y., S.Z. and B.D.; investigation, C.Z., H.T., H.Y., S.Z. and B.D.; writing—original draft preparation, C.Z.; writing—review and editing, M.P., Y.W., N.Y., B.Y. and Y.M.; project administration, C.Z., B.Y. and Y.M.

Funding

This research was supported by the National Natural Science Foundation of China (31970395, 31702017).

Acknowledgments

The authors would like to thank Wujiao Li, Chaochao Yan, Kai Cui, and Qinchao Wen for their valuable advice.

Conflicts of Interest

The authors declare that they have no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. He, L.; Dai, B.; Zeng, B.; Zhang, X.; Chen, B.; Yue, B.; Li, J. The complete mitochondrial genome of the Sichuan Hill Partridge (Arborophila rufipectus) and a phylogenetic analysis with related species. Gene 2009, 435, 23–28. [Google Scholar] [CrossRef] [PubMed]
  2. Dai, B.; Dowell, S.D.; Martins, R.P.; Williams, S.R. Conservation status of the Sichuan Hill Partridge. Bird Conserv. Int. 1998, 8, 349–359. [Google Scholar]
  3. Li, C.; Hu, J.; Yu, Z. Distribution and habitat selection of the Sichuan Hill-Partridge. Chin. J. Zool. 2003, 38, 46–51. [Google Scholar]
  4. Liao, W.B.; Li, C.; Hu, J.C.; Lu, X. Habitat utilization of Sichuan Hill Partridge (Arborophila rufipectus) in the non-breeding period in Laojunshan nature reserve. Zool. Res. 2007, 28, 172–178. [Google Scholar]
  5. Luo, R.; Liu, B.; Xie, Y.; Li, Z.; Huang, W.; Yuan, J.; He, G.; Chen, Y.; Pan, Q.; Liu, Y.; et al. SOAPdenovo2: An empirically improved memory-efficient short-read de novo assembler. Gigascience 2012, 1, 18. [Google Scholar] [CrossRef] [PubMed]
  6. Boetzer, M.; Henkel, C.V.; Jansen, H.J.; Butler, D.; Pirovano, W. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics 2011, 7, 578–579. [Google Scholar] [CrossRef] [PubMed]
  7. Parra, G.; Bradnam, K.; Korf, I. CEGMA: A pipeline to accurately annotate core genes in eukaryotic genomes. Bioinformatics 2007, 23, 1061–1067. [Google Scholar] [CrossRef] [PubMed]
  8. Simão, F.A.; Waterhouse, R.M.; Ioannidis, P.; Kriventseva, E.V.; Zdobnov, E.M. BUSCO: Assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 2015, 31, 3210–3212. [Google Scholar] [CrossRef]
  9. Zhou, C.; Zheng, S.; Jiang, X.; Liang, W.; Price, M.; Fan, Z.; Meng, Y.; Yue, B. First complete genome sequence in Arborophila and comparative genomics reveals the evolutionary adaptation of Hainan Partridge (Arborophila ardens). Avian Res. 2018, 9, 45. [Google Scholar] [CrossRef]
  10. Cui, K.; Li, W.; James, J.G.; Peng, C.; Jin, J.; Yan, C.; Fan, Z.; Du, L.; Price, M.; Wu, Y.; et al. The first draft genome of Lophophorus: A step forward for Phasianidae genomic diversity and conservation. Genomics 2018, 29. [Google Scholar] [CrossRef]
  11. Kimura, M. A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J. Mol. Evol. 1980, 16, 111–120. [Google Scholar] [CrossRef] [PubMed]
  12. Du, L.; Zhang, C.; Liu, Q.; Zhang, X.; Yue, B. Krait: An ultrafast tool for genome-wide survey of microsatellites and primer design. Bioinformatics 2017, 34, 681–683. [Google Scholar] [CrossRef] [PubMed]
  13. Zhou, C.; Wang, G.; Yu, H.; Geng, Y.; Wu, W.; Tu, H.; Price, M.; Fan, Z.; Meng, Y.; Yue, B. Genome-wide analysis reveals the genomic features of the turkey vulture (Cathartes aura) as a scavenger. Mol. Genet. Genom. 2019, 5, 1–4. [Google Scholar] [CrossRef] [PubMed]
  14. Zhou, C.; Jin, J.; Peng, C.; Wen, Q.; Wang, G.; Wei, W.; Jiang, X.; Price, M.; Cui, K.; Meng, Y.; et al. Comparative genomics sheds light on the predatory lifestyle of accipitrids and owls. Sci. Rep. 2019, 9, 2249. [Google Scholar] [CrossRef] [PubMed]
  15. Trifinopoulos, J.; Nguyen, L.T.; von Haeseler, A.; Minh, B.Q. W-IQ-TREE: A fast online phylogenetic tool for maximum likelihood analysis. Nucleic Acids Res. 2016, 44, W232–W235. [Google Scholar] [CrossRef] [PubMed]
  16. Li, W.; Godzik, A. Cd-hit: A fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 2006, 22, 1658–1659. [Google Scholar] [CrossRef] [PubMed]
  17. Glusman, G.; Bahar, A.; Sharon, D.; Pilpel, Y.; White, J.; Lancet, D. The olfactory receptor gene superfamily: Data mining, classification, and nomenclature. Mamm. Genome 2000, 11, 1016–1023. [Google Scholar] [CrossRef] [PubMed]
  18. Bailey, T.L.; Elkan, C. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. In Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, Stanford, CA, USA, 14–17 August 1994. [Google Scholar]
  19. Li, L.; Stoeckert, C.J.; Roos, D.S. OrthoMCL: Identification of ortholog groups for eukaryotic genomes. Genome Res. 2003, 13, 2178. [Google Scholar] [CrossRef] [PubMed]
  20. Nick, G.; Ari, L. WebPRANK: A phylogeny-aware multiple sequence aligner with interactive alignment browser. BMC Bioinform. 2010, 11, 1–7. [Google Scholar]
  21. Stamatakis, A. RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 2014, 30, 1312–1313. [Google Scholar] [CrossRef] [PubMed]
  22. Yang, Z. PAML 4: Phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 2007, 24, 1586. [Google Scholar] [CrossRef] [PubMed]
  23. Cai, Q.; Qian, X.; Lang, Y.; Luo, Y.; Xu, J.; Pan, S.; Hui, Y.; Gou, C.; Cai, Y.; Hao, M.; et al. Genome sequence of ground tit Pseudopodoces humilis and its adaptation to high altitude. Genome Biol. 2013, 14, R29. [Google Scholar] [CrossRef] [PubMed]
  24. Lu, L.; Chen, Y.; Wang, Z.; Li, X.; Chen, W.; Tao, Z.; Shen, J.; Tian, Y.; Wang, D.; Li, G.; et al. The goose genome sequence leads to insights into the evolution of waterfowl and susceptibility to fatty liver. Genome Biol. 2015, 16, 89. [Google Scholar] [CrossRef] [PubMed]
  25. Wu, J.; Mao, X.; Cai, T.; Luo, J.; Wei, L. KOBAS server: A web-based platform for automated annotation and pathway identification. Nucleic Acids Res. 2006, 34, W720–W724. [Google Scholar] [CrossRef] [PubMed]
  26. Xie, C.; Mao, X.; Huang, J.; Ding, Y.; Wu, J.; Dong, S.; Kong, L.; Gao, G.; Li, C.Y.; Wei, L. KOBAS 2.0: A web server for annotation and identification of enriched pathways and diseases. Nucleic acids Res. 2011, 39, W316–W322. [Google Scholar] [CrossRef] [PubMed]
  27. Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R. The sequence alignment/map format and SAMtools. Bioinformatics 2009, 25, 2078–2079. [Google Scholar] [CrossRef] [PubMed]
  28. Li, H.; Durbin, R. Inference of human population history from individual whole-genome sequences. Nature 2011, 475, 493. [Google Scholar] [CrossRef]
  29. Seabury, C.M.; Dowd, S.E.; Seabury, P.M.; Raudsepp, T.; Brightsmith, D.J.; Liboriussen, P.; Halley, Y.; Fisher, C.A.; Owens, E.; Viswanathan, G.; et al. A multi-platform draft de novo genome assembly and comparative analysis for the scarlet macaw (Ara macao). PLoS ONE 2013, 8, e62415. [Google Scholar] [CrossRef]
  30. Dalloul, R.A.; Long, J.A.; Zimin, A.V.; Aslam, L.; Beal, K.; Blomberg, L.A.; Bouffard, P.; Burt, D.W.; Crasta, O.; Crooijmans, R.P.; et al. Multi-platform next-generation sequencing of the domestic turkey (Meleagris gallopavo): Genome assembly and analysis. PLoS Biol. 2010, 8, e1000475. [Google Scholar] [CrossRef]
  31. Warren, W.C.; Clayton, D.F.; Ellegren, H.; Arnold, A.P.; Hillier, L.W.; Künstner, A.; Searle, S.; White, S.; Vilella, A.J.; Fairley, S.; et al. The genome of a songbird. Nature 2010, 464, 757. [Google Scholar] [CrossRef]
  32. Halley, Y.A.; Dowd, S.E.; Decker, J.E.; Seabury, P.M.; Bhattarai, E.; Johnson, C.D.; Rollins, D.; Tizard, I.R.; Brightsmith, D.J.; Peterson, M.J.; et al. A draft de novo genome assembly for the northern bobwhite (Colinus virginianus) reveals evidence for a rapid decline in effective population size beginning in the Late Pleistocene. PLoS ONE 2014, 9, e90240. [Google Scholar] [CrossRef] [PubMed]
  33. Nguyen, D.T.; Lee, K.; Choi, H.; Choi, M.K.; Le, M.T.; Song, N.; Kim, J.H.; Seo, H.G.; Oh, J.W.; Lee, K.; et al. The complete swine olfactory subgenome: Expansion of the olfactory gene repertoire in the pig genome. BMC Genom. 2012, 13, 584. [Google Scholar] [CrossRef] [PubMed]
  34. Lee, K.; Nguyen, D.T.; Choi, M.; Cha, S.Y.; Kim, J.H.; Dadi, H.; Seo, H.G.; Seo, K.; Chun, T.; Park, C. Analysis of cattle olfactory subgenome: The first detail study on the characteristics of the complete olfactory receptor repertoire of a ruminant. BMC Genom. 2013, 14, 596. [Google Scholar] [CrossRef] [PubMed]
  35. Zhan, X.; Pan, S.; Wang, J.; Dixon, A.; He, J.; Muller, M.G.; Ni, P.; Hu, L.; Liu, Y.; Hou, H.; et al. Peregrine and saker falcon genome sequences provide insights into evolution of a predatory lifestyle. Nat. Genet. 2013, 45, 563. [Google Scholar] [CrossRef] [PubMed]
  36. Le Duc, D.; Renaud, G.; Krishnan, A.; Almén, M.S.; Huynen, L.; Prohaska, S.J.; Ongyerth, M.; Bitarello, B.D.; Schiöth, H.B.; Hofreiter, M.; et al. Kiwi genome provides insights into evolution of a nocturnal lifestyle. Genome Biol. 2015, 16, 147. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Bu, P.; Jian, Z.; Koshy, J.; Shen, Y.; Yue, B.; Fan, Z. The olfactory subgenome and specific odor recognition in forest musk deer. Anim. Genet. 2019, 50, 358–366. [Google Scholar] [CrossRef] [PubMed]
  38. Lv, L.Y.; Liang, X.F.; He, S. Genome-wide identification and characterization of olfactory receptor genes in Chinese perch, Siniperca chuatsi. Genes 2019, 10, 178. [Google Scholar] [CrossRef]
  39. Nadachowska-Brzyska, K.; Li, C.; Smeds, L.; Zhang, G.; Ellegren, H. Temporal dynamics of avian populations during Pleistocene revealed by whole-genome sequences. Curr. Biol. 2015, 18, 1375–1380. [Google Scholar] [CrossRef] [PubMed]
  40. Yang, Z. Likelihood and Bayes estimation of ancestral population sizes in hominoids using data from multiple loci. Genetics 2002, 162, 1811–1823. [Google Scholar]
  41. Jensen, P. Behaviour of pigs. In The Ethology of Domestic Animals; Jensen, P., Ed.; CABI Publishing: Wallingford, UK, 2002. [Google Scholar]
  42. Bouissou, M.F.; Boissy, A.; Le Neindre, P.; Veissier, I. The social behaviour of cattle. In Social Behaviour in Farm Animals; Leeling, L.J., Gonyou, H.W., Eds.; CABI Publishing: Wallingford, UK, 2001. [Google Scholar]
  43. Crowe, T.M. Phylogenetics, biogeography and classification of, and character evolution in, gamebirds (Aves: Galliformes): Effects of character exclusion, data partitioning and missing data. Cladistics 2006, 22, 495–532. [Google Scholar] [CrossRef]
  44. Chen, D.; Braun, E.L.; Forthman, M.; Kimball, R.T.; Zhang, Z. A simple strategy for recovering ultraconserved elements, exons, and introns from low coverage shotgun sequencing of museum specimens: Placement of the partridge genus Tropicoperdix within the galliformes. Mol. Phylogenet. Evol. 2018, 129, 304–314. [Google Scholar] [CrossRef] [PubMed]
  45. Hosner, P.A.; Faircloth, B.C.; Glenn, T.C.; Braun, E.L.; Kimball, R.T. Avoiding missing data biases in phylogenomic inference: An empirical study in the landfowl (Aves: Galliformes). Mol. Biol. Evol. 2016, 33, 1110–1125. [Google Scholar] [CrossRef] [PubMed]
  46. Hosner, P.A.; Tobias, J.A.; Braun, E.L.; Kimball, R.T. How do seemingly non-vagile clades accomplish trans-marine dispersal? Trait and dispersal evolution in the landfowl (Aves: Galliformes). Proc. R. Soc. B Biol. Sci. 2017, 284, 1854. [Google Scholar] [CrossRef] [PubMed]
  47. Fu, Y.; Dai, B.; Wen, L.; Chen, B.; Dowell, S.; Zhang, Z. Unusual incubation behavior and embryonic tolerance of hypothermia in the Sichuan partridge (Arborophila rufipectus). J. Ornithol. 2017, 158, 707–715. [Google Scholar] [CrossRef]
Figure 1. Distribution and photos of the Sichuan partridge.
Figure 1. Distribution and photos of the Sichuan partridge.
Genes 10 00677 g001
Figure 2. Alignments of Sichuan partridge scaffolds to chicken chromosomes. (a) Several chromosomal rearrangements between chicken and Sichuan partridge are evident in the chicken chromosome 4, while (b) the alignment of Sichuan Partridge scaffolds to chicken chromosome 26 is an example of scaffolds approaching chromosome size, high synteny and high assembly correctness. Forward alignments are in blue, reverse alignments are in red.
Figure 2. Alignments of Sichuan partridge scaffolds to chicken chromosomes. (a) Several chromosomal rearrangements between chicken and Sichuan partridge are evident in the chicken chromosome 4, while (b) the alignment of Sichuan Partridge scaffolds to chicken chromosome 26 is an example of scaffolds approaching chromosome size, high synteny and high assembly correctness. Forward alignments are in blue, reverse alignments are in red.
Genes 10 00677 g002aGenes 10 00677 g002b
Figure 3. Comparative genomics in avian species studied. (a) Functional annotation of Sichuan partridge genes. (b) Orthologous gene clusters in five Phasianidae species. (c) Phylogenetic tree constructed using 1:1 orthologous genes. The time lines indicate divergence times among the species.
Figure 3. Comparative genomics in avian species studied. (a) Functional annotation of Sichuan partridge genes. (b) Orthologous gene clusters in five Phasianidae species. (c) Phylogenetic tree constructed using 1:1 orthologous genes. The time lines indicate divergence times among the species.
Genes 10 00677 g003aGenes 10 00677 g003b
Figure 4. Divergence rate distribution of four major types of transposable elements (TEs) in (a) Sichuan partridge (Arborophila rufipectus), (b) Hainan partridge (Arborophila ardens), (c) Chinese monal (Lophophorus lhuysii), (d) turkey (Meleagris gallopavo), and (e) chicken (Gallus gallus) genomes. The divergence rate was calculated between their identified TE elements in the genome and the consensus sequences (Repbase).
Figure 4. Divergence rate distribution of four major types of transposable elements (TEs) in (a) Sichuan partridge (Arborophila rufipectus), (b) Hainan partridge (Arborophila ardens), (c) Chinese monal (Lophophorus lhuysii), (d) turkey (Meleagris gallopavo), and (e) chicken (Gallus gallus) genomes. The divergence rate was calculated between their identified TE elements in the genome and the consensus sequences (Repbase).
Genes 10 00677 g004
Figure 5. Identification of olfactory receptor (OR) genes and phylogenetic analysis. (a) Composition of OR genes in the ten avian species (b) Comparison of OR gene clusters among the five Phasianidae species. (c) The phylogenetic tree of OR functional genes from the ten bird genomes. Functional OR genes of each bird are represented by the same color as the species in (a).
Figure 5. Identification of olfactory receptor (OR) genes and phylogenetic analysis. (a) Composition of OR genes in the ten avian species (b) Comparison of OR gene clusters among the five Phasianidae species. (c) The phylogenetic tree of OR functional genes from the ten bird genomes. Functional OR genes of each bird are represented by the same color as the species in (a).
Genes 10 00677 g005
Figure 6. Logo representation of the five best conserved motifs identified for Phasianidae OR genes. The degree of conservation was represented by the height of the amino acid code.
Figure 6. Logo representation of the five best conserved motifs identified for Phasianidae OR genes. The degree of conservation was represented by the height of the amino acid code.
Genes 10 00677 g006
Figure 7. Functional distribution of positively selected genes (PSGs). (a) Functional distribution of PSGs according to the gene ontology (GO) database. The y-axis reveals the GO functional categories, while the number of genes in each category is plotted on the x-axis. (b) Functional distribution of PSGs according to the Kyoto Encyclopedia of Genes and Genomes (KEGG)pathway database. The y-axis illustrates the KEGG functional categories, while the number of genes in each category is plotted on the x-axis.
Figure 7. Functional distribution of positively selected genes (PSGs). (a) Functional distribution of PSGs according to the gene ontology (GO) database. The y-axis reveals the GO functional categories, while the number of genes in each category is plotted on the x-axis. (b) Functional distribution of PSGs according to the Kyoto Encyclopedia of Genes and Genomes (KEGG)pathway database. The y-axis illustrates the KEGG functional categories, while the number of genes in each category is plotted on the x-axis.
Genes 10 00677 g007
Figure 8. Single-nucleotide polymorphism (SNP) density distribution and demography reconstruction of Sichuan partridge. (a) Distribution of SNP density across Sichuan partridge genome. Heterozygous SNPs between the chromosomes were annotated, and heterozygosity density was observed in non-overlapping 50-kb windows. (b) Pairwise sequentially Markovian coalescent (PSMC) inference of Sichuan partridge population history on basis of autosomal data. The central bold lines represent inferred population sizes. The 100 thin curves surrounding each line are the PSMC estimates which were generated using 100 sequences randomly resampled from the original sequence. The mutation rate on autosomes used in time scaling, was estimated using chicken autosome data.
Figure 8. Single-nucleotide polymorphism (SNP) density distribution and demography reconstruction of Sichuan partridge. (a) Distribution of SNP density across Sichuan partridge genome. Heterozygous SNPs between the chromosomes were annotated, and heterozygosity density was observed in non-overlapping 50-kb windows. (b) Pairwise sequentially Markovian coalescent (PSMC) inference of Sichuan partridge population history on basis of autosomal data. The central bold lines represent inferred population sizes. The 100 thin curves surrounding each line are the PSMC estimates which were generated using 100 sequences randomly resampled from the original sequence. The mutation rate on autosomes used in time scaling, was estimated using chicken autosome data.
Genes 10 00677 g008
Table 1. Genome sequencing information of the Sichuan partridge.
Table 1. Genome sequencing information of the Sichuan partridge.
Insert Size (bp)Read Length (bp)Raw DataClean Data
Read NumberTotal Bases (Gb)Sequencing Depth (x)Read NumberTotal Bases (Gb)Sequencing Depth (x)
250150485,619,97672.8466.80476,794,84071.5265.62
500150267,992,81840.2036.88263,165,59039.4736.22
800150423,275,62863.4958.25414,846,83462.2357.09
2000150207,573,89231.1428.57167,760,37625.1623.09
5000150433,978,93665.1059.73408,121,68261.2256.17
10,000150265,968,60039.9036.60231,043,68834.6631.80
15,00015096,333,41214.4513.2696,333,41214.4513.26
20,000150138,791,49420.8219.1098,033,40614.7113.49
Estimated genome size of the Sichuan partridge is 1.09 Gb.
Table 2. Statistics of repetitive elements in five Phasianidae species.
Table 2. Statistics of repetitive elements in five Phasianidae species.
TypeA. rufipectusA. ardensG. gallusM. gallopavoL. lhuysii
Length (bp)Percentage (%)Length (bp)Percentage (%)Length (bp)Percentage (%)Length (bp)Percentage (%)Length (bp)Percentage (%)
SINEs531,0880.05532,3920.05611,3380.05565,2620.05557,0530.06
LINEs69,173,0646.3567,925,5436.5081,486,6936.6282,214,2587.2970,584,1676.97
LTR elements19,342,7631.7713,540,2231.3026,993,5862.1923,893,3832.1214,107,3591.39
DNA elements11,459,4891.0511,554,6221.1113,381,4361.0911,677,7201.0311,403,8961.13
Unclassified716,2930.07767,6250.071,772,3120.141,182,6960.10632,5860.06
Total101,222,6979.2994,320,4059.02124,245,36510.10119,533,31910.5997,285,0619.61
Table 3. The summary of perfect microsatellite types in five Phasianidae species.
Table 3. The summary of perfect microsatellite types in five Phasianidae species.
TypesA. rufipectusA. ardensG. gallusM. gallopavoL. lhuysii
CountsLength (bp)Percentage (%)CountsLength (bp)Percentage (%)CountsLength (bp)Percentage (%)CountsLength (bp)Percentage (%)CountsLength (bp)Percentage (%)
Mono-263,0504,738,05071.58317,5805,538,02374.56216,4193,462,41352.44140,8791,959,74713.91209,8303,535,26071.75
Di-23,602427,2926.4224,677457,4405.7931,485590,7127.6326,237584,46622.2820,669376,9447.07
Tri-22,277402,6336.0623,886445,9655.6127,148487,4616.5820,303446,12721.9718,649335,7426.38
Tetra-42,231896,88811.4942,527968,8969.9847,3271,114,11611.4737,434814,94421.7729,203611,5689.99
Penta-14,233460,6853.8715,035572,3803.5355,6162,187,01513.4813,247418,52031.5911,498500,6153.93
Hexa-212084,4020.58225192,4660.5334,6791,610,1128.43390113,01033.342581105,4200.88

Share and Cite

MDPI and ACS Style

Zhou, C.; Tu, H.; Yu, H.; Zheng, S.; Dai, B.; Price, M.; Wu, Y.; Yang, N.; Yue, B.; Meng, Y. The Draft Genome of the Endangered Sichuan Partridge (Arborophila rufipectus) with Evolutionary Implications. Genes 2019, 10, 677. https://doi.org/10.3390/genes10090677

AMA Style

Zhou C, Tu H, Yu H, Zheng S, Dai B, Price M, Wu Y, Yang N, Yue B, Meng Y. The Draft Genome of the Endangered Sichuan Partridge (Arborophila rufipectus) with Evolutionary Implications. Genes. 2019; 10(9):677. https://doi.org/10.3390/genes10090677

Chicago/Turabian Style

Zhou, Chuang, Hongmei Tu, Haoran Yu, Shuai Zheng, Bo Dai, Megan Price, Yongjie Wu, Nan Yang, Bisong Yue, and Yang Meng. 2019. "The Draft Genome of the Endangered Sichuan Partridge (Arborophila rufipectus) with Evolutionary Implications" Genes 10, no. 9: 677. https://doi.org/10.3390/genes10090677

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop