Next Article in Journal
Polyphenol Profile and Pharmaceutical Potential of Quercus spp. Bark Extracts
Next Article in Special Issue
Characterization and Expression Analysis of the Ca2+/Cation Antiporter Gene Family in Tomatoes
Previous Article in Journal
Effect of Elevated CO2 Concentration on the Disease Severity of Compatible and Incompatible Interactions of Brassica napusLeptosphaeria maculans Pathosystem
Previous Article in Special Issue
Complete Chloroplast Genomes of Ampelopsis humulifolia and Ampelopsis japonica: Molecular Structure, Comparative Analysis, and Phylogenetic Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Comparison of the Complete Eragrostis pilosa Chloroplast Genome with Its Relatives in Eragrostideae (Chloridoideae; Poaceae)

1
College of Life Sciences, Shaanxi Normal University, Xi’an 710062, China
2
College of Life Sciences, Northwest A & F University, Yangling 712100, China
3
College of Life Sciences, Yan'an University, Yan'an 716000, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Plants 2019, 8(11), 485; https://doi.org/10.3390/plants8110485
Submission received: 22 September 2019 / Revised: 30 October 2019 / Accepted: 6 November 2019 / Published: 9 November 2019
(This article belongs to the Special Issue Plant Bioinformatics)

Abstract

:
Eragrostis of the tribe Eragrostideae is a taxonomically complex genus, because of its polyploid nature and the presence of similar morphological characters among its species. However, the relationship between these morphologically indistinguishable species at the genomic level has not yet been investigated. Here, we report the complete chloroplast genome of E. pilosa and compare its genome structures, gene contents, simple sequence repeats (SSRs), sequence divergence, codon usage bias, and Kimura 2-parameter (K2P) interspecific genetic distances with those of other Eragrostideae species. The E. pilosa chloroplast genome was 134,815 bp in length and contained 132 genes and four regions, including a large single-copy region (80,100 bp), a small single-copy region (12,661 bp), and a pair of inverted repeats (21,027 bp). The average nucleotide diversity between E. pilosa and E. tef was estimated to be 0.011, and 0.01689 among all species. The minimum and maximum K2P interspecific genetic distance values were identified in psaA (0.007) and matK (0.029), respectively. Of 45 SSRs, eight were shared with E. tef, all of which were in the LSC region. Phylogenetic analysis resolved the monophyly of the sampled Eragrostis species and confirmed the close relationship between E. pilosa and E. tef. This study provides useful chlorophyll genomic information for further species identification and phylogenetic reconstruction of Eragrostis species.

1. Introduction

Eragrostis is a large genus comprising more than 350 species in the tribe Eragrostideae (subfamily Chloridoideae; family Poaceae), which includes three subtribes (Cotteinae (4 genera), Eragrostidinae (11 genera), and Uniolinae (5 genera) [1,2]. Eragrostis species are distributed in weedy disturbed areas and dry habitats of the world’s tropical and subtropical regions [3]. It is commonly called love grass or cane grass, and has significant uses in different areas. Most species belonging to this genus are used as animal feed. However, E. tef is a staple food in some countries, such as Ethiopia. E. amabilis is grown as an ornamental plant, and E. cynosuroides has religious meaning and is used in Hindu temples. Growing E. bahiensis can remove highly toxic radioactive atoms from the environment, and E. curvula is widely planted to prevent soil erosion.
Previously, all Eragrostideae species were thought to use the C4 photosynthetic pathway. Leaf blade anatomies common to the C4 pathway, including NAD-ME, PCK-like, and intermediate forms, are observed in the genus Eragrostis. Similar to other Chloridoideae, chloridoid-, panicoid-, Pappophorum-type, and intermediate bicellular microhairs are observed in Eragrostis species [4,5]. However, Ellis (1984) [6] reported non-C4 leaf anatomy in E. walteri. Moreover, carbon isotope analyses confirmed that E. walteri is a C3 plant, while phylogenetic analyses positioned it outside Chloridoideae in Arundinoideae, a subfamily comprised entirely of C3 species [7,8].
Wolf (1776) [9] described the first Eragrostis species, E. minor. However, despite its identification long ago, which species actually belong in this genus remains debatable. Due to its large size and wide geographic distribution, there has been no comprehensive taxonomic analysis of Eragrostis. Ingram et al. (2007) [3] suggested that Eragrostis is monophyletic, with the inclusion of Acamptoclados, Diandrochloa, and Neeragrostis. In 2003, Ingram and Doyle described the close relationship between E. tef and E. pilosa using the nuclear gene waxy and the plastid locus rps16 [10].
E. pilosa is a common weed, native to Eurasia and Africa and distributed throughout tropical and subtropical areas of the world. The morphology of E. pilosa is very similar to E. tef, making them indistinguishable [11]. However, E. pilosa has been adapted to many climates, while E. tef has not. E. pilosa is an allotetraploid of hybrid origin that contains genes from other species. Ayele et al. (1999) [11] showed that E. pilosa and E. tef share many identical amplified fragment-length polymorphism markers. Therefore, they suggested that E. tef is a domesticate of E. pilosa, with few agronomic features altered during the domestication process.
The plastid, which is maternally inherited and functions in photosynthesis, carbon fixation, and stress response, provides additional information for species identification and evolutionary study [12]. With the development of advanced technologies, the amplification and sequencing of plastid loci has been possible for many years. The availability of complete chloroplast (cp) genome sequencing data has expanded rapidly with the onset of next-generation sequencing (NGS) technology. To date, eight complete cp genome sequences belonging to subtribe Eragrostideae (Eragrostis (five species), Enneapogon (two species), and Uniola (one species)) are available in the National Center for Biotechnology Information (NCBI) database. However, comparison of these cp genomes has not been previously reported. Here, we present the complete cp genome of E. pilosa, and compare its structural variations, gene arrangements, and simple sequence repeat (SSR) distributions with the complete cp genomes of E. tef and seven other species from tribe Eragrostideae. Additionally, we introduce several SSR primers for future screening of E. pilosa. Our results demonstrate the diversity of species in tribe Eragrostideae. These data are consistent with previous studies suggesting close relationships between E. pilosa and other Eragrostis species.

2. Results

2.1. Characteristic cp Genome Features of E. pilosa among Eragrostideae Species

The complete cp genome of E. pilosa was 134,815 bp long, and displayed a typical quadripartite structure containing a large single-copy (LSC) region (80,100 bp), a short single-copy (SSC) region (12,661 bp) and a pair of inverted repeat (IR) regions (21,027 bp). It consisted of 85 protein-coding genes, 39 tRNA genes, and eight rRNA genes, for a total of 132. Seven protein-coding genes (rps15, ycf68, rps7, ndhB, rpl23, rpl2, and rps19), eight tRNA genes (trnN-GUU, trnR-ACG, trnA-UGC, trnI-GAU, trnV-GAC, trnL-CAA, trnI-CAU, and trnH-GUG), and four rRNA genes were duplicated in the IR regions. In addition, trnM-CAU and trnfM-CAU were repeated in the LSC region. Ten protein-coding genes, including two duplicated genes (rpl2 and ndhB) and eight tRNA genes, including two duplicated genes (trnI-GAU and trnA-UGC) contained one intron, whereas the protein-coding gene ycf3 had two introns. Similar to most of the other angiosperms, the 5' end of rps12 was located in the LSC region and 3' end was duplicated in the IR regions (Figure 1).
The E. pilosa plastome sequence generated in this study was compared to all available complete plastomes from the tribe Eragrostideae. Six plastomes belonged to Eragrostis, two to Enneapogon, and one to Uniola. Of the Eragrostideae species, E. tenellula had the smallest cp genome (130,773 bp) while E. walteri had the largest (137,056 bp). The total number of genes varied among species, but E. pilosa and E. tef encoded an identical number of total genes, including the same protein-coding, tRNA, and rRNA genes. E. tenellula encoded the lowest total number of genes (124) due to loss of one copy of trnfM-CAU and the protein-coding genes rps19, ycf68, rps15, and ndhH. A copy of trnfM-CAU also lost in En. caerulescens, E. setifolia, and E. walteri. The additional trnT-GGU gene clustered with trnM-CAU in U. paniculata, En. oblongus, and E. minor. The protein-coding genes rps19 and ycf68 were also lost in E. setifolia and En. caerulescens, and loss of ycf68 was also observed in U. paniculata and E. walteri (Table 1 and Table S1).
The IR/SSC boundary regions of Eragrostideae species displayed some significant differences, even among species in the genus Eragrostis. E. pilosa, E. tef, and E. minor had similar IR/SSC boundary patterns, except for length variations in ndhF and ndhH. In E. pilosa and E. minor, ndhF was 2211 bp, but was 2217 bp in E. tef. This gene crossed the IRB/SSC boundary region by 23 bp in all three species. Furthermore, ndhH crossed the IRA/SSC region by 4 bp, and was 1188 bp long in E. pilosa and 1182 bp long in E. tef and E. minor. The trnH gene was located in the IRA region and duplicated in the IRB region, and was 75 bp in all species. The 282-bp rps19 gene was present in the IRA region and repeated in IRB in all species except E. setifolia, E. tenellula, and En. caerulescens (Figure 2).

2.2. Codon Usage

The GC contents of the entire cp genomes, the coding sequences (CDS), and each codon position were similar in all species. The GC content in the plastomes and CDS were lower (approximately 38.3% and 39%, respectively) suggesting an abundance of AT regions. The mean GC content at the first codon position (GC1, 48.0%) was higher than the mean values at the second and third positions (GC2 and GC3; 39.1% and 30.0% respectively) in all species (Table 2). The average number of codons ranged between 14,407 (in En. caerulescens) and 14,431 (in E. walteri). In all species, leucine and cysteine were the most and least abundant amino acids, respectively. Methionine and tryptophan were each represented by only one codon, while all other amino acids were encoded by more than one codon in all Eragrostideae species. In E. pilosa, leucine was encoded by six synonymous codons (UUA (522), UUG (281), CUU (328), CUC (95), CUA (225), and CUG (77)), with relative synonymous codon usage (RSCU) values of 2.05, 1.1, 1.29, 0.37, 0.88, and 0.3, respectively (Table S2).

2.3. Simple Sequence Repeats

SSRs in cp genomes are widely used in species identification, plant population genetics, and evolutionary studies [13,14]. In E. pilosa, 45 SSRs > 10 bp in length were detected, all of which were A/T and C/G mononucleotide repeats. The highest number of SSRs was detected in E. walteri (52), including 51 mononucleotide repeats and one dinucleotide repeat. The lowest number (41) was identified in E. minor. In all species, the SSRs were both mononucleotide or dinucleotide repeats, most were located in the LSC region, and A/T repeats were the most abundant type. All Eragrostis species except E. walteri contained SSRs incorporating C/G nucleotides. Compound SSRs also contained remarkably high A/T content, and in E. pilosa, only one compound SSR contained G nucleotides. E. tef and E. walteri had the highest number of compound SSRs containing A or T. The length of compound SSRs ranged from 22 to 200 bp in all species (Table 3).
Shared SSRs among Eragrostis species were calculated by aligning homologous regions in each pair of cp genomes. E. pilosa and E. minor shared the most SSRs (9), while E. tef and E. walteri shared the least (2). E. pilosa shared eight, four, and five SSRs with E. tef, E. setifolia, and E. tenellula, respectively, while E. tef and E. minor shared five (Figure 3 and Table S3). Ten E. pilosa SSRs were randomly selected, and primers were designed that can be used in future studies of the genetic diversity and population structure of these species (Table S4).

2.4. Sequence Divergence and Kimura 2-Parameter (K2P) Genetic Distance

Coding and non-coding regions that are highly variable between cp genomes are useful as potential DNA barcodes for species identification and phylogenetic reconstruction [15]. According to DNA Sequence Polymorphism (DnaSP), the estimated average nucleotide diversity (pi) value between E. pilosa and E. tef was 0.011. The highest variation was found in the LSC region (atpI-atpH) with a value of 0.125. Five regions in the LSC (rps16-trnQ, atpI-atpH, ycf3-trnS, trnL-UAA, and rbcL-psaI) showed high variability, with pi values > 0.06. The highest nucleotide diversity in the IR regions was in rrn23 (pi=0.117) while rrn16, which clusters with rrn23, had a pi value > 0.063. For the nine cp genomes, pi values ranged from 0-0.05872, with an average of 0.01689. The loci rps16-trnQ, trnG-trnM-trnT, trnL, rbcL-psaI, and ndhF-rpl32 were highly divergent among Eragrostideae species (pi > 0.04; Figure 4). The highest nucleotide diversity was observed in rps16-trnQ (0.05872). Furthermore, we performed a multiple sequence alignment between the nine complete cp genomes in mVISTA, using E. pilosa as a reference. (Figure 5). Similar to the estimated nucleotide diversity, the most divergent regions were located in the intergenic spacer regions of the LSC.
Using the sequences of 44 protein-coding genes, we calculated the interspecific K2P genetic distances among the nine species. Minimum and maximum K2P values were observed in psaA (0.007) and matK (0.029), respectively. The average K2P interspecific genetic distance was estimated at 0.014 (Figure 6). Pairwise comparisons of genetic divergence were estimated using the same protein-coding sequences. The lowest genetic divergence value was between E. pilosa and E. minor (0.003), while the highest divergence values (0.03) were between E. walteri and En. caerulescens/En. oblongus. Genetic divergence values between E. pilosa and E. minor, E. tef, E. walteri, E. setifolia, and E. tenellula were 0.003, 0.006, 0.028, 0.007, and 0.008, respectively, with the highest divergence observed between E. pilosa and E. walteri. When Enneapogon species were included, genetic divergence values with Eragrostis species were always high (ranging from 0.014 to 0.030). U. paniculata also showed relatively high genetic divergence with Eragrostis (ranging from 0.008 to 0.026) and Enneapogon (0.013) species. The genetic distance between En. caerulescens and En. oblongus was 0.002 (Table S5).

2.5. Phylogeny

The nine Eragrostideae species and the outgroup species Eleusine indica (NC_030486) from Cynodonteae were used for phylogenetic analysis. Similar topologies were observed in maximum likelihood (ML) and Bayesian inference (BI) generated trees based on complete cp genome sequences, with high bootstrap values for almost all relationships. E. pilosa was in the same clade as E. minor, and both were sisters of E. tef (Figure 7). BI trees based on highly divergent regions (rps16-trnQ + trnL+ rbcL-psaI) and 30 protein-coding genes of length > 600 bp were identical with high posterior probability values, and E. pilosa, E minor, and E. tef clustered together (Figure 8). All Eragrostis species except E. walteri had close relationships in all analysis. E. walteri was in the basal position followed by the Enneapogon species.

3. Discussion

Chloroplasts are essential plant cell organelles that convert solar energy to usable chemical energy [12]. They are involved in the synthesis of most leaf proteins, fatty acids, amino acids, and secondary metabolites. The chloroplast enzyme ribulose-1,5-bisphosphate carboxylase (RbcL) is the most abundant protein on earth, and it is widely used in evolutionary studies of plants [16]. During evolution, specific cp genes have been lost in various species, genera, and families [17]. Therefore, comparing the cp genomes of plant species is useful to understand their evolutionary processes over time. The development of advanced technologies for cp genome sequencing has allowed many research groups to produce complete cp DNA sequences. In addition, no single DNA barcode exists that can distinguish all angiosperms from each other. Therefore, many recent studies have been conducted to detect appropriate barcodes to identify species, genera, and families using complete cp sequences [18].
All cp genomes used in this study had a typical quadripartite structure containing two copies of the IR region separating the SSC and LSC regions, similar to most other angiosperms [19,20]. However, the size and gene content varied even with the Eragrostis genus. Sequence alignment indicated that specific genes were lost in some cp genomes compared to E. pilosa. E. tenellula, for example, displayed loss of trnfM, rps19, rps15, ndhH, and ycf68.
IR contraction and expansion causes length variation in angiosperm cp genomes [21]. Various gene migrations are common in most Poaceae species, such as the ndhF and ndhH genes, which are located at opposite ends of the SSC region and migrate toward the IR regions. The tribe Eragrostideae is classified in the PACMAD clade. Davis et al. (2010) [22] reported that in this clade, ndhF extended into the IR by < 30 nucleotides. Similar patterns of overlap were observed in this study, with the exception of E. tenellula, in which ndhF extended 53 nucleotides into the IRB region and ndhH was not present. The pattern in En. oblongus differed from other Eragrostideae species, with ndhF located entirely in the SSC.
The phenomenon of codon usage bias refers to the unequal use of synonymous codons in an organism [23]. It is associated with the gene expression level and accuracy of translation [24], gene length [25], gene translation initiation signals, protein amino acid composition, protein structure, tRNA abundance, mutation frequency and patterns, and GC composition [26,27]. Consistent with previously reported cp genomes [28], the GC content of the Eragrostideae plastid genomes was reduced, suggesting an abundance of AT-rich intergenic regions in cp genomes. The AT-rich bias was strongest in the third codon position, also consistent with previously reported plastomes.
As mentioned above, E. pilosa and E. tef are closely related species with similar morphology. We aligned their cp genome sequences to observe similarities and divergences at the genomic level. Between these two species, we observed variable regions that were mainly found in intergenic regions. We plotted the sequence identity using mVISTA using E. pilosa as a reference. The divergence was mostly located in the LSC region, with less divergence observed in the IR regions. However, divergence in this region was associated with the rrn cluster, containing rrn16 and rrn23. Regions of high sequence divergence identified in this study could be used in deep phylogenetic analysis and as potential DNA barcodes.
Taking the E. walteri to outside of the Chloridoideae, low genetic divergence was observed in Eragrostis compared to inter- generic divergence in Eragrostideae. Within the Eragrostis genus, comparatively low genetic divergence was observed between E. tef and E. pilosa, suggesting this inter-specific variation may account for E. pilosa's many climatic adaptations. Evidence of chloroplast genes contribute to plant adaptation to the different environment has been discussed previously in Najas flexilis [29] and the Hawaiian genus Schiedea [30].
The proper taxonomic classification of Eragrostis species within Chloridoideae has been debated for many years, due to their similar morphologies, polyploid natures, and broad geographic distributions. However, this study does not suggest the need for a new infrageneric classification, but rather adds additional molecular information on the cp genomes for existing infrageneric classifications. Hilu and Alice (2001) [31] demonstrated that Eragrostideae species do not appear to be monophyletic, according to their matK sequences. However, Ingram and Doyle (2007) [3] used the plastid locus rps16 and the nuclear gene waxy to demonstrate that Eragrostis species form a monophyletic group when a few segregate genera, including Acamptoclados, Diandrochloa, and Neeragrostis are included. In our data, Eragrostis species formed a monophyletic group, while Enneapogon and Uniola clustered separately. Meanwhile, Ingram et al. (2011) [7] demonstrated that E. walteri was misclassified in the Chloridoideae. Therefore, the placement of E. walteri in our phylogenetic tree is not surprising.
Alignment of the complete cp genomes demonstrated that E. pilosa and E. minor formed a clade and were sisters to E. tef. Additionally, it recovered a major Eragrostis clade and other genera. However, it is challenging to construct phylogenies for Eragrostis, because most known species are polyploid and the ploidy levels of many others remain unknown [3]. In BI trees based on highly divergent regions and 30 protein-coding genes, E. pilosa, E. tef, and E. minor clustered together. The similarities between E. pilosa and E. tef have been reported previously using morphological [32,33], cytological/biochemical [34,35] and molecular methods [10]. Based on cytological examinations, Tavassoli (1986) [35] suggested that E. minor and E. pilosa are close relatives of E. tef. Ingram and Doyle (2003) [10] showed that E. minor clustered with E. pilosa and E.tef with different progenitors. Therefore, extended sampling with several landraces of these close relatives will be necessary to place E. pilosa in its correct position. Our study highlights the utility of complete cp genome sequences, several protein-coding genes, and highly divergent regions for the phylogenetic analysis of Eragrostideae.

4. Materials and Methods

4.1. Plant Materials and DNA Sequencing

Fresh E. pilosa leaves were collected at Yan’an, Shaanxi, China (179° 36°39′20″ N 109° 24′26″ E). Total genomic DNA was extracted from silica-dried leaves using the modified cetyltrimethylammonium bromide (CTAB) method [36]. Purified DNA was used for paired-end library preparation with Illumina Paired-End DNA library Kit according to the manufacturer’s protocol (Illumina, CA, USA). Genomic DNA was sequenced on a HiSeq X Ten platform. Eight other species of Eragrostideae were used for comparative analysis (E. tef (NC_029413), E. minor (NC_029412), E. setifolia (NC_042832), E. tenellula (NC_042833), E. walteri (NC_035528), En. caerulescens (NC_042837), En. oblongus (NC_036682), U. paniculata (NC_036709)).

4.2. Genome Assembly and Annotation

We obtained high-quality reads from raw reads using the Trimmomatic tool [37]. The high-quality reads were then assembled using the GetOrganelle software (https://github.com/Kinggerm/GetOrganelle/blob/master/get_organelle_from_reads.py). Assembly was polished by mapping raw reads back to the retrieved sequence using the map reference tool in Geneious 10.0.5 (http://www.geneious.com). The generated consensus sequence was further adjusted by alignments with related species. Dual Organellar GenoMe Annotator online tool (https://dogma.ccbb.utexas.edu/) was used to annotate all protein-coding sequences and tRNA and mRNA genes [38]. Annotations of tRNA genes were confirmed with tRNAscan-SE [39]. Finally, the circular cp genome of E. pilosa was drawn with the online tool Organellar Genome DRAW (ORDRAW, https://chlorobox.mpimp-golm.mpg.de/OGDraw.html) [40]. The complete chloroplast genome of E. pilosa was deposited in GenBank with the accession number MN268502.

4.3. Codon Usage Bias, Sequence Divergence, and K2P Genetic Distance Analyses

Codon usage bias and GC1, GC2, and GC3 contents were analyzed using 44 common protein-coding sequences. Each selected CDS met the following criteria: (1) a length > 300 bp; (2) presence of an ATG start codon and TAA, TGA, or TAG stop codon [41]. RSCU values and GC1, GC2, and GC3 contents were calculated using MEGA 6.0. [42].
Alignments of whole cp genomes were visualized with mVISTA [43]. Sequence divergence values were determined using the sliding window method in DnaSP v5.10. [44] with a 200-bp step size and an 800-bp window length. We estimated the K2P genetic distance for each protein-coding gene and the pairwise distance between each species using the K2P model in MEGA 6.0.

4.4. SSR Analysis

The MISA tool [45] was used to detect SSRs, with the parameters set to ten repeated units for mononucleotide SSRs, six repeated units for dinucleotide SSRs, and five repeated units for tri-, tetra-, penta-, and hexanucleotide SSRs. Shared repeats among Eragrostis species were determined by aligning the species to detect identical repeats located in homologous regions. The Primer3 online tool (http://primer3.ut.ee/) was used to design E. pilosa SSR primers for future studies [46]. IR border regions were visualized using the IRscope online program (https://irscope.shinyapps.io/irapp/).

4.5. Phylogenetic Analysis

All nine Eragrostideae species and Eleusine indica from Cynodonteae were used for phylogenetic analysis. Phylogenetic trees were derived from the following three data sets: (1) the complete cp genome sequences; (2) 30 protein-coding genes of > 600 bp length, to avoid sampling bias; and (3) a concatenation of highly divergent regions (rps16-trnQ + trnL+ rbcL-psaI). The analyses were based on the ML and BI methods. Coding sequences extracted from each sequence were aligned using MAFFT [47] and concatenated using Geneious software. The ML tree was constructed in MEGA 6.0 based on the Tamura-Nei model using a heuristic search for initial trees. Bootstrap analysis was performed with 1,000 replicates. A discrete gamma distribution was used with four categories (+G, parameter = 0.3797). Initial ML tree(s) were obtained using the neighbor joining and BIONJ algorithms. BI analysis was conducted using MrBayes 3.2 [48] with a General Time Reversible substitution model. Two independent Markov chain Monte Carlo chains were run for 1.5 million generations, with a subsampling frequency of 200 generations.

5. Conclusions

In this study, we sequenced the complete cp genome sequence of E. pilosa, a grass belonging to tribe Eragrostideae. Specific genome features and SSRs were identified and compared with those of other Eragrostideae species. This study is the first complete cp genome comparison of Eragrostideae species, using all complete Eragrostideae cp genomes available in the NCBI database to date. Phylogenetic analysis demonstrates that cp genome data sets can be used to resolve Eragrostideae phylogeny through broad sampling. This study makes a significant contribution to the literature because it provides new information on the genetic relationships between Eragrostis species that could be used for further species identification and phylogenetic analysis.

Supplementary Materials

Author Contributions

Conceptualization, S.-Q.X., D.-L.G., and Y.S.; Methodology, L.Z., Y.S., and N.N.A.; Software and formal analysis, Y.S., and D.-L.G.; Writing—Original Draft Preparation, Y.S.; Writing—Review & Editing, S.-Q.X., D.-L.G.; Supervision, S.-Q.X., and W.-Q.W.; Funding Acquisition, S.-Q.X., and W.-Q.W.

Funding

This work was financially supported by the Excellent Doctor Innovation Project of Shaanxi Normal University (S2015YB03), the Fundamental Research Funds for the Central Universities (2016TS057), and the Fundamental Research Funds for the Central Universities (GK201604008; GK201702010; GK201701006); The key R & D plan of Shaanxi province (2018ZDXMSF-079), This work was supported in part by the National Natural Science Foundation of China (No.61673251, No. 61760101).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Watson:, I.; Dallwitz, M.J. The Grass Genera of the World; CAB International: Oxford, UK, 1992; p. 1038. [Google Scholar]
  2. Zappacosta, D.; Meier, M.; Carrera, A.; Pacheco, G.; Cardone, S.; Selva, J.P.; Echenique, V. Molecular markers to study the variability within the Eragrostis curvula complex. Phyton Int. J. Exp. Bot. 2011, 80, 211–220. [Google Scholar]
  3. Ingram, A.L.; Doyle, J.J. Eragrostis (Poaceae): Monophyly and Infrageneric Classification. Aliso A J. Syst. Evol. Bot. 2007, 23, 595–604. [Google Scholar] [CrossRef] [Green Version]
  4. Amarasinghe, V.; Watson, L. Taxonomic significance of microhair morphology in the genus eragrostis beauv. (poaceae). Taxon 1990, 39, 59–65. [Google Scholar] [CrossRef]
  5. Szczepanik, J.; Sowiński, P. The occurrence of chloroplast peripheral reticulum in grasses: A matter of phylogeny or a matter of function? Acta Physiol. Plant. 2014, 36, 1133–1142. [Google Scholar] [CrossRef]
  6. Ellis, R.P. Eragrostis walteri—A 1st record of non-Kranz anatomy in the subfamily Chloridoideae (Poaceae). S. Afr. J. Bot. 1984, 3, 380–386. [Google Scholar] [CrossRef]
  7. Ingram, A.L.; Christin, P.A.; Osborne, C.P. Molecular phylogenies disprove a hypothesized C4 reversion in Eragrostis walteri (Poaceae). Ann. Bot. 2011, 107, 321–325. [Google Scholar] [CrossRef]
  8. Teisher, J.K.; McKain, M.R.; Schaal, B.A.; Kellogg, E.A. Polyphyly of Arundinoideae (Poaceae) and evolution of the twisted geniculate lemma awn. Ann. Bot. 2017, 120, 725–738. [Google Scholar] [CrossRef]
  9. Wolf, N.M. Genera Plantarum Vocabulis Characteristicis Definite; Müllerus: Königsberg, Germany, 1776; p. 326. [Google Scholar] [CrossRef]
  10. Ingram, A.L.; Doyle, J.J. The origin and evolution of Eragrostis tef (Poaceae) and related polyploids: Evidence from nuclear waxy and plastid rps16. Am. J. Bot. 2003, 90, 116–122. [Google Scholar] [CrossRef]
  11. Ayele, M.; Tefera, H.; Assefa, K.; Nguyen, H.T. Genetic characterization of two Eragrostis species using AFLP and morphological traits. Hereditas 1999, 130, 33–40. [Google Scholar] [CrossRef]
  12. Sabater, B. Evolution and Function of the Chloroplast. Current Investigations and Perspectives. Int. J. Mol. Sci. 2018, 19, 3095. [Google Scholar] [CrossRef]
  13. Yang, A.-H.; Zhang, J.-J.; Yao, X.-H.; Huang, H.-W. Chloroplast microsatellite markers in Liriodendron tulipifera (Magnoliaceae) and cross-species amplification in L. chinense. Am. J. Bot. 2011, 98, e123–e126. [Google Scholar] [CrossRef] [PubMed]
  14. Xue, J.; Wang, S.; Zhou, S.-L. Polymorphic chloroplast microsatellite loci in Nelumbo (Nelumbonaceae). Am. J. Bot. 2012, 99, 240–244. [Google Scholar] [CrossRef] [PubMed]
  15. Ma, P.-F.; Zhang, Y.-X.; Zeng, C.-X.; Guo, Z.-H.; Li, D.-Z. Chloroplast Phylogenomic Analyses Resolve Deep-Level Relationships of an Intractable Bamboo Tribe Arundinarieae (Poaceae). Syst. Boil. 2014, 63, 933–950. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  16. Wildman, S.G. Aspects of Fraction I protein evolution. Arch. Biochem. Biophys. 1979, 196, 580–610. [Google Scholar] [CrossRef]
  17. Martin, M.; Sabater, B. Plastid ndh genes in plant evolution. Plant Physiol. Biochem. 2010, 48, 636–645. [Google Scholar] [CrossRef]
  18. Huang, Y.-Y.; Cho, S.-T.; Haryono, M.; Kuo, C.-H. Correction: Complete chloroplast genome sequence of common bermudagrass (Cynodon dactylon (L.) Pers.) and comparative analysis within the family Poaceae. PLoS ONE 2017, 12, e0184409. [Google Scholar] [CrossRef]
  19. Yang, J.; Yue, M.; Niu, C.; Ma, X.-F.; Li, Z.-H. Comparative Analysis of the Complete Chloroplast Genome of Four Endangered Herbals of Notopterygium. Genes 2017, 8, 124. [Google Scholar] [CrossRef]
  20. Zhang, Y.; Du, L.; Liu, A.; Chen, J.; Wu, L.; Hu, W.; Zhang, W.; Kim, K.; Lee, S.-C.; Yang, T.-J.; et al. The Complete Chloroplast Genome Sequences of Five Epimedium Species: Lights into Phylogenetic and Taxonomic Analyses. Front. Plant Sci. 2016, 7, 10. [Google Scholar] [CrossRef]
  21. Hong, S.-Y.; Cheon, K.-S.; Yoo, K.-O.; Lee, H.-O.; Cho, K.-S.; Suh, J.-T.; Kim, S.-J.; Nam, J.-H.; Sohn, H.-B.; Kim, Y.-H. Complete Chloroplast Genome Sequences and Comparative Analysis of Chenopodium quinoa and C. album. Front. Plant Sci. 2017, 8, 8. [Google Scholar] [CrossRef]
  22. Davis, J.I.; Soreng, R.J. Migration of endpoints of two genes relative to boundaries between regions of the plastid genome in the grass family (Poaceae). Am J Bot. 2010, 97, 874–892. [Google Scholar] [CrossRef] [Green Version]
  23. Gun, L.; Yumiao, R.; Haixian, P.; Liang, Z. Comprehensive Analysis and Comparison on the Codon Usage Pattern of Whole Mycobacterium tuberculosis Coding Genome from Different Area. BioMed Res. Int. 2018, 2018, 3574976. [Google Scholar] [CrossRef]
  24. Sharp, P.M.; Li, W.-H. The codon adaptation index-a measure of directional synonymous codon usage bias, and its potential applications. Nucleic Acids Res. 1987, 15, 1281–1295. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Wan, X.; Xu, D.; Zhou, J. A new informatics method for measuring synonymous codon usage bias. In Intelligent Engineering Systems through Artificial Neural Networks; Dagli, C.H., Buczak, A.L., Enke, D.L., Embrechts, M., Ersoy, O., Eds.; ASME Press: New York, NY, USA, 2003; Volume 13, pp. 1101–1118. [Google Scholar]
  26. Tao, X.; Dafu, D. The relationship between synonymous codon usage and protein structure. FEBS Lett. 1998, 434, 93–96. [Google Scholar] [CrossRef] [Green Version]
  27. Wan, X.-F.; Xu, D.; Kleinhofs, A.; Zhou, J. Quantitative relationship between synonymous codon usage bias and GC composition across unicellular genomes. BMC Evol. Boil. 2004, 4, 19. [Google Scholar]
  28. Li, Y.; Xu, W.; Zou, W.; Jiang, D.; Liu, X. Complete chloroplast genome sequences of two endangered Phoebe (Lauraceae) species. Bot. Stud. 2017, 58, 37. [Google Scholar] [CrossRef] [PubMed]
  29. Peredo, E.L.; King, U.M.; Les, D.H. The Plastid Genome of Najas flexilis: Adaptation to Submersed Environments Is Accompanied by the Complete Loss of the NDH Complex in an Aquatic Angiosperm. PLoS ONE 2013, 8, e68591. [Google Scholar] [CrossRef] [PubMed]
  30. Kapralov, M.V.; Filatov, D.A. Molecular Adaptation during Adaptive Radiation in the Hawaiian Endemic Genus Schiedea. PLoS ONE 2006, 1, e8. [Google Scholar] [CrossRef] [PubMed]
  31. Hilu, K.W.; Alice, L.A. A phylogeny of Chloridoideae (Poaceae) based on matK sequences. Syst. Bot. 2001, 26, 386–405. [Google Scholar]
  32. Jones, B.M.G.; Ponti, J.; Tavassoli, A.; Dixon, P.A. Relationships of the Ethiopian Cereal T′ef (Eragrostis tef (Zucc.) Trotter): Evidence from Morphology and Chromosome Number. Ann. Bot. 1978, 42, 1369–1373. [Google Scholar] [CrossRef]
  33. Costanza, S.H.; Dewet, J.M.J.; Harlan, J.R. Literature review and numerical taxonomy ofEragrostis tef (T’ef). Econ. Bot. 1979, 33, 413–424. [Google Scholar] [CrossRef]
  34. Bekele, E.; Lester, R.N. Biochemical Assessment of the Relationships of Eragrostis tef (Zucc.) Trotter with some Wild Eragrostis Species (Gramineae). Ann. Bot. 1981, 48, 717–725. [Google Scholar] [CrossRef]
  35. Tavassoli, A. The Cytology of Eragrostis with Special Reference to E. Tef and Its Relatives. Ph.D. Thesis, London University, London, UK, 1986. [Google Scholar]
  36. Doyle, J.J.; Doyle, J.L. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem. Bull. 1987, 19, 11–15. [Google Scholar]
  37. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef] [PubMed]
  38. Wyman, S.K.; Jansen, R.K.; Boore, J.L. Automatic annotation of organellar genomes with DOGMA. Bioinformatics 2004, 20, 3252–3255. [Google Scholar] [CrossRef] [Green Version]
  39. Lowe, T.M.; Chan, P.P. tRNAscan-SE On-line: Integrating search and context for analysis of transfer RNA genes. Nucleic Acids Res. 2016, 44, W54–W57. [Google Scholar] [CrossRef]
  40. Lohse, M.; Drechsel, O.; Bock, R. OrganellarGenomeDRAW (OGDRAW): A tool for the easy generation of high-quality custom graphical maps of plastid and mitochondrial genomes. Curr. Genet. 2007, 52, 267–274. [Google Scholar] [CrossRef]
  41. Yang, Y.; Zhu, J.; Feng, L.; Zhou, T.; Bai, G.; Yang, J.; Zhao, G. Plastid Genome Comparative and Phylogenetic Analyses of the Key Genera in Fagaceae: Highlighting the Effect of Codon Composition Bias in Phylogenetic Inference. Front. Plant Sci. 2018, 9, 82. [Google Scholar] [CrossRef]
  42. Tamura, K.; Stecher, G.; Peterson, D.; Filipski, A.; Kumar, S. MEGA6: Molecular Evolutionary Genetics Analysis Version 6.0. Mol. Boil. Evol. 2013, 30, 2725–2729. [Google Scholar] [CrossRef] [Green Version]
  43. Frazer, K.A.; Pachter, L.; Poliakov, A.; Rubin, E.M.; Dubchak, I. VISTA: Computational tools for comparative genomics. Nucleic Acids Res. 2004, 32, W273–W279. [Google Scholar] [CrossRef]
  44. Librado, P.; Rozas, J. DnaSP v5: A software for comprehensive analysis of DNA polymorphism data. Bioinformatics 2009, 25, 1451–1452. [Google Scholar] [CrossRef]
  45. Beier, S.; Thiel, T.; Münch, T.; Scholz, U.; Mascher, M. MISA-web: A web server for microsatellite prediction. Bioinformatics 2017, 33, 2583–2585. [Google Scholar] [CrossRef] [PubMed]
  46. Untergasser, A.; Cutcutache, I.; Koressaar, T.; Ye, J.; Faircloth, B.C.; Remm, M.; Rozen, S.G. Primer3-new capabilities and interfaces. Nucleic Acids Res. 2012, 40, e115. [Google Scholar] [CrossRef] [PubMed]
  47. Katoh, K. MAFFT: A novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002, 30, 3059–3066. [Google Scholar] [CrossRef] [PubMed]
  48. Ronquist, F.; Teslenko, M.; Van Der Mark, P.; Ayres, D.L.; Darling, A.; Höhna, S.; Larget, B.; Liu, L.; Suchard, M.A.; Huelsenbeck, J.P. MrBayes 3.2: Efficient Bayesian phylogenetic inference and model choice across a large model space. Syst. Boil. 2012, 61, 539–542. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Gene map of E. pilosa chloroplast genomes. Genes on the inside of the map are transcribed in the clockwise direction and genes on the outside of the map are transcribed in the counterclockwise direction. The darker gray in the inner circle represents to GC content whereas the light gray corresponds to AT content. Different functional group of genes are shown in different colors.
Figure 1. Gene map of E. pilosa chloroplast genomes. Genes on the inside of the map are transcribed in the clockwise direction and genes on the outside of the map are transcribed in the counterclockwise direction. The darker gray in the inner circle represents to GC content whereas the light gray corresponds to AT content. Different functional group of genes are shown in different colors.
Plants 08 00485 g001
Figure 2. Comparison of LSC, SSC and IR borders among nine chloroplast genome of Eragrostideae.
Figure 2. Comparison of LSC, SSC and IR borders among nine chloroplast genome of Eragrostideae.
Plants 08 00485 g002
Figure 3. Shared SSRs among Eragrostis species.
Figure 3. Shared SSRs among Eragrostis species.
Plants 08 00485 g003
Figure 4. Sliding window analysis of the whole chloroplast genomes. (top) Pi among E. pilosa and E. tef. (bottom) Pi among E. pilosa and other eight species. Window length: 800 bp; step size: 200 bp. X-axis: position of the midpoint of a window. Y-axis: nucleotide diversity of each window.
Figure 4. Sliding window analysis of the whole chloroplast genomes. (top) Pi among E. pilosa and E. tef. (bottom) Pi among E. pilosa and other eight species. Window length: 800 bp; step size: 200 bp. X-axis: position of the midpoint of a window. Y-axis: nucleotide diversity of each window.
Plants 08 00485 g004
Figure 5. Sequence alignment plot comparing four cp genomes with E. pilosa as a reference. Genome regions are color coded as protein coding, rRNA coding, tRNA coding or conserved noncoding sequences. The vertical scale indicates the percentage identity, ranging from 50% to 100%.
Figure 5. Sequence alignment plot comparing four cp genomes with E. pilosa as a reference. Genome regions are color coded as protein coding, rRNA coding, tRNA coding or conserved noncoding sequences. The vertical scale indicates the percentage identity, ranging from 50% to 100%.
Plants 08 00485 g005
Figure 6. The K2P values for individual protein coding genes.
Figure 6. The K2P values for individual protein coding genes.
Plants 08 00485 g006
Figure 7. Phylogenetic trees based on ML and BI analysis on whole cp genome sequences Bootstrap values and posterior values are shown in each node.
Figure 7. Phylogenetic trees based on ML and BI analysis on whole cp genome sequences Bootstrap values and posterior values are shown in each node.
Plants 08 00485 g007
Figure 8. Phylogenetic trees based on divergence region and protein coding genes. (A) BI tree based on rps16-trnQ + trnL+ rbcL-psaI variable region; (B) BI analysis based on 30 protein coding genes. Posterior values are shown in each node.
Figure 8. Phylogenetic trees based on divergence region and protein coding genes. (A) BI tree based on rps16-trnQ + trnL+ rbcL-psaI variable region; (B) BI analysis based on 30 protein coding genes. Posterior values are shown in each node.
Plants 08 00485 g008
Table 1. Chloroplast genome features of E. pilosa and other Eragrostideae species.
Table 1. Chloroplast genome features of E. pilosa and other Eragrostideae species.
SpeciesAcc. NoSizeLSCSSCIRNumber of GenesNumber Of Protein Coding GenesNumber of tRNA GenesNumber of rRNA Genes
Eragrostis pilosaMN268502134,81580,10012,66121,02713285398
Eragrostis tefNC_029413134,43579,80212,58121,02613285398
Eragrostis minorNC_029412135,02380,31612,57721,06513385408
Eragrostis setifoliaNC_042832134,92880,41612,49421,00912781388
Eragrostis tenellulaNC_042833130,77379,38712,59819,39412478388
Eragrostis walteriNC_035528137,05681,76312,69721,29812983388
Enneapogon caerulescensNC_042837133,23178,88312,41020,96912781388
Enneapogon oblongusNC_036682133,43378,83912,54621,02413385408
Uniola paniculataNC_036709135,32280,64312,59521,04213183408
Table 2. GC content of the Eragrostideae cp genome sequences. GC content (g) whole cp genome (c) 44 analyzed CDS (1) (2) (3) first, second, and third codon positions, respectively.
Table 2. GC content of the Eragrostideae cp genome sequences. GC content (g) whole cp genome (c) 44 analyzed CDS (1) (2) (3) first, second, and third codon positions, respectively.
SpeciesAnalyzed CDSGC (g)GC(c)GC1GC2GC3
E. pilosa4438.33948.039.029.9
E. tef4438.33948.039.129.9
E. minor4438.23948.039.130.0
E. setifolia4438.339.148.139.130.3
E. tenellula4438.43947.939.130.0
E. walteri4438.339.148.139.030.3
En. caerulescens4438.33948.039.130.0
En.oblongus4438.33948.039.129.8
U. paniculata4438.339.148.039.130.0
Table 3. SSR distribution among Eragrostideae cp genomes. CSSR; compound SSR.
Table 3. SSR distribution among Eragrostideae cp genomes. CSSR; compound SSR.
SpeciesTotal SSRCSSRA/TC/GAT/TALSCSSCIRaIRb
E. pilosa456423-40122
E. tef507472147111
E. setifolia403382-33322
E. tenellula492453142322
E. walteri52751-147311
E. minor415382136122
En. caerulescens44241-339122
En. oblongus45242-340122
U. paniculata50849-150---

Share and Cite

MDPI and ACS Style

Somaratne, Y.; Guan, D.-L.; Abbood, N.N.; Zhao, L.; Wang, W.-Q.; Xu, S.-Q. Comparison of the Complete Eragrostis pilosa Chloroplast Genome with Its Relatives in Eragrostideae (Chloridoideae; Poaceae). Plants 2019, 8, 485. https://doi.org/10.3390/plants8110485

AMA Style

Somaratne Y, Guan D-L, Abbood NN, Zhao L, Wang W-Q, Xu S-Q. Comparison of the Complete Eragrostis pilosa Chloroplast Genome with Its Relatives in Eragrostideae (Chloridoideae; Poaceae). Plants. 2019; 8(11):485. https://doi.org/10.3390/plants8110485

Chicago/Turabian Style

Somaratne, Yamuna, De-Long Guan, Nibras Najm Abbood, Liang Zhao, Wen-Qiang Wang, and Sheng-Quan Xu. 2019. "Comparison of the Complete Eragrostis pilosa Chloroplast Genome with Its Relatives in Eragrostideae (Chloridoideae; Poaceae)" Plants 8, no. 11: 485. https://doi.org/10.3390/plants8110485

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop