Abstract
Wild teas are valuable genetic resources for studying evolution and breeding. Here, we report the complete chloroplast genome of the ancient Korean tea ‘Hadong Cheon-nyeon Cha’ (C. sinensis var. sinensis), which is known as the oldest tea tree in Korea. This study determined seven Camellia sinensis var. sinenesis, including Hadong Cheon-nyeon Cha (HCNC) chloroplast genome sequences, using Illumina sequencing technology via de novo assembly. The chloroplast genome sizes ranged from 157,019 to 157,114 bp and were organized into quadripartite regions with the typical chloroplast genomes. Further, differences in SNPs and InDels were detected across the seven chloroplast genomes through variance analysis. Principal component and phylogenetic analysis suggested that regional constraints, rather than functional constraints, strongly affected the sequence evolution of the cp genomes in this study. These genomic resources provide evolutionary insight into Korean tea plant cultivars and lay the foundation for a better understanding of the ancient Korean tea plant HCNC.
1. Introduction
Tea (extract from the leaves of tea plant Camellia sinensis (L.) O. Kuntze) is one of the most popular beverages worldwide as it features attractive flavors and provides substantial economic value. The demand for tea production has recently increased due to its characteristic secondary metabolites, such as catechins, polyphenols, and caffeine, which have numerous human health benefits [1]. Based on morphological and physiological characteristics, the three main natural hybrids that make up the cultivated taxa of tea are C. sinensis (L.) O. Kuntze (also named China type), C. assamica (Masters) (also named Assam type), and C. assamica subsp. lasiocalyx (Planchon ex Watt.) (also called Cambod or Southern type) [2,3]. In Asia, tea plants have been grown for thousands of years. It was said to have originated in the Assam region of India and the Yunnan Province of China [4].
The origin of Korean tea, however, is not clear. Many Korean historical accounts state that tea seeds were first brought from China to Korea in the early 9th century, although widespread tea cultivation did not begin until the 12th century [5]. There is still an argument about whether there were wild tea plants native to Korea. Two types of green tea have been produced in Korea: the native variety, which comes naturally in the surrounding area of Mountain Jiri, and the cultivated type, whose breeding lines come from China and Japan [5]. Korean tea is mainly produced in Gyeongsang Province, Jeolla Province, and Jeju Island. Among these regions, the tea cultivation area near Mt. Jiri in Hadong in south Gyeongsang province and Boseong in south Jeolla province are known to be the most prominent areas. A study comparing and analyzing the different genetic resources of the wild green tea population was carried out because, among these, there is room for discussion about the genetic background of the wild tea population flourishing in the Hadong region. Consequently, the cultivated and wild populations of green tea in China and Japan originated from distinct sources compared to the cultivated variety [6]. The wild tea tree of the Hadong region named ‘Hadong Cheonnyeon Cha’ (‘Cheonnyeon’ means thousand years, while ‘Cha’ means tea in Korean), used in the previous study, is known to be the oldest tea tree in Korea, is estimated to be over 800 years old. Due to its high historical value, the ‘Hadong Cheonnyeon Cha (HCNC)’ population is managed and protected in Korea. Moreover, the genetic resources found in the native tea plants remain crucial for the breeding of green tea.
Functional genomics is now essential to understanding tea plant biology thanks to the development of sequencing technology [7,8]. The advances in next-generation sequencing technology, especially third-generation sequencing technology that produces reads longer than 10 kb, have recently led to decoding several cp genomes [9]. Because of this, even within tiny taxonomic groups, phylogenetic analyses based on cp genome data are becoming more accepted [10]. The chloroplast DNA, sometimes called the chloroplast genome, is commonly referred to as cp DNA [11]. According to Jansen et al. [12] and Palmer [13], it has the usual quadripartite structure, which is typically composed of four parts: a large single copy (LSC), a small single copy (SSC), and two inverted repeats (IRs) [12]. Plant cells or leaves typically contain 400–1600 copies of the chloroplast genome [14]. Numerous species’ taxonomic and phylogenetic analyses have profited from the effective application of the chloroplast genome sequence [15]. Additionally, there is a significant difference in the molecular evolution rate between the coding and non-coding regions [13]. However, cp DNA does not undergo genetic recombination, is highly conserved [16], and exhibits maternal inheritance [17]. Environmental variables and the chloroplast’s developmental program impact the regulation of gene expression, which occurs at multiple stages, namely transcription, post-transcription, translation, and post-translation [18]. Most of the regulation over chloroplast gene expression occurs at the post-transcriptional stage [18]. As a result, the plant chloroplast genome offers several benefits for illuminating species relationships and yielding a wealth of crucial information regarding chloroplast genetic change [19]. Because all the cp genomes have a low mutation rate and limited recombination, they are dependable sources of information for determining phylogeny and evolutionary history based on highly conserved gene content and structure. Several cp genomes have been used extensively recently, particularly in the Camellia genus, for phylogenetic reconstruction. Five SSR markers were found by Liu et al. [20] to be a core marker collection that is suggested for fingerprinting tea plant cultivars or accessions [20]. Consequently, whole-genome sequencing and resequencing are now valuable techniques for determining the differences between various tea kinds, mining functional genes, and researching the origins of tea plants [21]. Lee et al. [22] completed chloroplast genome sequence of the chloroplast genome sequence of the Korean C. sinensis cultivar Sangmok. According to their phylogenetic analysis, the C. sinensis L. cultivar Sangmok is closely related to KJ806277 Camellia pubicosta [22]. By combining the RNA-Seq data of 217 different tea accessions with high-quality chromosome-scale reference genome for an ancient tea tree (DASZ), Zhang et al. [23] were able to clarify the lineage of tea cultivars and identify the major players in the breeding of Chinese tea [23]. Furthermore, the C. sinensis (CSS) populations underwent a stronger selection for flavor and disease resistance during domestication than the C. sinensis (CSA) populations, according to the whole-genome resequencing of 139 tea accessions worldwide [21].
In this study, we complete cp genome sequences of seven C. sinensis var. cultivars to investigate the origin and genetic comparison between HCNC and other C. sinensis cultivars. Our genetic analyses suggested that HCNC might have evolved differently from the Chinese or the Japanese C. sinensis cultivars.
2. Methods and Materials
2.1. Plant Materials and DNA Sequencing
Fresh leaves of seven (four Korean, one Chinese, and two Japanese) specimens of C. sinensis cultivars were collected in the experimental field of the Institute of Hadong Green Tea in Korea. The leaves were immediately preserved in liquid nitrogen before DNA extraction. The total genomic DNA was extracted with the DNeasy Plant Mini Kit (Qiagen, Valencia, CA, USA), following the manufacturer’s instructions. The final DNA concentration was measured using a NanoDrop and Qubit spectrophotometer. Genome libraries (631 bp) were constructed using the Covaris S series (Covaris, MS, USA), following the manufacturer’s instructions. After purification, the extracted DNA was used to generate paired-end sequencing libraries according to the Illumina standard protocol (Illumina, San Diego, CA, USA). Genome sequencing was carried out on the Illumina Hiseq 2500 platform, following the manufacturer’s protocol (Illumina, San Diego, CA, USA). After sequencing and data treatment, 157,511,170,132–196,874,632,856 clean reads were retrieved for the seven C. sinensis chloroplast genomes.
2.2. Genome Assembly
FastQC and Sickle trimmed all of the raw reads. Next, we performed genome assembly using Trimmomatic0.38, BWA (v0.6.1), Picard 2.9.0, GATK (v4.1.3.0), and SnpEff v4.3t. The low-quality nucleotides and adapter sequences in each read were removed using Trimmomatic. The draft genome sequence of the tea plant (Camellia sinensis), downloaded from the database of the NCBI (https://www.ncbi.nlm.nih.gov/sra, accessed on 15 October 2023), was used as a reference genome. Using the default parameters, paired-end reads were mapped to the tea reference genome with BWA (v0.6.1) software. The Picard package was used to filter the duplicated reads. After removing duplicated reads with Picard 2.9.0, the variants in each sample were called using a GATK HaplotypeCaller. After alignment, SNP calling was conducted per individual using SAMtools. The CoverageBed program in BEDtools (v2.17.0) was used to calculate the coverage of sequence alignments. SAMtools software (v1.10.2) was used to convert mapping results into the BAM format and to filter the unmapped and nonunique reads.
2.3. Functional Annotation of Genetic Variants
SNP annotation was conducted based on the draft genome of C. sinensis using the SnpEff program. According to the annotation information, SNPs were distributed in the upstream regions, downstream regions, intergenic regions, exonic regions, 3′ UTRs, 5′ UTRs, splicing sites (which were distributed in 1 kb regions away from the transcription start site), and intergenic regions. Moreover, SNPs in exonic regions were further divided into synonymous SNPs (sSNPs) or non-synonymous SNPs (nsSNPs). The phylogenetic constructions were applied using two methods: neighbor joining and maximum likelihood based on a distance matrix calculated with MAFFT (v7.123b), MUSCLE (v3.8.31), Gblocks (0.91b), and FastTree (2.1.11). The workflow of the annotation procedure is described in Figure 1. PCA was also used to evaluate the genetic differentiation of the seven C. sinensis populations using the R package (RColorBrew, ggplot2, phylogram; v3.5.3). The aligned results were trimmed by trimAI v1.4. The phylogenetic analyses were implemented using maximum likelihood (ML) and neighbor-joining analysis methods based on the complete cp genome data. SNP data of the seven cp genomes sequenced in this study were used to perform PCA using GCTA v1.25.2, and the first two components were plotted.
Figure 1.
Workflow of annotation procedure. Overview of the data and workflow of the computational annotation and manual annotation.
2.4. Comparative Analysis and Gene Prediction
The chloroplast genome size and organization were compared, and the differences in the IR border of seven C. sinensis chloroplast genomes were analyzed. First, genome masking was conducted through RepeatMasker (version open 4.0.7) and RepeatModeler (version open 1.0.11). Gene prediction was performed with three categories—ab initio prediction, transcript alignments, and related protein alignments—as the tools of ab initio prediction used Augustus (Augustus 3.3) and GlimmerHMM (GlimmerHMM 3.0.4). Also, PASA (PASApipeline 2.2.0) was used for transcript alignments. Related protein alignments were performed with Exonerate (exonerate 2.2.0 x86_64) and GenomeThreader (gth 1.6.6). For consensus gene structure, they used EVidenceModeler (EvidenceModeler1.1.1). Finally, there is UTR annotation with PASA (PASApipeline2.2.0).
2.5. Principal Component Analysis (PCA) and Phylogenetic Analysis
Twenty-two complete chloroplast genome sequences were used in the phylogenetic analysis, including seven C. sinensis genome samples and fifteen completed genomes of C. sinensis from GenBank. All chloroplast genome sequences were aligned using the MAFFT algorithm on the MAFFT v7.123b and adjusted manually as needed gene maps of the seven C. sinenesis chloroplast genomes. The circular maps of seven C. sinensis chloroplast’s complete genomes were drawn using OrganellarGenomeDRAW v. 1.3.1 (OGDRAW) [24]. The phylogenetic reconstructions were applied using four methods: neighbor joining (NJ), maximum likelihood (ML), UPGMA, and minimum evolution (ME). Phylogenetic trees of seven C. sinensis were constructed using MEGA-X (Version 10.0.5) through the four methods described above.
3. Results
3.1. Chloroplast Genome Sequencing and Assembly
The chloroplast genomes of seven specimens of the C. sinensis cultivars were sequenced using the Illumina HiSeq 2500 system, producing clean data ranging from 157 to 196 Giga base pairs (Table 1). After achieving clean reads (98.5%), they were mapped to the complete genome, respectively. The genome of HCNC was produced using 162 Gbp clean reads, representing a total of 927 Gbp paired-end reads (Table 1). The mapped reads of seven C. sinensis were from 888,862,712 to 1,109,091,777 bp (Table 1). More than 24.3 billion bases from high-throughput sequencing (Q20 was 99.99%), along with the chloroplast genome of seven C. sinensis, were assembled according to “depth range” (≥510) and used to align with the reference GCF_004153795.1.
Table 1.
Sequencing data used for HCNC genome assembly.
3.2. Chloroplast Genome Features of HCNC
The total length of seven C. sinensis chloroplast genomes ranged from 157,012 bp to 157,104 bp (Table 1, Figure 2 and Figure 3). All these chloroplast genomes exhibited the typical quadripartite structure, consisting of a pair of IRs separated by the LSC and SSC regions (Figure 3). The entire chloroplast genome of HCNC has four standard structural regions, which include a large single copy (LSC), a small single copy (SSC), and two reverse repeat regions (IRa and IRb) (Figure 2). The total GC content of the HCNC cp genome was 38.3%. The chloroplast genomic characteristics of HCNC were compared with six other C. sinensis cultivars (Figure 3). The whole genome length of HCNC was not much different from that of the other C. sinensis cultivars. Among them, the whole chloroplast genome size of HCNC and Chinese wild type was the closest (Figure 2 and Figure 3).
Figure 2.
Chloroplast genome map of HCNC. Genes outside the main circle are transcribed clockwise, while genes on the inside are transcribed counterclockwise. The Organellar Genome Draw (OGDraw) online software (v 1.3.1) was used to draw this map. Different colors represent genes with other functions. The inner circle’s gray portion indicates the chloroplast genome’s GC content. *: Gene containing a single intron.
Figure 3.
Chloroplast genome maps of six C. sinensis cultivars. Genes outside the main circle are transcribed clockwise, while genes on the inside are transcribed counterclockwise. The Organellar Genome Draw (OGDraw) online software was used to draw this map. Different colors represent genes with other functions. The inner circle’s gray portion indicates the chloroplast genome’s GC content. (a) Beachwisull; (b) Keumsull; (c) Chinese wild type; (d) Saemidori; (e) Fushun; (f) Hadong wild type. *: Gene containing a single intron.
3.3. Gene Annotation
Each of the genomes encoded unique genes. These genes belong to several categories with different functions. They can be divided into self-replication (tRNA, rRNA, ribosome subunits, and DNA-dependent RNA polymerase), photosynthesis related (NADH oxidoreductase, photosystem subunits, rubisco, cytochrome, and ATP synthase), transcription and translation related (transcription, ribosomal proteins, translation initiation factor), biosynthesis (maturase, envelope membrane protein, acetyl-CoA subunits, translational initiation factor and protease), and conserved open reading frames. We identified 14 genes in SSCs, 29 in IRA, 22 in IRB, and 101 in LSCs in the HCNC cp genome (Table 2). Also, we detected 45 photosynthesis-related genes, 17 transcription and translation-related genes, 16 protein-coding genes, 14 tRNA genes, 4 rRNA genes, and 32 tRNA genes in the HCNC cp genome (Table 3).
Table 2.
List of HCNC cp genome genes organized according to their location.
Table 3.
List of genes encoded in the chloroplast genome of HCNC.
3.4. Chloroplast Genome Sequence Variations in HCNC
3.4.1. Variations in the Chloroplast Genome of HCNC
The seven C. sinensis cp genome sequences were aligned to understand the characteristics of variations through variant analysis. As a result, alternative genotypes of InDel and SNP were detected for each position (Table 4 and Table 5). InDel variations were detected to be the most numerous in the Chinese wild type, while HCNC has the most Homozygous type variations number. The SNP variation analyses revealed that the Chinese wild type was the most frequent, and HCNC has 74,542,948 SNP variations in the whole cp genome (Table 5). Table 6 and Figure 3 show a complete genome analysis, including the number and distribution of variations in each region. The region with the most distribution of variations, except for the intergenic region, was the upstream region. Variations in the exon region accounted for 1.43% of the total account (Table 6 and Figure 4).
Table 4.
List of InDels in cp genomes of seven C. sinensis var. sinensis.
Table 5.
List of SNPs in cp genomes of seven C. sinensis var. sinensis.
Table 6.
The distribution of the identified SNP in the different genomic regions (intergenic, upstream, downstream, intron, exon, and others) of the HCNC.
Figure 4.
The distribution of the identified SNP in the different genomic regions (intergenic, upstream, downstream, intron, exon, and others) of the HCNC.
3.4.2. Comparison of Chloroplast Genome Sequence Variations
The SNP analysis was performed to compare and analyze genetic similarity between seven C. sinensis cp genome sequences. There are two types of SNP in the exon region: synonymous SNPs and non-synonymous SNPs. Synonymous SNPs usually produce the same protein even if the one base changes, whereas non-synonymous SNPs produce missense and nonsense mutations. Non-synonymous SNP analyses were performed in two ways: constructing a phylogenetic tree using obtained non-synonymous SNP data from variant annotation analysis and principal component analysis (PCA). The phylogenetic tree was constructed by neighbor-joining methods (Figure 5). According to the findings, two distinct groupings can be made up of seven cultivars: three Korean cultivars clustered into one group, while China and Japanese cultivars clustered into another. The Hadong wild type was the first to be separated from the sister clade. Using SNP data, we performed PCA to evaluate the relationships between the seven C. sinensis cp genomes (Figure 6). Findings suggested that there was a substantial genetic diversity among the genomes. In the PCA plot, three Korean cultivars, Beachwisull, Keumsull, and Hadong wild type, were grouped according to their geographical origin. Interestingly, HCNC is only separated by far distance from other cultivars. The Japanese cultivar, Saemidori, is divided in the opposite direction with HCNC. The Fushun, which is the Japanese cultivar, and the Chinese wild type were slightly separated from the Korean group.
Figure 5.
Phylogenetic trees of seven C. sinenesis cultivars. Phylogenetic trees were constructed using complete cp genome data with the maximum likelihood method. The red color character indicates HCNC.
Figure 6.
Principal component analysis (PCA) plots of the seven C. sinensis individuals. The legend at the right indicates the cultivars of each circle.
3.5. Phylogenetic Analysis
Past research has shown that terrestrial plants’ chloroplast genome has been a valuable source among related species, which is applied in phylogenetic studies [25,26]. This paper aligned all chloroplast genomes of seven cultivars, including three Korean cultivars, one Chinese cultivar, three Japanese cultivars, and Korean HCNC. The phylogenetic tree was constructed using neighbor-joining, maximum likelihood, and UPGMA methods. Firstly, we identified the relationships of seven C. sinensis cultivars complete in this study (Figure 7a–c). The phylogenetic tree shows that HCNC was closely related to the Hadong wild type and the two Japanese cultivars in the neighbor-joining method. In maximum likelihood analysis, HCNC was more closely associated with Chinese wild type and two Korean cultivars than Japanese cultivars. The result of the UPGMA analysis was similar to the neighbor-joining method.
Figure 7.
Phylogenetic trees of seven C. sinenesis var. sinensis cultivars. (a) Phylogenetic trees constructed using complete cp genome data with neighbor-joining method; (b) phylogenetic trees constructed using complete cp genome data with maximum likelihood analysis method; (c) phylogenetic trees constructed using maximum likelihood with UPGMA method. The red color character indicates HCNC.
4. Discussion
It has been proposed that the cp genome can be helpful for low taxonomic-level phylogenetic reconstructions [25,27,28,29]. Using cp genome data, the evolutionary relationships between the several species within Camellia (Theaceae) were clearly resolved [30,31]. In the present study, we constructed three phylogenetic trees to identify relationships between HCNC and 22 C. sinensis, the complete cp genomic tree, and GenBank. According to the phylogenetic relationships, the 22 C. sinensis were mainly clustered into two clades with the neighbor-joining method (Figure 8a–c). HCNC grouped the same clade with C. assamica cultivars from China and India (Figure 8a). In the minimum evolution method, HCNC tends to be closely related to the Chinese C. sinensis cultivar, Anhua. However, it was difficult to close relationships due to their low bootstrap value (33%). Also, C. sinensis cultivars are more closely related than C. assamica cultivars with HCNC. The Saemidori, the Japanese cultivar, has a relationship with HCNC. This result is similar to the PCA result we indicated above in Section 3.4.2. Lastly, the UPGMA method is closely related to C. sinensis cultivars with HCNC. Likewise, HCNC tends to be closely associated with the Chinese C. sinensis cultivar, Anhua. However, it was difficult to close relationships due to their low bootstrap value (54%).
Figure 8.
Phylogenetic trees of fifteen C. sinenesis varieties. (a) Phylogenetic trees constructed using complete cp genome data with neighbor-joining method; (b) phylogenetic trees constructed using complete cp genome data with maximum likelihood analysis method; (c) phylogenetic trees constructed using maximum likelihood with UPGMA method. The red color character indicates HCNC.
Comparative analysis results indicate that seven cp genome sequences of C. sinensis showed highly conserved genomic structures [26,32,33]. Genes of ycf1 and infA were found to be pseudogenes in HCNC. The pseudogenizations of ycf1 and locations of ycf1 copies are commonly found in other plants [34,35,36]. Although it was formerly believed that the pseudogene had lost its capacity to code for proteins, it is now understood to be an evolutionary remnant of the functional gene [37]. Comparative study results showed that CDS and IR areas were more conserved within cp genomes than IGS and SCs. Comparative study results showed that, within cp genomes, CDS and IRs areas were more conserved than IGS and SCs, respectively [32,38]. Because of their remarkable conservation, the IR regions help stabilize the chloroplast genome’s structure. The IR region may have expanded or contracted depending on the species, according to a comparison of the IR/SC boundary areas [39]. CP genome length changes are frequently caused by the IR regions’ expansion and contraction. According to other studies, the conservatism caused the IR regions to exhibit a lower degree of sequence divergence than LSC and SSC areas in Camellia cp genomes. Numerous studies have examined species identification and molecular phylogeny at the interspecific level using the polymorphic cp DNA non-coding regions [6]. Three variable areas that can be utilized for phylogenetic analysis and species identification have been identified: petA-psbJ, psbI-trnS, and ccsA-ndhD [6]. Chloroplast genomes have become popular in taxonomy research for assessing the relationships between closely related species [37,40]. For instance, the cp genomes of 35 species from the Ranunculaceae family, which comprise 31 genera, were sequenced and used to shed light on the long-standing systematic disputes within this family [35].
From our results and previous reports [41], we propose the following hypothesis. Considering that HCNC is close to the Chinese species, it originated in China and has since evolved and differentiated independently in Korea. In addition, it can be assumed that HCNC was differentiated first, as it shows different types from Hadong wild type and cultivated species. Our phylogenetic analyses based on seven cp genomes successfully resolved intergeneric relationships within C. sinensis. As demonstrated by earlier research, even with the full chloroplast genome material available for analysis, all phylogenetic relationships could not be fully resolved. Furthermore, the comparison object we selected were no plant groups involved in the other genera of the family Theaceae, which might provide useful information for the evolutionary study of HCNC. Future research on evolution and differentiation will be required, and investigations such as molecular biological age estimation and the examination of morphological differences in leaves are thought to be essential.
In summary, our research can promote the exchange of information between the nuclear genomes of Camellia species and provide valuable genomic resources for phylogenetic studies.
5. Conclusions
Our study provides a valuable resource for understanding ancient tea plants’ chloroplast structure, variation information, and phylogenetic relationships in Korea. The significant SNPs associated with favorable variants, selection signals, and candidate genes are a valuable resource for the further improvement of leaf traits and plant types in ancient tea plants in Korea. The complete cp genome sequence will contribute to additional molecular identification, genetic diversity, and phylogeny studies.
Author Contributions
Conceptualization, D.S. and J.C.K.; methodology, D.S. and J.C.K.; software, D.S. and S.H.J.; validation, D.S. and D.-K.Y.; formal analysis, D.S. and S.H.J.; investigation, D.S. and D.-K.Y.; resources, D.S. and J.C.K.; data curation, D.S., S.H.J. and D.-K.Y.; writing—original draft preparation, D.-K.Y.; writing—review and editing, D.S. and D.-K.Y.; visualization, D.S. and D.-K.Y.; supervision, J.C.K. and D.-K.Y.; project administration, D.S., S.H.J. and J.C.K. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by a research promotion program (no. 416-82-14326) at Sunchon National University in Republic of Korea.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Data are contained within the article.
Conflicts of Interest
The authors declare no conflicts of interest.
References
- Zhang, X.; Chen, S.; Shi, L.; Gong, D.; Zhang, S.; Zhao, Q.; Zhan, D.; Vasseur, L.; Wang, Y.; Yu, J.; et al. Haplotype-resolved genome assembly provides insights into evolutionary history of the tea plant Camellia sinensis. Nat. Genet. 2021, 53, 1250–1259. [Google Scholar] [CrossRef]
- Li, L.; Hu, Y.; He, M.; Zhang, B.; Wu, W.; Cai, P.; Huo, D.; Hong, Y. Comparative chloroplast genomes: Insights into the evolution of the chloroplast genome of Camellia sinensis and the phylogeny of Camellia. BMC Genom. 2021, 22, 138. [Google Scholar] [CrossRef]
- Kumarihami, H.P.C.; Oh, E.U.; Nesumi, A.; Song, K.J. Comparative study on cross-compatibility between Camellia sinensis var. sinensis (China type) and C. sinensis var. assamica (Assam type) tea. Afr. J. Agric. Res. 2016, 11, 1092–1101. [Google Scholar] [CrossRef][Green Version]
- Meegahakumbura, M.K.; Wambulwa, M.C.; Li, M.M.; Thapa, K.K.; Sun, Y.S.; Moller, M.; Xu, J.C.; Yang, J.B.; Liu, J.; Liu, B.Y.; et al. Domestication Origin and Breeding History of the Tea Plant (Camellia sinensis) in China and India Based on Nuclear Microsatellites and cpDNA Sequence Data. Front. Plant Sci. 2017, 8, 2270. [Google Scholar] [CrossRef]
- Cho, K.H.; Lee, E.J.; Tsuge, T.; Jo, A.; Kim, J.C.; Cheong, G.W.; Yoon, H.S.; Kim, G.T. Comparative genomic analysis of Korean and Japanese green tea. Can. J. Plant Sci. 2010, 90, 293–298. [Google Scholar] [CrossRef]
- Choi, K.S.; Chung, M.G.; Park, S. The Complete Chloroplast Genome Sequences of Three Veroniceae Species (Plantaginaceae): Comparative Analysis and Highly Divergent Regions. Front. Plant Sci. 2016, 7, 355. [Google Scholar] [CrossRef]
- Xia, E.H.; Tong, W.; Wu, Q.; Wei, S.; Zhao, J.; Zhang, Z.Z.; Wei, C.L.; Wan, X.C. Tea plant genomics: Achievements, challenges and perspectives. Hortic. Res. 2020, 7, 7. [Google Scholar] [CrossRef] [PubMed]
- Daniell, H.; Jin, S.; Zhu, X.G.; Gitzendanner, M.A.; Soltis, D.E.; Soltis, P.S. Green giant-a tiny chloroplast genome with mighty power to produce high-value proteins: History and phylogeny. Plant Biotechnol. J. 2021, 19, 430–447. [Google Scholar] [CrossRef]
- Zhu, B.; Qian, F.; Hou, Y.; Yang, W.; Cai, M.; Wu, X. Complete chloroplast genome features and phylogenetic analysis of Eruca sativa (Brassicaceae). PLoS ONE 2021, 16, e0248556. [Google Scholar] [CrossRef]
- Cui, Y.; Chen, X.; Nie, L.; Sun, W.; Hu, H.; Lin, Y.; Li, H.; Zheng, X.; Song, J.; Yao, H. Comparison and Phylogenetic Analysis of Chloroplast Genomes of Three Medicinal and Edible Amomum Species. Int. J. Mol. Sci. 2019, 20, 4040. [Google Scholar] [CrossRef]
- Li, H.; Song, K.; Zhang, X.; Wang, D.; Dong, S.; Liu, Y.; Yang, L. Application of Multi-Perspectives in Tea Breeding and the Main Directions. Int. J. Mol. Sci. 2023, 24, 2643. [Google Scholar] [CrossRef]
- Jansen, R.K.; Raubeson, L.A.; Boore, J.L.; dePamphilis, C.W.; Chumley, T.W.; Haberle, R.C.; Wyman, S.K.; Alverson, A.J.; Peery, R.; Herman, S.J.; et al. Methods for obtaining and analyzing whole chloroplast genome sequences. Methods Enzym. 2005, 395, 348–384. [Google Scholar] [CrossRef]
- Olmstead, R.G.; Palmer, J.D. Chloroplast dna systematics: A review of methods and data analysis. Am. J. Bot. 1994, 81, 1205–1224. [Google Scholar] [CrossRef]
- Zhang, W.; Zhao, Y.; Yang, G.; Peng, J.; Chen, S.; Xu, Z. Determination of the evolutionary pressure on Camellia oleifera on Hainan Island using the complete chloroplast genome sequence. PeerJ 2019, 7, e7210. [Google Scholar] [CrossRef] [PubMed]
- Shen, X.; Guo, S.; Yin, Y.; Zhang, J.; Yin, X.; Liang, C.; Wang, Z.; Huang, B.; Liu, Y.; Xiao, S.; et al. Complete Chloroplast Genome Sequence and Phylogenetic Analysis of Aster tataricus. Molecules 2018, 23, 2426. [Google Scholar] [CrossRef] [PubMed]
- Wicke, S.; Schneeweiss, G.M.; dePamphilis, C.W.; Muller, K.F.; Quandt, D. The evolution of the plastid chromosome in land plants: Gene content, gene order, gene function. Plant Mol. Biol. 2011, 76, 273–297. [Google Scholar] [CrossRef] [PubMed]
- Corriveau, J.L.; Coleman, A.W. Rapid screening method to detect potential biparental inheritance of plastid dna and results for over 200 angiosperm species. Am. J. Bot. 1988, 75, 1443–1458. [Google Scholar] [CrossRef]
- Sugita, M.; Sugiura, M. Regulation of gene expression in chloroplasts of higher plants. Plant Mol. Biol. 1996, 32, 315–326. [Google Scholar] [CrossRef]
- Zoclanclounon, Y.A.B.; Thamilarasan, S.K.; Mo, Y.; Ahn, B.O.; Kim, J.G.; Lee, K. Insights into chloroplast genome structure and phylogenetic relationships within the Sesamum species complex (Pedaliaceae). Front. Genet. 2023, 14, 1207306. [Google Scholar] [CrossRef]
- Liu, S.; Liu, H.; Wu, A.; Hou, Y.; An, Y.; Wei, C. Construction of fingerprinting for tea plant (Camellia sinensis) accessions using new genomic SSR markers. Mol. Breed. 2017, 37, 1–14. [Google Scholar] [CrossRef]
- Wang, C.; Han, J.; Pu, Y.; Wang, X. Tea (Camellia sinensis): A Review of Nutritional Composition, Potential Applications, and Omics Research. Appl. Sci. 2022, 12, 5874. [Google Scholar] [CrossRef]
- Lee, D.J.; Kim, C.K.; Lee, T.H.; Lee, S.J.; Moon, D.G.; Kwon, Y.H.; Cho, J.Y. The complete chloroplast genome sequence of economical standard tea plant, Camellia sinensis L. cultivar Sangmok, in Korea. Mitochondrial DNA B Resour. 2020, 5, 2835–2836. [Google Scholar] [CrossRef]
- Zhang, W.; Zhang, Y.; Qiu, H.; Guo, Y.; Wan, H.; Zhang, X.; Scossa, F.; Alseekh, S.; Zhang, Q.; Wang, P.; et al. Genome assembly of wild tea tree DASZ reveals pedigree and selection history of tea varieties. Nat. Commun. 2020, 11, 3719. [Google Scholar] [CrossRef] [PubMed]
- Greiner, S.; Lehwark, P.; Bock, R. OrganellarGenomeDRAW (OGDRAW) version 1.3.1: Expanded toolkit for the graphical visualization of organellar genomes. Nucleic Acids Res. 2019, 47, W59–W64. [Google Scholar] [CrossRef] [PubMed]
- Ma, P.F.; Zhang, Y.X.; Zeng, C.X.; Guo, Z.H.; Li, D.Z. Chloroplast phylogenomic analyses resolve deep-level relationships of an intractable bamboo tribe Arundinarieae (poaceae). Syst. Biol. 2014, 63, 933–950. [Google Scholar] [CrossRef] [PubMed]
- Rawal, H.C.; Borchetia, S.; Bera, B.; Soundararajan, S.; Ilango, R.V.J.; Barooah, A.K.; Sharma, T.R.; Singh, N.K.; Mondal, T.K. Comparative analysis of chloroplast genomes indicated different origin for Indian tea (Camellia assamica cv TV1) as compared to Chinese tea. Sci. Rep. 2021, 11, 110. [Google Scholar] [CrossRef] [PubMed]
- Zhang, Y.; Du, L.; Liu, A.; Chen, J.; Wu, L.; Hu, W.; Zhang, W.; Kim, K.; Lee, S.C.; Yang, T.J.; et al. The Complete Chloroplast Genome Sequences of Five Epimedium Species: Lights into Phylogenetic and Taxonomic Analyses. Front. Plant Sci. 2016, 7, 306. [Google Scholar] [CrossRef]
- Zhang, Y.J.; Ma, P.F.; Li, D.Z. High-throughput sequencing of six bamboo chloroplast genomes: Phylogenetic implications for temperate woody bamboos (Poaceae: Bambusoideae). PLoS ONE 2011, 6, e20596. [Google Scholar] [CrossRef]
- Yang, J.B.; Tang, M.; Li, H.T.; Zhang, Z.R.; Li, D.Z. Complete chloroplast genome of the genus Cymbidium: Lights into the species identification, phylogenetic implications and population genetic analyses. BMC Evol. Biol. 2013, 13, 13. [Google Scholar] [CrossRef]
- Yang, M.; Zhang, X.; Liu, G.; Yin, Y.; Chen, K.; Yun, Q.; Zhao, D.; Al-Mssallem, I.S.; Yu, J. The complete chloroplast genome sequence of date palm (Phoenix dactylifera L.). PLoS ONE 2010, 5, e12762. [Google Scholar] [CrossRef]
- Huang, C.; Zhang, J.; Zhang, X.; Yu, Y.; Bian, W.; Zeng, Z.; Sun, X.; Li, X. Two New Polyphenol Oxidase Genes of Tea Plant (Camellia sinensis) Respond Differentially to the Regurgitant of Tea Geometrid, Ectropis obliqua. Int. J. Mol. Sci. 2018, 19, 2414. [Google Scholar] [CrossRef] [PubMed]
- Park, I.; Yang, S.; Kim, W.J.; Song, J.H.; Lee, H.S.; Lee, H.O.; Lee, J.H.; Ahn, S.N.; Moon, B.C. Sequencing and Comparative Analysis of the Chloroplast Genome of Angelica polymorpha and the Development of a Novel Indel Marker for Species Identification. Molecules 2019, 24, 1038. [Google Scholar] [CrossRef] [PubMed]
- Zan, T.; He, Y.T.; Zhang, M.; Yonezawa, T.; Ma, H.; Zhao, Q.M.; Kuo, W.Y.; Zhang, W.J.; Huang, C.H. Phylogenomic analyses of Camellia support reticulate evolution among major clades. Mol. Phylogenet Evol. 2023, 182, 107744. [Google Scholar] [CrossRef] [PubMed]
- Song, W.; Chen, Z.; Shi, W.; Han, W.; Feng, Q.; Shi, C.; Engel, M.S.; Wang, S. Comparative Analysis of Complete Chloroplast Genomes of Nine Species of Litsea (Lauraceae): Hypervariable Regions, Positive Selection, and Phylogenetic Relationships. Genes 2022, 13, 1550. [Google Scholar] [CrossRef] [PubMed]
- Zhai, W.; Duan, X.; Zhang, R.; Guo, C.; Li, L.; Xu, G.; Shan, H.; Kong, H.; Ren, Y. Chloroplast genomic data provide new and robust insights into the phylogeny and evolution of the Ranunculaceae. Mol. Phylogenet Evol. 2019, 135, 12–21. [Google Scholar] [CrossRef] [PubMed]
- Petersen, G.; Aagesen, L.; Seberg, O.; Larsen, I.H. When is enough, enough in phylogenetics? A case in point from Hordeum (Poaceae). Cladistics 2011, 27, 428–446. [Google Scholar] [CrossRef] [PubMed]
- Lu, Q.X.; Chang, X.; Gao, J.; Wu, X.; Wu, J.; Qi, Z.C.; Wang, R.H.; Yan, X.L.; Li, P. Evolutionary Comparison of the Complete Chloroplast Genomes in Convallaria Species and Phylogenetic Study of Asparagaceae. Genes 2022, 13, 1724. [Google Scholar] [CrossRef]
- Hazra, A.; Mahadani, P.; Das, S.; Bhattacharya, S.; Kumar, R.; Sengupta, C.; Das, S. Insight to the ancestral relations and varietal diversity of Indian tea [Camellia sinensis (L.) Kuntze] through plastid and nuclear phylogenetic markers. Genet. Resour. Crop Evol. 2020, 68, 773–783. [Google Scholar] [CrossRef]
- Chen, X.; Li, B.; Zhang, X. Comparison of chloroplast genomes and phylogenetic analysis of four species in Quercus section Cyclobalanopsis. Sci. Rep. 2023, 13, 18731. [Google Scholar] [CrossRef]
- Wu, Q.; Tong, W.; Zhao, H.; Ge, R.; Li, R.; Huang, J.; Li, F.; Wang, Y.; Mallano, A.I.; Deng, W.; et al. Comparative transcriptomic analysis unveils the deep phylogeny and secondary metabolite evolution of 116 Camellia plants. Plant J. 2022, 111, 406–421. [Google Scholar] [CrossRef]
- Ryu, Y.; Kim, I.R.; Su, M.H.; Jung, J.; Choi, H.-K.; Kim, C. Phylogeographical Study of Camellia japonica Inferred from AFLP and Chloroplast DNA Haplotype Analyses. J. Plant Biol. 2019, 62, 14–26. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).