Chloroplast Genome-Based Insights into Variety Identification in Toona sinensis
Abstract
1. Introduction
2. Materials and Methods
2.1. Plant Materials and DNA Sequencing
2.2. Chloroplast Genome Assembly and Annotation
2.3. Repeat Sequence Analysis
2.4. Chloroplast Genome Comparison and Sequence Divergence Analysis
2.5. Phylogenetic Tree Construction
2.6. Development and Validation of Molecular Markers for Species Discrimination
2.7. Read-Level Heteroplasmy Detection
2.8. Statistical Analysis
3. Results
3.1. General Features of the T. sinensis Chloroplast Genomes
3.2. Identification and Features of Chloroplast Repeats
3.3. Structural Dynamics and Hypervariable Sequence Landscapes
3.4. Phylogenetic Analysis
3.5. Molecular Marker Development and Sequence-Level Validation
3.6. Read-Level Heteroplasmy at Candidate Marker Regions
4. Discussion
4.1. Structural Conservation and Intraspecific Variation in the T. sinensis Chloroplast Genomes
4.2. The Utility of Repetitive Elements in T. sinensis Variety Identification
4.3. The Role of Hypervariable Regions and Conventional Markers in T. sinensis Variety Identification
4.4. The Utility of Whole Chloroplast Genomes in T. sinensis Variety Discrimination
4.5. Limitations, Future Perspectives, and Applications
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| SSR | Simple sequence repeats |
| RSCU | Relative synonymous codon usage |
| PCGs | Protein-coding genes |
| IR | inverted repeat |
| SSC | Small single-copy |
| LSC | Large single-copy |
| ML | Maximum likelihood |
| Pi | Nucleotide diversity |
| SVs | Structural variations |
References
- Zhao, Q.; Zhong, X.L.; Zhu, S.H.; Wang, K.; Tan, G.F.; Meng, P.H.; Zhang, J. Research advances in Toona sinensis, a traditional Chinese medicinal plant and popular vegetable in China. Diversity 2022, 14, 572. [Google Scholar] [CrossRef]
- Dobrogojski, J.; Adamiec, M.; Luciński, R. The chloroplast genome: A review. Acta Physiol. Plant. 2020, 42, 98. [Google Scholar] [CrossRef]
- Li, X.; Yang, Y.; Henry, R.J.; Rossetto, M.; Wang, Y.; Chen, S. Plant DNA barcoding: From gene to genome. Biol. Rev. 2015, 90, 157–166. [Google Scholar] [CrossRef]
- Nock, C.J.; Waters, D.L.; Edwards, M.A.; Bowen, S.G.; Rice, N.; Cordeiro, G.M.; Henry, R.J. Chloroplast genome sequences from total DNA for plant identification. Plant Biotechnol. J. 2010, 9, 328–333. [Google Scholar] [CrossRef] [PubMed]
- Wang, M.; Lin, H.; Lin, H.; Du, P.; Zhang, S. From species to varieties: How modern sequencing technologies are shaping medicinal plant identification. Genes 2024, 16, 16. [Google Scholar] [CrossRef] [PubMed]
- Xiao, Y.; Wang, X.; He, Z.H.; Lv, Y.W.; Zhang, C.H.; Hu, X.S. Assessing the phylogenetic relationship among varieties of Toona ciliata (Meliaceae) in sympatry with chloroplast genomes. Ecol. Evol. 2023, 13, e10828. [Google Scholar] [CrossRef]
- Yuan, L.; Ni, Y.; Chen, H.; Li, J.; Lu, Q.; Wang, L.; Zhang, X.; Yue, J.; Yang, H.; Liu, C. Comparative chloroplast genomes study of five officinal Ardisia Species: Unraveling interspecific diversity and evolutionary insights in Ardisia. Gene 2024, 912, 148349. [Google Scholar] [CrossRef]
- Zhang, L.; Song, Y.; Li, J.; Liu, J.; Zhang, Z.; Xu, Y.; Fan, D.; Liu, M.; Ren, Y.; He, J.; et al. Identification, comparative and phylogenetic analysis of eight Vitis species based on the chloroplast genome revealed their contribution to heat tolerance in grapevines. Sci. Hortic. 2024, 327, 112833. [Google Scholar] [CrossRef]
- Liu, B.; Zhang, J.; Shi, Y. Complete chloroplast genome of Toona sinensis (Meliaceae), a goluptious ‘tree vegetables’. Mitochondrial DNA B Resour. 2019, 4, 3025–3026. [Google Scholar] [CrossRef]
- Li, Y.; Gu, M.; Lin, J.; Jiang, H.; Xiao, X.; Zhou, W. Comparative analysis of the complete chloroplast genomes in Toona sinensis and Toona ciliata: Phylogenetic relationship of Toona. Res. Sq. 2022, preprint. [Google Scholar] [CrossRef]
- Duan, Y.; Wang, Y.; Ding, W.; Wang, C.; Meng, L.; Meng, J.; Chen, N.; Liu, Y.; Xing, S. Comparative and phylogenetic analysis of the chloroplast genomes of four commonly used medicinal cultivars of Chrysanthemums morifolium. BMC Plant Biol. 2024, 24, 992. [Google Scholar] [CrossRef]
- Guo, Q.; Xue, X.; Wang, D.; Zhang, L.; Liu, W.; Wang, E.; Cui, X.; Hou, X. Genetic diversity and population genetic structure of Paeonia suffruticosa by chloroplast DNA simple sequence repeats (cpSSRs). Hortic Plant J. 2024, 11, 367–376. [Google Scholar] [CrossRef]
- Ping, J.; Feng, P.; Li, J.; Zhang, R.; Su, Y.; Wang, T. Molecular evolution and SSRs analysis based on the chloroplast genome of Callitropsis funebris. Ecol. Evol. 2021, 11, 4786–4802. [Google Scholar] [CrossRef] [PubMed]
- Li, D.M.; Pan, Y.G.; Wu, X.Y.; Zou, S.P.; Wang, L.; Zhu, G.F. Comparative chloroplast genomics, phylogenetic relationships and molecular markers development of Aglaonema commutatum and seven green cultivars of Aglaonema. Sci. Rep. 2024, 14, 11820. [Google Scholar] [CrossRef]
- Li, W.; Liu, Y.; Yang, Y.; Xie, X.; Lu, Y.; Yang, Z.; Jin, X.; Dong, W.; Suo, Z. Interspecific chloroplast genome sequence diversity and genomic resources in Diospyros. BMC Plant Biol. 2018, 18, 210. [Google Scholar] [CrossRef] [PubMed]
- Li, H.; Xiao, W.; Tong, T.; Li, Y.; Zhang, M.; Lin, X.; Zou, X.; Wu, Q.; Guo, X. The specific DNA barcodes based on chloroplast genes for species identification of Orchidaceae plants. Sci. Rep. 2021, 11, 1424. [Google Scholar] [CrossRef] [PubMed]
- Yu, J.; Fu, J.; Fang, Y.; Xiang, J.; Dong, H. Complete chloroplast genomes of Rubus species (Rosaceae) and comparative analysis within the genus. BMC Genom. 2022, 23, 32. [Google Scholar] [CrossRef]
- Zhang, Y.M.; Han, L.J.; Yang, C.W.; Yin, Z.L.; Tian, X.; Qian, Z.G.; Li, G.D. Comparative chloroplast genome analysis of medicinally important Veratrum (Melanthiaceae) in China: Insights into genomic characterization and phylogenetic relationships. Plant Divers. 2022, 44, 70–82. [Google Scholar] [CrossRef]
- Doyle, J.J.; Doyle, J.L. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem. Bull. 1987, 19, 11–15. [Google Scholar]
- Chen, S. Ultrafast one-pass FASTQ data preprocessing, quality control, and deduplication using fastp. iMeta 2023, 2, e107. [Google Scholar] [CrossRef]
- Bioinformatics, B. FastQC: A Quality Control Tool for High Throughput Sequence Data; Babraham Institute Cambridge: Cambridge, UK, 2011; Available online: http://www.bioinformatics.babraham.ac.uk/projects/fastqc (accessed on 12 January 2025).
- Dierckxsens, N.; Mardulyn, P.; Smits, G. NOVOPlasty: De novo assembly of organelle genomes from whole genome data. Nucleic Acids Res. 2017, 45, e18. [Google Scholar]
- Huang, D.I.; Cronk, Q.C. Plann: A command-line application for annotating plastome sequences. Appl. Plant Sci. 2015, 3, 1500026. [Google Scholar] [CrossRef] [PubMed]
- Greiner, S.; Lehwark, P.; Bock, R. OrganellarGenomeDRAW (OGDRAW) version 1.3. 1: Expanded toolkit for the graphical visualization of organellar genomes. Nucleic Acids Res. 2019, 47, 59–64. [Google Scholar] [CrossRef]
- Shields, D.C.; Sharp, P.M. Synonymous codon usage in Bacillus subtilis reflects both translational selection and mutational biases. Nucleic Acids Res. 1987, 15, 8023–8040. [Google Scholar] [CrossRef] [PubMed]
- Lenz, H.; Hein, A.; Knoop, V. Plant organelle RNA editing and its specificity factors: Enhancements of analyses and new database features in PREPACT 3.0. BMC Bioinform. 2018, 19, 255. [Google Scholar] [CrossRef]
- Kalia, R.K.; Rai, M.K.; Kalia, S.; Singh, R.; Dhawan, A.K. Microsatellite markers: An overview of the recent progress in plants. Euphytica 2011, 177, 309–334. [Google Scholar] [CrossRef]
- Beier, S.; Thiel, T.; Münch, T.; Scholz, U.; Mascher, M. MISA-web: A web server for microsatellite prediction. Bioinformatics 2017, 33, 2583–2585. [Google Scholar] [CrossRef] [PubMed]
- Kurtz, S.; Choudhuri, J.V.; Ohlebusch, E.; Schleiermacher, C.; Stoye, J.; Giegerich, R. REPuter: The manifold applications of repeat analysis on a genomic scale. Nucleic Acids Res. 2001, 29, 4633–4642. [Google Scholar] [CrossRef]
- Benson, G. Tandem repeats finder: A program to analyze DNA sequences. Nucleic Acids Res. 1999, 27, 573–580. [Google Scholar] [CrossRef]
- Zhu, A.; Guo, W.; Gupta, S.; Fan, W.; Mower, J.P. Evolutionary dynamics of the plastid inverted repeat: The effects of expansion, contraction, and loss on substitution rates. New Phytol. 2016, 209, 1747–1756. [Google Scholar] [CrossRef]
- Li, H.; Guo, Q.; Xu, L.; Gao, H.; Liu, L.; Zhou, X. CPJSdraw: Analysis and visualization of junction sites of chloroplast genomes. PeerJ 2023, 11, e15326. [Google Scholar] [CrossRef]
- Kearse, M.; Moir, R.; Wilson, A.; Stones-Havas, S.; Cheung, M.; Sturrock, S.; Buxton, S.; Cooper, A.; Markowitz, S.; Duran, C.; et al. Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 2012, 28, 1647–1649. [Google Scholar] [CrossRef]
- Grant, J.R.; Enns, E.; Marinier, E.; Mandal, A.; Herman, E.K.; Chen, C.Y.; Graham, M.; Van Domselaar, G.; Stothard, P. Proksee: In-depth characterization and visualization of bacterial genomes. Nucleic Acids Res. 2023, 51, 484–492. [Google Scholar] [CrossRef] [PubMed]
- Frazer, K.A.; Pachter, L.; Poliakov, A.; Rubin, E.M.; Dubchak, I. VISTA: Computational tools for comparative genomics. Nucleic Acids Res. 2004, 32, 273–279. [Google Scholar] [CrossRef]
- Librado, P.; Rozas, J. DnaSP v5: A software for comprehensive analysis of DNA polymorphism data. Bioinformatics 2009, 25, 1451–1452. [Google Scholar] [CrossRef] [PubMed]
- Katoh, K.; Rozewicki, J.; Yamada, K.D. MAFFT online service: Multiple sequence alignment, interactive sequence choice and visualization. Brief. Bioinform. 2018, 20, 1160–1166. [Google Scholar] [CrossRef]
- Stamatakis, A. RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 2014, 30, 1312–1313. [Google Scholar] [CrossRef]
- Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv 2013, arXiv:1303.3997. [Google Scholar] [CrossRef]
- Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R. The sequence alignment/map format and SAMtools. Bioinformatics 2009, 25, 2078–2079. [Google Scholar] [CrossRef]
- Wilm, A.; Aw, P.P.; Bertrand, D.; Yeo, G.H.; Ong, S.H.; Wong, C.H.; Khor, C.C.; Petric, R.; Hibberd, M.L.; Nagarajan, N. LoFreq: A sequence-quality aware, ultra-sensitive variant caller for uncovering cell-population heterogeneity from high-throughput sequencing datasets. Nucleic Acids Res. 2012, 40, 11189–11201. [Google Scholar] [CrossRef]
- Daniell, H.; Lin, C.S.; Yu, M.; Chang, W.J. Chloroplast genomes: Diversity, evolution, and applications in genetic engineering. Genome Biol. 2016, 17, 134. [Google Scholar] [CrossRef]
- Wang, G.; Ren, Y.; Su, Y.; Zhang, H.; Li, J.; Han, J. Molecular marker development and phylogenetic analysis of Aconitum species based on chloroplast genomes. Ind. Crops Prod. 2024, 221, 119386. [Google Scholar] [CrossRef]
- Xue, H.; Xing, Y.; Bian, C.; Hou, W.; Men, W.; Zheng, H.; Yang, Y.; Ying, X.; Kang, T.; Xu, L. Comparative analysis of chloroplast genomes of Pulsatilla species reveals evolutionary and taxonomic status of newly discovered endangered species Pulsatilla saxatilis. BMC Plant Biol. 2024, 24, 293. [Google Scholar] [CrossRef]
- Miao, H.; Bao, J.; Li, X.; Ding, Z.; Tian, X. Comparative analyses of chloroplast genomes in ‘Red Fuji’ apples: Low rate of chloroplast genome mutations. PeerJ 2022, 10, e12927. [Google Scholar] [CrossRef]
- Li, Z.; Duan, B.; Zhou, Z.; Fang, H.; Yang, M.; Xia, C.; Zhou, Y.; Wang, J. Comparative analysis of medicinal plants Scutellaria baicalensis and common adulterants based on chloroplast genome sequencing. BMC Genom. 2024, 25, 39. [Google Scholar] [CrossRef] [PubMed]
- Li, Y.C.; Korol, A.B.; Fahima, T.; Beiles, A.; Nevo, E. Microsatellites: Genomic distribution, putative functions and mutational mechanisms: A review. Mol. Ecol. 2002, 11, 2453–2465. [Google Scholar] [CrossRef] [PubMed]
- Kapustová, V.; Tulpová, Z.; Toegelová, H.; Novák, P.; Macas, J.; Karafiátová, M.; Hřibová, E.; Doležel, J.; Šimková, H. The dark matter of large cereal genomes: Long tandem repeats. Int. J. Mol. Sci. 2019, 20, 2483. [Google Scholar] [CrossRef]
- Kumar, P.; Gupta, V.K.; Misra, A.K.; Modi, D.R.; Pandey, B.K. Potential of molecular markers in plant biotechnology. Plant Omics 2009, 2, 141–162. [Google Scholar]
- Hollingsworth, P.M.; Graham, S.W.; Little, D.P. Choosing and using a plant DNA barcode. PLoS ONE 2011, 6, e19254. [Google Scholar] [CrossRef] [PubMed]
- Hollingsworth, P.M.; Forrest, L.L.; Spouge, J.L.; Hajibabaei, M.; Ratnasingham, S.; van der Bank, M.; Chase, M.W.; Cowan, R.S.; Erickson, D.L.; Fazekas, A.J.; et al. A DNA barcode for land plants. Proc. Natl. Acad. Sci. USA 2009, 106, 12794–12797. [Google Scholar]
- Mehmood, F.; Shahzadi, I.; Waseem, S.; Mirza, B.; Ahmed, I.; Waheed, M.T. Chloroplast genome of Hibiscus rosa-sinensis (Malvaceae): Comparative analyses and identification of mutational hotspots. Genomics 2020, 112, 581–591. [Google Scholar] [CrossRef] [PubMed]
- Ramesh, G.A.; Mathew, D.; John, K.J.; Ravisankar, V. Chloroplast gene matK holds the barcodes for identification of Momordica (Cucurbitaceae) species from Indian subcontinent. Hortic Plant J. 2022, 8, 89–98. [Google Scholar] [CrossRef]
- Abouseada, H.H.; Mohamed, A.S.; Teleb, S.S.; Badr, A.; Tantawy, M.E.; Ibrahim, S.D.; Ellmouni, F.Y.; Ibrahim, M. Genetic diversity analysis in wheat cultivars using SCoT and ISSR markers, chloroplast DNA barcoding and grain SEM. BMC Plant Biol. 2023, 23, 193. [Google Scholar] [CrossRef]
- Park, J.; Min, J.; Kim, Y.; Chung, Y. The comparative analyses of six complete chloroplast genomes of morphologically diverse Chenopodium album L. (Amaranthaceae) collected in Korea. Int. J. Genom. 2021, 2021, 6643444. [Google Scholar] [CrossRef] [PubMed]







| Accession Number | Total Size (bp) | LSC Length (bp) | SSC Length (bp) | IR Length (bp) | Overall GC Content (%) | LSC GC Content (%) | SSC GC Content (%) | IR GC Content (%) |
|---|---|---|---|---|---|---|---|---|
| JZ1 | 159,278 | 86,970 | 18,346 | 26,981 | 37.88 | 36.04 | 32.22 | 42.79 |
| JZ2 | 159,278 | 86,922 | 18,346 | 27,005 | 37.88 | 36.03 | 32.22 | 42.79 |
| JZ3 | 159,300 | 86,922 | 18,346 | 27,016 | 37.88 | 36.03 | 32.22 | 42.79 |
| LQ1 | 159,286 | 86,916 | 18,334 | 27,018 | 37.89 | 36.04 | 32.24 | 42.78 |
| LQ2 | 159,286 | 86,978 | 18,334 | 26,987 | 37.89 | 36.05 | 32.24 | 42.78 |
| LQ3 | 159,286 | 86,914 | 18,334 | 27,019 | 37.89 | 36.04 | 32.24 | 42.78 |
| LW1 | 159,252 | 86,894 | 18,334 | 27,012 | 37.89 | 36.05 | 32.22 | 42.78 |
| LW2 | 159,252 | 86,898 | 18,334 | 27,010 | 37.89 | 36.05 | 32.22 | 42.78 |
| LW3 | 159,252 | 86,890 | 18,334 | 27,014 | 37.89 | 36.05 | 32.22 | 42.78 |
| QZ1 | 159,290 | 86,978 | 18,334 | 26,989 | 37.89 | 36.04 | 32.24 | 42.78 |
| QZ2 | 159,311 | 87,007 | 18,334 | 26,989 | 37.89 | 36.04 | 32.24 | 42.78 |
| QZ3 | 159,286 | 86,962 | 18,334 | 26,995 | 37.89 | 36.04 | 32.24 | 42.78 |
| HB1 | 159,281 | 86,947 | 18,332 | 27,001 | 37.89 | 36.04 | 32.24 | 42.78 |
| HB2 | 159,281 | 86,985 | 18,332 | 26,982 | 37.89 | 36.05 | 32.24 | 42.78 |
| HB3 | 159,281 | 86,911 | 18,332 | 27,019 | 37.89 | 36.04 | 32.24 | 42.78 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Zhang, S.; Du, P.; Lin, H.; Wang, M.; Li, R. Chloroplast Genome-Based Insights into Variety Identification in Toona sinensis. Agronomy 2026, 16, 127. https://doi.org/10.3390/agronomy16010127
Zhang S, Du P, Lin H, Wang M, Li R. Chloroplast Genome-Based Insights into Variety Identification in Toona sinensis. Agronomy. 2026; 16(1):127. https://doi.org/10.3390/agronomy16010127
Chicago/Turabian StyleZhang, Shuqiao, Panyue Du, Hongqiang Lin, Mingcheng Wang, and Rui Li. 2026. "Chloroplast Genome-Based Insights into Variety Identification in Toona sinensis" Agronomy 16, no. 1: 127. https://doi.org/10.3390/agronomy16010127
APA StyleZhang, S., Du, P., Lin, H., Wang, M., & Li, R. (2026). Chloroplast Genome-Based Insights into Variety Identification in Toona sinensis. Agronomy, 16(1), 127. https://doi.org/10.3390/agronomy16010127

